The statement from input.xml https://github.com/BibWiss/notebooks/blob/b6d2a638fe86513027c8ad3f7ba21e32c1551d36/ner4xml/data/input.xml#L2 will get lost during xml parsing with `ElementTree`. A solution might include using the `lxml` library.