Skip to content

XML output option creates invalid characters #56

@solfeggietto

Description

@solfeggietto
  1. Character encoding is not set
    Used: ?xml version="1.0" ?
    Should be changed to: ?xml version="1.0" encoding="UTF-8"?

  2. Examples of the Noark 5-validation report with invalid xml-characters

With E5 for å:

> <test id="AST13" name="Record status" description="Tests whether non-finalized records exist in the archive. A record is considered non-finalized if a finalized date or a finalizing party is not specified, or if the record status is not 'Arkivert' or 'Utgår'." summary="0" info="1" warn="1301" error="0"></test>

Norwegian å is saved as ascii 1 character hex E5

I believe this could be saved as the character entity:
& aring; | Latin small letter a with ring above

With & aring; for å

> <test id="AST13" name="Record status" description="Tests whether non-finalized records exist in the archive. A record is considered non-finalized if a finalized date or a finalizing party is not specified, or if the record status is not 'Arkivert' or 'Utg&aring;r'." summary="0" info="1" warn="1301" error="0"></test>

But better to use UTF-8 declaration at top and save the file as UTF-8
å | LATIN SMALL LETTER A WITH RING ABOVE (U+00E5) | c3a5

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions