Skip to content

Invalid character in XML #3

@pnorman

Description

@pnorman

http://planet.osm.org/replication/changesets/001/659/607.osm.gz contains an invalid character value

$ curl -s http://planet.osm.org/replication/changesets/001/659/607.osm.gz | zcat | xmllint -
-:11: parser error : PCDATA invalid Char value 1
“beacon_special_purpose”, etc. Then you may specify its colour, shape, etc.
                                                                               ^

Viewing it with less shows line 11 is

        <text>The objects “light_minor” and “light_major” are simply lights without any details of the supporting structure. If you wish to specify the supporting structure, then you should use a seamark:type such as “beacon_special_purpose”, etc. Then you may specify its colour, shape, etc. ^AAlso, to specify a colour_pattern, there should be at least two colours.</text>

The ^A is highlighted as a control code.

The relevant part of the hexdump is

00000120  6f 75 72 2c 20 73 68 61  70 65 2c 20 65 74 63 2e  |our, shape, etc.|
00000130  20 01 41 6c 73 6f 2c 20  74 6f 20 73 70 65 63 69  | .Also, to speci|

which confirms there's a &x01; in the document not as an entity.

Cross-ref ToeBee/ChangesetMD#20

There is also probably a bug somewhere if this character got into the database

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions