-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
Description
Some URLs in the extracted TED dataset that are reused from the source XML data contain backslash characters (i.e. \), which are not allowed to be a part of URI unescaped. For example, backslashes appear in URL values of pc:buyerProfile (BTW, this should be changed to pc:profile). In most cases, this is a typo of replacing the regular forward slash with the backslash.
The solution to this issue could be to replace any backslashes in source URLs by forward slashes. A more conservative option would be to escape the backslashes.