Skip to content

HTML fact extractor does not support not marked single tags like <br> #30

@todeslord

Description

@todeslord

HTML fact extractor does not support not marked single tags like
.
The used parser cannot distinguish start tags from single tags.
(The used SAX-Parser is not supporting single-tags cor-
rectly. A <br> is leading to a wrong fragment file whereas <br/> is)

Possible solutions:
-Find another parser
-write a parser that finds single tags
-use a preprocessor that converts single tags to the <br/> style.
...

Issue from the Fact Extraction paper of June 22.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions