Skip to content

Use proper parser #3

@DavidLeoni

Description

@DavidLeoni

Currently parsing seems to be done by means of regexes --> bad thing!

Instead, we should use a proper parser. Alternatives found so far:

parsoid

  • parser used for Visual editor of Wikimedia
  • works in NodeJS
  • TODO does it work in browser?
  • mainly parses to HTML

wtf_wikipedia

  • works both in NodeJS and browser
  • according to the author creates proper json structure, while Parsoid doesn't

RemexHTML

  • new official PHP parser, replaces old one called Tidy

Maybe we could use WTF to get structured json so to have an easy structure for mindmaps , while we could use Parsoid for creating HTML to display in tooltips

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions