Skip to content

Improve SOM extraction for Wikipedia tables #6

@dbhurley

Description

@dbhurley

Wikipedia uses complex table markup that could be better represented in SOM.

Steps:

  1. Run plasmate fetch https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal)
  2. Check how tables are represented in the SOM output
  3. Improve extraction if data is lost or poorly structured

Good first issue - helps improve real-world coverage.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions