Skip to content

Sub-issue: Implement HTML File Parser #15

@OmarMGaber

Description

@OmarMGaber

Description

Add a parser for .hmtl/.htm files to extract text for indexing.


What to Do

  • Implement a HTMLParser struct that implements the Parser interface.
  • Ignore scripts, styles, and tags for now (could be useful then for ranking or term weight).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions