Skip to content

Provide Articles in a machine-readable format #9

@lukasjuhrich

Description

@lukasjuhrich

(This is a question / a feature request)

Is there any way to present the nlab articles in a different format than a complete HTML website with all its surroundings?

I wanted to write a chatbot which looks up or searches nlab articles, but realized that I would

  • either have to do a lot of ugly scraping of the HTML output[1], but even if that worked, it is usually unstable and either does not for 100% of the articles and is not stable under structural changes to the HTML page
  • on the other hand, fetch /source/<article-name>, and try to parse the instiki syntax (if there is even a spec for this) myself. However, completely building such a parser seems overkill.
  1. Is it possible to provide the pages in a different format (whatever this may be) , or
  2. is there a parser for the instiki syntax – or whatever derivative of this is used in the nlab – and
  3. would it even be possible to provide a REST-API?

[1] For instance, it's not quite clear when the first actual part of the article begins – usually, we have some <h1 id="contents">, but that need not exist.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions