Skip to content

dmoz parser / dumps from 2017 #3

@ghost

Description

Hi,

Hope you are all well !

Just wanted to share with you a dmoz dump parser in golang.
ref. https://github.com/lin11230/DMOZ/blob/master/dmoz_content_parser.go

I used it to build my own dump. But, I did not picked the description and title from the content.rdf.u8 as I wanted a fresh title/description for each link.

For sure, there is a space of improvement for this script.

Also, here is a dump of the latest tarballs from dmoz:

wget -r -l1 --no-check-certificate https://curlz.org/dmoz_rdf/

Cheers,
X

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions