extract_metadata.py
-
Extracts the contents of
<mods:title>,<mods:subject>, and<mods:genre>, from an OAI harvest XML file, and writes them to a new output file. -
<mods:subject>, and<mods:genre>are filtered to output only the unique values. -
Set input/output file and run
python extract_metadata.py
- with a nod to Mark Jordan's metadata_reporter