Skip to content

Harvesting

Kevin Worthington edited this page Mar 29, 2024 · 2 revisions

Command-line Harvesting

The Command-line is used to harvest data. With terminal path set to the code location and with the virtual environment activated Enter the following python manage.py harvest -e 1

Harvested records will download from the end_point and be saved to the in the /resources/harvester/data/ folder using the 'Org_name'

-e is the endpoint id from the management system. To harvest from mulitple endpoints at once, separated the ids with commas (e.g 1,2,3)

Other Arguments -d is for date, and is set to '' to use no date when using Git tracking omit this to have today's date appended to loaded data when using local matadata tracking

-r for reference ids to target from endpoint

-o for overwriting existing loaded data without this, subsequent harvests will not refresh existing data

-t to track_missing files. This will add a flag for resources that no longer listed in the endpoint.

GUI Harvesting

From the End_points admin page, you can select one or more End_points and use the 'Action' dropdown to harvest from them. The page will refresh when the harvest(s) is/are complete.

Clone this wiki locally