Implement XML fetcher / PastPerfect mapper #468

lthurston · 2023-07-19T17:53:05Z

No description provided.

lthurston · 2023-07-19T18:05:19Z

@amywieliczka, if you want to take a sneak peek at this XML fetcher, I invite you to do so. It works, and I fetched collection 26935 of the reported 77k records in about 20 seconds locally. It reported there are actually more than 110k records though, so there might be an issue there, or maybe there's actually more records.

I haven't written any mapping code yet, so I consider this to be a little naive, a little optimistic, but nevertheless it does what it's supposed to do. Let me know your thoughts!

aturner · 2023-07-19T18:53:19Z

@lthurston I think our legacy harvester code has some logic built in to leave out "metadata only" records; the source collection has some items that don't have a digital image -- just metadata records only. That may account for the count difference that you're seeing

lthurston · 2023-07-19T18:58:16Z

@aturner That makes sense, thanks for the explanation. My instinct is to leave those records in our imported files in order to stay as true to the original source data as possible (despite the fact that we have to rewrite it to paginate), but am only too happy to be overruled.

lthurston force-pushed the xml branch 2 times, most recently from 4ee0b6f to 9130b12 Compare July 19, 2023 18:00

lthurston changed the title ~~[WIP] Implement xml_file fetcher~~ Implement XML fetcher / PastPerfect mapper Aug 1, 2023

lthurston added 2 commits August 1, 2023 08:13

Implement xml_file fetcher

f4f1e1e

Implement pastperfect mapper

8c7baaf

lthurston force-pushed the xml branch from 50a89e3 to 8c7baaf Compare August 1, 2023 15:13

christinklez added this to the #4 CIC work milestone Aug 15, 2023

This was linked to issues Nov 29, 2023

PastPerfectXMLMapper(Mapper) -- paused #369

Open

Fetcher: XML -- paused #368

Open

christinklez modified the milestones: #4 CIC work, Mappers & Fetchers (wrap up post-mvp) Feb 5, 2024

amywieliczka force-pushed the main branch 5 times, most recently from 5664e9b to d833812 Compare February 14, 2024 16:50

amywieliczka force-pushed the main branch 10 times, most recently from b68eff9 to 3bd6b7c Compare March 19, 2024 17:22

amywieliczka force-pushed the main branch from e2b4fa6 to b7d8ce3 Compare October 1, 2024 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement XML fetcher / PastPerfect mapper #468

Implement XML fetcher / PastPerfect mapper #468

Uh oh!

lthurston commented Jul 19, 2023

Uh oh!

lthurston commented Jul 19, 2023

Uh oh!

aturner commented Jul 19, 2023

Uh oh!

lthurston commented Jul 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Implement XML fetcher / PastPerfect mapper #468

Are you sure you want to change the base?

Implement XML fetcher / PastPerfect mapper #468

Uh oh!

Conversation

lthurston commented Jul 19, 2023

Uh oh!

lthurston commented Jul 19, 2023

Uh oh!

aturner commented Jul 19, 2023

Uh oh!

lthurston commented Jul 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants