Harvest USC

Description

None

Activity

Show:
Mark Breedlove
July 6, 2015, 1:15 PM

The re-ingest finished on Saturday, July 4th.

I'm leaving this open for final confirmation before closing it.

Mark Breedlove
June 30, 2015, 12:49 AM

USC is a provider for whom we only specify particular sets to include. See the "sets" property in https://github.com/dpla/ingestion/blob/develop/profiles/usc.pjs. As such, the list of blacklisted sets above only overlaps with three of the ones that we harvest: p15799coll117, p15799coll127, and p15799coll84. I'm removing those three from their profile and reharvesting.

Gretchen Gueguen
June 29, 2015, 8:27 PM

since fixing either the page-level record fix on USC's end, or the error with the roll-back code will be a considerable amount of effort, let's instead reharvest, but blacklist the following sets:

p15799coll3

p15799coll16

p15799coll17

p15799coll18

p15799coll20

p15799coll23

p15799coll24

p15799coll26

p15799coll29

p15799coll30

p15799coll127

p15799coll117

p15799coll70

p15799coll84

Gretchen Gueguen
June 29, 2015, 2:21 PM

roll back of USC records is needed b/c of page-level records in their feed. MB has already started this.

Mark Breedlove
June 28, 2015, 1:24 AM

The ingest is complete, but I'm leaving this ticket open for feedback in case there's anything wrong with the large count of added records.

Done

Assignee

Mark Breedlove

Reporter

Gretchen Gueguen

Labels

None

Priority

Medium