Harvest USC

Description

None

Activity

Show:
Mark Breedlove
June 28, 2015, 1:24 AM

The ingest is complete, but I'm leaving this ticket open for feedback in case there's anything wrong with the large count of added records.

Gretchen Gueguen
June 29, 2015, 2:21 PM

roll back of USC records is needed b/c of page-level records in their feed. MB has already started this.

Gretchen Gueguen
June 29, 2015, 8:27 PM

since fixing either the page-level record fix on USC's end, or the error with the roll-back code will be a considerable amount of effort, let's instead reharvest, but blacklist the following sets:

p15799coll3

p15799coll16

p15799coll17

p15799coll18

p15799coll20

p15799coll23

p15799coll24

p15799coll26

p15799coll29

p15799coll30

p15799coll127

p15799coll117

p15799coll70

p15799coll84

Mark Breedlove
June 30, 2015, 12:49 AM

USC is a provider for whom we only specify particular sets to include. See the "sets" property in https://github.com/dpla/ingestion/blob/develop/profiles/usc.pjs. As such, the list of blacklisted sets above only overlaps with three of the ones that we harvest: p15799coll117, p15799coll127, and p15799coll84. I'm removing those three from their profile and reharvesting.

Mark Breedlove
July 6, 2015, 1:15 PM

The re-ingest finished on Saturday, July 4th.

I'm leaving this open for final confirmation before closing it.

Assignee

Mark Breedlove

Reporter

Gretchen Gueguen

Labels

None

Priority

Medium
Configure