Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Next »

Standard Enrichments

(example enrichment profile)

NameTypeDescriptionTypical fields usedRuby module
Create prefLabel from providedLabelRDFCopy the value from a provided label to the preferred label of a property when no preferred label exists.Allcreate_pref_label_from_provided.rb
DCMIType MapRDFMatch string values within the type properties to DCMIType terms and add matching term URI. Remove non-DCMIType values and move to format or genre.Type, Genre, Format

dcmi_enforcer.rb

dcmi_type_map.rb

move_non_dcmi_type.rb

De-Duplicate


Look for identically matching values within instances of a property and remove the duplicate property.Alldedup_nodes.rb

Genre matching


Match string values to a controlled list of values for genre.Genre, Format, Type

genre_filter.rb

move_non_dcmi_type.rb

Language Normalization


Match a string to corresponding ISO 639-3 code and add that value and the URI for the language to record. Alternately, match a three-character string to a corresponding language in the ISO 639-3 vocabulary. Add the name string and the URI for the language to the record.Languagelanguage_to_lexvo.rb

Parse Date


Parse information in date fields and normalize to EDTF format. When a single date is present, split it into begin and end dates for the Temporal Class. When only a begin and end date is present, create a date range label.Date, Timespan

  parse_date.rb

timespan_split.rb

timespan_label.rb

Remove Empty Fields
Looks for and removes any existing property with no value.Allremove_empty_fields.rb

Geocoding


Search GeoNames vocabulary for matches to string values in spatial fields. Build out the rest of the spatial class properties based on the GeoNames data.Spatial 
Split at DelimiterStringSplit values at a particular delimiter (usually semicolon) and put each resulting value in its own property instanceAll

split_at_delimiter.rb

split_provided_label_at_delimiter.rb

Strip Ending PunctuationStringRemove ending punctuation from property values including quotation marks, colons, semicolons, and dashes, but excluding brackets and parentheses. Periods are typically removed except when the final word is a two character string (such as “Jr.”) or two characters separated by periods (“P.A.”).

Title, Creator, Contributor, Publisher, Subject, Format, Genre

strip_ending_punctuation.rb
Strip HTMLStringLooks for and removes any HTML code from a property value.Allstrip_html.rb
Strip Leading ColonsStringRemove initial character from a value if it is a colon.Titlestrip_leading_colons.rb
Strip Leading PunctuationStringRemove initial punctuation from property values including quotation marks, colons, semicolons, and dashes, but excluding brackets and parentheses.Creator, Contributor, Publisher, Subject, Format, Genrestrip_leading_punctuation.rb
Strip WhitespaceStringLook for multiple whitepaces within a value and reduce to a single whitespace.Allstrip_whitespace.rb
Web Resource URIs
Remove Web Resources that are not http URIs.isShownAt, Preview, Objectweb_resource_uri.rb

 

Specialized Enrichments

NameDescriptionTypical fields usedRuby moduleHubs
Convert to Sentence CaseConvert a string in all caps to sentence case.Titleconvert_to_sentence_case.rb 
Dates in Coverage fieldsParse coverage properties in original records for values that are dates and move to timespan property.Spatial  
Genre from MARC leaderMatch specific codes in the MARC leader for a record to a controlled list of values for genreGenre HathiTrust, University of Florida, GPO
Limit CharactersLimit the value of a property to a specified number of chractersDescriptionlimit_characters.rbUniversity of Washington, Minnesota Digital Library
Remove Placeholder ValuesRemove a given placeholder value such as "XYZ"Subjectremove_placeholder.rbUniversity of Washington
Separate CoordinatesSplit sets of geospatial coordinates in original record properties into appropriate latitude and longitude properties.Spatialsplit_coordinates.rbTennessee Digital Library, Minnesota Digital Library
  • No labels