_drafts

OpenRefine for Metadata

Outline:

  • Import metadata:
    • character encoding (UTF-8!)
    • import options (separator, batches)
    • CONTENTdm TSV parsing issues: uncheck the option Use character " to enclose cells containing column separators
  • Explore metadata:
    • facets (from column menu)
    • text filter (from column menu)
  • Clean up metadata:
    • editing facets / cells (hover on value)
    • trim whitespace (from column menu)
    • Star / Flag, and facet (all column menu)
    • remove rows (all column menu)
    • reorder columns (all column menu)
    • split into multiple columns (from column menu)
  • Clean up multi-valued cells (e.g. Subjects):
    • split multi-valued cells (from column menu)
    • trim (from column menu)
    • cluster (from column menu)
    • join multi-valued cells (from column menu)
  • Transform using GREL:
    • add text, value + " something new"
    • find & replace, value.replace("old","new")
    • get values from other cells, cells['column_name'].value
    • dates: GREL dates, value.toDate().toString('yyyy-MM-dd')
  • Export metadata:
    • CSV export
    • templating

Reference: