OpenRefine for Metadata
Outline:
- Import metadata:
- character encoding (UTF-8!)
- import options (separator, batches)
- CONTENTdm TSV parsing issues: uncheck the option
Use character " to enclose cells containing column separators
- Explore metadata:
- facets (from column menu)
- text filter (from column menu)
- Clean up metadata:
- editing facets / cells (hover on value)
- trim whitespace (from column menu)
- Star / Flag, and facet (all column menu)
- remove rows (all column menu)
- reorder columns (all column menu)
- split into multiple columns (from column menu)
- Clean up multi-valued cells (e.g. Subjects):
- split multi-valued cells (from column menu)
- trim (from column menu)
- cluster (from column menu)
- join multi-valued cells (from column menu)
- Transform using GREL:
- add text,
value + " something new"
- find & replace,
value.replace("old","new")
- get values from other cells,
cells['column_name'].value
- dates: GREL dates,
value.toDate().toString('yyyy-MM-dd')
- add text,
- Export metadata:
- CSV export
- templating
Reference: