diff --git a/README.md b/README.md index 7060b10..9e00491 100644 --- a/README.md +++ b/README.md @@ -9,11 +9,34 @@ lists can henceforth be matched using the web UI. The tool will finally generate the necessary lines of code to register the relevant terms in the concordance lists. +Good to know +------------ + +### Handling table-based imports + +If all entries of a given type are currently in a table column +- e.g. of a CSV file - just open the file in a spreadsheet editor +and select the column contents to copy and paste them over to +the text area of the concordance checker. + +### XML-based imports + +Here, [XPath](https://en.wikipedia.org/wiki/XPath) may be helpful. +Different programs support the use of XPath from the command line. +A useful one-liner to select the value of all elements of the name +`OkuFunktionS` using [xmlstarlet](https://en.wikipedia.org/wiki/XMLStarlet) +and filter out duplicate entries is, e.g.: + +``` +xmlstarlet sel -t -v "//OkuFunktionS" Obj_Kuenstler.xml | sort | awk '!seen[$0]++' +``` + See also -------- - [Repository for concordance lists](https://gitea.armuli.eu/museum-digital/MDImporterConcordanceLists) - [Repository for museum-digital's controlled lists](https://gitea.armuli.eu/museum-digital/MDAllowedValueSets) +- [md:quality: Quality checker built upon museum-digital's import tools. Unmatched entries will stop the quality checks.](https://quality.museum-digital.org/) TO-DO -----