I had a quick call with people (including @Michael_Markert) who contribute to the authority database of the German national library, GND (Gemeinsame Normdatei). They are interested in exploring ways in which OpenRefine could be equipped with an integration similar to the Wikibase one, but for populating the GND instead.
OpenRefine is already used in such workflows, in combination with the lobid-GND reconciliation endpoint, to do the initial matching. To export the data to GND, one approach is to generate MARC data from OpenRefine using the templating exporter (I assume in XML format, but I am not sure). One problem with this is that it does not come with any quality checks or even validation that the resulting file is indeed valid MARC. So, a better way to export to MARC would be potentially useful. The inspiration from the Wikibase schema editor (to specify how to map data to which MARC fields) and its issues tab (to validate the translated data) could be followed.
Other approaches that were mentioned include:
- adding support for data upload in the reconciliation API itself
- exporting from OpenRefine to other formats, such as RDF
I mentioned that we are interested in supporting use cases like this one and that we are thinking how to improve our extension system to make it easier to develop such integrations outside of OpenRefine's code base.
There will probably be a follow-up meeting. The conversation was in German but I am sure we could switch to English if needed. Is anyone interested in discussing this? I have mentioned that we have the BarCamp coming up, where such discussions could also take place.
See also this thread on metadaten.community: GND-Updates aus OpenRefine - Gemeinsame Normdatei (GND) - metadaten.community