Hi,
I hope you are going well!
I came across Openrefine recently and it is very useful for the bibliometric work that I am currently pursuing (version used : Openrefine 3.8.2).
More precisely, I am using the facet called "Cluster and edit" to find titles of articles/books in my corpus that are very similar and yet don't share the same ID (an identifier that is useful to tell if two articles/books are the same or not). However, the number of clusters I get when using this facet is very big. I am able to cluster around a 1000 clusters at a time and the total number of clusters is aroud 170 000 . Manually doing this operation over and over again is very tedious and far from being time efficient :
Is there be a way to automate this process with a script for example? I am not well versed in programming, but I would like to achieve something that automatically 1) select all the cluster, 2) merge all the selected clusters, 3) repeat that operation until no clusters are left.
Do you think it is archivable? Any help will be appreciated
Best ,
Jacob