Reconcile many values into single entity

Podbrushkin · December 12, 2023, 5:40am

I need to reconcile csv with authors. A common task. Column looks like that:

Александр Дюма
Александр  Дюма
Александр Дюма.
Дюма Александр
Дюма, Александр
Сельма Лагерлёф
Лагерлеф Сельма
Лагерлёф Сельма
Лагерлёф, Сельма
Сельма Лагерлеф

What I did first: applied clustering and reconciled produced unique names.
I've found out reconciliation result was not so good. I think it would be better if it would take into considerations not a single final unique name produced by clustering, but all known invariants of this name. Is it possible? How can it be done?
I can try joining all invariants into list into single column, but it doesn't seem right.
Original data structure is not important, I can transform it freely. Thanks in advance.
Also, if some effective technique exists for this common task which will make my original question irrelevant, feel free to share!

Lucas.Belo · December 23, 2023, 11:33pm

Hello Podbrushkin, an alternative would be to create a column for each alternative name and then reconcile each column using facets so as not to do the same reconciliation more than once. For example:

name	alias1	alias2	alias3
Александр Дюма	Alexandre Dumas	Dumas Davy de la Pailleterie	...

The first reconciliation would be in the name column, after that activate the facet to only have the rows that were not reconciled, then reconcile the alias1 column and repeat the process as many times as necessary

Topic		Replies	Views
How to find / group more than one property (values) after 'Add column from reconciled values'? Support and Helpdesk reconciliation	19	743	March 29, 2023
Multiple values in one cell during reconcile Support and Helpdesk	7	138	June 30, 2024
How can I reconcile from my own CSV file? Support and Helpdesk reconciliation	3	911	May 31, 2023
Reconciling literature tables Support and Helpdesk reconciliation , hints-and-tips	2	152	December 13, 2023
Exact matching on a property during reconcile Support and Helpdesk wikidata , reconciliation	2	87	June 7, 2024

Reconcile many values into single entity

Related topics