Cluster and edit function returns values that look similar. What difference does it see in my Excel file that I don't? I tried different Method and Distance function but I always get similar results.
Those look like they could potentially have some whitespace differences (it's really hard to tell by eye, but based on the non-aligning (xx rows)
information next to the values there could be trailing whitespace after the visible string
Do you see this with the key collision / fingerprint clustering method? If so, which version of OpenRefine are you using?
2 Likes
Whitespace was the problem! Thank you very much!
1 Like
As a general hint, applying "trim()" to all columns (go to All
column, open dropdown menu, choose Transform
enter value.trim()
and (assuming you want to apply to all columns) leave all columns selected then click "OK") is often a good first step to just ensure you don't have any trailing whitespaces across the project
2 Likes