I noticed that Open Refine would be more complete as Data Management and Cleaning tool if we can implement functions to solve common data management/cleansing problem taking advantage of it's powerful clustering algorithm:
-Entity Resolution or Deduplication.
-Record Linkage between two datasets
Interesting. Could you tell more about this?