User Interface for Deduplicating Records
I am looking for UX interface patterns that can be used to dedupe records.
Suppose, for example, one has a database of customers. Perhaps Ronald T. Wiggins and Ronald Wiggins are both in the database and have the same address. Users would like to be able to merge those two records into one canonical record.
One pattern that facilitates record merging is provided by OpenRefine, the open source project by Google for data cleaning. We can see below various "clusters" where each cluster contains items that a user can merge into a single canonical record:
In my PhD research I was working on identifying visual copyright infringement, and rigged up a basic UI that let me identify whether or not images match and should be clustered:
Has anyone encountered interfaces that facilitate this kind of record merging? Any examples others can offer would be hugely helpful!