The trouble with de-duplication and web-scale discovery - The Distant Librarian - 0 views
-
Christophe ICD on 18 May 10"One of the topics of discussion at last week's Summon Advisory Board was the status of de-duping records returned by Summon. On the face of it it seems to be a simple issue - if the titles and authors match, throw the duplicate records out and you're good to go. The Summon technical team explained why it's a little harder than that though. "