Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Money and Mathematics

Money and Mathematics

$89.99
LImmigration en France depuis 1945

LImmigration en France depuis 1945

$54.99
Refugee and Mixed Migration Flows

Refugee and Mixed Migration Flows

$129.00
Practices of Anorectal Surgery

Practices of Anorectal Surgery

$109.00
Swarm Intelligence

Swarm Intelligence

$54.99
High Performance Scientific and Engineering Computing

High Performance Scientific and Engineering Computing

$109.99
Life Cycle Assessment & Circular Economy

Life Cycle Assessment & Circular Economy

$139.99
Glaukom 2003

Glaukom 2003

$29.99
Landmark Trials in Oncology

Landmark Trials in Oncology

$99.99
Handbook of Quality Assurance in Mental Health

Handbook of Quality Assurance in Mental Health

$39.99
Options for a New Britain

Options for a New Britain

$54.99
Grundlagen der Politischen Theorie

Grundlagen der Politischen Theorie

$29.99
Decentralization and Rural Development in Indonesia

Decentralization and Rural Development in Indonesia

$79.99
Piper: A Model Genus for Studies of Phytochemistry, Ecology, and Evolution

Piper: A Model Genus for Studies of Phytochemistry, Ecology, and Evolution

$169.99
Set Theory and Hierarchy Theory

Set Theory and Hierarchy Theory

$44.99
Time-Resolved Spectroscopy in Complex Liquids

Time-Resolved Spectroscopy in Complex Liquids

$169.99
Nonlinear Optics

Nonlinear Optics

$54.99
European Bison

European Bison

$169.99
Mechanical Characterization of Load Bearing Fibre Composite Laminates

Mechanical Characterization of Load Bearing Fibre Composite Laminates

$219.99
Coral Reef Studies of Japan

Coral Reef Studies of Japan

$129.00
Presenting Futures

Presenting Futures

$129.00
Konstruktivistisch forschen

Konstruktivistisch forschen

$39.99
Anatomy of Dissent in Islamic Societies

Anatomy of Dissent in Islamic Societies

$39.99
Ready-To-Go 2 25 Book Classroom Library: Favorites, Grade 4

Ready-To-Go 2 25 Book Classroom Library: Favorites, Grade 4

$145.00
Design Data for Reinforced Plastics

Design Data for Reinforced Plastics

$129.00
Laparo-endoskopische Hernienchirurgie

Laparo-endoskopische Hernienchirurgie

$229.99
Branding

Branding

$169.99
Telepathology

Telepathology

$84.99
Agronomic Crops

Agronomic Crops

$219.99
Identifying Potential for Equitable Access to Tertiary Level Science

Identifying Potential for Equitable Access to Tertiary Level Science

$109.99
Biomedical Applications

Biomedical Applications

$84.99
Elementare statistische Bewertung von Messdaten der analytischen Chemie mit Excel

Elementare statistische Bewertung von Messdaten der analytischen Chemie mit Excel

$17.99
Research Advances in Database and Information Systems Security

Research Advances in Database and Information Systems Security

$84.99
Hormone Resistance and Other Endocrine Paradoxes

Hormone Resistance and Other Endocrine Paradoxes

$39.99
Understanding Neighbourhood Dynamics

Understanding Neighbourhood Dynamics

$109.99
The Paradoxes of Globalisation

The Paradoxes of Globalisation

$109.99
Ethnic and National Issues in Russian and East European History

Ethnic and National Issues in Russian and East European History

$109.99
The Politics of Shakespeare

The Politics of Shakespeare

$84.99
Political Party Funding and Private Donations in Italy

Political Party Funding and Private Donations in Italy

$79.99
Systems, Decision and Control in Energy VI

Systems, Decision and Control in Energy VI

$249.99
previous
next