Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Health Effects of the New Labour Market

Health Effects of the New Labour Market

$84.99
Banking Supervision and Criminal Investigation

Banking Supervision and Criminal Investigation

$129.99
Perinatal and Prenatal Disorders

Perinatal and Prenatal Disorders

$169.99
Ausstrahlung, Ausbreitung und Aufnahme Elektromagnetischer Wellen

Ausstrahlung, Ausbreitung und Aufnahme Elektromagnetischer Wellen

$54.99
Tobacco

Tobacco

$39.99
Innovation Practices for Digital Transformation in the Global South

Innovation Practices for Digital Transformation in the Global South

$99.99
Synergetik und Marktprozesse

Synergetik und Marktprozesse

$59.99
Generationenwechsel im Mittelstand

Generationenwechsel im Mittelstand

$17.99
Was haben wir bei unserer Ernhrung im Haushalt zu beachten?

Was haben wir bei unserer Ernhrung im Haushalt zu beachten?

$59.99
Methoden zur durchgngigen virtuellen Eigenschaftsentwicklung von Fahrzeugen mit Bremsregelsystem

Methoden zur durchgngigen virtuellen Eigenschaftsentwicklung von Fahrzeugen mit Bremsregelsystem

$79.99
Singular Integral Operators, Quantitative Flatness, and Boundary Problems

Singular Integral Operators, Quantitative Flatness, and Boundary Problems

$109.99
Das Tornado-Phnomen

Das Tornado-Phnomen

$74.99
Environmental Performance and Social Inclusion in Informal Settlements

Environmental Performance and Social Inclusion in Informal Settlements

$119.99
Memory as Colonial Capital

Memory as Colonial Capital

$119.99
Logos of Phenomenology and Phenomenology of The Logos. Book Three

Logos of Phenomenology and Phenomenology of The Logos. Book Three

$219.99
Handbuch der Drahtlosen Telegraphie und Telephonie

Handbuch der Drahtlosen Telegraphie und Telephonie

$84.99
Normal Development of Voice

Normal Development of Voice

$49.99
EuroKarst 2022, Mlaga

EuroKarst 2022, Mlaga

$189.00
Data Mining and Machine Learning in High-Performance Sport

Data Mining and Machine Learning in High-Performance Sport

$54.99
Supercollider 2

Supercollider 2

$39.99
Heidelberger Jahrbcher

Heidelberger Jahrbcher

$69.99
Math Physics Foundation of Advanced Remote Sensing Digital Image Processing

Math Physics Foundation of Advanced Remote Sensing Digital Image Processing

$89.00
Durability of Disease Resistance

Durability of Disease Resistance

$129.00
Decision and Control in Hybrid Wind Farms

Decision and Control in Hybrid Wind Farms

$84.99
Lndliche Armut im Umbruch

Lndliche Armut im Umbruch

$59.99
Design your mind  Denkfallen entlarven und berwinden

Design your mind Denkfallen entlarven und berwinden

$29.99
Management of Telecommunication Systems and Services

Management of Telecommunication Systems and Services

$54.99
Computational Science  ICCS 2023

Computational Science ICCS 2023

$84.99
Gesundheitsverhalten

Gesundheitsverhalten

$59.99
The Growing Spine

The Growing Spine

$189.00
Burn Care and Treatment

Burn Care and Treatment

$89.00
Creative Conservation

Creative Conservation

$329.99
Praxisbuch Adipositas in der Geburtshilfe

Praxisbuch Adipositas in der Geburtshilfe

$54.99
Innovationen bei Rechen- und Kommunikationssystemen

Innovationen bei Rechen- und Kommunikationssystemen

$69.99
Fortschritte der Chemie organischer Naturstoffe / Progress in the Chemistry of Organic Natural Produ

Fortschritte der Chemie organischer Naturstoffe / Progress in the Chemistry of Organic Natural Produ

$84.99
Measuring and Understanding Complex Phenomena

Measuring and Understanding Complex Phenomena

$84.99
Allegory in Enlightenment Britain

Allegory in Enlightenment Britain

$49.99
Immunobiology of Bacterial CpG-DNA

Immunobiology of Bacterial CpG-DNA

$84.99
Geschftsmodelle erfolgreich entwickeln und implementieren

Geschftsmodelle erfolgreich entwickeln und implementieren

$39.99
Adhesion 14

Adhesion 14

$84.99
previous
next