Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Das Rechnungswesen bei automatisierter Datenverarbeitung

Das Rechnungswesen bei automatisierter Datenverarbeitung

$59.99
Understanding and Responding to Sibling Sexual Abuse

Understanding and Responding to Sibling Sexual Abuse

$49.99
Redeploying Urban Infrastructure

Redeploying Urban Infrastructure

$79.99
Thinking History Globally

Thinking History Globally

$119.99
Betriebliches Umweltschutzengagement

Betriebliches Umweltschutzengagement

$49.99
Management of Information, Process and Cooperation

Management of Information, Process and Cooperation

$54.99
Drugs, Crime, and Other Deviant Adaptations

Drugs, Crime, and Other Deviant Adaptations

$109.99
The Mass Media

The Mass Media

$54.99
Zuverlssigkeits- und Instandhaltungstheorie

Zuverlssigkeits- und Instandhaltungstheorie

$49.99
International Handbook on the Demography of Sexuality

International Handbook on the Demography of Sexuality

$379.99
Abstract State Machines

Abstract State Machines

$54.99
Third-World Political Organizations

Third-World Political Organizations

$19.99
Higher Education and Hope

Higher Education and Hope

$79.99
Staatliche Wirtschaftsaufsicht in Deutschland

Staatliche Wirtschaftsaufsicht in Deutschland

$59.99
Mental Health Care and National Health Insurance

Mental Health Care and National Health Insurance

$39.99
Computing in Intelligent Transportation Systems

Computing in Intelligent Transportation Systems

$99.99
The Bridge

The Bridge

$9.74
Democracy and the Kingdom of God

Democracy and the Kingdom of God

$129.00
Cancer, Stress, and Death

Cancer, Stress, and Death

$39.99
Optimale Stufenrdergetriebe fr Werkzeugmaschinen

Optimale Stufenrdergetriebe fr Werkzeugmaschinen

$44.99
Development of an Environmental and Economic Assessment Tool (Enveco Tool) for Fire Events

Development of an Environmental and Economic Assessment Tool (Enveco Tool) for Fire Events

$39.99
Simulating Knowledge Dynamics in Innovation Networks

Simulating Knowledge Dynamics in Innovation Networks

$109.99
Grundschulpdagogik zwischen Wissenschaft und Transfer

Grundschulpdagogik zwischen Wissenschaft und Transfer

$59.99
Proceedings of 2021 4th International Conference on Civil Engineering and Architecture

Proceedings of 2021 4th International Conference on Civil Engineering and Architecture

$129.00
Kindernotfall-ABC

Kindernotfall-ABC

$29.99
Network Biology

Network Biology

$299.99
Reconstruction of Urban Forests

Reconstruction of Urban Forests

$139.00
Cultural Practices and Dermatoses

Cultural Practices and Dermatoses

$39.99
Pulmonary Function Measurement in Noninvasive Ventilatory Support

Pulmonary Function Measurement in Noninvasive Ventilatory Support

$159.99
Ethical Exploration in a Multifaith Society

Ethical Exploration in a Multifaith Society

$99.99
The Philosophy of Mathematics and Logic in the 1920s and 1930s in Poland

The Philosophy of Mathematics and Logic in the 1920s and 1930s in Poland

$39.99
Graph-Based Semi-Supervised Learning

Graph-Based Semi-Supervised Learning

$29.99
Advances in Grid and Pervasive Computing

Advances in Grid and Pervasive Computing

$39.99
Evolutionary Developmental Biology

Evolutionary Developmental Biology

$219.99
Neue Herausforderungen im Employer Branding

Neue Herausforderungen im Employer Branding

$29.99
Diagnosis of human peroxisomal disorders

Diagnosis of human peroxisomal disorders

$54.99
Selbstbestimmung, Privatheit und Datenschutz

Selbstbestimmung, Privatheit und Datenschutz

$59.99
Orthopdie fr die Praxis

Orthopdie fr die Praxis

$59.99
The Geometry of Biological Time

The Geometry of Biological Time

$109.99
Food Industry and the Environment

Food Industry and the Environment

$39.99
previous
next