Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Handbuch Geschichte der deutschsprachigen Soziologie

Handbuch Geschichte der deutschsprachigen Soziologie

$49.99
Electronic Properties of High-Tc Superconductors and Related Compounds

Electronic Properties of High-Tc Superconductors and Related Compounds

$109.99
Die Autonomie von Landesorganisationen bei der Marktbearbeitung

Die Autonomie von Landesorganisationen bei der Marktbearbeitung

$74.99
Flows of Non-Smooth Vector Fields and Degenerate Elliptic Equations

Flows of Non-Smooth Vector Fields and Degenerate Elliptic Equations

$19.99
High-Performance Big-Data Analytics

High-Performance Big-Data Analytics

$109.99
Economics of International Migration

Economics of International Migration

$219.99
Interpretationen der Modallogik

Interpretationen der Modallogik

$89.00
Therapy of Malignant Brain Tumors

Therapy of Malignant Brain Tumors

$39.99
Hello Reader! Level 2: Harriet Tubman

Hello Reader! Level 2: Harriet Tubman

$4.95
Gemischbildungs-, Selbstzndungs- und Verbrennungsvorgnge im Hinblick auf die Vorgnge bei Gasturbi

Gemischbildungs-, Selbstzndungs- und Verbrennungsvorgnge im Hinblick auf die Vorgnge bei Gasturbi

$59.99
Organic Chemistry of the Earths Atmosphere

Organic Chemistry of the Earths Atmosphere

$54.99
Refinements of the Nash Equilibrium Concept

Refinements of the Nash Equilibrium Concept

$54.99
Konsens als normatives Prinzip der Demokratie

Konsens als normatives Prinzip der Demokratie

$49.99
Asian Organized Crime and the Anglosphere

Asian Organized Crime and the Anglosphere

$119.99
Analyse digitaler Signale

Analyse digitaler Signale

$69.99
Two Centuries of Local Autonomy

Two Centuries of Local Autonomy

$109.99
Debating Women, Politics, and Power in Early Modern Europe

Debating Women, Politics, and Power in Early Modern Europe

$54.99
The Agro-Food Chains and Networks for Development

The Agro-Food Chains and Networks for Development

$219.99
Qualitt huslicher Lernumwelten im Vorschulalter

Qualitt huslicher Lernumwelten im Vorschulalter

$44.99
The Economics of Audit Quality

The Economics of Audit Quality

$109.99
Industrielle Automatisierungs- und Informationstechnik

Industrielle Automatisierungs- und Informationstechnik

$49.99
Islamic Sustainable Finance, Law and Innovation

Islamic Sustainable Finance, Law and Innovation

$169.00
Sustainable Development and Social ResponsibilityVolume 1

Sustainable Development and Social ResponsibilityVolume 1

$249.99
Phenomenology: East and West

Phenomenology: East and West

$219.99
Nachhaltige Betriebliche Umweltinformationssysteme

Nachhaltige Betriebliche Umweltinformationssysteme

$49.99
Scientific Computing in Electrical Engineering

Scientific Computing in Electrical Engineering

$109.99
The Politics of Immigration in Multi-Level States

The Politics of Immigration in Multi-Level States

$29.99
Regional Development Reconsidered

Regional Development Reconsidered

$109.99
Human Body Odor

Human Body Odor

$39.99
Bioimaging in Neurodegeneration

Bioimaging in Neurodegeneration

$219.99
Young Migrants

Young Migrants

$54.99
Working Below Capacity

Working Below Capacity

$69.99
Inquiry into the Singapore Science Classroom

Inquiry into the Singapore Science Classroom

$84.99
IUTAM Symposium on Modelling Nanomaterials and Nanosystems

IUTAM Symposium on Modelling Nanomaterials and Nanosystems

$129.00
Entwicklung eines universell gltigen Regressionsmodells zur Ermittlung von Planzeitwerten fr vorwi

Entwicklung eines universell gltigen Regressionsmodells zur Ermittlung von Planzeitwerten fr vorwi

$44.99
Transport in Plants III

Transport in Plants III

$84.99
Re-Constructing the Man of Steel

Re-Constructing the Man of Steel

$32.99
Infinite Dimensional Analysis

Infinite Dimensional Analysis

$74.99
Immunoinformatics

Immunoinformatics

$129.00
First Steps in Mathematica

First Steps in Mathematica

$74.99
previous
next