Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Aktuelle Forschungsfragen im Dienstleistungsmarketing

Aktuelle Forschungsfragen im Dienstleistungsmarketing

$69.99
Androgenetic Alopecia

Androgenetic Alopecia

$39.99
Plant Conservation and Biodiversity

Plant Conservation and Biodiversity

$169.00
Artificial Intelligence Techniques for a Scalable Energy Transition

Artificial Intelligence Techniques for a Scalable Energy Transition

$99.00
Computer Science  CACIC 2017

Computer Science CACIC 2017

$54.99
Hyper-V for VMware Administrators

Hyper-V for VMware Administrators

$49.99
Islamic Sufism Unbound

Islamic Sufism Unbound

$54.99
Wohnumfeldverbesserungen fr Menschen mit Demenz

Wohnumfeldverbesserungen fr Menschen mit Demenz

$74.99
Computational Analysis of Terrorist Groups: Lashkar-e-Taiba

Computational Analysis of Terrorist Groups: Lashkar-e-Taiba

$129.99
Mglichkeit und Grenzen des Effizienzvergleichs von Wirtschaftssystemen

Mglichkeit und Grenzen des Effizienzvergleichs von Wirtschaftssystemen

$59.99
Expanding Environmental Awareness in Education Through the Arts

Expanding Environmental Awareness in Education Through the Arts

$159.99
Fielding, Dickens, Gosse, Iris Murdoch and Oedipal Hamlet

Fielding, Dickens, Gosse, Iris Murdoch and Oedipal Hamlet

$29.99
Self-Organizing Systems

Self-Organizing Systems

$54.99
Implementation of the Small-Scale Fisheries Guidelines

Implementation of the Small-Scale Fisheries Guidelines

$159.99
Flchtlingsschutz als globale und lokale Herausforderung

Flchtlingsschutz als globale und lokale Herausforderung

$49.99
Philosophy and Oscar Wilde

Philosophy and Oscar Wilde

$129.99
Pharmazeutische Analytik

Pharmazeutische Analytik

$69.99
Ecological Time Series

Ecological Time Series

$109.99
Aerospace Robotics II

Aerospace Robotics II

$84.99
Chemical Instabilities

Chemical Instabilities

$219.99
Advances in Databases and Information Systems

Advances in Databases and Information Systems

$54.99
Hydroxylapatitkeramik als Knochenersatzstoff

Hydroxylapatitkeramik als Knochenersatzstoff

$59.99
Memory Matters in Transitional Peru

Memory Matters in Transitional Peru

$39.99
The Crustacean Nervous System

The Crustacean Nervous System

$329.99
Selected Areas in Cryptography

Selected Areas in Cryptography

$39.99
Practical Use of Mathcad

Practical Use of Mathcad

$74.99
Notch Effects in Fatigue and Fracture

Notch Effects in Fatigue and Fracture

$169.99
Platelets and Megakaryocytes

Platelets and Megakaryocytes

$169.00
Introduction to Multiple Time Series Analysis

Introduction to Multiple Time Series Analysis

$54.99
Risk Assessment Methods

Risk Assessment Methods

$169.99
Towards Industry 5.0

Towards Industry 5.0

$299.00
Grasping in Robotics

Grasping in Robotics

$129.00
Earthquake Geology and Tectonophysics around Eastern Tibet and Taiwan

Earthquake Geology and Tectonophysics around Eastern Tibet and Taiwan

$109.00
Evolution of Matter and Energy on a Cosmic and Planetary Scale

Evolution of Matter and Energy on a Cosmic and Planetary Scale

$109.99
Abwanderung, Geburtenrckgang und regionale Entwicklung

Abwanderung, Geburtenrckgang und regionale Entwicklung

$74.99
kologische Belastungsgrenzen - Critical Loads & Levels

kologische Belastungsgrenzen - Critical Loads & Levels

$119.99
Soft Computing in Data Science

Soft Computing in Data Science

$54.99
Space-Time Computing with Temporal Neural Networks

Space-Time Computing with Temporal Neural Networks

$54.99
On the Brink

On the Brink

$69.99
Bezugsgruppenwechsel und Bildungsaufstieg

Bezugsgruppenwechsel und Bildungsaufstieg

$59.99
previous
next