Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Produkte mit Profil

Produkte mit Profil

$59.99
Technischer Lehrgang Kupplungen

Technischer Lehrgang Kupplungen

$44.99
15N Tracing of Microbial Assimilation, Partitioning and Transport of Fertilisers in Grassland Soils

15N Tracing of Microbial Assimilation, Partitioning and Transport of Fertilisers in Grassland Soils

$109.99
Partial Least Squares Path Modeling

Partial Least Squares Path Modeling

$139.00
Abstraction Refinement for Scale Model Checking

Abstraction Refinement for Scale Model Checking

$109.99
1543 and All That

1543 and All That

$169.99
Finite Elements in Fracture Mechanics

Finite Elements in Fracture Mechanics

$99.99
Angiogenesis in Brain Tumors

Angiogenesis in Brain Tumors

$169.99
Environmental Problem Solving

Environmental Problem Solving

$54.99
Richtiges Messen In Dampf- und Feuerungsbetrieben

Richtiges Messen In Dampf- und Feuerungsbetrieben

$59.99
Der traumatische Lungenkollaps

Der traumatische Lungenkollaps

$54.99
The United Nations and Sustainable Development Goals

The United Nations and Sustainable Development Goals

$129.99
Moving Wearables into the Mainstream

Moving Wearables into the Mainstream

$109.99
Handelsbilanzen

Handelsbilanzen

$59.99
Ecology, Cognition and Landscape

Ecology, Cognition and Landscape

$54.99
Liquid Phase Sintering

Liquid Phase Sintering

$169.99
New Statistical Developments in Data Science

New Statistical Developments in Data Science

$179.99
Teeanalyse

Teeanalyse

$44.99
The Ptolemaic Papyri of Homer

The Ptolemaic Papyri of Homer

$59.99
VHDL92

VHDL92

$39.99
Skin Stress Response Pathways

Skin Stress Response Pathways

$199.99
Transitions in Oligomer and Polymer Systems

Transitions in Oligomer and Polymer Systems

$39.99
Advances in Knowledge Discovery and Data Mining

Advances in Knowledge Discovery and Data Mining

$89.99
Intelligence Policy and National Security

Intelligence Policy and National Security

$44.99
Multicriteria Optimization

Multicriteria Optimization

$139.00
Assessing Hate Crime Laws

Assessing Hate Crime Laws

$139.99
Merkblatt enthaltend Richtlinien fr die Ernhrung gesunder und kranker Kinder bis zum 2. Lebensjahr

Merkblatt enthaltend Richtlinien fr die Ernhrung gesunder und kranker Kinder bis zum 2. Lebensjahr

$54.99
Rookie Reader-GR Level B: Game Day

Rookie Reader-GR Level B: Game Day

$3.71
Balancing and Sequencing of Assembly Lines

Balancing and Sequencing of Assembly Lines

$129.99
Proceedings of 17th Symposium on Earthquake Engineering (Vol. 4)

Proceedings of 17th Symposium on Earthquake Engineering (Vol. 4)

$299.99
Mycotoxins in Foodstuffs

Mycotoxins in Foodstuffs

$169.99
Fractional Calculus

Fractional Calculus

$139.00
Managing a Hospital

Managing a Hospital

$44.99
Local and Nonlocal Micromechanics of Heterogeneous Materials

Local and Nonlocal Micromechanics of Heterogeneous Materials

$279.99
Contemporary Economic Issues in Asian Countries: Proceeding of CEIAC 2022, Volume 2

Contemporary Economic Issues in Asian Countries: Proceeding of CEIAC 2022, Volume 2

$189.00
Der Beitrag von Finanzanalysten zur Informationsverarbeitung

Der Beitrag von Finanzanalysten zur Informationsverarbeitung

$59.99
WALCOM: Algorithm and Computation

WALCOM: Algorithm and Computation

$69.99
Tensor Eigenvalues and Their Applications

Tensor Eigenvalues and Their Applications

$169.99
Modeling and Using Context

Modeling and Using Context

$54.99
A Brief Illustrated History of Machines and Mechanisms

A Brief Illustrated History of Machines and Mechanisms

$109.99
previous
next