Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Headache in Children and Adolescents

Headache in Children and Adolescents

$109.99
Antiviral Agents

Antiviral Agents

$54.99
Elektrische Energieversorgung

Elektrische Energieversorgung

$69.99
The Hagendorf-Pleystein Province: the Center of Pegmatites in an Ensialic Orogen

The Hagendorf-Pleystein Province: the Center of Pegmatites in an Ensialic Orogen

$84.99
Coming Up Cuban (Hardcover)

Coming Up Cuban (Hardcover)

$14.24
Psychosoziale Medizin Gesundheit und Krankheit in bio-psycho-sozialer Sicht

Psychosoziale Medizin Gesundheit und Krankheit in bio-psycho-sozialer Sicht

$69.95
Oneida Roulette

Oneida Roulette

$64.99
Salt-affected Soils and Marginal Waters

Salt-affected Soils and Marginal Waters

$179.99
Structural Analysis of Historical Constructions

Structural Analysis of Historical Constructions

$379.99
Data Science and Artificial Intelligence

Data Science and Artificial Intelligence

$119.99
Education, Culture and the Singapore Developmental State

Education, Culture and the Singapore Developmental State

$39.99
return  Jahrgang 2021

return Jahrgang 2021

$99.99
Universitten im Wettbewerb

Universitten im Wettbewerb

$59.99
ber spezielle Probleme der Zerkleinerungstechnik von Weichstoffen

ber spezielle Probleme der Zerkleinerungstechnik von Weichstoffen

$59.99
Latent Variable Analysis and Signal Separation

Latent Variable Analysis and Signal Separation

$39.99
Applications of Microscopy in Materials and Life Sciences

Applications of Microscopy in Materials and Life Sciences

$169.99
Sustainable Finance for SMEs

Sustainable Finance for SMEs

$109.99
Struggling for Leadership: Antwerp-Rotterdam Port Competition between 1870 2000

Struggling for Leadership: Antwerp-Rotterdam Port Competition between 1870 2000

$54.99
Al-Kashi's Miftah al-Hisab

Al-Kashi's Miftah al-Hisab

$219.99
Chromosome atlas: Fish, Amphibians, Reptiles and Birds

Chromosome atlas: Fish, Amphibians, Reptiles and Birds

$59.99
Advances in Design Methods from Modeling Languages for Embedded Systems and SoCs

Advances in Design Methods from Modeling Languages for Embedded Systems and SoCs

$129.00
Chest Wall Deformities and Corrective Procedures

Chest Wall Deformities and Corrective Procedures

$99.99
Potential and Challenges of Low Carbon Fuels for Sustainable Transport

Potential and Challenges of Low Carbon Fuels for Sustainable Transport

$179.99
ffentlichkeit, Partizipation, Empowerment

ffentlichkeit, Partizipation, Empowerment

$44.99
Facetten der Mathematikdidaktik

Facetten der Mathematikdidaktik

$49.99
Evaluation of Science and Technology Education at the Dawn of a New Millennium

Evaluation of Science and Technology Education at the Dawn of a New Millennium

$39.99
Die Chromosomenstrungen

Die Chromosomenstrungen

$69.99
Global Production and Trade in East Asia

Global Production and Trade in East Asia

$129.00
Current Problems and Ways of Industry Development: Equipment and Technologies

Current Problems and Ways of Industry Development: Equipment and Technologies

$169.99
Unschooling

Unschooling

$89.00
Energiemanagement bei ffentlich-Privaten Partnerschaften

Energiemanagement bei ffentlich-Privaten Partnerschaften

$84.99
Liquids and Solids

Liquids and Solids

$54.99
2. Deutsch-sterreichisch-Schweizerische Unfalltagung in Berlin

2. Deutsch-sterreichisch-Schweizerische Unfalltagung in Berlin

$69.99
From Signals to Image

From Signals to Image

$79.99
Blockchain Technology for Smart Cities

Blockchain Technology for Smart Cities

$139.99
The Romance of Gambling in the Eighteenth-Century British Novel

The Romance of Gambling in the Eighteenth-Century British Novel

$54.99
Bericht ber die Untersuchungen des Berliner Leitungswassers in der Zeit vom 1. Juni 1885 bis 1. Apr

Bericht ber die Untersuchungen des Berliner Leitungswassers in der Zeit vom 1. Juni 1885 bis 1. Apr

$54.99
Theory, Methodology, Tools and Applications for Modeling and Simulation of Complex Systems

Theory, Methodology, Tools and Applications for Modeling and Simulation of Complex Systems

$109.99
Urban Informatics Using Mobile Network Data

Urban Informatics Using Mobile Network Data

$199.99
Governing Europe under a Constitution

Governing Europe under a Constitution

$109.99
previous
next