Breaking News!
60% Off the Hottest Halloween Costumes & Accessories

An Introduction to Duplicate Detection

Best Price (Coupon Required):
Buy An Introduction to Duplicate Detection for $18.00 at @ Link.springer.com when you apply the 10% OFF coupon at checkout.
Click “Get Coupon & Buy” to copy the code and unlock the deal.

Set a price drop alert to never miss an offer.

2 Offers Price Range: $19.99 - $30.00
BEST PRICE

Single Product Purchase

$18.00
@ Link.springer.com with extra coupon

Price Comparison

Seller Contact Seller List Price On Sale Shipping Best Promo Final Price Volume Discount Financing Availability Seller's Page
BEST PRICE
1 Product Purchase
@ Link.springer.com
$19.99 $19.99

10% OFF
This deals requires coupon
$18.00
See Site In stock Visit Store

Product Details

Brand
Springer Nature
Manufacturer
N/A
Part Number
0
GTIN
9783031007071
Condition
New
Product Description

With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate representations are usually not identical but slightly differ in their values. Second, in principle all pairs of records should be compared, which is infeasible for large volumes of data. This lecture examines closely the two main components to overcome these difficulties: (i) Similarity measures are used to automatically identify duplicates when comparing two records. Well-chosen similarity measures improve the effectiveness of duplicate detection. (ii) Algorithms are developed to perform on very large volumes of data in search for duplicates. Well-designed algorithms improve the efficiency of duplicate detection. Finally, we discuss methods to evaluate the success of duplicate detection. Table of Contents: Data Cleansing: Introduction and Motivation / Problem Definition / Similarity Functions / Duplicate Detection Algorithms / Evaluating Detection Success / Conclusion and Outlook / Bibliography.

Available Colors
Available Sizes

Reviews

0
0 reviews
5 stars
4 stars
3 stars
2 stars
1 star

Questions & Answers

Similar Products

Investigation of Membrane-Located Receptors

Investigation of Membrane-Located Receptors

$84.99
Subject/Verb Agreement Grade 5 Differentiation Pack

Subject/Verb Agreement Grade 5 Differentiation Pack

$5.99
Unternehmensstrategien erfolgreich umsetzen durch Commitment Management

Unternehmensstrategien erfolgreich umsetzen durch Commitment Management

$59.99
Organisation der Innovation im Konzern

Organisation der Innovation im Konzern

$39.99
Value at Risk fr Kreditinstitute

Value at Risk fr Kreditinstitute

$74.99
Parliamentary Democracy

Parliamentary Democracy

$139.99
Intelligent Systems in Production Engineering and Maintenance  ISPEM 2017

Intelligent Systems in Production Engineering and Maintenance ISPEM 2017

$129.00
Domestication in Action

Domestication in Action

$119.99
Bioinformatics

Bioinformatics

$34.99
Progress in Intercalation Research

Progress in Intercalation Research

$169.99
Evolutionary Ethnobiology

Evolutionary Ethnobiology

$109.99
Handbuch des Wgens

Handbuch des Wgens

$69.99
Newtons Scientific and Philosophical Legacy

Newtons Scientific and Philosophical Legacy

$189.00
Effective Medical Communication

Effective Medical Communication

$84.99
Remote Sensing of Plant Biodiversity

Remote Sensing of Plant Biodiversity

$59.99
Methods in Pulmonary Research

Methods in Pulmonary Research

$39.99
Internationale Mergers & Acquisitions

Internationale Mergers & Acquisitions

$44.99
Wertschpfung im digitalisierten Buchmarkt

Wertschpfung im digitalisierten Buchmarkt

$59.99
Economic Innovations in Public Utility Regulation

Economic Innovations in Public Utility Regulation

$169.99
Postcolonial Writers in the Global Literary Marketplace

Postcolonial Writers in the Global Literary Marketplace

$39.99
Emerging Nanotechnologies

Emerging Nanotechnologies

$169.99
Global Dynamics of the Earth

Global Dynamics of the Earth

$109.99
Stahlkunde fr Ingenieure

Stahlkunde fr Ingenieure

$54.99
Subrecursive Programming Systems

Subrecursive Programming Systems

$84.99
Human Perception of Visual Information

Human Perception of Visual Information

$179.99
Time-Resolved Vibrational Spectroscopy VI

Time-Resolved Vibrational Spectroscopy VI

$109.99
Molecular Physics and Hypersonic Flows

Molecular Physics and Hypersonic Flows

$329.99
Types, Tableaus, and Gdels God

Types, Tableaus, and Gdels God

$84.99
AIDS and South Africa: The Social Expression of a Pandemic

AIDS and South Africa: The Social Expression of a Pandemic

$49.99
Verbreitungsatlas der Farn- und Bltenpflanzen der Schweiz Bd. 1 + 2

Verbreitungsatlas der Farn- und Bltenpflanzen der Schweiz Bd. 1 + 2

$269.00
Reactive Programming with Angular and ngrx

Reactive Programming with Angular and ngrx

$69.99
Genetic Programming Theory and Practice XVIII

Genetic Programming Theory and Practice XVIII

$129.00
Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web

Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web

$39.99
Biocompatible Glasses

Biocompatible Glasses

$169.99
Crazy Plants (A True Book: Incredible Plants!)

Crazy Plants (A True Book: Incredible Plants!)

$5.96
World Ecosystems Grades 3-5

World Ecosystems Grades 3-5

$31.00
Beginning F#

Beginning F#

$34.99
Bayesian Statistics and New Generations

Bayesian Statistics and New Generations

$109.99
Vorlesungen ber Numerische Mathematik

Vorlesungen ber Numerische Mathematik

$59.99
Entwicklungsbedingungen im Kontext der Eltern-Kind-Beziehung

Entwicklungsbedingungen im Kontext der Eltern-Kind-Beziehung

$84.99
previous
next