Article | REF: H3701 V1

Detection and correction of data quality problems with machine learning

Author: Laure BERTI-ÉQUILLE

Publication date: May 10, 2023

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


Français

2. Detection and correction by machine learning

Recent work has shown that machine learning models can be used to accurately identify problems in data and correct certain types of error with complex (semi-)automatic correction mechanisms that were previously performed manually or based on hard-to-maintain heuristics. New learning strategies can also determine which corrections are necessary depending on the analysis objective, as it is not necessarily necessary (or even feasible) to correct all problems.

Learning-based approaches can be used to detect or correct erroneous data. They rely on examples of erroneous and correct records to train the model. But designing models that are sufficiently expressive and therefore complex requires the use of a large number of examples. Depending on the task (detection of outliers, duplicates, inconsistencies, etc.), the creation of these sets of examples can prove very difficult,...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors
+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year
From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

This article is included in

Software technologies and System architectures

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
Detection and correction by machine learning