Article | REF: H3701 V1

Detection and correction of data quality problems with machine learning

Author: Laure BERTI-ÉQUILLE

Publication date: May 10, 2023

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


Français

1. The impact of data quality in machine learning

The main types of error to be considered when assessing the quality of a dataset are: missing values, outliers, inconsistent values (i.e. values that do not satisfy a set of predefined constraints), and finally, duplicates, as illustrated in the table 1 .

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors
+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year
From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

This article is included in

Software technologies and System architectures

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
The impact of data quality in machine learning