Article | REF: H3728 V3

Automatic speech recognition

Author: Jean-Paul HATON

Publication date: October 10, 2018

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


Overview

Français

ABSTRACT

Great progress has recently been made in speech recognition performance (close to that of humans), but the level of understanding of present systems remains very low. Such systems are based on statistical modeling of speech: Hidden Markov Models (HMM) for acoustics, and n-gram models storing the conditional probabilities of sequences of linguistic units. Recent progress has been achieved by coupling classical HMMs with deep neural networks that are made up of a large number of hidden layers and trained by deep learning algorithms using very large amounts of training data. Applications concern mainly text dictation, transcription of media (radio, television) and especially vocal telematics.

Read this article from a comprehensive knowledge base, updated and supplemented with articles reviewed by scientific committees.

Read the article

AUTHOR

  • Jean-Paul HATON: Professor at the University of Lorraine, LORIA/INRIA – Member of the Institut Universitaire de France

 INTRODUCTION

The use of speech as a means of communication between man and machine has been widely studied in recent decades. In this article, we focus on automatic speech recognition (ASR), i.e. all the techniques used to communicate verbally with a machine. ALR is of undeniable practical interest, under certain conditions of use (remote access, heavy workload, disabled people, etc.). Commercial products have been available for over thirty years, initially mainly for the recognition of isolated and concatenated words, and now for continuously spoken sentences. Most are based on dynamic programming algorithms and stochastic models (Markov sources). However, there are still problems to be solved in order to increase the robustness of these systems and extend their dialog capabilities. Current research focuses on the recognition of noisy speech, the processing of incomplete or incorrect utterances, the definition of dialog procedures, etc.

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors
+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year
From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

This article is included in

Digital documents and content management

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
Automatic speech recognition