7. Deep neural networks

Around 2006, acoustic recognition models were significantly improved thanks to Deep Neural Networks (DNNs), with a far greater number of hidden layers than traditional multi-layer perceptrons (several dozen or more, with thousands of nodes in the hidden layers). These networks, inspired by the functioning of the animal cortex, are capable of learning much more complex functions than ever before. A possible learning algorithm for these deep networks, proposed by G. Hinton, is of the semi-supervised type. The principle is to initialize the weights of each layer's connections in an unsupervised way, then to adapt the whole network in a supervised way. DNNs have proven their worth in a wide variety of fields, including speech recognition, text processing, computer vision and diagnostics. We should also mention the precursors of these deep models designed for image processing and handwriting...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!

The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors

+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year

From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

This article is included in

Digital documents and content management

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
Deep neural networks

Previous
page Robust recognition methods

Perspectives and conclusions

Bibliography

(1) - RABINER (L.), HUANG (B.H.) - Fundamentals of speech recognition. – - Prentice-Hall, Englewood Cliffs (1993).
(2) - JUNQUA (J.-C.), HATON (J.-P.) - Robustness in automatic speech recognition. – - Kluwer Academic, Dordrecht...

Software tools

HTK (HMM ToolKit): open-source software for the development of complete speech recognition applications based on MMC http://www.htk.eng.cam.ac.uk/

VISPER (Visual speech processing system): free software for visualizing dynamic programming and MMC recognition stages, developed by the Technical University of Liberec, Czech...