7. Deep neural networks
Around 2006, acoustic recognition models were significantly improved thanks to Deep Neural Networks (DNNs), with a far greater number of hidden layers than traditional multi-layer perceptrons (several dozen or more, with thousands of nodes in the hidden layers). These networks, inspired by the functioning of the animal cortex, are capable of learning much more complex functions than ever before. A possible learning algorithm for these deep networks, proposed by G. Hinton, is of the semi-supervised type. The principle is to initialize the weights of each layer's connections in an unsupervised way, then to adapt the whole network in a supervised way. DNNs have proven their worth in a wide variety of fields, including speech recognition, text processing, computer vision and diagnostics. We should also mention the precursors of these deep models designed for image processing and handwriting...
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference
This article is included in
Digital documents and content management
This offer includes:
Knowledge Base
Updated and enriched with articles validated by our scientific committees
Services
A set of exclusive tools to complement the resources
Practical Path
Operational and didactic, to guarantee the acquisition of transversal skills
Doc & Quiz
Interactive articles with quizzes, for constructive reading
Deep neural networks
Bibliography
Software tools
HTK (HMM ToolKit): open-source software for the development of complete speech recognition applications based on MMC http://www.htk.eng.cam.ac.uk/
VISPER (Visual speech processing system): free software for visualizing dynamic programming and MMC recognition stages, developed by the Technical University of Liberec, Czech...
Directory
Manufacturers – Suppliers – Distributors (non-exhaustive list)
Companies specializing in automatic speech processing:
Vecsys http://www.vecsys.fr/presentation/index.htm
Loquendo http://www.loquendo.com/fr/
...Documentation
Speech Communication magazine (4 issues/year)
IEEE Transactions on Pattern Recognition and Machine Intelligence (6 issues/year)
International Journal of Pattern Recognition and Artificial Intelligence (4 issues/year)
Journal of the Acoustical Society of America (12 issues/year)
Traitement du Signal magazine (4 issues/year)...
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference