Overview
FrançaisABSTRACT
Artificial intelligence (AI) has experienced accelerated growth in the digital media field since the 2015s. This article first provides a reminder of the principles, components and techniques of artificial intelligence, in particular machine learning, and deep learning with neural networks. It then proposes a sample of AI applications developed in the field of images in photography, for old films, and in video. In the field of sounds, the article presents some examples related to the automatic processing of speech, 3D audio and music.
Read this article from a comprehensive knowledge base, updated and supplemented with articles reviewed by scientific committees.
Read the articleAUTHOR
-
Jean-Noël GOUYET: Training engineer in digital media techniques and management - Former Research Manager at INA (Institut National de l'Audiovisuel)
INTRODUCTION
Back in the 1950s, the pioneers of artificial intelligence (AI) assumed that learning and artificial intelligence (AI) could be simulated by a machine. Particularly since the 2000s, the numerous projects, research and application developments testify, on the one hand, to the growth of this IT sector, and, on the other, to the major human and financial investments made by the world's leading players in the development of projects and products incorporating AI. In the United States: Google, Apple, Facebook, Amazon, Microsoft – and in China: Baidu, Alibaba, Tencent...
Another characteristic of AI is the wide range of knowledge and technologies involved: cognitive sciences, learning modes and machine learning, automatic speech processing, signal and image analysis and processing, computer vision, robotics...
The aim of this series of two articles is to provide an overview of the quantity and diversity of AI applications in digital media, which have been multiplying since the mid-2010s.
This first article is divided into three parts:
a review of the principles, components and techniques of AI, as well as its uses;
a sample of AI applications in the field of images (photo, film, video);
a sample of AI applications in the field of sound (speech, 3D audio, music).
The second article
presents these and other AI applications in the broadcast and media industry;
-
focuses on two case studies:
journalism and AI,
-
deepfakes.
Specific products or services mentioned in this article are for illustrative purposes only and do not represent a promotion, recommendation or endorsement by the author of this document. All articles or specialized sites presenting and evaluating them (referenced in the appendix) are the sole responsibility of their respective authors.
Numerous references detailing AI techniques and models used in applications are provided in the "Further reading" appendix, for the interested reader to consult. These are generally...
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference
KEYWORDS
film | artificial intelligence | photo | video | 3D audio | AI | speech processing | music
This article is included in
Signal processing and its applications
This offer includes:
Knowledge Base
Updated and enriched with articles validated by our scientific committees
Services
A set of exclusive tools to complement the resources
Practical Path
Operational and didactic, to guarantee the acquisition of transversal skills
Doc & Quiz
Interactive articles with quizzes, for constructive reading
Digital media and Artificial Intelligence (AI): Image and sound applications
Bibliography
- (1) - KAVLAKOGLU (E.) - What's the difference ?. - AI vs. Machine Learning vs. Deep Learning vs. Neural Networks. https://www.ibm.com/cloud/blog/ai-vs-machine-learning-vs-deep-learning-vs-neural-networks...
Standards and norms
- Moving Picture, Audio, and Data Coding by Artificial Intelligence. https://mpai.community/standards/. - MPAI - 2020
- JPEG AI Learning-based Image Coding System. https://www.iso.org/standard/81984.html. - ISO/IEC AWI 6048 - 2020
- Information technology – Multimedia content description interface – Part 17: Compression of neural networks for multimedia content description and analysis. https://www.iso.org/standard/78480.htmlhttps://www.mpegstandards.org/standards/MPEG-7/17/....
Websites and software for learning AI
Sources: A specialized site, run by a young AI engineer in Canada, offers a detailed overview of different resources for learning or perfecting AI: books, YouTube videos, free or paid online courses...
https://www.louisbouchard.ai/learnai/ (in English)
AI & media software
Source : Patrick ARNECKE – Dive into AI and machine learning. EBU tech-I 35, 1 er Mar 2018
https://tech.ebu.ch/publications/tech-i-035
Table B (non-exhaustive list)
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference