Article | REF: TE5897 V1

Digital Media and Artificial Intelligence (AI): Applications for sounds and pictures

Author: Jean-Noël GOUYET

Publication date: August 10, 2022

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


Overview

Français

ABSTRACT

Artificial intelligence (AI) has experienced accelerated growth in the digital media field since the 2015s. This article first provides a reminder of the principles, components and techniques of artificial intelligence, in particular machine learning, and deep learning with neural networks. It then proposes a sample of AI applications developed in the field of images in photography, for old films, and in video. In the field of sounds, the article presents some examples related to the automatic processing of speech, 3D audio and music.

Read this article from a comprehensive knowledge base, updated and supplemented with articles reviewed by scientific committees.

Read the article

AUTHOR

  • Jean-Noël GOUYET: Training engineer in digital media techniques and management - Former Research Manager at INA (Institut National de l'Audiovisuel)

 INTRODUCTION

Back in the 1950s, the pioneers of artificial intelligence (AI) assumed that learning and artificial intelligence (AI) could be simulated by a machine. Particularly since the 2000s, the numerous projects, research and application developments testify, on the one hand, to the growth of this IT sector, and, on the other, to the major human and financial investments made by the world's leading players in the development of projects and products incorporating AI. In the United States: Google, Apple, Facebook, Amazon, Microsoft – and in China: Baidu, Alibaba, Tencent...

Another characteristic of AI is the wide range of knowledge and technologies involved: cognitive sciences, learning modes and machine learning, automatic speech processing, signal and image analysis and processing, computer vision, robotics...

The aim of this series of two articles is to provide an overview of the quantity and diversity of AI applications in digital media, which have been multiplying since the mid-2010s.

This first article is divided into three parts:

  • a review of the principles, components and techniques of AI, as well as its uses;

  • a sample of AI applications in the field of images (photo, film, video);

  • a sample of AI applications in the field of sound (speech, 3D audio, music).

The second article [TE 5 898] :

  • presents these and other AI applications in the broadcast and media industry;

  • focuses on two case studies:

    • journalism and AI,

    • deepfakes.

       

Specific products or services mentioned in this article are for illustrative purposes only and do not represent a promotion, recommendation or endorsement by the author of this document. All articles or specialized sites presenting and evaluating them (referenced in the appendix) are the sole responsibility of their respective authors.

Numerous references detailing AI techniques and models used in applications are provided in the "Further reading" appendix, for the interested reader to consult. These are generally...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors
+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year
From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

This article is included in

Signal processing and its applications

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
Digital media and Artificial Intelligence (AI): Image and sound applications