A Hybrid Deep Animation Codec for Low-Bitrate Video Conferencing - Laboratoire Traitement et Communication de l'Information Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

A Hybrid Deep Animation Codec for Low-Bitrate Video Conferencing

Résumé

Deep generative models, and particularly facial animation schemes, can be used in video conferencing applications to efficiently compress a video through a sparse set of keypoints, without the need to transmit dense motion vectors. While these schemes bring significant coding gains over conventional video codecs at low bitrates, their performance saturates quickly when the available bandwidth increases. In this paper, we propose a layered, hybrid coding scheme to overcome this limitation. Specifically, we extend a codec based on facial animation by adding an auxiliary stream consisting of a very low bitrate version of the video, obtained through a conventional video codec (e.g., HEVC). The animated and auxiliary videos are combined through a novel fusion module. Our results show consistent average BD-Rate gains in excess of-30% on a large dataset of video conferencing sequences, extending the operational range of bitrates of a facial animation codec alone.
Fichier principal
Vignette du fichier
ICIP_deep_animation_coding.pdf (1.21 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03720713 , version 1 (27-07-2022)
hal-03720713 , version 2 (29-12-2022)

Identifiants

  • HAL Id : hal-03720713 , version 1

Citer

Goluck Konuko, Stéphane Lathuilière, Giuseppe Valenzise. A Hybrid Deep Animation Codec for Low-Bitrate Video Conferencing. IEEE International Conference on Image Processing, Oct 2022, Bordeaux, France. ⟨hal-03720713v1⟩

Collections

GS-ENGINEERING
408 Consultations
139 Téléchargements

Partager

Gmail Facebook X LinkedIn More