Video Compression with Entropy-Constrained Neural Representations

We propose a novel convolutional architecture for video representation that better represents spatio-temporal information and a training strategy capable of jointly optimizing rate and distortion.

June 4, 2023

IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Authors

Carlos Gomes (ETH Zürich)

Roberto Azevedo (DisneyResearch|Studios)

Christopher Schroers (DisneyResearch|Studios)

Video Compression with Entropy-Constrained Neural Representations

Download Publication PDF

Abstract

Encoding videos as neural networks is a recently proposed approach that allows new forms of video processing. However, traditional techniques still outperform such neural video representation (NVR) methods for the task of video compression. This performance gap can be explained bythe fact that current NVR methods: i) use architectures that do not efficiently obtain a compact representation of temporal and spatial information; and ii) minimize rate and distortion disjointly (first overfitting a network on a video and then using heuristic techniques such as post-training quantization or weight pruning to compress the model). We propose a novel convolutional architecture for video representation that better represents spatio-temporal information and a training strategy capable of jointly optimizing rate and distortion. All network and quantization parameters are jointly learned end-to-end, and the post-training operations used in previous works are unnecessary. We evaluate our method on the UVG dataset, achieving new state-ofthe-art results for video compression with NVRs.

Copyright Notice

The documents contained in these directories are included by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author’s copyright. These works may not be reposted without the explicit permission of the copyright holder.

Video Compression with Entropy-Constrained Neural Representations

We propose a novel convolutional architecture for video representation that better represents spatio-temporal information and a training strategy capable of jointly optimizing rate and distortion.

Authors

Video Compression with Entropy-Constrained Neural Representations

Abstract

Copyright Notice

Research at Disney

Legal

MORE