谷歌浏览器插件
订阅小程序
在清言上使用

A Neural Video Codec with Spatial Rate-Distortion Control.

2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)(2023)

引用 2|浏览37
暂无评分
摘要
Neural video compression algorithms are nearly competitive with hand-crafted codecs in terms of rate-distortion performance and subjective quality. However, many neural codecs are inflexible black boxes, and give users little to no control over the reconstruction quality and bitrate. In this work, we present a flexible neural video codec that combines ideas from variable-bitrate codecs and region-of-interest-based coding. By conditioning our model on a global rate-distortion tradeoff parameter and a region-of-interest (ROI) mask, we obtain dynamic control over the per-frame bitrate and the reconstruction quality in the ROI at test time. The resulting codec enables practical use cases such as coding under bitrate constraints with fixed ROI quality, while taking a negligible hit in performance compared to a fixed-rate model. We find that our codec performs best on sequences with complex motion, where we substantially outperform non-ROI codecs in the region of interest with Bjøntegaard-Delta rate savings exceeding 60%.
更多
查看译文
关键词
Algorithms: Image recognition and understanding (object detection,categorization,segmentation),Machine learning architectures,formulations,and algorithms (including transfer,low-shot,semi-,self-,and un-supervised learning),Video recognition and understanding (tracking,action recognition,etc.)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要