当前位置:网站首页>Vcip2021: super resolution using decoded information
Vcip2021: super resolution using decoded information
2022-06-22 14:10:00 【Dillon2015】
This article is from VCIP2021《CNN-based Super Resolution for Video Coding Using Decoded Information》

brief introduction
With the increase of high-resolution video , It brings great challenges to video transmission in the case of limited bandwidth . To solve this problem , The encoding method of resampling can be adopted , Such as Fig.1, The video is down sampled before encoding , Then encode low resolution video , After decoding, the decoder performs up sampling to restore the original resolution .AV1 There is a mode of encoding the down sampled frames and then up sampling at the decoder .VVC China also supports RPR.

With based on CNN Superresolution of (SR) The development of , It has great potential in video coding . In this paper, we propose a method to combine the coding information in video coding SR. In the existing research SR And encoder are usually regarded as independent parts , And the paper put forward in SR Not only reconstruction information but also prediction information is used in 、QP And so on .
Model design
Due to the different characteristics of brightness and chroma , Designed separately for brightness and chroma SR Model .

Fig.2 Is the structure of the brightness model , Input includes reconstruction information 、 Forecast information and QP map, Benchmark model selection single-scale EDSR, Since each convolution layer has only 64 Feature channels, so the model has no residual scaling layer . In the figure RB Is the residual block structure , share 16 individual . The last accretion layer is 4 Channels , And then through shuffle Layers produce high-resolution reconstructed images .
Fig.3 Is the structure of the chromaticity model , The main difference from the brightness model is the input , In order to make full use of texture information, brightness reconstruction information is also used as input . The brightness reconstruction information should pass through the step of 2 The same resolution as chromaticity is sampled under the convolution layer of . The model input also includes chromaticity U Reconstruction information 、 chroma V Reconstruction information and QP map, The input does not contain forecast information .
experimental result
Data sets
Use DIV2K Dataset training model , Image to YUV420 Format , Use VTM11.0 stay RPR Configure the next encoding ,QP={22,27,32,37,42}, The training image is encoded first 2 Double down sampling , Then the decoded low resolution image and the corresponding original resolution image are used for training .
Experimental configuration
Use PyTorch Frame training model , use Tesla V-100 GPU Training ,mini-batch size Set to 16, Use Adam The optimizer learning rate is le-4, Every time 200 Wheel press 0.5 The decay factor reduces the learning rate .
experimental result


The encoder adopts All Intra To configure ,QP={22,27,32,37,42}, Results such as table 1 Shown ,Fig.4 Is the of each sequence RD curve . The gain mainly comes from the low bit rate part , This shows that this method is better for low bandwidth scenarios .
Interested parties, please pay attention to WeChat official account Video Coding

边栏推荐
- Linux setting enables Oracle10g to start automatically
- 安装和使用protobuf-c
- polardbx是pg还是mysql?
- 在CSDN写文几年,我出了“第一本书“,感恩!
- JSP based library management system, including source code, database script, video tutorial for project operation, and video tutorial for thesis writing
- BSN发展联盟理事长单志广:DDC可为中国元宇宙产业发展提供底层支撑
- 谈谈人生风控
- Summary of patent blog writing
- CVE-2022-22965复现
- Ppt data collection methods and analysis skills
猜你喜欢

3dMax建模笔记(一):介绍3dMax和创建第一个模型Hello world

Configuring cplex12.4 tutorial in VS2010

天润云上市在即:VC大佬田溯宁大幅减持,预计将套现2.6亿港元
![[Nacos cloud native] the first step of reading the source code is to start Nacos locally](/img/f8/d9b848593cf7380a6c99ee0a8158f8.png)
[Nacos cloud native] the first step of reading the source code is to start Nacos locally

Leetcode math problems

VR全景拍摄,打破传统宣传雁过不留痕的僵局

坚持了 10 年的 9 个编程好习惯

Query rewriting for opengauss kernel analysis

Oceanbase database helps the ideal automobile intelligent production line to realize automatic recovery within 30 seconds

聊一聊数据库的行存与列存
随机推荐
论文专利博客写作总结
Cve - 2022 - 22965 Resume
华为这份关于专利的会议纪要,都说了什么?(内含华为十大发明彩蛋)
程序员要不要选择软件人才外包公司?
Talk about row storage and column storage of database
谈谈人生风控
Istio服务网格中的流量复制
Oceanbase database helps the ideal automobile intelligent production line to realize automatic recovery within 30 seconds
Temporary recommendation on graphs via long- and short term preference fusion
My suggestions on SAP ABAP transformation
力扣每日一练之双指针2Day9
openGauss内核分析之查询重写
“不敢去怀疑代码,又不得不怀疑代码”记一次网络请求超时分析
Getting started with go web programming: validators
Starting Oracle under Linux
History of hash index design
A simple scientific research secret
Installing and using protobuf-c
SQL and Oracle statements for eliminating duplicate records
坚持了 10 年的 9 个编程好习惯