当前位置:网站首页>Reference frame generation based on deep learning
Reference frame generation based on deep learning
2022-07-06 20:56:00 【Dillon2015】
This article comes from the proposal JVET-T0058 and JVET-U0087, This method generates virtual reference frames for inter frame prediction by inserting frames . The whole model consists of several sub models , Perform optical flow estimation respectively 、 Compensation and detail enhancement .
The overall architecture
The overall architecture is as follows Fig.1 Shown , In the process of video coding DPB There is a reference frame for motion estimation , according to GOP Structure the current frame has one or more forward 、 Backward reference frame . The default in the proposal is POC The two reference frames closest to the current frame generate a virtual reference frame , Such as Fig.1 Current frame in POC yes 5, Then use POC by 4 and 6 The frame of generates a reference frame . The generated virtual reference frame will be put into DPB For reference , Virtual reference frame POC Set to the same as the current frame . In order to prevent affecting the time domain MVP According to the POC Distant MV Zoom process , Virtual reference frame MV All set to 0 And is used as a long-term reference frame . In the proposal , After the current frame is decoded, the virtual reference frame starts from DPB Remove .
For high resolution sequences (4K or 8K) Due to resource constraints, neural network processing cannot be directly used for the whole frame , At this time, it is assumed that the virtual reference frame is divided into multiple regions , Each area uses network generation separately , Then put these areas together into a reference frame .
A network model
Optical flow estimation and compensation are mostly used in general video interpolation , Generally, bidirectional optical flow method is used , Then the two optical flows are combined into one through a linear model . Only the single optical flow model is used in the proposal .
Such as Fig.2, First, optical flow is generated by optical flow estimation model ( Input is POC The two nearest reference frames ), And then through backward warping Process processing optical flow , The processed optical flow and two reference frames pass through fusion Process synthesis intermediate frame . The intermediate frame will enhance the quality of the model through details , The detail enhancement model consists of two parts ,PCD(Pyramid, Cascading and Deformable) For space-time optimization and TSA (Temporal and Spatial Attention) Used to improve important features attention.
experimental result
Interested parties, please pay attention to WeChat official account Video Coding
边栏推荐
- Laravel笔记-自定义登录中新增登录5次失败锁账户功能(提高系统安全性)
- #yyds干货盘点#重新梳理箭头函数的this
- 数据湖(八):Iceberg数据存储格式
- Deployment of external server area and dual machine hot standby of firewall Foundation
- 性能测试过程和计划
- use. Net analysis Net talent challenge participation
- Notes - detailed steps of training, testing and verification of yolo-v4-tiny source code
- 2022 portal crane driver registration examination and portal crane driver examination materials
- [DSP] [Part 1] start DSP learning
- What is the difference between procedural SQL and C language in defining variables
猜你喜欢
Statistical inference: maximum likelihood estimation, Bayesian estimation and variance deviation decomposition
OLED屏幕的使用
1500万员工轻松管理,云原生数据库GaussDB让HR办公更高效
看过很多教程,却依然写不好一个程序,怎么破?
2022 construction electrician (special type of construction work) free test questions and construction electrician (special type of construction work) certificate examination
Mécanisme de fonctionnement et de mise à jour de [Widget Wechat]
15 millions d'employés sont faciles à gérer et la base de données native du cloud gaussdb rend le Bureau des RH plus efficace
【mysql】触发器
The most comprehensive new database in the whole network, multidimensional table platform inventory note, flowus, airtable, seatable, Vig table Vika, flying Book Multidimensional table, heipayun, Zhix
Implementation of packaging video into MP4 format and storing it in TF Card
随机推荐
过程化sql在定义变量上与c语言中的变量定义有什么区别
Logic is a good thing
Web开发小妙招:巧用ThreadLocal规避层层传值
知识图谱构建流程步骤详解
Activiti global process monitors activitieventlistener to monitor different types of events, which is very convenient without configuring task monitoring in acitivit
Tips for web development: skillfully use ThreadLocal to avoid layer by layer value transmission
快过年了,心也懒了
Pycharm remote execution
Leetcode hot topic Hot 100 day 32: "minimum coverage substring"
(工作记录)2020年3月11日至2021年3月15日
Select data Column subset in table R [duplicate] - select subset of columns in data table R [duplicate]
新型数据库、多维表格平台盘点 Notion、FlowUs、Airtable、SeaTable、维格表 Vika、飞书多维表格、黑帕云、织信 Informat、语雀
华为设备命令
SAP Fiori应用索引大全工具和 SAP Fiori Tools 的使用介绍
Performance test process and plan
PG基础篇--逻辑结构管理(事务)
How to turn a multi digit number into a digital list
[weekly pit] output triangle
[weekly pit] positive integer factorization prime factor + [solution] calculate the sum of prime numbers within 100
Huawei device command