当前位置:网站首页>Reference frame generation based on deep learning
Reference frame generation based on deep learning
2022-07-06 20:56:00 【Dillon2015】
This article comes from the proposal JVET-T0058 and JVET-U0087, This method generates virtual reference frames for inter frame prediction by inserting frames . The whole model consists of several sub models , Perform optical flow estimation respectively 、 Compensation and detail enhancement .
The overall architecture
The overall architecture is as follows Fig.1 Shown , In the process of video coding DPB There is a reference frame for motion estimation , according to GOP Structure the current frame has one or more forward 、 Backward reference frame . The default in the proposal is POC The two reference frames closest to the current frame generate a virtual reference frame , Such as Fig.1 Current frame in POC yes 5, Then use POC by 4 and 6 The frame of generates a reference frame . The generated virtual reference frame will be put into DPB For reference , Virtual reference frame POC Set to the same as the current frame . In order to prevent affecting the time domain MVP According to the POC Distant MV Zoom process , Virtual reference frame MV All set to 0 And is used as a long-term reference frame . In the proposal , After the current frame is decoded, the virtual reference frame starts from DPB Remove .
For high resolution sequences (4K or 8K) Due to resource constraints, neural network processing cannot be directly used for the whole frame , At this time, it is assumed that the virtual reference frame is divided into multiple regions , Each area uses network generation separately , Then put these areas together into a reference frame .
A network model
Optical flow estimation and compensation are mostly used in general video interpolation , Generally, bidirectional optical flow method is used , Then the two optical flows are combined into one through a linear model . Only the single optical flow model is used in the proposal .
Such as Fig.2, First, optical flow is generated by optical flow estimation model ( Input is POC The two nearest reference frames ), And then through backward warping Process processing optical flow , The processed optical flow and two reference frames pass through fusion Process synthesis intermediate frame . The intermediate frame will enhance the quality of the model through details , The detail enhancement model consists of two parts ,PCD(Pyramid, Cascading and Deformable) For space-time optimization and TSA (Temporal and Spatial Attention) Used to improve important features attention.
experimental result
Interested parties, please pay attention to WeChat official account Video Coding
边栏推荐
- Kubernetes learning summary (20) -- what is the relationship between kubernetes and microservices and containers?
- 过程化sql在定义变量上与c语言中的变量定义有什么区别
- ##无yum源安装spug监控
- Utilisation de l'écran OLED
- 拼多多败诉,砍价始终差0.9%一案宣判;微信内测同一手机号可注册两个账号功能;2022年度菲尔兹奖公布|极客头条
- Hardware development notes (10): basic process of hardware development, making a USB to RS232 module (9): create ch340g/max232 package library sop-16 and associate principle primitive devices
- 基于STM32单片机设计的红外测温仪(带人脸检测)
- Activiti global process monitors activitieventlistener to monitor different types of events, which is very convenient without configuring task monitoring in acitivit
- Yyds dry goods count re comb this of arrow function
- OAI 5G NR+USRP B210安装搭建
猜你喜欢
每个程序员必须掌握的常用英语词汇(建议收藏)
How does kubernetes support stateful applications through statefulset? (07)
Gui Gui programming (XIII) - event handling
Database - how to get familiar with hundreds of tables of the project -navicat these unique skills, have you got it? (exclusive experience)
Basic knowledge of lists
Force deduction brush question - 98 Validate binary search tree
Intel 48 core new Xeon run point exposure: unexpected results against AMD zen3 in 3D cache
Manifest of SAP ui5 framework json
Entity alignment two of knowledge map
"Penalty kick" games
随机推荐
【mysql】游标的基本使用
【每周一坑】计算100以内质数之和 +【解答】输出三角形
Intel 48 core new Xeon run point exposure: unexpected results against AMD zen3 in 3D cache
Comprehensive evaluation and recommendation of the most comprehensive knowledge base management tools in the whole network: flowus, baklib, jiandaoyun, ones wiki, pingcode, seed, mebox, Yifang cloud,
Huawei device command
Quel genre de programmation les enfants apprennent - ils?
Variable star --- article module (1)
PG basics -- Logical Structure Management (transaction)
Can novices speculate in stocks for 200 yuan? Is the securities account given by qiniu safe?
What is the problem with the SQL group by statement
[weekly pit] calculate the sum of primes within 100 + [answer] output triangle
[asp.net core] set the format of Web API response data -- formatfilter feature
Web开发小妙招:巧用ThreadLocal规避层层传值
【微信小程序】運行機制和更新機制
What programming do children learn?
Spiral square PTA
请问sql group by 语句问题
[diy] self designed Microsoft makecode arcade, official open source software and hardware
什么是RDB和AOF
Pat 1078 hashing (25 points) ⼆ times ⽅ exploration method