当前位置:网站首页>[point cloud compression] variable image compression with a scale hyperprior
[point cloud compression] variable image compression with a scale hyperprior
2022-06-12 02:49:00 【Jonathan_ Paul 10】
Variational Image Compression with A Scale Hyperprior
This paper presents a new method of compression : Using transcendental knowledge . Transcendental yes ” A priori a priori ”.
Intro
This paper gives the edge information (Side information) The definition of : Side information is an additional bit stream from the encoder to the decoder , The information is modified to the entropy model , This reduces mismatches (additional bits of information sent from the encoder to the decoder, which signal modifications to the entropy model intended to reduce the mismatch). therefore , This kind of edge information is regarded as a priori of entropy model parameters , And edge information has become a hidden representation “ A priori a priori ” 了 .
Ideas
Background
Transformation based models
Coding of transformations (Transform coding) Now in the depth of learning is popular . Input the vector of the image x x x You can use a parameterized transformation , become :
y = g a ( x ; ϕ g ) y=g_a(x;\phi_g) y=ga(x;ϕg)
there y y y Is a potential feature ; ϕ g \phi_g ϕg Yes converter ( Encoder ) Parameters of ; This process is called Parametric Analysis The process . And notice , there y y y It needs to be quantized before entropy coding ( Quantized to discrete values , So that it can be entropy encoded losslessly ). It is assumed that the potential characteristics after quantification are y ^ \hat y y^, Then the transformation used in the reconstruction , bring :
x ^ = g s ( y ^ ; θ g ) \hat x = g_{s}\left(\hat{ {y}} ; {\theta}_{g}\right) x^=gs(y^;θg)
among , This process is called Parametric Synthesis The process ( Here, it can also be regarded as a decoder ). θ g {\theta}_{g} θg Is the parameter of the decoder .
VAE
Variational self encoder (Variational Autoencoder, VAE) Compare with AE, It maps the input to a distribution ( This distribution is usually Gussian) Not a specific vector , As described in the previous section Transformation based models Medium y y y. stay VAE in , He used “ Inferential model ”(Inference Model) Deduce the potential representation in the probability source of the image (“inferring” the latent representation from the source image), use “ Generate models ”(Generative model) Generate the probability to get the reconstructed image .
For more details, please refer to [1]. But notice , In this paper , We use z z z To express super prior information rather than potential distribution . Please distinguish .
Model
Pictured 2 Shown , Potential representations obtained by using prior knowledge y y y( chart 2 The second graph from the left of ) There are structural dependencies ( Spatial coupling ), This cannot be captured by the total decomposition of the variational model . therefore , The model will be modeled in a super prior way .

The so-called super a priori is a priori of a priori . therefore , Then a potential representation is established y y y Potential representation of z z z, To capture this spatial dependency . It is worth mentioning that , there z z z That is, edge information ( z z z is then quantized, compressed, and transmitted as side information). Capture potential representations z z z after , After quantification z ^ \hat z z^ To estimate σ ^ \hat \sigma σ^. This σ ^ \hat \sigma σ^ Will be used to reconstruct at the decoder side y ^ \hat y y^, In order to obtain x ^ \hat x x^.

Reference
边栏推荐
- 博创智能冲刺科创板:年营收11亿 应收账款账面价值3亿
- [no title] 2022 coal mine safety inspection test questions and online simulation test
- Force deduction solution summary 1037- effective boomerang
- SSH public key login failed with error: Sign_ and_ send_ pubkey: no mutual signature supported
- 跨域有哪些解决方法?
- ACL 2022 - strong combination of pre training language model and graphic model
- alertmanager告警配置
- errno: -4078, code: ‘ECONNREFUSED‘, syscall: ‘connect‘, address: ‘127.0.0.1‘, port: 3306; Postman error
- Intel case
- Force deduction programming problem - solution summary
猜你喜欢

errno: -4078, code: ‘ECONNREFUSED‘, syscall: ‘connect‘, address: ‘127.0.0.1‘, port: 3306;postman报错

Graduation design of fire hydrant monitoring system --- thesis (add the most comprehensive hardware circuit design - > driver design - > Alibaba cloud Internet of things construction - > Android App D

In 2022, don't you know the difference between arrow function and ordinary function?

Demand and business model innovation - demand 11 - overview of demand analysis

Calculus review 2

Requirements and business model innovation - Requirements 7- user requirements acquisition based on use case / scenario model

Cupp dictionary generation tool (similar tools include crunch)

One article to show you how to understand the harmonyos application on the shelves

Unity3d ugui translucent or linear gradient pictures display abnormally (blurred) problem solving (color space mismatch)

架构入门讲解 - 谁动了我的蛋糕
随机推荐
Calculus review 2
Getting started with RPC
min25筛
Force deduction solution summary 386 dictionary order
Force deduction solution summary 1037- effective boomerang
Alertmanager alarm configuration
Force deduction solution summary 449 serialization and deserialization binary search tree
For the first time, why not choose "pure medium platform" for byte beating data platform
微信小程序项目实例——体质计算器
Drawcall, batches, setpasscall in unity3d
Depth copy
1 minute to understand the essential difference between low code and zero code
Selection (045) - what is the output of the following code?
Summary of force deduction solution 436- finding the right interval
Force deduction solution summary 713- subarray with product less than k
How to build urban smart bus travel? Quick code to answer
oracle之模式对象
WPS表格 学习笔记 - 高亮显示重复值
Force deduction solution summary 668- the smallest number k in the multiplication table
Demand and business model innovation - demand 11 - overview of demand analysis