当前位置：网站首页>BN folding and its quantification

BN folding and its quantification

2022-07-06 08:56:00 【cyz0202】

This paper introduces the process of quantification BN Fold ;

The following pictures are quoted from Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

BN Fold

First introduced BN Fold ;BN Folding comes from BN The implementation difference between the training phase and the inference phase of the module ;

As shown in the figure below ： With conv+BN For example , chart 1 For the training stage BN, chart 2 For the inferential stage BN

chart 2： Inference stage BN,BN Folded knife CONV in

It can be proved that such folding is reasonable , Pay attention to the picture 2 Medium w It refers to the convolution kernel ;

$\\ O_f=\gamma * \frac{\sum(W * I) + b_{conv} - \mu}{\sigma} + b_{bn} \\ O_f=\sum( \frac{\gamma}{\sigma}*W*I) + \frac{\gamma}{\sigma}*(b_{conv} - \mu) + b_{bn} \\ => W' = \frac{\gamma}{\sigma}*W \\ => b'_{conv} = \frac{\gamma}{\sigma}*(b_{conv} - \mu) + b_{bn} \\ => b'_{conv} = \frac{\gamma}{\sigma}*(- \mu) + b_{bn} = b_{bn}- \frac{\gamma\mu}{\sigma} \space\ \space\ \space\ if \space\ \space\ b_{conv} == 0$

In quantification BN Fold

For quantification , Especially quantitative perception （ Pseudo quantization ）, We need to keep the pseudo quantization in the training stage and the above figure 2 The quantitative implementation of the inference phase of is consistent , Therefore, for the training stage CONV+BN Fold in two steps , Here's the picture 3：

chart 3： In the training phase BN Folding transformation , Ideas come from pictures 2 Implementation shown

chart 3 Our design idea comes from figure 2, The basic idea is to seek $\mu/\sigma$ Of EMA（ In the picture moment And the steps before and after ） After taking it off, you should match the picture 2 Agreement ;

According to the figure 3 Shown BN Fold , You can do post training quantification or pseudo quantification of the folding in the training stage , Here's the picture 4

chart 4：BN After folding the training stage （ false ） Quantification process

Code implementation

The following code block is from github distiller, There is no complete context implementation , Interested readers can read step by step

chart 5：BN Fold quantization implementation

summary

The above is a brief introduction to BN Folding and its quantification , The thought is quite ingenious , Interested readers can further read in depth according to the literature mentioned in the article , thank you
Please correct any misunderstandings

原网站

版权声明
本文为[cyz0202]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/187/202207060850360928.html