当前位置：网站首页>19. Up and down sampling and batchnorm

19. Up and down sampling and batchnorm

2022-07-27 05:59:00 【Pie star's favorite spongebob】

Catalog

Pool layer and sampling
- downsample Down sampling
- upsample On the sampling
BatchNorm
- feature scaling

A unit in the convolution neural network process ：
conv2D->batchnorm->pooling->Relu
The last three orders depend on the mainstream and experience , Inversion doesn't have much effect .

Pool layer and sampling

Down sampling means right map narrow , Upsampling and amplification methods are very similar

downsample Down sampling

pooling and subsampl The result is similar but the operation is different
max pooling
It's right kernel Values in the range , Choose a maximum number .
avg pooling
It's right kernel Values in the range , Calculate the average value and output .

x=torch.rand(1,16,14,14)
layer=nn.MaxPool2d(2,stride=2)
out=layer(x)
print('out shape:',out.shape)

2 It means size 2×2, Step length is 2, Halve the result .

Insert picture description here

upsample On the sampling

in the light of tensor Of . Simply copy the latest value , Play the role of amplification .

x=torch.rand(1,16,7,7)
out1=F.interpolate(x,scale_factor=2,mode='nearest')
print('out1 shape:',out1.shape)
out2=F.interpolate(x,scale_factor=3,mode='nearest')
print('out2 shape:',out2.shape)

Insert picture description here

BatchNorm

We usually use relu Function instead of sigmod, Because sigmod Function is outside a certain range , The gradient information is 0, But have to use sigmo when , We need to put our input value x, To a certain extent . best x stay 0 near .
When multiple inputs ,x1 and x2 When there is a large difference in the range of , The weight w1w2 Different changes in , Yes loss The impact of .

feature scaling

Image normalization

Picture three channels RGB There is mean variance

normalize=transforms.Normalize(mean=[0.485,0.456,0.406],std=[0.229,0.224,0.225])

(Xr-0.485)/0.229,(Xg-0.456)/0.224,(X\b-0.406)/0.225, The new distribution obtained is more in line with what we said N(0,1), And input to the next layer con2d when , It can solve well .

Batch Normalization

Insert picture description here
hypothesis [16,3,784]
our channel yes 3, For channel0 All of this passage 16 Number of pictures of all feature, Calculate , Get the mean and variance . Finally, the common dimension is 1,shape by 3 Number of numbers , Represents the 3 individual channel.

μ and σ Is based on the current batch Statistics of , The global mean and variance are stored in running_mean,running_var in .
β and γ We learned , It will automatically update , Even if there is an initial value at the beginning , And need gradient information .
Obey separately N(μ,σ),N(β,γ) Insert picture description here

x=torch.rand(100,16,784)
layer=nn.BatchNorm1d(16)
out=layer(x)

print('running_mean：',layer.running_mean)
print('running_val：',layer.running_var)

Insert picture description here

Standardized writing

Insert picture description here

nn.BatchNorm2d

x=torch.rand(1,16,7,7)
layer=nn.BatchNorm2d(16)
out=layer(x)
print('out shape:',out.shape)
print(vars(layer))

running_mean,running_var Is the global mean and variance , We can't know every... From the parameters at present batch The mean and variance of .
layer.weight and layer.bias Parameter is γ and β
‘affine:True’：β and γ Whether to learn automatically , And automatically update .
Insert picture description here

test And train Use difference

test At the time of the μ and σ Is a global , Can be obtained from running_mean,running_var Copy value .test No, backward, therefore γ and β It doesn't need to be updated .

layer.eval()
BatchNorm1d(16,eps=1e-05,momentum=0.1,affine=True,track_running_stats=True)

out=layer(x)

We need to call eval(), Transfer to test Pattern

advantage

Convergence is faster
Better performance
A more stable

原网站

版权声明
本文为[Pie star's favorite spongebob]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/208/202207270511314689.html

当前位置：网站首页>19. Up and down sampling and batchnorm

19. Up and down sampling and batchnorm

Catalog

Pool layer and sampling

downsample Down sampling

upsample On the sampling

BatchNorm

feature scaling

Image normalization

Batch Normalization

Standardized writing

nn.BatchNorm2d

test And train Use difference

advantage

边栏推荐

猜你喜欢

随机推荐