当前位置：网站首页>1.4 nn. Module neural network (II)

1.4 nn. Module neural network (II)

2022-07-29 03:22:00 【smiling0927】

Loss function ：

nn.MSELoss Used to calculate the mean square error

nn.CrossEntropyLoss Used to calculate cross entropy loss .

eg:

import torch.nn as nn
import torch.nn.functional as F
from torch.autograd import Variable
import torch as t
class Net(nn.Module):
    def __init__(self):
        #nn.Module The function of the subclass must execute the constructor of the parent class in the constructor 
        # The following formula is equivalent to nn.Module.__init__(self)
        super(Net,self).__init__()
        # Convolution layer ‘1’ The input image is a single channel ,‘6’ Indicates the number of output channels ,‘5’ The convolution kernel is 5*5
        self.conv1=nn.Conv2d(1,6,5)
        # Convolution layer 
        self.conv2 = nn.Conv2d (6, 16, 5)
        # Affine layer 、 Fully connected layer ,y=wx+b
        # General definition of a linear Layer time , It's written as nn.Linear(in_features,out_features)
        self.fc1=nn.Linear(16*5*5,120)
        self.fc2 = nn.Linear (120, 84)
        self.fc3 = nn.Linear (84, 10)
    def forward(self,x):
        # Convolution --> Activate --> Pooling 
        x=F.max_pool2d(F.relu(self.conv1(x)),(2,2))
        x = F.max_pool2d (F.relu (self.conv2 (x)), 2)
        #reshape,‘-1’ It means adaptive 
        # This sentence usually appears in model Class forward Function , The specific location is usually before calling the classifier . Classifier is a simple nn.Linear() structure ,
        #  The input and output are all values with one dimension ,x = x.view(x.size(0), -1)   The emergence of this sentence is to integrate the previous multi-dimensional tensor Flatten into one dimension 
        #view() The functional root of a function reshape similar , To convert size size .x = x.view(batchsize, -1) in batchsize Refers to the number of lines after conversion ,
        #  and -1 Without telling the function how many columns there are , According to the original tensor Data and batchsize Auto assign columns .
        x=x.view(x.size()[0],-1)
        x=F.relu(self.fc1(x))
        x=F.relu(self.fc2(x))
        x=self.fc3(x)
        return x
net=Net()
input=Variable(t.randn(1,1,32,32))
out = net(input)
print(out)
net.zero_grad()
out.backward(t.randn(1,10))
output = net(input)
target = Variable(t.randn(1,10))# Hypothetical target :1,2,3,4,5,6,7,8,9,10
criterion = nn.MSELoss()
loss = criterion(output,target)
print(loss)

out:
tensor([[-0.1351, -0.0720, -0.0460,  0.1184, -0.0810,  0.0441, -0.0008,
         -0.0082, -0.1029,  0.0620]])
tensor(2.6400)

Be careful ：target = Variable(t.arange(1,11))# Hypothetical target :1,2,3,4,5,6,7,8,9,10, The output is ：tensor([ 1., 2., 3., 4., 5., 6., 7., 8., 9., 10.]), It's a vector , but output Output is 1*10 Matrix , Therefore, there will be mismatches . Report errors ：RuntimeError: input and target shapes do not match: input [1 x 10], target [10] at /opt/conda/conda-bld/pytorch_1524584710464/work/aten/src/THNN/generic/MSECriterion.c:13.

But books and some blogs do have normal output , Therefore, the change point needs to be studied .

2. If for loss Back propagation ,（ Use grad_fn attribute ）, You can see its calculation diagram . When calling loss.backward() when , The graph will be generated and automatically differentiated , It will also calculate the derivative of the parameters in the calculation diagram .

print(loss)
# Application .backward, Observe before and after the call grad
print(loss.grad_fn)
print(loss.grad_fn.next_functions)
print(loss.grad_fn.next_functions[0][0].next_functions)
net.zero_grad()# Gradient zeroing of all parameters 
print(' Before back propagation conv1.bias Gradient of ')
print(net.conv1.bias.grad)
loss.backward()
print(' After back propagation conv1.bias Gradient of ')
print(net.conv1.bias.grad)

out:

<MseLossBackward object at 0x7f80b5af9320>
((<AddmmBackward object at 0x7f80b5728cc0>, 0),)
((<ExpandBackward object at 0x7f80b5728cc0>, 0), (<ReluBackward object at 0x7f80b57009b0>, 0), (<TBackward object at 0x7f80f0ac7198>, 0))
Before back propagation conv1.bias Gradient of
tensor([ 0., 0., 0., 0., 0., 0.])
After back propagation conv1.bias Gradient of
tensor(1.00000e-02 *
[-0.2564, 0.5056, 0.5424, 0.2758, -1.5066, -0.0828])

原网站

版权声明
本文为[smiling0927]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/196/202207130553332483.html

当前位置：网站首页>1.4 nn. Module neural network (II)

1.4 nn. Module neural network (II)

边栏推荐

猜你喜欢

随机推荐