当前位置:网站首页>Pytorch(二) —— 激活函数、损失函数及其梯度
Pytorch(二) —— 激活函数、损失函数及其梯度
2022-07-01 04:35:00 【CyrusMay】
Pytorch(二) —— 激活函数、损失函数及其梯度
1.激活函数
1.1 Sigmoid / Logistic
δ ( x ) = 1 1 + e − x δ ′ ( x ) = δ ( 1 − δ ) \delta(x)=\frac{1}{1+e^{-x}}\\\delta'(x)=\delta(1-\delta) δ(x)=1+e−x1δ′(x)=δ(1−δ)
import matplotlib.pyplot as plt
import torch.nn.functional as F
x = torch.linspace(-10,10,1000)
y = F.sigmoid(x)
plt.plot(x,y)
plt.show()
1.2 Tanh
t a n h ( x ) = e x − e − x e x + e − x ∂ t a n h ( x ) ∂ x = 1 − t a n h 2 ( x ) tanh(x)=\frac{e^x-e^{-x}}{e^x+e^{-x}}\\\frac{\partial tanh(x)}{\partial x}=1-tanh^2(x) tanh(x)=ex+e−xex−e−x∂x∂tanh(x)=1−tanh2(x)
import matplotlib.pyplot as plt
import torch.nn.functional as F
x = torch.linspace(-10,10,1000)
y = F.tanh(x)
plt.plot(x,y)
plt.show()
1.3 ReLU
f ( x ) = m a x ( 0 , x ) f(x)=max(0,x) f(x)=max(0,x)
import matplotlib.pyplot as plt
import torch.nn.functional as F
x = torch.linspace(-10,10,1000)
y = F.relu(x)
plt.plot(x,y)
plt.show()
1.4 Softmax
p i = e a i ∑ k = 1 N e a k ∂ p i ∂ a j = { p i ( 1 − p j ) i = j − p i p j i ≠ j p_i=\frac{e^{a_i}}{\sum_{k=1}^N{e^{a_k}}}\\ \frac{\partial p_i}{\partial a_j}=\left\{ \begin{array}{lc} p_i(1-p_j) & i=j \\ -p_ip_j&i\neq j\\ \end{array} \right. pi=∑k=1Neakeai∂aj∂pi={ pi(1−pj)−pipji=ji=j
import torch.nn.functional as F
logits = torch.rand(10)
prob = F.softmax(logits,dim=0)
print(prob)
tensor([0.1024, 0.0617, 0.1133, 0.1544, 0.1184, 0.0735, 0.0590, 0.1036, 0.0861,
0.1275])
2.损失函数
2.1 MSE
import torch.nn.functional as F
x = torch.rand(100,64)
w = torch.rand(64,1)
y = torch.rand(100,1)
mse = F.mse_loss(y,[email protected])
print(mse)
tensor(238.5115)
2.2 CorssEntorpy
import torch.nn.functional as F
x = torch.rand(100,64)
w = torch.rand(64,10)
y = torch.randint(0,9,[100])
entropy = F.cross_entropy([email protected],y)
print(entropy)
tensor(3.6413)
3. 求导和反向传播
3.1 求导
- Tensor.requires_grad_()
- torch.autograd.grad()
import torch.nn.functional as F
import torch
x = torch.rand(100,64)
w = torch.rand(64,1)
y = torch.rand(100,1)
w.requires_grad_()
mse = F.mse_loss([email protected],y)
grads = torch.autograd.grad(mse,[w])
print(grads[0].shape)
torch.Size([64, 1])
3.2 反向传播
- Tensor.backward()
import torch.nn.functional as F
import torch
x = torch.rand(100,64)
w = torch.rand(64,10)
w.requires_grad_()
y = torch.randint(0,9,[100,])
entropy = F.cross_entropy([email protected],y)
entropy.backward()
w.grad.shape
torch.Size([64, 10])
by CyrusMay 2022 06 28
人生 只是 须臾的刹那
人间 只是 天地的夹缝
——————五月天(因为你 所以我)——————
边栏推荐
- 2022年化工自动化控制仪表操作证考试题库及答案
- Loop filtering based on Unet
- LM small programmable controller software (based on CoDeSys) note 19: errors do not match the profile of the target
- [leetcode skimming] February summary (updating)
- Haskell lightweight threads overhead and use on multicores
- The junior college students were angry for 32 days, four rounds of interviews, five hours of soul torture, and won Ali's offer with tears
- Common thread methods and daemon threads
- How to choose the right server for website data collection?
- 嵌入式系統開發筆記80:應用Qt Designer進行主界面設計
- Possible problems and solutions of using scroll view to implement slider view
猜你喜欢
Measurement of quadrature axis and direct axis inductance of three-phase permanent magnet synchronous motor
The index is invalid
25.k sets of flipped linked lists
Simple implementation of slf4j
[recommended algorithm] C interview question of a small factory
"Target detection" + "visual understanding" realizes the understanding of the input image
[human version] Web3 privacy game in the dark forest
NFT: start NFT royalty journey with eip-2981
【LeetCode】100. Same tree
[ue4] event distribution mechanism of reflective event distributor and active call event mechanism
随机推荐
2022年上海市安全员C证考试题模拟考试题库及答案
Internet winter, how to spend three months to make a comeback
2022 G2 power station boiler stoker examination question bank and G2 power station boiler stoker simulation examination question bank
How to use maixll dock
TCP/IP 详解(第 2 版) 笔记 / 3 链路层 / 3.4 桥接器与交换机 / 3.4.2 多属性注册协议(Multiple Registration Protocol (MRP))
[today in history] June 30: von Neumann published the first draft; The semiconductor war in the late 1990s; CBS acquires CNET
OSPF notes [dr and bdr]
Embedded System Development Notes 81: Using Dialog component to design prompt dialog box
嵌入式系统开发笔记81:使用Dialog组件设计提示对话框
嵌入式系统开发笔记79:为什么要获取本机网卡IP地址
"Target detection" + "visual understanding" realizes the understanding of the input image
283. move zero
2022 Shanghai safety officer C certificate examination question simulation examination question bank and answers
嵌入式系统开发笔记80:应用Qt Designer进行主界面设计
2022年化工自动化控制仪表操作证考试题库及答案
[difficult] sqlserver2008r2, can you recover only some files when recovering the database?
2022 hoisting machinery command registration examination and hoisting machinery command examination registration
Grey correlation cases and codes
软件研发的十大浪费:研发效能的另一面
[recommended algorithm] C interview question of a small factory