当前位置:网站首页>Pytorch training process was interrupted
Pytorch training process was interrupted
2022-07-05 11:17:00 【IMQYT】
I'm scared to death , Training 3 The process of Tian's model was killed by his own hand , I almost cried , Has the money for renting a server for a week been wasted , Is time wasted , Can it be remedied ! For the first time , And my code runs very slowly (RTXA5000, It's reasonable to say that it's not slow , Too much data , In order to reduce the number of logs IO Wasted time , There is no log ), Only the model is saved . Already my hands are shaking
Don't talk much , How to remedy it ?

Save the model in the code only torch.save. Other parameters are not saved .epoch Nothing is saved , Found a lot of experience , Finally find a remedy
Reload the model
path='autodl-tmp/GraphDTA-master/model_GINConvNet_kiba.model'
model.load_state_dict(torch.load(path))In this case , What the model learned is back , Include loss And so on. .

From here I can see ,loss It did continue 294 The training of the time , The same is true of the predicted value. Continue 294 The result after the first time , Fortunately, I got it back , But there was a problem , Because I saw epoch It seems to be from 1 Here we go , In this case, we need to train 600 Time ?, So remember to revise epoch The total number of times ,600-294=306, Although the control interrupt writes this 1, But retraining 306 This time it will end . Be accomplished
边栏推荐
- 边缘计算如何与物联网结合在一起?
- When using gbase 8C database, an error is reported: 80000502, cluster:%s is busy. What's going on?
- R3Live系列学习(四)R2Live源码阅读(2)
- Three suggestions for purchasing small spacing LED display
- websocket
- matlab cov函数详解
- Modulenotfounderror: no module named 'scratch' ultimate solution
- Ffmpeg calls avformat_ open_ Error -22 returned during input (invalid argument)
- go语言学习笔记-初识Go语言
- Applet framework taro
猜你喜欢
![[office] eight usages of if function in Excel](/img/ce/ea481ab947b25937a28ab5540ce323.png)
[office] eight usages of if function in Excel

Codeforces Round #804 (Div. 2)

【广告系统】Parameter Server分布式训练

Detailed explanation of DDR4 hardware schematic design

9、 Disk management

数据库三大范式

华为设备配置信道切换业务不中断

COMSOL--三维图形的建立

【DNS】“Can‘t resolve host“ as non-root user, but works fine as root

紫光展锐全球首个5G R17 IoT NTN卫星物联网上星实测完成
随机推荐
NFT 交易市场主要使用 ETH 本位进行交易的局面是如何形成的?
The art of communication III: Listening between people
分类TAB商品流多目标排序模型的演进
MFC pet store information management system
我用开天平台做了一个城市防疫政策查询系统【开天aPaaS大作战】
COMSOL--三维随便画--扫掠
[there may be no default font]warning: imagettfbbox() [function.imagettfbbox]: invalid font filename
Dspic33ep clock initialization program
vite//
R3Live系列学习(四)R2Live源码阅读(2)
Spark Tuning (I): from HQL to code
COMSOL--三维图形的建立
uniapp
Array
Leetcode 185 All employees with the top three highest wages in the Department (July 4, 2022)
Variables///
R3live series learning (IV) r2live source code reading (2)
如何将 DevSecOps 引入企业?
Process control
紫光展锐全球首个5G R17 IoT NTN卫星物联网上星实测完成