当前位置:网站首页>Pytorch training process was interrupted
Pytorch training process was interrupted
2022-07-05 11:17:00 【IMQYT】
I'm scared to death , Training 3 The process of Tian's model was killed by his own hand , I almost cried , Has the money for renting a server for a week been wasted , Is time wasted , Can it be remedied ! For the first time , And my code runs very slowly (RTXA5000, It's reasonable to say that it's not slow , Too much data , In order to reduce the number of logs IO Wasted time , There is no log ), Only the model is saved . Already my hands are shaking
Don't talk much , How to remedy it ?

Save the model in the code only torch.save. Other parameters are not saved .epoch Nothing is saved , Found a lot of experience , Finally find a remedy
Reload the model
path='autodl-tmp/GraphDTA-master/model_GINConvNet_kiba.model'
model.load_state_dict(torch.load(path))In this case , What the model learned is back , Include loss And so on. .

From here I can see ,loss It did continue 294 The training of the time , The same is true of the predicted value. Continue 294 The result after the first time , Fortunately, I got it back , But there was a problem , Because I saw epoch It seems to be from 1 Here we go , In this case, we need to train 600 Time ?, So remember to revise epoch The total number of times ,600-294=306, Although the control interrupt writes this 1, But retraining 306 This time it will end . Be accomplished
边栏推荐
- C # to obtain the filtered or sorted data of the GridView table in devaexpress
- Web Components
- 2022 t elevator repair operation certificate examination questions and answers
- [SWT component] content scrolledcomposite
- Process control
- Do you really understand the things about "prototype"? [part I]
- A mining of edu certificate station
- I used Kaitian platform to build an urban epidemic prevention policy inquiry system [Kaitian apaas battle]
- Summary of websites of app stores / APP markets
- [advertising system] parameter server distributed training
猜你喜欢

关于vray 5.2的使用(自研笔记)

Intelligent metal detector based on openharmony

如何将 DevSecOps 引入企业?
![[office] eight usages of if function in Excel](/img/ce/ea481ab947b25937a28ab5540ce323.png)
[office] eight usages of if function in Excel

Go language learning notes - first acquaintance with go language

A mining of edu certificate station

Summary of thread and thread synchronization under window

修复动漫1K变8K

7 大主题、9 位技术大咖!龙蜥大讲堂7月硬核直播预告抢先看,明天见

About the use of Vray 5.2 (self research notes)
随机推荐
【爬虫】wasm遇到的bug
DDRx寻址原理
IPv6与IPv4的区别 网信办等三部推进IPv6规模部署
DOM//
如何将 DevSecOps 引入企业?
Lombok makes ⽤ @data and @builder's pit at the same time. Are you hit?
How to close the log window in vray5.2
Technology sharing | common interface protocol analysis
Operators
紫光展锐全球首个5G R17 IoT NTN卫星物联网上星实测完成
居家办公那些事|社区征文
COMSOL--建立几何模型---二维图形的建立
DDR4的特性与电气参数
9、 Disk management
String
32: Chapter 3: development of pass service: 15: Browser storage media, introduction; (cookie,Session Storage,Local Storage)
Explanation of message passing in DGL
regular expression
Basics - rest style development
Bracket matching problem (STL)