当前位置:网站首页>Pytorch training process was interrupted
Pytorch training process was interrupted
2022-07-05 11:17:00 【IMQYT】
I'm scared to death , Training 3 The process of Tian's model was killed by his own hand , I almost cried , Has the money for renting a server for a week been wasted , Is time wasted , Can it be remedied ! For the first time , And my code runs very slowly (RTXA5000, It's reasonable to say that it's not slow , Too much data , In order to reduce the number of logs IO Wasted time , There is no log ), Only the model is saved . Already my hands are shaking
Don't talk much , How to remedy it ?
Save the model in the code only torch.save. Other parameters are not saved .epoch Nothing is saved , Found a lot of experience , Finally find a remedy
Reload the model
path='autodl-tmp/GraphDTA-master/model_GINConvNet_kiba.model'
model.load_state_dict(torch.load(path))
In this case , What the model learned is back , Include loss And so on. .
From here I can see ,loss It did continue 294 The training of the time , The same is true of the predicted value. Continue 294 The result after the first time , Fortunately, I got it back , But there was a problem , Because I saw epoch It seems to be from 1 Here we go , In this case, we need to train 600 Time ?, So remember to revise epoch The total number of times ,600-294=306, Although the control interrupt writes this 1, But retraining 306 This time it will end . Be accomplished
边栏推荐
- [there may be no default font]warning: imagettfbbox() [function.imagettfbbox]: invalid font filename
- 技术管理进阶——什么是管理者之体力、脑力、心力
- 【全网首发】(大表小技巧)有时候 2 小时的 SQL 操作,可能只要 1 分钟
- 解决readObjectStart: expect { or n, but found N, error found in #1 byte of ...||..., bigger context ..
- BOM//
- Modulenotfounderror: no module named 'scratch' ultimate solution
- 边缘计算如何与物联网结合在一起?
- uniapp
- DOM//
- Cross page communication
猜你喜欢
2022 mobile crane driver examination question bank and simulation examination
【爬虫】charles unknown错误
Explanation of message passing in DGL
pytorch训练进程被中断了
Wechat nucleic acid detection appointment applet system graduation design completion (8) graduation design thesis template
About the use of Vray 5.2 (self research notes) (II)
[advertising system] incremental training & feature access / feature elimination
In the last process before the use of the risk control model, 80% of children's shoes are trampled here
32: Chapter 3: development of pass service: 15: Browser storage media, introduction; (cookie,Session Storage,Local Storage)
Summary of thread and thread synchronization under window
随机推荐
一次edu证书站的挖掘
Ddrx addressing principle
[advertising system] incremental training & feature access / feature elimination
Risc-v-qemu-virt in FreeRTOS_ Scheduling opportunity of GCC
[Oracle] use DataGrid to connect to Oracle Database
Technology sharing | common interface protocol analysis
基础篇——基础项目解析
Wechat nucleic acid detection appointment applet system graduation design completion (8) graduation design thesis template
关于vray 5.2的使用(自研笔记)
2022 chemical automation control instrument examination questions and online simulation examination
Explanation of message passing in DGL
修复动漫1K变8K
Cron表达式(七子表达式)
关于vray 5.2的使用(自研笔记)
Basic testing process of CSDN Software Testing Introduction
2022 t elevator repair operation certificate examination questions and answers
Data types ntext and varchar are incompatible in the not equal to operator - 95 small pang
SLAM 01. Modeling of human recognition Environment & path
7.2每日学习4
How to introduce devsecops into enterprises?