当前位置:网站首页>Pytorch training process was interrupted
Pytorch training process was interrupted
2022-07-05 11:17:00 【IMQYT】
I'm scared to death , Training 3 The process of Tian's model was killed by his own hand , I almost cried , Has the money for renting a server for a week been wasted , Is time wasted , Can it be remedied ! For the first time , And my code runs very slowly (RTXA5000, It's reasonable to say that it's not slow , Too much data , In order to reduce the number of logs IO Wasted time , There is no log ), Only the model is saved . Already my hands are shaking
Don't talk much , How to remedy it ?
Save the model in the code only torch.save. Other parameters are not saved .epoch Nothing is saved , Found a lot of experience , Finally find a remedy
Reload the model
path='autodl-tmp/GraphDTA-master/model_GINConvNet_kiba.model'
model.load_state_dict(torch.load(path))
In this case , What the model learned is back , Include loss And so on. .
From here I can see ,loss It did continue 294 The training of the time , The same is true of the predicted value. Continue 294 The result after the first time , Fortunately, I got it back , But there was a problem , Because I saw epoch It seems to be from 1 Here we go , In this case, we need to train 600 Time ?, So remember to revise epoch The total number of times ,600-294=306, Although the control interrupt writes this 1, But retraining 306 This time it will end . Be accomplished
边栏推荐
- Cron表达式(七子表达式)
- Wechat nucleic acid detection appointment applet system graduation design completion (7) Interim inspection report
- COMSOL--三维图形的建立
- Four departments: from now on to the end of October, carry out the "100 day action" on gas safety
- 一次edu证书站的挖掘
- Web Components
- COMSOL--建立几何模型---二维图形的建立
- Web Security
- msfconsole命令大全,以及使用说明
- pytorch训练进程被中断了
猜你喜欢
【Oracle】使用DataGrip连接Oracle数据库
LSTM applied to MNIST dataset classification (compared with CNN)
Oneforall installation and use
不要再说微服务可以解决一切问题了!
Summary of thread and thread synchronization under window
NFT 交易市场主要使用 ETH 本位进行交易的局面是如何形成的?
【广告系统】增量训练 & 特征准入/特征淘汰
2022 t elevator repair operation certificate examination questions and answers
Lombok makes ⽤ @data and @builder's pit at the same time. Are you hit?
9、 Disk management
随机推荐
Broyage · fusion | savoir que le site officiel de chuangyu mobile end est en ligne et commencer le voyage de sécurité numérique!
AUTOCAD——遮罩命令、如何使用CAD对图纸进行局部放大
九、磁盘管理
基础篇——REST风格开发
websocket
spark调优(一):从hql转向代码
Array
DDRx寻址原理
2022 Pengcheng cup Web
关于vray 5.2的使用(自研笔记)
解决grpc连接问题Dial成功状态为TransientFailure
关于vray5.2怎么关闭日志窗口
comsol--三维图形随便画----回转
分类TAB商品流多目标排序模型的演进
爬虫(9) - Scrapy框架(1) | Scrapy 异步网络爬虫框架
Paradigm in database: first paradigm, second paradigm, third paradigm
C # to obtain the filtered or sorted data of the GridView table in devaexpress
Huawei equipment configures channel switching services without interruption
msfconsole命令大全,以及使用说明
Honing · fusion | know that the official website of Chuangyu mobile terminal is newly launched, and start the journey of digital security!