当前位置:网站首页>Anomaly-Transformer (ICLR 2022 Spotlight)复现过程及问题
Anomaly-Transformer (ICLR 2022 Spotlight)复现过程及问题
2022-07-01 22:56:00 【理心炼丹】
作者推荐的是 python3.6,pytorch 1.4
1. 环境修改
尝试安装 pytorch 1.4 运行,但是代码会卡住,并且没有报错。定位错误在:Anomaly-Transformer/model/attn.py
self.distances = torch.zeros((window_size, window_size)).cuda().cuda() 卡住:原因是 安装的 pytorch 1.4 对应的CUDA 版本为 10.x,算力是 sm_86,CUDA 10.x 最高支持到 sm_75,因此需要CUDA 11.x来支持sm_8.x。
因此升级 我的环境 python3.7, pytorch 1.12 , 显卡3080Ti, CUDA 版本:11.3
conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch再次运行训练脚本,又报错:
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [512, 25]], which is output 0 of AsStridedBackward0, is at version 2; expected version 1 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).
解决:注释掉Anomaly-Transformer/solver.py 的第一个 .step():
# Minimax strategy
loss1.backward(retain_graph=True)
# self.optimizer.step()
loss2.backward()
self.optimizer.step()参考:Why the optimizer.step() write twice? · Issue #8 · thuml/Anomaly-Transformer · GitHub
2. 恭喜! 成功运行!
python main.py --anormly_ratio 1 --num_epochs 3 --batch_size 128 --mode train --dataset PSM --data_path dataset/PSM --input_c 25 --output_c 25------------ Options -------------
anormly_ratio: 1.0
batch_size: 128
data_path: dataset/PSM
dataset: PSM
input_c: 25
k: 3
lr: 0.0001
mode: train
model_save_path: checkpoints
num_epochs: 3
output_c: 25
pretrained_model: None
win_size: 100
======================TEST MODE======================
/opt/conda/lib/python3.7/site-packages/torch/nn/_reduction.py:42: UserWarning: size_average and reduce args will be deprecated, please use reduction='none' instead.
warnings.warn(warning.format(ret))
Threshold : 0.002150955616962149
pred: (87800,)
gt: (87800,)
pred: (87800,)
gt: (87800,)
Accuracy : 0.9848, Precision : 0.9713, Recall : 0.9739, F-score : 0.9726
论文中的结果:对于PSM数据集
P: 96.91,R: 98.9, F1: 97.89
复现的 Recall 略低。但是 Precision 略高。二者本就是需要权衡。可以通过调整上面的 Threshold : 0.002150955616962149 平衡二者。
边栏推荐
- MySQL binlog cleanup
- [MySQL] database optimization method
- 认识--Matplotlib
- 2021 RoboCom 世界机器人开发者大赛-高职组复赛
- Matplotlib常用設置
- STM32F030F4驱动TIM1637数码管芯片
- 【微服务|Sentinel】@SentinelResource详解
- [机缘参悟-35]:鬼谷子-飞箝篇-远程连接、远程控制与远程测试之术
- Notes on problems - /usr/bin/perl is needed by mysql-server-5.1.73-1 glibc23.x86_ sixty-four
- Zero foundation tutorial of Internet of things development
猜你喜欢

问题随记 —— /usr/bin/perl is needed by MySQL-server-5.1.73-1.glibc23.x86_64

The online beggar function of Japanese shopping websites

玻璃马赛克

Programming English vocabulary notebook

【小程序】通过scroll-view组件实现左右【滑动】列表

Stm32f030f4 drives tim1637 nixie tube chip

Notes to problems - file /usr/share/mysql/charsets/readme from install of mysql-server-5.1.73-1 glibc23.x86_ 64 c

日本购物网站的网络乞丐功能

认识--Matplotlib

硅谷产品实战学习感触
随机推荐
转行软件测试,知道这四点就够了!
2022 examination questions and online simulation examination for safety management personnel of hazardous chemical business units
问题随记 —— /usr/bin/perl is needed by MySQL-server-5.1.73-1.glibc23.x86_64
YOGA27多维一体电脑,兼具出色外观与高端配置
CKS CKA ckad change terminal to remote desktop
[LeetCode] 最后一个单词的长度【58】
建模和影视后期有什么关联?
The online beggar function of Japanese shopping websites
[applet] realize the left and right [sliding] list through the scroll view component
Why is PHP called hypertext preprocessor
flutter Unable to load asset: assets/images/888. png
Wechat personal small store one click opening assistant applet development
玻璃马赛克
CADD课程学习(3)-- 靶点药物相互作用
2022 crane driver (limited to bridge crane) examination questions and simulation examination
Matplotlib常用設置
Daily three questions 6.30
[micro service sentinel] @sentinelresource details
CADD course learning (3) -- target drug interaction
Distance measurement - Hamming distance