当前位置:网站首页>项目场景 with ERRTYPE = cudaError CUDA failure 999 unknown error
项目场景 with ERRTYPE = cudaError CUDA failure 999 unknown error
2022-08-02 02:20:00 【mtl1994】
项目场景 [with ERRTYPE = cudaError; bool THRW = true] CUDA failure 999: unknown error ; GPU=24 :
需要升级之前老的程序,之前的cuda 是10.2
问题描述:
环境
cuda 11.2 (之前是10.2)
onnxruntime-gpu 1.10
python 3.9.7

启动程序的时候
Traceback (most recent call last):
File "/home/aiuser/cover/liheng-foggun/app.py", line 15, in <module>
model = DetectMultiBackend(weights=config.paddle.model_file)
File "/home/aiuser/miniconda3/envs/cover/lib/python3.9/site-packages/torch/autograd/grad_mode.py", line 28, in decorate_context
return func(*args, **kwargs)
File "/home/aiuser/cover/liheng-foggun/models/yolo.py", line 37, in __init__
self.session = onnxruntime.InferenceSession(weights, providers=['CUDAExecutionProvider'])
File "/home/aiuser/miniconda3/envs/cover/lib/python3.9/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 335, in __init__
self._create_inference_session(providers, provider_options, disabled_optimizers)
File "/home/aiuser/miniconda3/envs/cover/lib/python3.9/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 379, in _create_inference_session
sess.initialize_session(providers, provider_options, disabled_optimizers)
RuntimeError: /onnxruntime_src/onnxruntime/core/providers/cuda/cuda_call.cc:122 bool onnxruntime::CudaCall(ERRTYPE, const char*, const char*, ERRTYPE, const char*) [with ERRTYPE =
cudaError; bool THRW = true] /onnxruntime_src/onnxruntime/core/providers/cuda/cuda_call.cc:116 bool onnxruntime::CudaCall(ERRTYPE, const char*, const char*
, ERRTYPE, const char*) [with ERRTYPE = cudaError; bool THRW = true] CUDA failure 999: unknown error ; GPU=24 ; hostname=aiserver-sl-01 ; expr=cudaSetDevice(info_.device_id);
原因分析:
1.刚开始以为是onnxruntime-gpu 版本问题 升级到了 1.12 还是报错
2.网上又说是不兼容的问题
3.试试重装下驱动,卸载了11.2 的时候 通过nvidia-smi 发现之前10.2的驱动还存在
4.是因为之前的驱动没有卸载干净
解决方案:
1.卸载10.2
sudo /usr/local/cuda-10.2/bin/cuda-uninstaller
2.安装新驱动
#离线安装 515.57
sudo ./NVIDIA-Linux-x86_64-515.57.run -no-x-check -no-nouveau-check
VIDIA-Linux-x86_64-515.57.run -no-x-check -no-nouveau-check
边栏推荐
- Software testing Interface automation testing Pytest framework encapsulates requests library Encapsulates unified request and multiple base path processing Interface association encapsulation Test cas
- The underlying data structure of Redis
- Power button 1374. Generate each character string is an odd number
- [LeetCode Daily Question] - 103. Zigzag Level Order Traversal of Binary Tree
- 记一次gorm事务及调试解决mysql死锁
- AWR分析报告问题求助:SQL如何可以从哪几个方面优化?
- 【LeetCode每日一题】——103.二叉树的锯齿形层序遍历
- Fundamentals of Cryptography: X.690 and Corresponding BER CER DER Encodings
- Talking about the "horizontal, vertical and vertical" development trend of domestic ERP
- Rasa 3 x learning series - Rasa - 4873 dispatcher Issues. Utter_message study notes
猜你喜欢

Software testing Interface automation testing Pytest framework encapsulates requests library Encapsulates unified request and multiple base path processing Interface association encapsulation Test cas

Good News | AR opens a new model for the textile industry, and ALVA Systems wins another award!

The principle and code implementation of intelligent follower robot in the actual combat of innovative projects

Oracle19c安装图文教程

LeetCode Brushing Diary: 74. Searching 2D Matrix

一次SQL优化,数据库查询速度提升 60 倍

Win Go development kit installation configuration, GoLand configuration

AOF rewrite

FOFAHUB usage test

Nanoprobes丨1-mercapto-(triethylene glycol) methyl ether functionalized gold nanoparticles
随机推荐
Ask God to answer, how should this kind of sql be written?
leetcode / anagram in string - some permutation of s1 string is a substring of s2
字典常用方法
Force buckle, 752-open turntable lock
2022-08-01 安装mysql监控工具phhMyAdmin
用位运算为你的程序加速
Chopper webshell feature analysis
2022年NPDP考完多久出成绩?怎么查询?
bool框架::PosInGrid (const简历:关键点kp, int &posX, int诗句)
Redis 底层的数据结构
PHP live source code to achieve simple barrage effect related code
BI - SQL 丨 WHILE
C language inserted into the characters of simple exercises
LeetCode刷题日记: 33、搜索旋转排序数组
LeetCode Review Diary: 34. Find the first and last position of an element in a sorted array
字符串常用方法
BioVendor人俱乐部细胞蛋白(CC16)Elisa试剂盒研究领域
Electronic Manufacturing Warehouse Barcode Management System Solution
Rasa 3 x learning series - Rasa - 4873 dispatcher Issues. Utter_message study notes
Service discovery of kubernetes