当前位置:网站首页>项目场景 with ERRTYPE = cudaError CUDA failure 999 unknown error
项目场景 with ERRTYPE = cudaError CUDA failure 999 unknown error
2022-08-02 02:20:00 【mtl1994】
项目场景 [with ERRTYPE = cudaError; bool THRW = true] CUDA failure 999: unknown error ; GPU=24 :
需要升级之前老的程序,之前的cuda 是10.2
问题描述:
环境
cuda 11.2 (之前是10.2)
onnxruntime-gpu 1.10
python 3.9.7

启动程序的时候
Traceback (most recent call last):
File "/home/aiuser/cover/liheng-foggun/app.py", line 15, in <module>
model = DetectMultiBackend(weights=config.paddle.model_file)
File "/home/aiuser/miniconda3/envs/cover/lib/python3.9/site-packages/torch/autograd/grad_mode.py", line 28, in decorate_context
return func(*args, **kwargs)
File "/home/aiuser/cover/liheng-foggun/models/yolo.py", line 37, in __init__
self.session = onnxruntime.InferenceSession(weights, providers=['CUDAExecutionProvider'])
File "/home/aiuser/miniconda3/envs/cover/lib/python3.9/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 335, in __init__
self._create_inference_session(providers, provider_options, disabled_optimizers)
File "/home/aiuser/miniconda3/envs/cover/lib/python3.9/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 379, in _create_inference_session
sess.initialize_session(providers, provider_options, disabled_optimizers)
RuntimeError: /onnxruntime_src/onnxruntime/core/providers/cuda/cuda_call.cc:122 bool onnxruntime::CudaCall(ERRTYPE, const char*, const char*, ERRTYPE, const char*) [with ERRTYPE =
cudaError; bool THRW = true] /onnxruntime_src/onnxruntime/core/providers/cuda/cuda_call.cc:116 bool onnxruntime::CudaCall(ERRTYPE, const char*, const char*
, ERRTYPE, const char*) [with ERRTYPE = cudaError; bool THRW = true] CUDA failure 999: unknown error ; GPU=24 ; hostname=aiserver-sl-01 ; expr=cudaSetDevice(info_.device_id);
原因分析:
1.刚开始以为是onnxruntime-gpu 版本问题 升级到了 1.12 还是报错
2.网上又说是不兼容的问题
3.试试重装下驱动,卸载了11.2 的时候 通过nvidia-smi 发现之前10.2的驱动还存在
4.是因为之前的驱动没有卸载干净
解决方案:
1.卸载10.2
sudo /usr/local/cuda-10.2/bin/cuda-uninstaller
2.安装新驱动
#离线安装 515.57
sudo ./NVIDIA-Linux-x86_64-515.57.run -no-x-check -no-nouveau-check
VIDIA-Linux-x86_64-515.57.run -no-x-check -no-nouveau-check
边栏推荐
- 拼多多借力消博会推动国内农产品品牌升级 看齐国际精品农货
- 2022-08-01 mysql/stoonedb slow SQL-Q18 analysis
- Rasa 3 x learning series - Rasa - 4873 dispatcher Issues. Utter_message study notes
- AWR analysis report questions for help: How can SQL be optimized from what aspects?
- Effects of Scraping and Aggregation
- FOFAHUB usage test
- LeetCode brushing diary: 53, the largest sub-array and
- 面对职场“毕业”,PM&PMO应该如何从容的应对?如何跳槽能够大幅度升职加薪?
- 永磁同步电机36问(二)——机械量与电物理量如何转化?
- 2023年起,这些地区软考成绩低于45分也能拿证
猜你喜欢

Remember a pit for gorm initialization

2023年起,这些地区软考成绩低于45分也能拿证

Install mysql using docker

Remember a gorm transaction and debug to solve mysql deadlock

列表常用方法
![[Unity entry plan] 2D Game Kit: A preliminary understanding of the composition of 2D games](/img/8a/07ca69c6dcc22757156cb615e241f8.png)
[Unity entry plan] 2D Game Kit: A preliminary understanding of the composition of 2D games

Project Background Technology Express

接口测试神器Apifox究竟有多香?

BioVendor Human Club Cellular Protein (CC16) Elisa Kit Research Fields

拼多多借力消博会推动国内农产品品牌升级 看齐国际精品农货
随机推荐
BI - SQL 丨 WHILE
LeetCode 213. Robbery II (2022.08.01)
Win Go development kit installation configuration, GoLand configuration
LeetCode刷题日记: 33、搜索旋转排序数组
A good book for newcomers to the workplace
cocos中使用async await异步加载资源
Install mysql using docker
JVM调优实战
oracle查询扫描全表和走索引
永磁同步电机36问(三)——SVPWM代码实现
永磁同步电机36问(二)——机械量与电物理量如何转化?
MySQL8 download, start, configure, verify
【Unity入门计划】2D Game Kit:初步了解2D游戏组成
The underlying data structure of Redis
nacos startup error, the database has been configured, stand-alone startup
CodeTon Round 2 D. Magical Array 规律
[LeetCode Daily Question] - 103. Zigzag Level Order Traversal of Binary Tree
swift项目,sqlcipher3 -&gt; 4,无法打开旧版数据库有办法解决吗
Constructor instance method inheritance of typescript37-class (extends)
Use DBeaver for mysql data backup and recovery