当前位置:网站首页>MindSpore:【MindSpore1.1】Mindspore安装后验证出现cudaSetDevice failed错误
MindSpore:【MindSpore1.1】Mindspore安装后验证出现cudaSetDevice failed错误
2022-07-30 19:04:00 【小乐快乐】
问题描述:
【问题描述】
使用singularity方式创造ubuntu镜像安装MindSpore ,出现 cudaSetDevice failed, ret[999], unknown error 问题
原生操作系统: CentOS Linux release 7.4.1708 (Core)
singularity版本: 3.5.2
镜像操作系统: ubuntu 18.04.5 LTS (Bionic Beaver)
镜像源: docker://nvidia/cuda:10.1-cudnn7-devel-ubuntu18.04
在镜像操作系统安装MindSpore1.1.1 按照安装步骤安装成功,使用样例程序验证报错:
[ERROR] DEVICE(11474,python):2021-04-01-01:26:20.266.013 [mindspore/ccsrc/runtime/device/gpu/cuda_driver.cc:244] set_current_device] cudaSetDevice failed, ret[999], unknown error
[ERROR] SESSION(11474,python):2021-04-01-01:26:20.266.099 [mindspore/ccsrc/backend/session/gpu_session.cc:97] Init] GPUSession failed to set current device id.
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/opt/anaconda3/envs/python3.7/lib/python3.7/site-packages/mindspore/ops/primitive.py", line 186, in __call__
return _run_op(self, self.name, args)
File "/opt/anaconda3/envs/python3.7/lib/python3.7/site-packages/mindspore/common/api.py", line 75, in wrapper
results = fn(*arg, **kwargs)
File "/opt/anaconda3/envs/python3.7/lib/python3.7/site-packages/mindspore/ops/primitive.py", line 525, in _run_op
output = real_run_op(obj, op_name, args)
RuntimeError: mindspore/ccsrc/backend/session/gpu_session.cc:97 Init] GPUSession failed to set current device id.
【截图信息】
请问有大神知道这个是什么原因吗?该怎么解决呢?
解决方案:
机器环境问题
边栏推荐
猜你喜欢
MindSpore:【JupyterLab】按照新手教程训练时报错
自然语言处理nltk
【Prometheus】Prometheus联邦的一次优化记录[续]
Node encapsulates a console progress bar plugin
Codeblocks + Widgets create window code analysis
MindSpore:【模型训练】【mindinsight】timeline的时间和实际用时相差很远
【剑指 Offe】剑指 Offer 17. 打印从1到最大的n位数
VS Code 连接SQL Server
The advanced version of the cattle brushing series (search for rotating sorted arrays, inversion of the specified range in the linked list)
SwiftUI iOS Boutique Open Source Project Complete Baked Food Recipe App based on SQLite (tutorial including source code)
随机推荐
node封装一个控制台进度条插件
Critical Reviews | 南农邹建文组综述全球农田土壤抗生素与耐药基因分布
- daily a LeetCode 】 【 191. A number of 1
Tensorflow2.0 confusion matrix does not match printing accuracy
2种手绘风格效果比较,你更喜欢哪一种呢?
VBA 运行时错误‘-2147217900(80040e14):自动化(Automation)错误
监听开机广播
Witness the magical awakening of the mini world in HUAWEI CLOUD
在华为云,见证迷你世界的神奇觉醒
VBA批量将Excel数据导入Access数据库
kotlin的by lazy
几个GTest、GMock的例子
【科普】无线电波怎样传送信息?
自然语言处理nltk
【剑指 Offer】剑指 Offer 22. 链表中倒数第k个节点
Google's AlphaFold claims to have predicted almost every protein structure on Earth
Meta元宇宙部门第二季度亏损28亿!仍要继续押注?元宇宙发展尚未看到出路!
尊重客观事实
防抖和节流有什么区别,分别用于什么场景?
OneFlow source code analysis: Op, Kernel and interpreter