当前位置:网站首页>MindSpore:图算融合报错
MindSpore:图算融合报错
2022-08-04 09:03:00 【小乐快乐】
问题描述:
在mindspore1.1.1上做图算融合,硬件环境是Ascend 910,程序没有问题,添加运行图算融合(enable_graph_kernel=True)参数之后程序报错:
ms.context.set_context(mode=ms.context.GRAPH_MODE, device_target=args.device_target,
save_graphs=False, max_call_depth=10000, device_id=args.device_id, enable_graph_kernel=True)
报错记录如下:
Traceback (most recent call last):
File "/opt/python3.7.5/lib/python3.7/site-packages/mindspore/_extends/parallel_compile/akg_compiler/compiler.py", line 35, in <module>
run_compiler(sys.argv[1])
File "/opt/python3.7.5/lib/python3.7/site-packages/mindspore/_extends/parallel_compile/akg_compiler/compiler.py", line 28, in run_compiler
p = __import__("akg", globals(), locals(), ['ms'], 0)
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/__init__.py", line 84, in <module>
from . import autodiff
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/autodiff.py", line 19, in <module>
from akg.tvm._ffi.function import _init_api
File "<frozen importlib._bootstrap>", line 983, in _find_and_load
File "<frozen importlib._bootstrap>", line 967, in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 668, in _load_unlocked
File "<frozen importlib._bootstrap>", line 638, in _load_backward_compatible
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/__init__.py", line 62, in load_module
__import__(self.__rname, globals(), locals())
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/tvm/__init__.py", line 27, in <module>
from . import tensor
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/tvm/tensor.py", line 20, in <module>
from ._ffi.node import NodeBase, NodeGeneric, register_node, convert_to_node
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/tvm/_ffi/node.py", line 24, in <module>
from .object import Object, register_object, _set_class_node
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/tvm/_ffi/object.py", line 23, in <module>
from .base import _FFI_MODE, _RUNTIME_ONLY, check_call, _LIB, c_str
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/tvm/_ffi/base.py", line 81, in <module>
_LIB, _LIB_NAME = _load_lib()
File "/opt/python3.7.5/lib/python3.7/site-packages/akg/tvm/_ffi/base.py", line 73, in _load_lib
lib = ctypes.CDLL(lib_path[0], ctypes.RTLD_GLOBAL)
File "/opt/python3.7.5/lib/python3.7/ctypes/__init__.py", line 364, in __init__
self._handle = _dlopen(self._name, mode)
OSError: /opt/python3.7.5/lib/python3.7/site-packages/mindspore/lib/libakg.so: undefined symbol: pthread_atfork
Traceback (most recent call last):
Traceback (most recent call last):
File "/opt/python3.7.5/lib/python3.7/site-packages/mindspore/_extends/parallel_compile/akg_compiler/compiler.py", line 35, in <module>
File "/opt/python3.7.5/lib/python3.7/site-packages/mindspore/_extends/parallel_compile/akg_compiler/compiler.py", line 35, in <module>
run_compiler(sys.argv[1])
run_compiler(sys.argv[1])
File "/opt/python3.7.5/lib/python3.7/site-packages/mindspore/_extends/parallel_compile/akg_compiler/compiler.py", line 28, in run_compiler
File "/opt/python3.7.5/lib/python3.7/site-packages/mindspore/_extends/parallel_compile/akg_compiler/compiler.py", line 28, in run_compiler
p = __import__("akg", globals(), locals(), ['ms'], 0)
p = __import__("akg", globals(), locals(), ['ms'], 0)
这个报错记录太长了,超过帖子最大字数限制,我把报错记录放在附件error.txt中了
求问有哪位大神指导这个问题如何解决吗?
解决方案:
这个问题的原因在于使用了高版本的gcc编译akg时找不到pthread_atfork。
pthread_atfork是POSIX标准,在编译时要加上-pthread
-lpthread是老版本的gcc编译器用的,在新版本中应该用-pthread取代-lpthread
可以在akg 的 cmakelist下的target_link_libraries增加pthread重新编译一次
边栏推荐
猜你喜欢

智汇华云 | 华云软件定义网络 DCI介绍

技术实现 | 图像检索及其在淘宝的应用

sync-diff-inspector 使用实践
![[Punctuality Atom STM32 Serial] Chapter 3 Development Environment Construction Excerpted from [Punctual Atom] MiniPro STM32H750 Development Guide_V1.1](/img/6f/c736a3404377961e92b3bd1b5ea90e.png)
[Punctuality Atom STM32 Serial] Chapter 3 Development Environment Construction Excerpted from [Punctual Atom] MiniPro STM32H750 Development Guide_V1.1

PD 源码分析- Checker: region 健康卫士

如何设计一个注册中心
![Detailed explanation of telnet remote login aaa mode [Huawei eNSP]](/img/cf/aaf3a0b794b1076423fc5b90ecc9f0.png)
Detailed explanation of telnet remote login aaa mode [Huawei eNSP]

VRRP+MSTP配置详解【华为eNSP实验】

TiCDC同步延迟问题处理

请你谈谈网站是如何进行访问的?【web领域面试题】
随机推荐
It is found that several WRH tables are locked, what should I do?
去掉js代码文件所有注释
外包干了四年,秋招终于上岸了
【无标题】
SQL后计算的利器
VRRP + MSTP configuration, huawei eNSP experiment 】 【
今日睡眠质量记录71分
Four common methods of network attacks and their protection
How to restore the Youxuan database with only data files
Occupy, fill in later
telnet远程登录aaa模式详解【华为eNSP】
TiFlash 源码阅读(五) DeltaTree 存储引擎设计及实现分析 - Part 2
Post-94 Byte P7 posted the salary slip: It's really good to make up for this...
C Language Lectures from Scratch Part 6: Structure
关于Oracle RAC 11g重建磁盘组的问题
binder通信实现
NAT/NAPT地址转换(内外网通信)技术详解【华为eNSP】
TCP的四次挥手
[Punctuality Atom STM32 Serial] Chapter 4 STM32 First Experience Excerpted from [Punctual Atom] MiniPro STM32H750 Development Guide_V1.1
[Punctuality Atomic STM32 Serial] Chapter 1 Learning Method of the Book Excerpted from [Punctuality Atomic] MiniPro STM32H750 Development Guide_V1.1