当前位置:网站首页>Huawei modelarts training alexnet model
Huawei modelarts training alexnet model
2022-07-01 07:20:00 【Flower boy】
One 、 Reference material
Two 、 Key steps
2.1 Upload Source code To obs
.
|-- dataset # Data sets
| |-- train
| `-- val `-- train
|-- data.py
|-- model.py
`-- train_npu.py # Startup file
2.2 obs Create directory
obs Directory structure
log: Log storage path
output: Training output path

2.3 Create algorithm
Algorithm management
---》 establish
name : Customize
describe : Customize
How it was created : Customize
AI engine :tensorflow_1.15-cann_5.0.3-py_3.7-euler_2.8.3-aarch64
Code directory :/xxx/alexnet/train/
Startup file :/xxx/alexnet/train/train_npu.py
Input data configuration : Dataset path
Mapping name : Customize ( The default can be )
Code path parameters :data_url
Output data configuration : Training output path
Mapping name : Customize ( The default can be )
Code path parameters :train_url
other : The default can be

2.4 Create training assignments
Training management
---》 Training assignment
---》 establish
name : Customize
describe : Customize
Algorithm :
---》 My algorithm
---》 Check the algorithm just created
Training input :
---》data_url:/xxx/alexnet/dataset/
Training output :
---》train_url:/xxx/alexnet/output/
Resource pool : According to the actual demand
The resource type :Ascend
specifications : According to the actual demand
Calculate the number of nodes : According to the actual demand
Job log path :/xxx/alexnet/log/

2.5 Training done

3、 ... and 、FAQ
Q:The input shape of GeOp5_0 is dynamic
File "/home/ma-user/modelarts/user-job-dir/train/train_npu.py", line 114, in main
test_loss, test_acc, summary = sess.run([cost, accuracy, summary_op], feed_dict=val_feed)
File "/home/ma-user/anaconda/lib/python3.7/site-packages/tensorflow_core/python/client/session.py", line 956, in run
run_metadata_ptr)
File "/home/ma-user/anaconda/lib/python3.7/site-packages/tensorflow_core/python/client/session.py", line 1180, in _run
feed_dict_tensor, options, run_metadata)
File "/home/ma-user/anaconda/lib/python3.7/site-packages/tensorflow_core/python/client/session.py", line 1359, in _do_run
run_metadata)
File "/home/ma-user/anaconda/lib/python3.7/site-packages/tensorflow_core/python/client/session.py", line 1384, in _do_call
raise type(e)(node_def, op, message)
tensorflow.python.framework.errors_impl.InternalError: The input shape of GeOp5_0 is dynamic, please ensure that npu option[dynamic_input] is set correctly, for more details please refer to the migration guide.
[[{
{
node GeOp5_0}}]]
The reason for the error :
There are inputs during model training shape Problem of change
terms of settlement :
train_npu.py Add
custom_op.parameter_map["dynamic_input"].b = True
custom_op.parameter_map["dynamic_graph_execute_mode"].s = tf.compat.as_bytes("lazy_recompile")
边栏推荐
- The computer has a network, but all browser pages can't be opened. What's the matter?
- Buildreoot override mechanism
- Which securities company is better or safer for mobile phone account opening
- Subclasses call methods and properties of the parent class with the same name
- weback5基础配置详解
- 如何画产品架构图?
- 华泰证券开户是安全可靠的么?怎么开华泰证券账户
- 关于“2022年度网络安全教育线上培训”相关问题的复盘和说明
- kdtree(kd树)笔记
- Operation and maintenance management system, humanized operation experience
猜你喜欢

Will Internet talents be scarce in the future? Which technology directions are popular?

【LINGO】求无向图的最短路问题
![C language implementation [minesweeping game] full version (implementation source code)](/img/70/60f9a61bd99fa5fb5fab679a32528e.png)
C language implementation [minesweeping game] full version (implementation source code)

Why did grayscale fall from the altar?

Is it suitable for girls to study product manager? What are the advantages?

微软宣布开源 (GODEL) 语言模型聊天机器人

【推荐系统】美团外卖推荐场景的深度位置交互网络DPIN的突破与畅想

ctfshow-web352,353(SSRF)

【编程强训2】排序子序列+倒置字符串

图像风格迁移 CycleGAN原理
随机推荐
go-etcd
Cadence OrCAD Capture “网络名”相同,但是未连接或连接错误的解放方案之nodename的用法
Buildreoot override mechanism
Those high-frequency written tests and interview questions in [Jianzhi offer & Niuke 101] - linked list
[chapter 72 of the flutter problem series] a solution to the problem that pictures taken in the flutter using the camera plug-in are stretched
[Shenzhen IO] precise Food Scale (some understanding of assembly language)
Easynvs cloud management platform function reconfiguration: support adding users, modifying information, etc
Paging in servlets and JSPS
Programming examples of stm32f1 and stm32subeide infrared receiving and decoding of NEC protocol
redisson使用全解——redisson官方文檔+注釋(上篇)
[Electrical dielectric number] electrical dielectric number and calculation considering HVDC and facts components
ctfshow-web352,353(SSRF)
开源了!文心大模型ERNIE-Tiny轻量化技术,又准又快,效果全开
如何制作专属的VS Code主题
Mysql与Redis一致性解决方案
灰度何以跌下神坛?
Is it safe to do fund fixed investment on Great Wall Securities?
北漂程序员深夜emo发帖求助:女朋友走了我很孤独 ......
[classification model] Q-type cluster analysis
【剑指offer&牛客101】中那些高频笔试,面试题——链表篇