当前位置:网站首页>AI chief architect 10-aica-lanxiang, propeller frame design and core technology
AI chief architect 10-aica-lanxiang, propeller frame design and core technology
2022-06-30 18:39:00 【The mountain of ignorance, the valley of despair, the slope of 】
0、 Introduce
platform
1、 Overall introduction to the propeller platform
The development process of deep learning
In depth learning framework process
Panoramic view of propeller
Facing the challenge
Leading technology to solve the challenge
Unity of movement and stillness
High low fusion
Simplify development process
Rich model base
Rich course
2、 The design of the training frame and the core technology of the propeller
Users and industry landing
The overall structure of the training framework
IR, Reduce complexity
Unify front-end conversion and performance
First multi-layer IR
for the first time IR
Static diagram
mapping
The second time IR
Dynamic and static conversion
Dynamic graphs contain static graphs
Dynamic tree rotation , Tree optimization , Tree generated static graph code
Tree transformation
Analysis and transcription
Dynamic and static conversion
3、 performance optimization
Optimization of each stage
Mixing accuracy , Reduce space and speed up
Facing problems
solve the problem , Zoom in , Back up high-precision data
Overall hybrid accuracy solution
Sparsity acceleration
Structured hardware acceleration and unstructured software acceleration
Hardware acceleration , Compress
effect
Model quantification
Quantitative training
Quantify after training
Quantify the effect , The data distribution
OP Fusion optimization
Multiple integration of one
Vertical integration and horizontal integration , The horizontal direction can be split at the back
Commonly used
Coding optimization
contrast
Operator optimization , Middle layer , Focus logic is different and common
kernel Optimize the system
effect
Hardware auto sensing optimization
effect
4、 Deep learning compiler
XLA
TVM
Compiler comparison
Baidu compiler
Training and reasoning can be automatically tuned
effect
summary
边栏推荐
- LeetCode之合并二叉树
- countdownlatch 和 completableFuture 和 CyclicBarrier
- C语言结构体
- Redis - persistent RDB and persistent AOF
- MySQL找不到mysql.sock文件的临时解
- Ardunio esp32 obtains real-time temperature and humidity in mqtt protocol (DH11)
- 冰河老师的书
- Deep understanding of JVM (III) - memory structure (III)
- Rust 书籍资料 - 芽之家书馆
- 英飞凌--GTM架构-Generic Timer Module
猜你喜欢
剑指 Offer 16. 数值的整数次方
Helping the ultimate experience, best practice of volcano engine edge computing
Deep understanding of JVM (III) - memory structure (III)
Deep understanding of JVM (V) - garbage collection (II)
医院在线问诊小程序源码 互联网医院源码 智慧医院源码
In distributed scenarios, do you know how to generate unique IDs?
C# Winform程序界面优化实例
AI首席架构师10-AICA-蓝翔 《飞桨框架设计与核心技术》
[PROJECT] Xiaomao school (IX)
Customer relationship CRM management system based on SSH
随机推荐
MySQL advanced - basic index and seven joins
Tensorflow2 ten must know for deep learning
Development and construction of NFT mining tour gamefi chain tour system
Deep understanding of JVM (VI) -- garbage collection (III)
Redis (IX) - enterprise level solution (II)
Advanced embedded application of uni app [day14]
漏洞复现----37、Apache Unomi 远程代码执行漏洞 (CVE-2020-13942)
小程序容器技术,促进园区运营效率提升
Apple Watch无法开机怎么办?苹果手表不能开机解决方法!
Rust 操控大疆可编程无人机 tello
Deep understanding of JVM (IV) - garbage collection (I)
Ardunio esp32 obtains real-time temperature and humidity in mqtt protocol (DH11)
TCP session hijacking based on hunt1.5
「经验」浅谈聚类分析在工作中的应用
Apache parsing vulnerability (cve-2017-15715)_ Vulnerability recurrence
uni-app进阶之内嵌应用【day14】
Dropout: immediate deactivation
Post office - post office issues (dynamic planning)
Ardunio esp32 DH11 real time uploading temperature and humidity Alibaba cloud self built mqtt
Compilation problems and solutions of teamtalk winclient