当前位置:网站首页>ONEFLOW V0.8.0 officially released
ONEFLOW V0.8.0 officially released
2022-07-25 21:10:00 【InfoQ】
- In addition to the original ZeRO-DP outside ,ZeRO The zero redundancy optimizer can work with MP,2-D,3-D Use in parallel , Further save the cost of video memory .
- Graph Proposed a new pipelined parallel API, While simplifying pipelined parallel configuration, accelerate pipelined parallel and 3-D Parallel performance .
- In order to further improve Graph.debug Debugging efficiency , Add about logic diagram 、light plan Physical diagram 、 Memory analysis 、Python Multi dimensional debugging functions such as stack information .


- Support for tiered storage , Dynamic capacity expansion Embedding, Users can expand at a lower cost Embedding Capacity
- Hybrid parallel strategy , It can easily expand the model horizontally to the scene of multiple machines and multiple cards
- Communication quantization compression function , In the parallel scenario , Quantize and compress the communication data , To reduce traffic , Improve your training speed
- Efficient data pipeline , Execute the parts of the model without data dependency in advance , Overlap in time
- Support automatic hybrid accuracy training , Part of the calculation will be converted into FP16 Data type calculation , Improve training speed while reducing the occupation of video memory , And it can ensure the convergence accuracy of the model
- Provide a series of high-performance for common operations of recommended system models CUDA operator
- Support flexible model building

- OneFlow Source list :GDB Compile debugging
- Reading Pathways: The next step forward is OneFlow
- OneFlow The source code parsing : Automatic inference of operator signature
- Hinton: My 50 years of in-depth study career and Research on mental skills
- LLVM The father of : Why should we rebuild AI Infrastructure software
- Quantitative model of parallel computing and its application in deep learning engine
- Large model training is difficult ? Superior efficiency 、 Easy-to-use “ Li Bai ” Here comes the model library
边栏推荐
- yuv422转rgb(422sp转420p)
- Huatai Securities account opening process, is it safe to open an account on your mobile phone
- MPI learning notes (II): two implementation methods of matrix multiplication
- MySQL inserts three tables with different values. The association condition is the primary foreign key. How about the syntax of the insertion statement?
- MPI学习笔记(二):矩阵相乘的两种实现方法
- CV image flipping, emgucv image rotation "recommended collection"
- Achieve accurate positioning based on Tencent map, and realize the attendance punch function of wechat applet
- SSH private key realizes login to remote target server
- Debugged PEB (beingdebugged, ntglobalflag)
- Temperature and humidity environment monitoring system based on stm32
猜你喜欢

Detailed explanation of document operation

Leetcode-6125: equal row and column pairs

Illustration leetcode - 3. longest substring without repeated characters (difficulty: medium)

As a test, how to understand thread synchronization and asynchrony

Leetcode-6131: the shortest dice sequence impossible to get
![[online tutorial] iptables official tutorial -- learning notes 2](/img/7d/5f8328d1b4c8878f17c95d2658d2d6.jpg)
[online tutorial] iptables official tutorial -- learning notes 2

Record the transfer of domain names from Alibaba cloud service providers to Huawei cloud

leetcode-155:最小栈

黑盒(功能)测试基本方法

Struct, enum type and union
随机推荐
Detailed explanation of document operation
What's special about Huawei's innovative solutions to consolidate the foundation of ERP for small and medium-sized enterprises?
CV image flipping, emgucv image rotation "recommended collection"
DDD go practice
Intel base instruction -- bnd
Golang language quickly get started to comprehensive practical notes (go language, beego framework, high concurrency chat room, crawler)
Airtest解决“自动装包”过程中需要输入密码的问题(同适用于随机弹框处理)
day04_ array
kali修改更新源(无法安全的用该源更新)
Huawei occupies half of the folding mobile phone market, proving its irreplaceable position in the high-end market
preprocessor directives
Miscellaneous notes -- a hodgepodge
Leetcode skimming -- guess the size of numbers II 375 medium
Decompile app
How to automatically generate short chains? How to generate links with UTM parameters online in batches?
Pycharm跑程序时自动进入测试模式
cuda_error_out_of_memory(out of memory怎么办)
MySQL master-slave replication data synchronization, summary of common problems
Yolov7 training error indexerror: list index out of range
[online tutorial] iptables official tutorial -- learning notes 2