当前位置:网站首页>ONEFLOW V0.8.0 officially released
ONEFLOW V0.8.0 officially released
2022-07-25 21:10:00 【InfoQ】
- In addition to the original ZeRO-DP outside ,ZeRO The zero redundancy optimizer can work with MP,2-D,3-D Use in parallel , Further save the cost of video memory .
- Graph Proposed a new pipelined parallel API, While simplifying pipelined parallel configuration, accelerate pipelined parallel and 3-D Parallel performance .
- In order to further improve Graph.debug Debugging efficiency , Add about logic diagram 、light plan Physical diagram 、 Memory analysis 、Python Multi dimensional debugging functions such as stack information .


- Support for tiered storage , Dynamic capacity expansion Embedding, Users can expand at a lower cost Embedding Capacity
- Hybrid parallel strategy , It can easily expand the model horizontally to the scene of multiple machines and multiple cards
- Communication quantization compression function , In the parallel scenario , Quantize and compress the communication data , To reduce traffic , Improve your training speed
- Efficient data pipeline , Execute the parts of the model without data dependency in advance , Overlap in time
- Support automatic hybrid accuracy training , Part of the calculation will be converted into FP16 Data type calculation , Improve training speed while reducing the occupation of video memory , And it can ensure the convergence accuracy of the model
- Provide a series of high-performance for common operations of recommended system models CUDA operator
- Support flexible model building

- OneFlow Source list :GDB Compile debugging
- Reading Pathways: The next step forward is OneFlow
- OneFlow The source code parsing : Automatic inference of operator signature
- Hinton: My 50 years of in-depth study career and Research on mental skills
- LLVM The father of : Why should we rebuild AI Infrastructure software
- Quantitative model of parallel computing and its application in deep learning engine
- Large model training is difficult ? Superior efficiency 、 Easy-to-use “ Li Bai ” Here comes the model library
边栏推荐
- Force deduction ----- calculate the money of the force deduction bank
- Leetcode-6131: the shortest dice sequence impossible to get
- Remote—基本原理介绍
- 黑盒(功能)测试基本方法
- 7.23
- Decompile app
- 有哪些优化mysql索引的方式请举例(sqlserver索引优化)
- The international summit osdi included Taobao system papers for the first time, and end cloud collaborative intelligence was recommended by the keynote speech of the conference
- 一道golang中关于recover的面试题
- Vivo official website app full model UI adaptation scheme
猜你喜欢

Sum of two numbers and three numbers

Too many passwords, don't know how to record? Why don't you write a password box applet yourself

leetcode-6129:全 0 子数组的数目

The international summit osdi included Taobao system papers for the first time, and end cloud collaborative intelligence was recommended by the keynote speech of the conference

leetcode-6131:不可能得到的最短骰子序列

LeetCode刷题——猜数字大小II#375#Medium
What's special about Huawei's innovative solutions to consolidate the foundation of ERP for small and medium-sized enterprises?

Test cases and defect report templates

Product principles of non-financial decentralized application

matlab----EEGLab查看脑电信号
随机推荐
Decompile app
Leetcode-79: word search
leetcode-114:二叉树展开为链表
How to automatically generate short chains? How to generate links with UTM parameters online in batches?
An interview question about recover in golang
Using the OAP aspect causes the controller to be called repeatedly
如何自动生成短链?如何在线批量生成带UTM参数的链接?
Remote—基本原理介绍
MySQL inserts three tables with different values. The association condition is the primary foreign key. How about the syntax of the insertion statement?
zigbee物联网开发平台(工业物联网)
Sqlx library usage
Remote - basic principle introduction
Beisen Holdings' IPO: a total loss of 4.115 billion yuan in three years, and a total of 2.84 billion yuan in the previous nine rounds of financing
Compilation and operation of program
What's special about Huawei's innovative solutions to consolidate the foundation of ERP for small and medium-sized enterprises?
Leetcode-919: complete binary tree inserter
leetcode-6127:优质数对的数目
An interview question about concurrent reading and writing of map in golang
[technical dry goods] how to ensure the idempotency of the interface?
Too many passwords, don't know how to record? Why don't you write a password box applet yourself