当前位置:网站首页>Yalm 100b: 100billion parameter open source large model from yandex, Russia, allowing commercial use
Yalm 100b: 100billion parameter open source large model from yandex, Russia, allowing commercial use
2022-06-27 01:55:00 【Zhiyuan community】
GitHub Address : https://github.com/yandex/YaLM-100B ( Just released a few days , There has been a 2400 It's a star )
Yandex Russian search giant , The official blog This is how it is introduced :
Over the past year , We have been Alice Voice assistant and Yandex Use... In search YaLM Serial language model . today , We will open source the largest YaLM Model , Yes 1000 One hundred million parameters . We spent it. 65 Days of time in 800 individual A100 On the graphics card and 1.7 TB Online text 、 Training this model on books and countless other resources . We are GitHub Models and useful materials are published on , use Apache 2.0 The license , Allow for research and commercial use . It is the largest freely available English in the world GPT Neural network like .
The blog also conscientiously introduces many experiences of model training acceleration , Including how to find bottlenecks 、 Use quick data types 、 Speed up GPU The operating 、 Reduce memory access 、 Ban Dropout、 signal communication 、ZeRO Optimizer, etc , Recommended reading .
边栏推荐
- Parameter estimation -- Chapter 7 study report of probability theory and mathematical statistics (point estimation)
- Hibernate generates SQL based on Dialect
- memcached基础10
- I earned 3W yuan a month from my sideline: the industry you despise really makes money!
- D's appendto packaging
- Dameng database installation
- Fork (), exec (), waitpid (), $? > > in Perl 8 combination
- 为什么传递SPIF_SENDCHANGE标志SystemParametersInfo会挂起?
- Look! In June, 2022, the programming language ranking list was released! The first place is awesome
- Memcached basics 13
猜你喜欢

"All majors are persuading them to quit." is it actually the most friendly to college students?

svg拖拽装扮Kitty猫

Daily question brushing record (V)

Simply learn the entry-level concepts of googlecolab

简单学习GoogleColab的入门级概念

markdown表格(合并)

WiFi-IoT 鸿蒙开发套件样例开发

执念斩长河暑期规划

three.js多米诺骨牌js特效

Meituan: data management and pit avoidance strategy summarized after stepping on Thunder for several years
随机推荐
numpy 数组运算机制浅探
Memcached basics 14
你的case真的pass了吗?
Topolvm: kubernetes local persistence scheme based on LVM, capacity aware, dynamically create PV, and easily use local disk
消费者追捧iPhone,在于它的性价比超越国产手机
On the operation mechanism of numpy array
热议:月薪1.8万却毫无意义的工作,你干吗?
C language -- Design of employee information management system
Oracle/PLSQL: To_ Clob Function
Oracle/PLSQL: VSize Function
Hot discussion: what are you doing for a meaningless job with a monthly salary of 18000?
Clip: learning transferable visual models from natural language monitoring
Memcached Foundation 12
Oracle/PLSQL: NumToYMInterval Function
速看!2022年6月编程语言排行榜出炉!第一名太牛啦
按键控制LED状态翻转
Canvas particles: mouse following JS effect
Some exception handling for idea plug-in development
Bs-gx-016 implementation of textbook management system based on SSM
Oracle/PLSQL: CharToRowid Function