当前位置:网站首页>Tencent cloud won the first place in the cloud natural language understanding classification task
Tencent cloud won the first place in the cloud natural language understanding classification task
2022-06-24 01:59:00 【Tencent cloud Ti platform】
In recent days, , Tencent cloud AI With the acceleration team of Tencent Youtu laboratory in CLUE Experiment on the task of language classification , At one fell swoop in the classification task 1.0 and 1.1 Won the first place in the industry .
How to communicate with AI Have an accessible dialogue ?
In recent years, with the development of artificial intelligence ,NLP( natural language processing ) It has been the focus of attention both inside and outside the industry , The pre training model (Pre-Trained Model,PTM) Technology is the most revolutionary innovation achievement at present , It is becoming the focus of Internet enterprises at home and abroad , It is imperative to build a super large-scale pre training model with Chinese as the core and ecological environment , The major companies are feeding back their own businesses, and at the same time, they are turning to CLUE List initiation “ charge ”.
CLUE It is one of the most authoritative benchmarks in the field of Chinese language understanding , It covers text similarity 、 classification 、 Many semantic analysis and understanding subtasks such as reading comprehension . As “ A sharp weapon to brush the list ” The pre training model of is to gather a large amount of computing power on large-scale texts , Train the big model intensively , The pre training has developed common language features , It can be used by a large number of enterprises , It greatly reduces the threshold of naturallanguageprocessing research and application .
“ A good workman does his work well , You must sharpen your tools first ”
Tencent cloud TI The platform is a one-stop machine learning ecological service platform based on the powerful computing power of Tencent cloud . It can be used for a variety of data sources 、 Components 、 Algorithm 、 The model and the evaluation module are combined , Algorithm Engineers and data scientists can easily train models on it 、 Evaluate and predict .TI Series of products support public cloud access 、 Privatized deployment and exclusive cloud deployment .
TI-ACC Tencent cloud AI And the latest product released by Youtu laboratory AI Acceleration component products , It can provide AI Model training and reasoning acceleration services , Support a variety of frameworks and scenarios , It can significantly improve the reasoning efficiency of model training 、 cost reduction .
The pre training of this large model completely relies on Tencent cloud TI platform , And USES the TI-ACC Train and speed up . The overall training program is as follows :
First , The excellent effect of the model is inseparable from the support of a large number of high-quality Chinese pre training corpus behind it . Tencent cloud team is TI The preprocessing of massive corpus is built on the platform 、 Cleaning and evaluation tasks , A collection of novels 、 Journalism 、 Quality content in different areas such as community reviews , And papers on various subjects 、 Application description and other specialized contents , Filter out hundreds GB High quality Chinese corpus , Ensure data “ Wide source ” And “ The quality is fine ”.
On this basis , in the light of NLP The characteristics and existing problems of super large model , Tencent cloud team has conducted in-depth optimization in terms of single machine computing performance and multi machine expansion in combination with the underlying infrastructure . In computing performance optimization ,TI-ACC Yes Transformer The structure model is sparsely calculated 、 Operator fusion 、 Dynamic text length input optimization . On multi machine extensions , Adopted Zero-DP Technology combined with reverse graphics memory saving 、 Multi round communication with large model parameters 、 application layer NCCL Communication optimization, parameter automatic optimization and other optimization means . Final ,TI-ACC Capable of efficiently training hundreds of billions of level parameters NLP Big model , It greatly improves the efficiency of model pre training .
Besides , We are right on the model Transformer The structure has been fine tuned , Plus the progressive course learning and training program , Make large models faster 、 Learn knowledge better .
This summit CLUE The list , On the one hand, it represents that Tencent cloud is NLP The ecological field has reached the leading level in the industry , On the other hand, it indicates TI-ACC Help Chinese pre training model to reach a new level in efficient training and reasoning .
Click to learn more Tencent cloud AI Product solutions
边栏推荐
- Why promote steam education?
- Modify the original place where the method needs to be called and triggered
- SAP mm Migo 411k error - correct the customizing settings for the unique
- Baysor: cell segmentation in imaging based spatial transcriptomics
- [guide to cloud first] point north before tdsql elite challenge
- SAP mm UB type sto cannot be transferred to vendor consignment inventory?
- Line/kotlin jdsl: kotlin DSL for JPA criteria API
- Five things programmers need to consider when developing with low code – thenewstack
- Tencent cloud double 11 Live Room activity rules
- Kubesphere upgrade & enable plug-ins after installation
猜你喜欢

Stm32g474 infrared receiving based on irtim peripherals

layer 3 switch
![[SQL injection 13] referer injection foundation and Practice (based on burpseuite tool and sqli labs less19 target platform)](/img/b5/a8c4bbaf868dd20b7dc9449d2a4378.jpg)
[SQL injection 13] referer injection foundation and Practice (based on burpseuite tool and sqli labs less19 target platform)
![[SQL injection 12] user agent injection foundation and Practice (based on burpsuite tool and sqli labs LESS18 target machine platform)](/img/c8/f6c2a62b8ab8fa88bd2b3d8f35f592.jpg)
[SQL injection 12] user agent injection foundation and Practice (based on burpsuite tool and sqli labs LESS18 target machine platform)

Review of AI hotspots this week: the Gan compression method consumes less than 1/9 of the computing power, and the open source generator turns your photos into hand drawn photos

I, a 27 year old female programmer, feel that life is meaningless, not counting the accumulation fund deposit of 430000

BIM model example

It's too difficult for me. Ali has had 7 rounds of interviews (5 years of experience and won the offer of P7 post)
随机推荐
How do users of Fortress computers add servers? How much does it cost to add servers for fortress users?
It's too difficult for me. Ali has had 7 rounds of interviews (5 years of experience and won the offer of P7 post)
Railway patrol system - Railway Intelligent Patrol communication system
Echo framework: implementing service end flow limiting Middleware
[new function!] How anycast CLB supports multi location & dynamically accelerated load balancing services and high-speed Internet forwarding!
[untitled]
MySQL architecture
[technical grass planting] how can this double eleven be cost-effective!
Collation of commonly used glusterfs commands
Grpc: implement grpc proxy
Detailed explanation of SSH tunnel and stable intranet penetration using autossh
Thorough and thorough analysis of factory method mode
Comparison between rule engine and ML model - xlaszlo
Gin framework: implementing timeout Middleware
How to restart the server through the fortress machine how to log in to the fortress machine
6、 Symbols and commands for numerical calculation of variables
Grp: implement GRP timeout interceptor
I, a 27 year old female programmer, feel that life is meaningless, not counting the accumulation fund deposit of 430000
Tcapulusdb Jun · industry news collection (November 22)
Five things programmers need to consider when developing with low code – thenewstack