当前位置:网站首页>Thesis reading_ Chinese NLP_ ELECTRA
Thesis reading_ Chinese NLP_ ELECTRA
2022-07-03 04:43:00 【xieyan0811】
- Introduce :ELECTRA from Manning Jointly released by Google , Later, iFLYTEK Joint Laboratory of Harbin Institute of technology trained the corresponding Chinese model . The reduced model effect and BERT Not so much , The size of the model is only BERT Of 1/10,ELECTRA-small Only 46M.
- Code & Model download & Detailed instructions :https://github.com/ymcui/Chinese-ELECTRA
- Use :LTP Use it as the base model .
- principle : Training natural language models using generative confrontation Networks , Time is short , Less parameters . The model is divided into two parts : Generators and discriminators , Build implementation MLM, The discriminator is used to identify whether each word is generated by the model .
- effect : Take Chinese reading comprehension as an example , The effect comparison is as follows , For other experiments, see github

边栏推荐
- [set theory] Cartesian product (concept of Cartesian product | examples of Cartesian product | properties of Cartesian product | non commutativity | non associativity | distribution law | ordered pair
- Arthas watch grabs a field / attribute of the input parameter
- Internationalization and localization, dark mode and dark mode in compose
- RSRS index timing and large and small disc rotation
- Wine travel Jianghu War: Ctrip is strong, meituan is strong, and Tiktok is fighting
- [set theory] binary relation (example of binary relation operation | example of inverse operation | example of composite operation | example of limiting operation | example of image operation)
- Ffmpeg mix
- Priv-app permission异常
- Introduction to message queuing (MQ)
- Leetcode simple question: check whether the array is sorted and rotated
猜你喜欢

When using the benchmarksql tool to preheat data for kingbasees, execute: select sys_ Prewarm ('ndx_oorder_2 ') error

Sdl2 + OpenGL glsl practice (Continued)

The reason why the entity class in the database is changed into hump naming
![[set theory] relational representation (relational matrix | examples of relational matrix | properties of relational matrix | operations of relational matrix | relational graph | examples of relationa](/img/a9/92059db74ccde03b84c69dfce35b37.jpg)
[set theory] relational representation (relational matrix | examples of relational matrix | properties of relational matrix | operations of relational matrix | relational graph | examples of relationa

MC Layer Target

Human resource management system based on JSP

Review the old and know the new: Notes on Data Science

Php+mysql registration landing page development complete code

I stepped on a foundation pit today

4 years of experience to interview test development, 10 minutes to end, ask too
随机推荐
document. The problem of missing parameters of referer is solved
Market status and development prospects of the global IOT active infrared sensor industry in 2022
data2vec! New milestone of unified mode
The usage of micro service project swagger aggregation document shows all micro service addresses in the form of swagger grouping
Youdao cloud notes
[tools run SQL blind note]
Day 51 - tree problem
Leetcode simple question: check whether the array is sorted and rotated
Crazy scientist
联发科技2023届提前批IC笔试(题目)
4 years of experience to interview test development, 10 minutes to end, ask too
怎么用Kotlin去提高生产力:Kotlin Tips
Market status and development prospect prediction of the global forward fluorescent microscope industry in 2022
《牛客刷verilog》Part II Verilog进阶挑战
2022 tea master (intermediate) examination questions and tea master (intermediate) examination skills
Career planning of counter attacking College Students
Auman Galaxy new year of the tiger appreciation meeting was held in Beijing - won the double certification of "intelligent safety" and "efficient performance" of China Automotive Research Institute
关于开学的准备与专业认知
Why should programmers learn microservice architecture if they want to enter a large factory?
stm32逆向入门