当前位置:网站首页>Thesis reading_ Chinese NLP_ ELECTRA
Thesis reading_ Chinese NLP_ ELECTRA
2022-07-03 04:43:00 【xieyan0811】
- Introduce :ELECTRA from Manning Jointly released by Google , Later, iFLYTEK Joint Laboratory of Harbin Institute of technology trained the corresponding Chinese model . The reduced model effect and BERT Not so much , The size of the model is only BERT Of 1/10,ELECTRA-small Only 46M.
- Code & Model download & Detailed instructions :https://github.com/ymcui/Chinese-ELECTRA
- Use :LTP Use it as the base model .
- principle : Training natural language models using generative confrontation Networks , Time is short , Less parameters . The model is divided into two parts : Generators and discriminators , Build implementation MLM, The discriminator is used to identify whether each word is generated by the model .
- effect : Take Chinese reading comprehension as an example , The effect comparison is as follows , For other experiments, see github
边栏推荐
- Some information about the developer environment in Chengdu
- 雇佣收银员(差分约束)
- Smart contract security audit company selection analysis and audit report resources download - domestic article
- Priv app permission exception
- MC Layer Target
- Joint search set: the number of points in connected blocks (the number of points in a set)
- Shell script -- condition judgment
- "Niuke brush Verilog" part II Verilog advanced challenge
- Market status and development prospect prediction of global neutral silicone sealant industry in 2022
- Kubernetes源码分析(一)
猜你喜欢
A outsourcing boy's mid-2022 summary
Career planning of counter attacking College Students
Leetcode simple question: check whether the array is sorted and rotated
并发操作-内存交互操作
Jincang KFS data bidirectional synchronization scenario deployment
"Niuke brush Verilog" part II Verilog advanced challenge
2022 tea master (intermediate) examination questions and tea master (intermediate) examination skills
GFS distributed file system (it's nice to meet it alone)
C language self-made Games: Sanzi (tic tac toe chess) intelligent chess supplement
FFMpeg filter
随机推荐
Some information about the developer environment in Chengdu
Introduction to JVM principle
Shell script -- condition judgment
Wine travel Jianghu War: Ctrip is strong, meituan is strong, and Tiktok is fighting
Asp access teaching management system design finished product
Summary of training competition (Lao Li's collection of questions)
[PCL self study: filtering] introduction and use of various filters in PCL (continuously updated)
Why does I start with =1? How does this code work?
How to retrieve the password for opening word files
2022 new examination questions for the main principals of hazardous chemical business units and examination skills for the main principals of hazardous chemical business units
Preparation for school and professional cognition
普通本科大学生活避坑指南
UiPath实战(08) - 选取器(Selector)
Number of uniform strings of leetcode simple problem
2022 chemical automation control instrument examination summary and chemical automation control instrument certificate examination
4 years of experience to interview test development, 10 minutes to end, ask too
[USACO 2009 Dec S]Music Notes
[luatos sensor] 1 light sensing bh1750
When using the benchmarksql tool to test the concurrency of kingbasees, there are sub threads that are not closed in time after the main process is killed successfully
How to choose cross-border e-commerce multi merchant system