当前位置:网站首页>【AI4Code】《IntelliCode Compose: Code Generation using Transformer》 ESEC/FSE 2020
【AI4Code】《IntelliCode Compose: Code Generation using Transformer》 ESEC/FSE 2020
2022-07-25 13:08:00 【chad_ lee】
《IntelliCode Compose: Code Generation using Transformer》 ESEC/FSE 2020
It's not just about generating a word , Instead, generate a line . It's using GPT-2. The dataset is 12 Billion rows Python, C#, Javascript, TypeScript Code of language
Byte-Pair Encoding (BPE)
On the sequence token Chemical treatment , One is to use subtoken To narrow the vocabulary , One is to shield strings to prevent sensitive data leakage .
IntelliCode Compose
The model uses GPT, When inferring, it will sequence decoding The process of is regarded as the search process of the tree , until token appear :
Use when building trees beam search,beam with by K, Suppose the length of the final generated sequence is L, The model needs to predict K*L Time , But the model can batch perform , So all you need is L Time .
Multilingual model
Four ways of modeling multilingualism are compared :
1) Ignore differences between languages , Train multiple languages with a unified model 【 Experiments show that : This method is worse than monolingual training alone 】
2) Join in language type embedding Information , Each language is represented by a vector , And the original token embedding Equal combination .
3) Add a sentence at the beginning of each training sample "lang * remaining token sequence", among l a n g ∈ { P y t h o n , C # , J a v a S c r i p t , T y p e S c r i p t } lang \in \{Python, C\#, JavaScript,TypeScript\} lang∈{ Python,C#,JavaScript,TypeScript}
4) In pre training , Add a language type classification Mission , That is, one more head, Predict the type of the language each time .
边栏推荐
- Substance designer 2021 software installation package download and installation tutorial
- 艰辛的旅程
- B tree and b+ tree
- Detailed explanation of switch link aggregation [Huawei ENSP]
- Shell常用脚本:判断远程主机的文件是否存在
- Mlx90640 infrared thermal imager temperature sensor module development notes (V)
- Business visualization - make your flowchart'run'(3. Branch selection & cross language distributed operation node)
- [today in history] July 25: IBM obtained the first patent; Verizon acquires Yahoo; Amazon releases fire phone
- Docker learning - redis cluster -3 master and 3 slave - capacity expansion - capacity reduction building
- If you want to do a good job in software testing, you can first understand ast, SCA and penetration testing
猜你喜欢

Leetcode 0133. clone diagram

卷积神经网络模型之——VGG-16网络结构与代码实现

Seven lines of code made station B crash for three hours, but "a scheming 0"

Docekr学习 - MySQL8主从复制搭建部署

word样式和多级列表设置技巧(二)

Leetcode 1184. distance between bus stops

Zero basic learning canoe panel (14) -- led control and LCD control

"Wei Lai Cup" 2022 Niuke summer multi school training camp 2 supplementary problem solution (g, J, K, l)

Connotation and application of industrial Internet

【AI4Code】《CoSQA: 20,000+ Web Queries for Code Search and Question Answering》 ACL 2021
随机推荐
MySQL remote connection permission error 1045 problem
工业互联网的内涵及其应用
若依如何实现用户免密登录配置方法?
[machine learning] experimental notes - emotion recognition
EMQX Cloud 更新:日志分析增加更多参数,监控运维更省心
Shell common script: get the IP address of the network card
吕蒙正《破窑赋》
【AI4Code】《GraphCodeBERT: Pre-Training Code Representations With DataFlow》 ICLR 2021
“蔚来杯“2022牛客暑期多校训练营2 补题题解(G、J、K、L)
pytorch创建自己的Dataset加载数据集
Microsoft proposed CodeT: a new SOTA for code generation, with 20 points of performance improvement
零基础学习CANoe Panel(16)—— Clock Control/Panel Control/Start Stop Control/Tab Control
零基础学习CANoe Panel(13)—— 滑条(TrackBar )
A hard journey
Moving Chinese figure liushenglan
AtCoder Beginner Contest 261E // 按位思考 + dp
cv2.resize函数报错:error: (-215:Assertion failed) func != 0 in function ‘cv::hal::resize‘
Force deduction 83 biweekly T4 6131. The shortest dice sequence impossible to get, 303 weeks T4 6127. The number of high-quality pairs
Atcoder beginer contest 261 f / / tree array
Shell常用脚本:获取网卡IP地址