当前位置:网站首页>【AI4Code】《IntelliCode Compose: Code Generation using Transformer》 ESEC/FSE 2020
【AI4Code】《IntelliCode Compose: Code Generation using Transformer》 ESEC/FSE 2020
2022-07-25 13:08:00 【chad_ lee】
《IntelliCode Compose: Code Generation using Transformer》 ESEC/FSE 2020
It's not just about generating a word , Instead, generate a line . It's using GPT-2. The dataset is 12 Billion rows Python, C#, Javascript, TypeScript Code of language
Byte-Pair Encoding (BPE)
On the sequence token Chemical treatment , One is to use subtoken To narrow the vocabulary , One is to shield strings to prevent sensitive data leakage .
IntelliCode Compose
The model uses GPT, When inferring, it will sequence decoding The process of is regarded as the search process of the tree , until token appear :
Use when building trees beam search,beam with by K, Suppose the length of the final generated sequence is L, The model needs to predict K*L Time , But the model can batch perform , So all you need is L Time .
Multilingual model
Four ways of modeling multilingualism are compared :
1) Ignore differences between languages , Train multiple languages with a unified model 【 Experiments show that : This method is worse than monolingual training alone 】
2) Join in language type embedding Information , Each language is represented by a vector , And the original token embedding Equal combination .
3) Add a sentence at the beginning of each training sample "lang * remaining token sequence", among l a n g ∈ { P y t h o n , C # , J a v a S c r i p t , T y p e S c r i p t } lang \in \{Python, C\#, JavaScript,TypeScript\} lang∈{ Python,C#,JavaScript,TypeScript}
4) In pre training , Add a language type classification Mission , That is, one more head, Predict the type of the language each time .
边栏推荐
- ESP32-C3 基于Arduino框架下Blinker点灯控制10路开关或继电器组
- Summary of Niuke forum project deployment
- A hard journey
- 【CSDN 年终总结】结束与开始,一直在路上—— “1+1=王”的2021总结
- Business visualization - make your flowchart'run'(3. Branch selection & cross language distributed operation node)
- R语言GLM广义线性模型:逻辑回归、泊松回归拟合小鼠临床试验数据(剂量和反应)示例和自测题
- 错误: 找不到或无法加载主类 xxxx
- [300 opencv routines] 239. accurate positioning of Harris corner detection (cornersubpix)
- 【问题解决】org.apache.ibatis.exceptions.PersistenceException: Error building SqlSession.1 字节的 UTF-8 序列的字
- 【视频】马尔可夫链蒙特卡罗方法MCMC原理与R语言实现|数据分享
猜你喜欢

Mid 2022 review | latest progress of large model technology Lanzhou Technology

EMQX Cloud 更新:日志分析增加更多参数,监控运维更省心

【AI4Code】《Contrastive Code Representation Learning》 (EMNLP 2021)

Substance designer 2021 software installation package download and installation tutorial

程序的内存布局

2022.07.24 (lc_6124_the first letter that appears twice)

Moving Chinese figure liushenglan

跌荡的人生

mysql函数汇总之日期和时间函数

Seven lines of code made station B crash for three hours, but "a scheming 0"
随机推荐
web安全入门-UDP测试与防御
Zero basic learning canoe panel (14) -- led control and LCD control
2022.07.24 (lc_6126_design food scoring system)
ECCV2022 | TransGrasp类级别抓取姿态迁移
程序员奶爸自制AI喂奶检测仪,预判宝宝饿点,不让哭声影响老婆睡眠
【历史上的今天】7 月 25 日:IBM 获得了第一项专利;Verizon 收购雅虎;亚马逊发布 Fire Phone
Can flinkcdc import multiple tables in mongodb database together?
Masscode is an excellent open source code fragment manager
[operation and maintenance, implementation of high-quality products] interview skills for technical positions with a monthly salary of 10k+
State mode
MySQL remote connection permission error 1045 problem
Mid 2022 review | latest progress of large model technology Lanzhou Technology
状态(State)模式
Use of Spirng @conditional conditional conditional annotation
艰辛的旅程
Summary of Niuke forum project deployment
go : gin 自定义日志输出格式
基于JEECG制作一个通用的级联字典选择控件-DictCascadeUniversal
Microsoft proposed CodeT: a new SOTA for code generation, with 20 points of performance improvement
【AI4Code】《GraphCodeBERT: Pre-Training Code Representations With DataFlow》 ICLR 2021