当前位置:网站首页>Bert's summary of me
Bert's summary of me
2022-06-25 17:37:00 【Green Lantern swordsman】
BERT Read a lot of information , I think I have some insight . For two years , I didn't sort it out myself . Now start sorting :
One 、Google Bert In the source modeling file
modeling yes bert The origin of , It's better to understand here first . You can refer to the materials of other great gods :
1. Code interpretation , Analysis of a three-year-old brother , It's very clear
2. bert The paper of , The first article should read 《BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding》, This link has a good explanation in Chinese Add link description
3. The second important paper is 《Pre-Training with Whole Word Maskingfor Chinese BERT》. The idea is google Bring up the , The Chinese version was trained by Harbin Institute of technology , Hada's link is This github. Relevant supporting materials include :BERT-WWM note 、BERT-wwm、BERT-wwm-ext
4. Met a summary BERT Information articles , Look at this link . however , I think he wrote too much , This means that these things are not necessarily useful .
Two 、transform You should make a good understanding of
2.1 The first one is Wang Yudi's pdf, It's really good . After seeing , combination tensorflow Code , View paper Attention Is All You Need
3、 ... and 、 How to load the code in the application ?
(1)keras The loading method is simple , There is a tool developed by sujianlin's team . See here for its use : Introduction 、github Address
(2)huggingface Of github see here ,Google Officially recommended PyTorch BERB Version implementation . For example , see B The graduate student at the station Example , You can also learn by hand Bert Text classification of this Example
(3) Official Google Code , It seems that loading is also good , Sure
Four 、 Other matters needing attention
(1) Optimizer used adamw, It is different from the conventional adam What improvements have been made , see here
边栏推荐
- Learn Tai Chi Maker - mqtt (III) connect to mqtt server
- 十大证券公司哪个佣金最低 办理开户安全吗
- WARNING: Unsupported upgrade request.
- 上线移动ERP系统有哪些步骤?环环紧扣很重要
- Treasure and niche Chinese painting 3D texture material website sharing
- conda安装的py3.6和py3.7
- Comprehensive optimization of the six topics, Alibaba performance optimization booklet open source, leading you to the ultimate performance
- 用连续自然数之和来表达整数
- 杰理之获取复位源和唤醒的 IO 口的方法【篇】
- 启牛涨乐财付通下载是可以开户吗?开户安全吗
猜你喜欢

芝士糖豆打造AR潮玩新体验

【Matlab】数值微积分与方程求解

Good fat man takes you to learn Flink series -flink source code analysis episode I standalone startup script analysis

超全金属PBR多通道贴图素材网站整理

Why are there few embedded system designers in the soft test?

LSF如何看job预留slot是否合理?

Mathematical modeling -- integer programming

HMS Core机器学习服务实现同声传译,支持中英文互译和多种音色语音播报
![[matlab] curve fitting](/img/58/3fdcc4d34e7c7c71b73324517ff69d.png)
[matlab] curve fitting

配电室环境的分布式远程管理
随机推荐
Jerry's ADC_ get_ Incorrect voltage value obtained by voltage function [chapter]
WPF开发随笔收录-心电图曲线绘制
How Jerry used to output a clock source to the outside world [chapter]
Mathematical modeling - linear programming
学习太极创客 — MQTT(三)连接MQTT服务端
ACY100油烟浓度在线监控仪针对饮食业厨房油烟排放
Distinguishing seven kinds of facial expressions by deep separable convolution neural network
Best practices for data relocation: using CDM to relocate offline Mysql to DWS
Kotlin入门(20)几种常见的对话框
How to solve the problem of network disconnection after enabling hotspot sharing in win10?
杰理之唤醒口使用注意事项【篇】
TCP chat + transfer file server server socket v2.8 - fix 4 known problems
mysql mysql-8.0.19-winx64 安装与navicat连接
求满足条件的最长子串长度
Can I open an account? Is it safe to open an account
Super Full Metal PBR Multi - channel Mapping Materials website collation
学习太极创客 — MQTT(一)MQTT 是什么
HMS Core机器学习服务实现同声传译,支持中英文互译和多种音色语音播报
用连续自然数之和来表达整数
ES6知识点