当前位置:网站首页>NLP model Bert: from introduction to mastery (2)
NLP model Bert: from introduction to mastery (2)
2020-11-06 01:22:00 【Elementary school students in IT field】
Named entity recognition
First download the corresponding bert modular
pip install bert-base==0.0.9 -i https://pypi.python.org/simple
Also can reference Official website Handle
install

What the package now supports
1. Named entity recognition training
2. Services for Named Entity Recognition C/S
3. Inherit excellent open source software :bert_as_service(hanxiao) Of BERT All services
4. Text categorization Services
The following functions will continue to increase
Training named entity recognition model based on named row :
installed bert-base after , Two tools based on named rows will be generated , among bert-base-ner-train Support the training of named entity recognition model , You just need to specify the directory of training data ,BERT The directory of relevant parameters can be . You can use the following command to view help

The examples of training are named as follows :
bert-base-ner-train \
-data_dir {your dataset dir}\
-output_dir {training output dir}\
-init_checkpoint {Google BERT model dir}\
-bert_config_file {bert_config.json under the Google BERT model dir} \
-vocab_file {vocab.txt under the Google BERT model dir}
Parameter description
among data_dir It's the directory where your data is located , Training data , The naming format of validation data and test data is :train.txt, dev.txt,test.txt, Please name the file in this format , Otherwise, an error will be reported .
The format of training data is as follows :
The sea O
fishing O
Than O
" O
The earth O
spot O
stay O
mansion B-LOC
door I-LOC
And O
gold B-LOC
door I-LOC
And O
between O
Of O
The sea O
Domain O
. O
The first word in each line is , The second is its label , Use spaces ’ ' Separate , Please make sure to use spaces . Use blank lines between sentences . The program will automatically read your data .
output_dir: Training model output file path , Model checkpoint And some tag mapping tables will be stored here , This path is used as a service , Can be specified as -ner_model_dir
init_checkpoint: Download Google BERT Model
bert_config_file : Google BERT Under the model bert_config.json
vocab_file: Google BERT Under the model vocab.txt
After training , You can specify in your output_dir To see the results of your training .
More operations :
https://blog.csdn.net/macanv/article/details/85684284
One more bert Encapsulation of models
https://www.jianshu.com/p/1d6689851622
https://cloud.tencent.com/developer/article/1470051
https://www.h3399.cn/201908/714454.html
版权声明
本文为[Elementary school students in IT field]所创,转载请带上原文链接,感谢
边栏推荐
- How do the general bottom buried points do?
- Python + appium automatic operation wechat is enough
- 业内首发车道级导航背后——详解高精定位技术演进与场景应用
- Want to do read-write separation, give you some small experience
- Linked blocking Queue Analysis of blocking queue
- 全球疫情加速互联网企业转型,区块链会是解药吗?
- Python crawler actual combat details: crawling home of pictures
- Filecoin最新动态 完成重大升级 已实现四大项目进展!
- 快快使用ModelArts,零基础小白也能玩转AI!
- 6.2 handleradapter adapter processor (in-depth analysis of SSM and project practice)
猜你喜欢

合约交易系统开发|智能合约交易平台搭建

axios学习笔记(二):轻松弄懂XHR的使用及如何封装简易axios

Filecoin的经济模型与未来价值是如何支撑FIL币价格破千的

小程序入门到精通(二):了解小程序开发4个重要文件

快快使用ModelArts,零基础小白也能玩转AI!

Aprelu: cross border application, adaptive relu | IEEE tie 2020 for machine fault detection

From zero learning artificial intelligence, open the road of career planning!

Examples of unconventional aggregation

DevOps是什么

教你轻松搞懂vue-codemirror的基本用法:主要实现代码编辑、验证提示、代码格式化
随机推荐
I'm afraid that the spread sequence calculation of arbitrage strategy is not as simple as you think
How long does it take you to work out an object-oriented programming interview question from Ali school?
Architecture article collection
[C / C + + 1] clion configuration and running C language
数字城市响应相关国家政策大力发展数字孪生平台的建设
Group count - word length
Troubleshooting and summary of JVM Metaspace memory overflow
Working principle of gradient descent algorithm in machine learning
ES6 essence:
Relationship between business policies, business rules, business processes and business master data - modern analysis
Serilog原始碼解析——使用方法
Python + appium automatic operation wechat is enough
大数据应用的重要性体现在方方面面
Did you blog today?
Vuejs development specification
如何玩转sortablejs-vuedraggable实现表单嵌套拖拽功能
The choice of enterprise database is usually decided by the system architect - the newstack
你的财务报告该换个高级的套路了——财务分析驾驶舱
How to select the evaluation index of classification model
Using Es5 to realize the class of ES6