当前位置:网站首页>Texttext data enhancement method data argument
Texttext data enhancement method data argument
2022-07-06 10:26:00 【How about a song without trace】
Knowledge point :text Data to enhance data argumentation
random insertion Insert randomly
random deletion Random delete
random swap Random exchange
Reference paper : EDA : Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
Back Translation
give an example : English --> chinese --> English
# Need to install : pip install google_trans_new
from google_trans_new import google_translator
translator = google_translator()
sentence = ['stay hungry, stay foolish. -- spoken / said by Steve Jobs']
# Britain --> in
translation_cn = translator.translate(sentence, lang_tgt='zh-cn')
translation_cn
# in --> Britain
translation_en = translator.translate(translation_cn, lang_tgt='en')
translation_en
Choose a language translation randomly
import random
import google_trans_new
languages = list(google_trans_new.LANGUAGES.keys())
len(languages) # Translatable languages 108 Kind of
object_lang = random.choice(languages)
object_lang
# Forward translation
translations = translator.translate(sentence, lang_tgt=object_lang)
translations
# Reverse translation
back_trans = translator.translate(translations, lang_tgt='en')
back_trans
# Reverse translation
back_trans = translator.translate(translations, lang_tgt='en')
back_trans
边栏推荐
- UEditor国际化配置,支持中英文切换
- Mexican SQL manual injection vulnerability test (mongodb database) problem solution
- Implement context manager through with
- Ueeditor internationalization configuration, supporting Chinese and English switching
- The programming ranking list came out in February. Is the result as you expected?
- ByteTrack: Multi-Object Tracking by Associating Every Detection Box 论文阅读笔记()
- ① BOKE
- How to make shell script executable
- Routes and resources of AI
- 实现微信公众号H5消息推送的超级详细步骤
猜你喜欢
Mysql32 lock
Complete web login process through filter
实现以form-data参数发送post请求
[after reading the series of must know] one of how to realize app automation without programming (preparation)
C miscellaneous lecture continued
基于Pytorch肺部感染识别案例(采用ResNet网络结构)
Super detailed steps for pushing wechat official account H5 messages
Security design verification of API interface: ticket, signature, timestamp
使用OVF Tool工具从Esxi 6.7中导出虚拟机
MySQL real battle optimization expert 11 starts with the addition, deletion and modification of data. Review the status of buffer pool in the database
随机推荐
Carolyn Rosé博士的社交互通演讲记录
MySQL real battle optimization expert 08 production experience: how to observe the machine performance 360 degrees without dead angle in the process of database pressure test?
Docker MySQL solves time zone problems
A new understanding of RMAN retention policy recovery window
MySQL combat optimization expert 02 in order to execute SQL statements, do you know what kind of architectural design MySQL uses?
Time in TCP state_ The role of wait?
14 medical registration system_ [Alibaba cloud OSS, user authentication and patient]
How to make shell script executable
Software test engineer development planning route
text 文本数据增强方法 data argumentation
Use JUnit unit test & transaction usage
cmooc互联网+教育
评估方法的优缺点
The governor of New Jersey signed seven bills to improve gun safety
PyTorch RNN 实战案例_MNIST手写字体识别
15 medical registration system_ [appointment registration]
NLP routes and resources
MySQL real battle optimization expert 11 starts with the addition, deletion and modification of data. Review the status of buffer pool in the database
Nanny hand-in-hand teaches you to write Gobang in C language
Set shell script execution error to exit automatically