当前位置:网站首页>Texttext data enhancement method data argument
Texttext data enhancement method data argument
2022-07-06 10:26:00 【How about a song without trace】
Knowledge point :text Data to enhance data argumentation
random insertion Insert randomly
random deletion Random delete
random swap Random exchange
Reference paper : EDA : Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks
Back Translation
give an example : English --> chinese --> English
# Need to install : pip install google_trans_new
from google_trans_new import google_translator
translator = google_translator()
sentence = ['stay hungry, stay foolish. -- spoken / said by Steve Jobs']
# Britain --> in
translation_cn = translator.translate(sentence, lang_tgt='zh-cn')
translation_cn
# in --> Britain
translation_en = translator.translate(translation_cn, lang_tgt='en')
translation_en
Choose a language translation randomly
import random
import google_trans_new
languages = list(google_trans_new.LANGUAGES.keys())
len(languages) # Translatable languages 108 Kind of
object_lang = random.choice(languages)
object_lang
# Forward translation
translations = translator.translate(sentence, lang_tgt=object_lang)
translations
# Reverse translation
back_trans = translator.translate(translations, lang_tgt='en')
back_trans
# Reverse translation
back_trans = translator.translate(translations, lang_tgt='en')
back_trans边栏推荐
- 14 medical registration system_ [Alibaba cloud OSS, user authentication and patient]
- If someone asks you about the consistency of database cache, send this article directly to him
- How to make shell script executable
- [paper reading notes] - cryptographic analysis of short RSA secret exponents
- 百度百科数据爬取及内容分类识别
- 高并发系统的限流方案研究,其实限流实现也不复杂
- MySQL实战优化高手07 生产经验:如何对生产环境中的数据库进行360度无死角压测?
- MySQL combat optimization expert 09 production experience: how to deploy a monitoring system for a database in a production environment?
- MySQL storage engine
- South China Technology stack cnn+bilstm+attention
猜你喜欢

Jar runs with error no main manifest attribute

MySQL36-数据库备份与恢复

The underlying logical architecture of MySQL
![[C language] deeply analyze the underlying principle of data storage](/img/d6/1c0cd38c75da0d0cc1df7f36938cfb.png)
[C language] deeply analyze the underlying principle of data storage

Emotional classification of 1.6 million comments on LSTM based on pytoch

MySQL实战优化高手02 为了执行SQL语句,你知道MySQL用了什么样的架构设计吗?

该不会还有人不懂用C语言写扫雷游戏吧

寶塔的安裝和flask項目部署

数据库中间件_Mycat总结
![16 medical registration system_ [order by appointment]](/img/7f/d94ac2b3398bf123bc97d44499bb42.png)
16 medical registration system_ [order by appointment]
随机推荐
简单解决phpjm加密问题 免费phpjm解密工具
软件测试工程师必备之软技能:结构化思维
评估方法的优缺点
UEditor国际化配置,支持中英文切换
Mysql32 lock
cmooc互联网+教育
MySQL35-主从复制
If someone asks you about the consistency of database cache, send this article directly to him
用于实时端到端文本识别的自适应Bezier曲线网络
MySQL实战优化高手06 生产经验:互联网公司的生产环境数据库是如何进行性能测试的?
宝塔的安装和flask项目部署
数据库中间件_Mycat总结
MySQL实战优化高手03 用一次数据更新流程,初步了解InnoDB存储引擎的架构设计
MNIST implementation using pytoch in jupyter notebook
Complete web login process through filter
MySQL combat optimization expert 09 production experience: how to deploy a monitoring system for a database in a production environment?
Installation of pagoda and deployment of flask project
【C语言】深度剖析数据存储的底层原理
MySQL實戰優化高手04 借著更新語句在InnoDB存儲引擎中的執行流程,聊聊binlog是什麼?
Sed text processing