CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

此版本基于Pytorch后端的huggingface进行实现。由于此实现使用了Oneflow的dataloader作为数据读入的方式，因此也需要安装Oneflow。其它框架的数据读取可以参考OneflowDataloaderToPytorchDataset类的实现。

使用说明

安装依赖（前置要求：已在环境中安装好Pytorch和Oneflow）

pip install transformers pandas
git clone https://github.com/tea321000/hugging_face_competition
cd hugging_face_competition

运行train_BERT_base.sh和train_BERT_large.sh 单机单卡的baseline。保持其它参数不变，通过调节shell文件里的hidden_size参数，即可观察不同hidden_size所占显存的变化（可通过watch -n 0.1 nvidia-smi直观观察）

python train.py \
--ofrecord_path sample_seq_len_512_example \
--lr 1e-4 --epochs 10 \
--train_batch_size 2 \
--seq_length=512 \
--max_predictions_per_seq=80 \
--num_hidden_layers=24 \
--num_attention_heads=16 \
--hidden_size=1024 \#要调节的参数
--vocab_size=30522

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

Related tags

Overview

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

使用说明

Owner

Ziqi Zhou

VampiresVsWerewolves - Our Implementation of a MiniMax algorithm with alpha beta pruning in the context of an in-class competition

Tool which allow you to detect and translate text.

Simple text to phones converter for multiple languages

TruthfulQA: Measuring How Models Imitate Human Falsehoods

A paper list of pre-trained language models (PLMs).

A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, and deploy transformer specific models.

Machine learning models from Singapore's NLP research community

End-to-end image captioning with EfficientNet-b3 + LSTM with Attention

Mkdocs + material + cool stuff

SpikeX - SpaCy Pipes for Knowledge Extraction

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers

Bnagla hand written document digiiztion

Final Project for the Intel AI Readiness Boot Camp NLP (Jan)

Text to speech for Vietnamese, ez to use, ez to update

⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡

Simple and efficient RevNet-Library with DeepSpeed support

Officile code repository for "A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning"

Data preprocessing rosetta parser for python

Use PaddlePaddle to reproduce the paper：mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer