当前位置:网站首页>Two tower model: Ernie gram pre training and fine-tuning matching
Two tower model: Ernie gram pre training and fine-tuning matching
2022-06-09 23:16:00 【Artificial intelligence Zeng Xiaojian】
Background introduction
be based on ERNIE-Gram Training Pair-wise Model .Pair-wise The matching model is suitable for The similarity of text pairs is input to the upper ranking module as one of the features Application scenarios for sorting .
ERNIE-Gram
1. Technical proposal and evaluation index
Technical solution
The twin tower model , Use ERNIE-Gram Preliminary training Model , Use margin_ranking_loss Training models .
Evaluation indicators
(1) use AUC Indicators to evaluate Sort model The sorting effect of .
Effect evaluation
| Model | AUC |
|---|---|
| ERNIE-Gram | 0.801 |
2. Environmental dependencies and installation instructions
Environment depends on
- python >= 3.x
- paddlepaddle >= 2.1.3
- paddlenlp >= 2.2
- pandas >= 0.25.1
- scipy >= 1.3.1
3. The code structure
The following is the main code structure and description of the project :
ernie_matching/
├── deply # Deploy
└── python
├── deploy.sh # Forecast deployment bash Script
└── predict.py # python Forecast deployment example
|—— scripts
├── export_model.sh # Dynamic graph parameters export static graph parameters bash file
├── train_pairwise.sh # Pair-wise Single tower matching model training bash file
├── evaluate.sh # Validation evaluation document bash Script
├── predict_pairwise.sh # Pair-wise Single tower matching model prediction script bash file
├── export_model.py # Dynamic graph parameter export static graph parameter script
├── model.py # Pair-wise Match the model
├── data.py # Pair-wise Transformation logic of training samples 、Pair-wise The logic of generating random negative examples
├── train_pairwise.py # Pair-wise Single tower matching model training script
├── evaluate.py # Validation evaluation document
├── predict_pairwise.py # Pair-wise Single tower matching model prediction script , The output text pair is similar
4. Data preparation
Data set description
The sample data is as follows :
Personal income tax planning Based on the new personal income tax perspective of tax planning analysis of the new personal income tax ; Individual income tax ; Tax planning Personal income tax salary tax planning research on personal income tax , Wages and salaries , Tax preparation
Stress analysis of hydraulic support base ZY4000/09/19D Finite element analysis of hydraulic support , Finite element analysis , Load both ends , Partial load , Reverse be based on ANSYS Multi working condition stress analysis of hydraulic support , Four working conditions , Simulation analysis ,ANSYS, Stress concentration , Optimize
Delayed vasospasm Effect of cilostazol on cerebral vasospasm after aneurysmal subarachnoid hemorrhage Meta Analysis of cilostazol , Subarachnoid hemorrhage , Cerebral vasospasm ,Meta analysis Effect of cilostazol on cerebral vasospasm after aneurysmal subarachnoid hemorrhage Meta Analysis of cilostazol , Subarachnoid hemorrhage , Cerebral vasospasm ,Meta analysis
Silicon oxide Composite sol - Preparation of silicon oxide for lithium ion batteries by gel one pot method / Carbon composite anode material silicon oxide , Sol - Gel method , Nanoparticles , Negative pole , Lithium ion battery Supported polyimide - silicon dioxide - Preparation and characterization of silver hybrid film polyimide , silicon dioxide , silver , Hybrid membrane , Promote transmission
Dataset Download
├── milvus # milvus Build database data set
├── milvus_data.csv. # Build the data of recall library
├── recall # Recall ( Semantic index ) Data sets
├── corpus.csv # Recall library for testing
├── dev.csv # Recall validation set
├── test.csv # Recall test set
├── train.csv # Recall training sets
├── train_unsupervised.csv # Unsupervised training set
├── sort # Sort data sets
├── test_pairwise.csv # Sort test sets
├── dev_pairwise.csv # Sort validation set
└── train_pairwise.csv # Sort training sets 边栏推荐
- 这知识我没见过--MySQL 服务演进
- QNX system learning
- Continuous integration, continuous delivery and continuous deployment (ci/cd) details
- Laravel 上传文件信息获取
- [BP prediction] BP neural network based on AdaBoost realizes data regression prediction with matlab code
- 在线文本字符串批量替换工具
- Another important content - Overseas cash loan product form and risk control measures
- How does uni app build applets?
- CMD命令
- This configuration section cannot be used in this path. If the section is locked at the parent level, the solution to this situation will occur
猜你喜欢

先睹为快!Benji Bananas 第一季奖励活动数据一览!

服务器运维环境安全体系(下篇)

qnx系统学习

Continuous integration, continuous delivery and continuous deployment (ci/cd) details

『查漏补缺』Android实习面试知识点(二)
![[volume guide] mendeley document management tool tutorial](/img/21/06649cfcf4d1c42f5c12dc372b519d.png)
[volume guide] mendeley document management tool tutorial

QNX system learning

不能在此路径中使用此配置节,如果在父级别上锁定了该节,便会出现这种情况的解决办法

这知识我没见过--MySQL 服务演进

使用sqlplus远程连接ASM实例的方式
随机推荐
Explanation of leetcode UHF questions (II)
项目实训(十五)---视频多人物识别补充
Easyrecovery15 mobile computer full function data recovery software
mkdir 创建目录命令
又一重磅内容|海外现金贷产品形态及风控措施
Leetcode(力扣)超高频题讲解(二)
“35岁还没副业,都不好意思混职场”:摆脱死工资推荐这种副业
Metauniverse may replace the Internet as the mainstream lifestyle in the next decade or even two
Centos+mysql message: can't connect to local MySQL server through socket '/var/lib/mysql/mysql socket
最新活动|OpenHarmony开源开发者成长计划解决方案学生挑战赛即将开启!
This configuration section cannot be used in this path. If the section is locked at the parent level, the solution to this situation will occur
[image encryption and decryption] image encryption and decryption based on chaotic sequence combined with dwt+svd (including correlation test) including Matlab source code
Common embedded end streaming media server open source projects!
Information leakage and computational complexity of EMD like methods in time series prediction
What is liquidity pledge? What is a farm pledge?
Implementing Lmax disruptor queue from scratch (II) analysis of consumption dependency principle among multiple consumers and consumer groups
Oracle中如何记录访问数据库的登录信息?
Boolean operation
lua学习笔记(4)-- 搭建mobdebug 远程开发环境
The original tree array can be so simple?