当前位置:网站首页>Brief introduction of prompt paradigm
Brief introduction of prompt paradigm
2022-07-02 07:58:00 【MezereonXP】
Prompt Brief description of paradigm
Traditional Framework:
- pre-train
- fine-tune
The traditional training framework is , First, pre train the model on a large-scale data set , Then fine tune the data set of the target task .
Prompt Framework
- pre-train
- prompt
- predict
Prompt The framework is divided into three parts , Preliminary training ,Prompt Generate , And prediction
Goal: Let the pertained model itself can be used to predict the desired output without any task-specific training.
Prompt In essence, it is to transform task data , Put the original goal 、 label , Make a conversion , Integrate into the data .
For example, the emotion label of the sentence “I won the game.” is good.
We can also get a longer sentence “I won the game, so I felt good.”
The above example is , Label good Convert to additional statements , Add after input .
Such a transformation makes , We learn through natural self-monitoring , You can achieve the goals required by the task
Prompt Basics
Generally speaking ,Prompt There are three steps :
- Prompt Addition
- Answer Search
- Answer Mapping
Prompt Addition
This step is actually to modify the input
such as [X] Overall, it was a [Z] movie Form like this
We fill in the input to [X] The location of , Returns an entire statement , Set aside [Z] The location of , Wait for the answer to fill .
Answer Search
z ^ = search z ∈ Z P ( f fill ( x ′ , z ) ; θ ) \hat z = \text{search}_{z\in \mathcal{Z}}P(f_{\text{fill}}(x',z);\theta) z^=searchz∈ZP(ffill(x′,z);θ)
As shown in the above formula , In this step , Here's what we're trying to do , Fill in the most likely answer .
Answer Mapping
Match the searched answer with the output value
That is, the answer that will be filled , Map to the final output , It could be a label , It may also be the corresponding sentence directly .
边栏推荐
- 【Batch】learning notes
- JVM instructions
- Open3d learning notes II [file reading and writing]
- The internal network of the server can be accessed, but the external network cannot be accessed
- Common CNN network innovations
- 【学习笔记】Matlab自编图像卷积函数
- open3d学习笔记五【RGBD融合】
- Target detection for long tail distribution -- balanced group softmax
- How to turn on night mode on laptop
- 【Programming】
猜你喜欢
【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》
【Hide-and-Seek】《Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised Localization xxx》
【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》
【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》
【Mixup】《Mixup:Beyond Empirical Risk Minimization》
【Batch】learning notes
使用C#语言来进行json串的接收
简易打包工具的安装与使用
[learning notes] matlab self compiled Gaussian smoother +sobel operator derivation
【Random Erasing】《Random Erasing Data Augmentation》
随机推荐
Open3d learning notes II [file reading and writing]
论文tips
[CVPR‘22 Oral2] TAN: Temporal Alignment Networks for Long-term Video
JVM instructions
The internal network of the server can be accessed, but the external network cannot be accessed
Common CNN network innovations
CVPR19-Deep Stacked Hierarchical Multi-patch Network for Image Deblurring论文复现
【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》
【Random Erasing】《Random Erasing Data Augmentation》
[multimodal] clip model
[learning notes] numerical differentiation of back error propagation
【MagNet】《Progressive Semantic Segmentation》
【Wing Loss】《Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks》
Prompt 范式简述
Yolov3 trains its own data set (mmdetection)
【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》
【MagNet】《Progressive Semantic Segmentation》
open3d学习笔记五【RGBD融合】
Target detection for long tail distribution -- balanced group softmax
[binocular vision] binocular correction