当前位置:网站首页>Brief introduction of prompt paradigm
Brief introduction of prompt paradigm
2022-07-02 07:58:00 【MezereonXP】
Prompt Brief description of paradigm
Traditional Framework:
- pre-train
- fine-tune
The traditional training framework is , First, pre train the model on a large-scale data set , Then fine tune the data set of the target task .
Prompt Framework
- pre-train
- prompt
- predict
Prompt The framework is divided into three parts , Preliminary training ,Prompt Generate , And prediction
Goal: Let the pertained model itself can be used to predict the desired output without any task-specific training.
Prompt In essence, it is to transform task data , Put the original goal 、 label , Make a conversion , Integrate into the data .
For example, the emotion label of the sentence “I won the game.” is good.
We can also get a longer sentence “I won the game, so I felt good.”
The above example is , Label good Convert to additional statements , Add after input .
Such a transformation makes , We learn through natural self-monitoring , You can achieve the goals required by the task
Prompt Basics
Generally speaking ,Prompt There are three steps :
- Prompt Addition
- Answer Search
- Answer Mapping
Prompt Addition
This step is actually to modify the input
such as [X] Overall, it was a [Z] movie Form like this
We fill in the input to [X] The location of , Returns an entire statement , Set aside [Z] The location of , Wait for the answer to fill .
Answer Search
z ^ = search z ∈ Z P ( f fill ( x ′ , z ) ; θ ) \hat z = \text{search}_{z\in \mathcal{Z}}P(f_{\text{fill}}(x',z);\theta) z^=searchz∈ZP(ffill(x′,z);θ)
As shown in the above formula , In this step , Here's what we're trying to do , Fill in the most likely answer .
Answer Mapping
Match the searched answer with the output value
That is, the answer that will be filled , Map to the final output , It could be a label , It may also be the corresponding sentence directly .
边栏推荐
- 【DIoU】《Distance-IoU Loss:Faster and Better Learning for Bounding Box Regression》
- 【Hide-and-Seek】《Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised Localization xxx》
- 【FastDepth】《FastDepth:Fast Monocular Depth Estimation on Embedded Systems》
- Embedding malware into neural networks
- Machine learning theory learning: perceptron
- 【Mixup】《Mixup:Beyond Empirical Risk Minimization》
- 【学习笔记】Matlab自编高斯平滑器+Sobel算子求导
- w10升级至W11系统,黑屏但鼠标与桌面快捷方式能用,如何解决
- WCF更新服务引用报错的原因之一
- TimeCLR: A self-supervised contrastive learning framework for univariate time series representation
猜你喜欢

【Random Erasing】《Random Erasing Data Augmentation》

【Mixed Pooling】《Mixed Pooling for Convolutional Neural Networks》

Label propagation

Jetson nano installation tensorflow stepping pit record (scipy1.4.1)

Translation of the paper "written mathematical expression recognition with bidirectionally trained transformer"

将恶意软件嵌入到神经网络中

label propagation 标签传播

【Random Erasing】《Random Erasing Data Augmentation》

open3d学习笔记四【表面重建】

Specification for package drawing
随机推荐
利用Transformer来进行目标检测和语义分割
[CVPR‘22 Oral2] TAN: Temporal Alignment Networks for Long-term Video
Hystrix dashboard cannot find hystrix Stream solution
【Mixed Pooling】《Mixed Pooling for Convolutional Neural Networks》
How do vision transformer work? [interpretation of the paper]
【Wing Loss】《Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks》
【Batch】learning notes
Target detection for long tail distribution -- balanced group softmax
联邦学习下的数据逆向攻击 -- GradInversion
【MobileNet V3】《Searching for MobileNetV3》
Where do you find the materials for those articles that have read 10000?
Go functions make, slice, append
Handwritten call, apply, bind
How gensim freezes some word vectors for incremental training
【双目视觉】双目立体匹配
Summary of solving the Jetson nano installation onnx error (error: failed building wheel for onnx)
Remplacer l'auto - attention par MLP
Solve the problem of latex picture floating
【Mixed Pooling】《Mixed Pooling for Convolutional Neural Networks》
【Sparse-to-Dense】《Sparse-to-Dense:Depth Prediction from Sparse Depth Samples and a Single Image》