当前位置:网站首页>Paper reading (62):pointer networks
Paper reading (62):pointer networks
2022-07-29 07:18:00 【Inji】
List of articles
1 introduce
1.1 subject
2015 Pointer network (Pointer networks)
1.2 Code
Github:https://github.com/shirgur/PointerNet
1.3 Abstract
A new neural architecture is introduced to Learn the conditional probability of the output sequence , Its elements are discrete markers corresponding to positions in the input sequence . Some existing methods , For example, sequence to sequence and neural Turing machines cannot deal with this problem , Because in every step of their output , The number of target classes depends on the length of the input , But the length of the input sequence is variable . Sorting of variable size sequences , And all kinds of combinatorial optimization problems belong to this kind of problem .
In particular , Variable size output dictionary By introducing Attention mechanism To solve . It differs from previous attention in , It does not use attention to fuse the hidden units of the encoder into context vectors in each decoder step , It is Use attention as a pointer to select the members of the input sequence as the output . Such an architecture is called Pointer network (Ptr-Net).
In order to prove Ptr-Net The effectiveness of the , Three famous geometric problems are used , For finding plane convex hull 、 Calculation Delaunay Triangulation and plane traveling salesman problem .Ptr-Net It not only improves sequence to sequence by inputting attention , And it can also be extended to variable size output dictionaries . The final results show that the generalization ability of the learning models exceeds the maximum length of their training . These results will encourage neural learning to explore a wider range of discrete problems .
1. Bib
@article{
Vinyals:2015:pointer,
author = {
Oriol Vinyals and Meire Fortunato and Navdeep Jaitly},
title = {
Pointer networks},
journal = {
Advances in neural information processing systems},
volume = {
28},
year = {
2015}
}
2 Model
First of all, two background works are introduced , That is, sequence to sequence and input attention model . Then it leads to the Ptr-Net framework .
2.1 Sequence to sequence model
Given a training pair ( P , C P ) (\mathcal{P},\mathcal{C^P}) (P,CP), Sequence to sequence The model uses parameters θ \theta θ Of RNN Calculate the conditional probability based on the probability chain rule :
p ( C P ∣ P ; θ ) = ∏ i = 1 m ( P ) p ( C i ∣ C 1 , … , C i − 1 , P ; θ ) (1) \tag{1} p(\mathcal{C^P}|\mathcal{P};\theta)=\prod_{i=1}^{m(\mathcal{P})}p(C_i|C_1,\dots,C_{i-1},\mathcal{P};\theta) p(CP∣P;θ)=i=1∏m(P)p(Ci∣C1,…,Ci−1,P;θ)(1) A schematic is as follows chart 1.
边栏推荐
- [redis] redis development specifications and precautions
- 330. 按要求补齐数组
- [OpenGL] use of shaders
- gin 路由,参数,输出
- Win11vmware turns on the virtual machine and restarts on the blue screen and the solution that cannot be started
- vagrant box 集群 处理
- Nodejs installation tutorial
- Comparison of advantages between can & canfd integrated test analysis software lkmaster and PCA Explorer 6 analysis software
- MutationObserver文档学习
- 后缀自动机(SAM)讲解 + Luogu p3804【模板】后缀自动机 (SAM)
猜你喜欢

彻底搞懂kubernetes调度框架与插件

Kubernetes (V) -- deploy kubernetes dashboard

WPF simple login page completion case

After three years of outsourcing, the salary of automatic testing after job hopping is twice that of the original. The secret is

WPF嵌套布局案例

CAN&CANFD综合测试分析软件LKMaster与PCAN-Explorer 6分析软件的优势对比

Vite3.0 has been released, can you still roll it (list of new features)

Section 7 - compilation of programs (preprocessing operations) + links

CVPR2021| 基于自监督学习的多视图立体匹配 (CVPR2021)

女研究生做“思维导图”与男友吵架!网友:吵架届的“内卷之王”....
随机推荐
Gin template
Operator3 - design an operator
Nodejs安装教程
Spark Learning Notes (VII) -- spark core core programming - RDD serialization / dependency / persistence / partition / accumulator / broadcast variables
[C language brush leetcode] 1054. Bar code with equal distance (m)
城市花样精~侬好!DESIGN#可视化电台即将开播
接口测试实战项目03:执行测试用例
Why does ETL often become ELT or even let?
0 9 布隆过滤器(Bloom Filter)
WPF 界面布局必知基础
Explanation of suffix automata (SAM) + Luogu p3804 [template] suffix automata (SAM)
Redis Basics
实现改变一段文字的部分颜色效果
Fillder use
Student status management system based on C language design
暑期总结(二)
JS chicken laying eggs and egg laying chickens. Who appeared earlier, object or function? Is function an instance of function?
Error 1045 (28000) access denied for user 'root' @ 'localhost' solution
怎么会不喜欢呢,CICD中轻松发送邮件
npm install报错npm ERR Could not resolve dependency npm ERR peer