当前位置:网站首页>Introduction to sakt method
Introduction to sakt method
2022-07-07 14:10:00 【Try more, record more, accumulate more】
Network architecture and embedded interpretation :
SAKT The Internet : At each timestamp , The attention weight is estimated only for each of the preceding elements . key 、 Values and queries are extracted from the embedding layer shown below . When the first j The first element is the query element and the i When elements are key elements , Note that the weight is a i j a_{ij} aij.
Embedded layer : Embed the current exercise the student is trying and his past interactions . At each mark t + 1 t+1 t+1 when , Use exercises to embed the current problem e t + 1 e_{t+1} et+1 Embedded in the query space , Use interaction to embed elements that will interact in the past x t x_t xt Embedded in key and value spaces .
The method is introduced in detail :
Model purpose : According to the students 1 To t moment Answer the exercises of ,( Interaction sequence X = x 1 , x 2 , . . . , x t X = x_1, x_2, ..., x_t X=x1,x2,...,xt,) Forecast on t + 1 t+1 t+1 moment , exercises e t + 1 e_{t+1} et+1 Response of ( That is, predict the real situation , The right probability ).
Interactive tuples : x t = ( e t , r t ) x_t = ( e_t, r_t) xt=(et,rt) : t t t Time exercises e t e_t et Answer of r t r_t rt Composed of . x t x_t xt When numbering , Use both to express ,: y t = e t + r t × E y_t = e_t + r_t × E yt=et+rt×E , E E E Is the number of topics , You can see the interaction number , Wrong answer The time is the same as the title number y t = e t y_t = e_t yt=et, When the answer is correct , Number plus the total number of topics y t = e t + E y_t = e_t + E yt=et+E.
Embedded layer description :
The interaction sequence needs to be divided , Ensure that the length of all interaction sequences is consistent , Many are truncated , Short fill .
Therefore, the interaction sequence is composed of y = ( y 1 , y 2 , . . . , y t ) y = (y_1, y_2, ...,y_t) y=(y1,y2,...,yt) Turn into s = ( s 1 , s 2 , . . . , s n ) s = (s_1,s_2,...,s_n) s=(s1,s2,...,sn).
Train an interactive embedding matrix : M ∈ R 2 E × d M ∈ R^{2E×d} M∈R2E×d, among d It's a potential dimension , Used to get interactive embedding . s i s_i si The embedding of is expressed as M s i M_{s_i} Msi
Practice embedding a matrix : E ∈ R E × d E ∈ R^{E×d} E∈RE×d, Users get exercises embedded . e i e_i ei The embedding of is expressed as E e i E_{e_i} Eei
Location code :
In order to encode the sequence sequence , Introduce parameters P ∈ R n × d P ∈ R^{n×d} P∈Rn×d, Add to interactive embedding , Form a new code . P i P_i Pi Add to section i i i An interactive embedding vector , Form an interactive embedding vector with position coding .
From the attention level
Q: Exercises embedded
K: Answer interactively embedded
V : Answer interactively embedded
Using the attention mechanism of scaling dot product
The current exercise interacts with each previous answer Have a relationship , Calculate the attention weight .
long position
Capture information from different subspaces .
Causal relationship
Because of the sequence , Unable to know the information of the predicted topic , So use the causality layer to mask the weights learned from future interactions .
Feedforward layer
In order to add nonlinearity to the model and consider the interaction between different potential dimensions , We use a feedforward network .
Residual connection
Use low-level information
Prediction layer
The probability of getting the prediction
Network training
Cross entropy
边栏推荐
- Evolution of customer service hotline of dewu
- [high frequency interview questions] difficulty 2.5/5, simple combination of DFS trie template level application questions
- 2022-7-6 Leetcode 977.有序数组的平方
- Sliding rail stepping motor commissioning (national ocean vehicle competition) (STM32 master control)
- [1] Basic knowledge of ros2 - summary version of operation commands
- 3D detection: fast visualization of 3D box and point cloud
- The reason why data truncated for column 'xxx' at row 1 appears in the MySQL import file
- 3D Detection: 3D Box和点云 快速可视化
- js 获取当前时间 年月日,uniapp定位 小程序打开地图选择地点
- How does MySQL control the number of replace?
猜你喜欢
2022-7-7 Leetcode 34.在排序数组中查找元素的第一个和最后一个位置
手把手教会:XML建模
2022-7-6 beginner redis (I) download, install and run redis under Linux
2022-7-6 Leetcode 977. Square of ordered array
LeetCode简单题分享(20)
libSGM的horizontal_path_aggregation程序解读
数据库系统概论-第一章绪论【概念模型、层次模型和三级模式(外模式、模式、内模式)】
Evolution of customer service hotline of dewu
docker部署oracle
Xshell connection server changes key login to password login
随机推荐
【立体匹配论文阅读】【三】INTS
CSMA/CD 载波监听多点接入/碰撞检测协议
[network security] SQL injection syntax summary
现在网上开户安全么?那么网上开户选哪个证券公司?
Regular expression integer positive integer some basic expressions
Hands on Teaching: XML modeling
3D detection: fast visualization of 3D box and point cloud
参数关键字Final,Flags,Internal,映射关键字Internal
Excerpt from "misogyny: female disgust in Japan"
XML文件的解析操作
FCOS3D label assignment
[untitled]
【AI实战】应用xgboost.XGBRegressor搭建空气质量预测模型(二)
Laravel Form-builder使用
请问,redis没有消费消息,都在redis里堆着是怎么回事?用的是cerely 。
Battle Atlas: 12 scenarios detailing the requirements for container safety construction
高等数学---第八章多元函数微分学1
得物客服热线的演进之路
Advanced Mathematics - Chapter 8 differential calculus of multivariate functions 1
【网络安全】sql注入语法汇总