当前位置:网站首页>Introduction to sakt method
Introduction to sakt method
2022-07-07 14:10:00 【Try more, record more, accumulate more】
Network architecture and embedded interpretation :
SAKT The Internet : At each timestamp , The attention weight is estimated only for each of the preceding elements . key 、 Values and queries are extracted from the embedding layer shown below . When the first j The first element is the query element and the i When elements are key elements , Note that the weight is a i j a_{ij} aij.
Embedded layer : Embed the current exercise the student is trying and his past interactions . At each mark t + 1 t+1 t+1 when , Use exercises to embed the current problem e t + 1 e_{t+1} et+1 Embedded in the query space , Use interaction to embed elements that will interact in the past x t x_t xt Embedded in key and value spaces .
The method is introduced in detail :
Model purpose : According to the students 1 To t moment Answer the exercises of ,( Interaction sequence X = x 1 , x 2 , . . . , x t X = x_1, x_2, ..., x_t X=x1,x2,...,xt,) Forecast on t + 1 t+1 t+1 moment , exercises e t + 1 e_{t+1} et+1 Response of ( That is, predict the real situation , The right probability ).
Interactive tuples : x t = ( e t , r t ) x_t = ( e_t, r_t) xt=(et,rt) : t t t Time exercises e t e_t et Answer of r t r_t rt Composed of . x t x_t xt When numbering , Use both to express ,: y t = e t + r t × E y_t = e_t + r_t × E yt=et+rt×E , E E E Is the number of topics , You can see the interaction number , Wrong answer The time is the same as the title number y t = e t y_t = e_t yt=et, When the answer is correct , Number plus the total number of topics y t = e t + E y_t = e_t + E yt=et+E.
Embedded layer description :
The interaction sequence needs to be divided , Ensure that the length of all interaction sequences is consistent , Many are truncated , Short fill .
Therefore, the interaction sequence is composed of y = ( y 1 , y 2 , . . . , y t ) y = (y_1, y_2, ...,y_t) y=(y1,y2,...,yt) Turn into s = ( s 1 , s 2 , . . . , s n ) s = (s_1,s_2,...,s_n) s=(s1,s2,...,sn).
Train an interactive embedding matrix : M ∈ R 2 E × d M ∈ R^{2E×d} M∈R2E×d, among d It's a potential dimension , Used to get interactive embedding . s i s_i si The embedding of is expressed as M s i M_{s_i} Msi
Practice embedding a matrix : E ∈ R E × d E ∈ R^{E×d} E∈RE×d, Users get exercises embedded . e i e_i ei The embedding of is expressed as E e i E_{e_i} Eei
Location code :
In order to encode the sequence sequence , Introduce parameters P ∈ R n × d P ∈ R^{n×d} P∈Rn×d, Add to interactive embedding , Form a new code . P i P_i Pi Add to section i i i An interactive embedding vector , Form an interactive embedding vector with position coding .
From the attention level
Q: Exercises embedded
K: Answer interactively embedded
V : Answer interactively embedded
Using the attention mechanism of scaling dot product
The current exercise interacts with each previous answer Have a relationship , Calculate the attention weight .
long position
Capture information from different subspaces .
Causal relationship
Because of the sequence , Unable to know the information of the predicted topic , So use the causality layer to mask the weights learned from future interactions .
Feedforward layer
In order to add nonlinearity to the model and consider the interaction between different potential dimensions , We use a feedforward network .
Residual connection
Use low-level information
Prediction layer
The probability of getting the prediction
Network training
Cross entropy
边栏推荐
- 請問,在使用flink sql sink數據到kafka的時候出現執行成功,但是kafka裏面沒有數
- VSCode 配置使用 PyLint 语法检查器
- 现在网上开户安全么?那么网上开户选哪个证券公司?
- Mysql怎样控制replace替换的次数?
- Beginner XML
- wpf dataGrid 实现单行某个数据变化 ui 界面随之响应
- 请问,在使用flink sql sink数据到kafka的时候出现执行成功,但是kafka里面没有数
- Leecode3. Longest substring without repeated characters
- 请问,如图,pyhon云函数提示使用了 pymysql模块,这个是怎么回事?
- [daily training] 648 Word replacement
猜你喜欢
2022-7-7 Leetcode 34. Find the first and last positions of elements in a sorted array
566. Reshaping the matrix
2022-7-7 Leetcode 34.在排序数组中查找元素的第一个和最后一个位置
Redis can only cache? Too out!
高等數學---第八章多元函數微分學1
Advanced Mathematics - Chapter 8 differential calculus of multivariate functions 1
Did login metamask
SAKT方法部分介绍
. Net core about redis pipeline and transactions
一个简单LEGv8处理器的Verilog实现【四】【单周期实现基础知识及模块设计讲解】
随机推荐
Supply chain supply and demand estimation - [time series]
得物客服热线的演进之路
供应链供需预估-[时间序列]
社会责任·价值共创,中关村网络安全与信息化产业联盟对话网信企业家海泰方圆董事长姜海舟先生
The delivery efficiency is increased by 52 times, and the operation efficiency is increased by 10 times. See the compilation of practical cases of financial cloud native technology (with download)
Environment configuration of lavarel env
AutoCAD - how to input angle dimensions and CAD diameter symbols greater than 180 degrees?
requires php ~7.1 -> your PHP version (7.0.18) does not satisfy that requirement
Huawei image address
Flink | multi stream conversion
Introduction to database system - Chapter 1 introduction [conceptual model, hierarchical model and three-level mode (external mode, mode, internal mode)]
How to check the ram and ROM usage of MCU through Keil
c#利用 TCP 协议建立连接
Excerpt from "misogyny: female disgust in Japan"
Best practice | using Tencent cloud AI willingness to audit as the escort of telephone compliance
AI人才培育新思路,这场直播有你关心的
VSCode 配置使用 PyLint 语法检查器
Indoor ROS robot navigation commissioning record (experience in selecting expansion radius)
3D Detection: 3D Box和点云 快速可视化
Wired network IP address of VMware shared host