当前位置:网站首页>Development trend of Ali Taobao fine sorting model
Development trend of Ali Taobao fine sorting model
2022-07-06 01:06:00 【learner_ ctr】
One 、DIN The Internet 2018 year 7 month 19 Japan
, Put forward attention Structure is used for user sequence characteristics , such attention No self-attention, It's an estimate item Used as query, Each in the user sequence item As key value
Two 、DIEN The Internet 2018 year
be relative to DIN The network has such improvements ,DIN The network only makes user sequence attention This matter ,DIEN First, add a loss, use gru Calculate the click sequence ( The length is T) Of T Outputs , Then take the front. T-1 And output after T-1 Seek accumulation loss, In this way, this prediction will include the significance of predicting the next .
For this T Output using tape attention Of gru Get the final output final_state2, And other features (user Of embedding、item Of embedding、 Various sequences 、 Dot product of the first two )concat together , Send it to the back mlp layer
dien Thesis translation _1066196847 The blog of -CSDN Blog
3、 ... and 、 Long sequence modeling MIMN 2019 year 7 month 25 Japan
It's solved DIN DIEN It can't deal with the bottleneck of long sequence , This model can handle lengths up to 1000 Sequence modeling
Modeling the long-term historical user behavior of Alibaba mom ——MIMN The model, - You know
Four 、sim Model 2020 year 6 month
MIMN Ask for more , No consideration target item;din Be accurate , But it can't handle long sequences ,sim The model can handle lengths up to 10000 Sequence , This is in the e-commerce scene ( TaoBao ) Or content scene ( Tiktok ), It's very common , For example, I learned from the article ,180 Day behavior data 30% The click sequence of samples exceeds 10000, This proportion is quite a lot
sim First, search out a batch of suitable item, recycling din Thoughts are processed
边栏推荐
猜你喜欢

Recursive method to realize the insertion operation in binary search tree

Four dimensional matrix, flip (including mirror image), rotation, world coordinates and local coordinates

Mlsys 2020 | fedprox: Federation optimization of heterogeneous networks
![[groovy] XML serialization (use markupbuilder to generate XML data | create sub tags under tag closures | use markupbuilderhelper to add XML comments)](/img/d4/4a33e7f077db4d135c8f38d4af57fa.jpg)
[groovy] XML serialization (use markupbuilder to generate XML data | create sub tags under tag closures | use markupbuilderhelper to add XML comments)

MCU realizes OTA online upgrade process through UART

WordPress collection plug-in automatically collects fake original free plug-ins

看抖音直播Beyond演唱会有感

SSH login is stuck and disconnected

KDD 2022 | EEG AI helps diagnose epilepsy

Illustrated network: the principle behind TCP three-time handshake, why can't two-time handshake?
随机推荐
Dynamic programming -- linear DP
curlpost-php
详细页返回列表保留原来滚动条所在位置
Mlsys 2020 | fedprox: Federation optimization of heterogeneous networks
I'm interested in watching Tiktok live beyond concert
FFT 学习笔记(自认为详细)
C language programming (Chapter 6 functions)
Mysql--- query the top 5 students
Browser reflow and redraw
KDD 2022 | EEG AI helps diagnose epilepsy
Distributed base theory
The population logic of the request to read product data on the sap Spartacus home page
Finding the nearest common ancestor of binary search tree by recursion
Xunrui CMS plug-in automatically collects fake original free plug-ins
云导DNS和知识科普以及课堂笔记
Meta AI西雅图研究负责人Luke Zettlemoyer | 万亿参数后,大模型会持续增长吗?
Vulhub vulnerability recurrence 74_ Wordpress
Natural language processing (NLP) - third party Library (Toolkit):allenlp [library for building various NLP models; based on pytorch]
Zhuhai laboratory ventilation system construction and installation instructions
图解网络:TCP三次握手背后的原理,为啥两次握手不可以?