当前位置:网站首页>Embedding cutting-edge understanding
Embedding cutting-edge understanding
2022-07-31 06:14:00 【Young_win】
Content from https://mp.weixin.qq.com/s/j34nJGomvR23ZJiqIFMoAQ
Q: With massive sparse features, how to find a good feature Embedding expression?
(1) For Item Embedding in sequence behavior, what kind of Embedding expression is better?
(2) For the recommendation model of non-behavioral sequences, with regard to feature Embedding, the usual practice is to use the Embedding Size of the feature as a super-parameter, and manually test to find a good Embedding size.However, is there a better way?
A1: Res-embedding for Deep Learning Based Click-Through Rate Prediction Modeling.
Res-embedding first proved theoretically that the generalization error of the neural network CTR model is closely related to the distribution of Items in the Embedding space. If the Items with similar user interests, the smaller the envelope radius in the Embedding space, the smaller the envelope radius.That is to say, the more compact the items of the same interest are in the embedding space, and the smaller the cluster radius is, the smaller the model generalization error is, that is, the better the model's generalization ability.This conclusion is very meaningful.Because this conclusion can be used to constrain Item Embedding in the training process to make it meet certain conditions, so as to increase the model ability.On the basis of this conclusion, Res-embedding proposes a more general method: For Item Embedding with similar user interests, we let it consist of two parts superimposed, one is the interest center shared by all Items belonging to this interestCentral Embedding, the other is the residual Residual Embedding of the Item itself.
A2: Neural Input Search for Large Scale Recommendation Models (NIS).
First imagine a relatively perfect feature Embedding allocation scheme. If it exists, it should look like this: For high-frequency features, a longer Embedding size can be assigned to it, so that it can be encoded and expressed more fullyinformation.For low-frequency features, it is desirable to assign a shorter Embedding, because for low-frequency features, it appears less frequently in the training data. If a longer Embedding is assigned, overfitting is more likely to occur, which affects the generalization performance of the model.For those very low-frequency features, there is basically nothing to learn, but it will bring all kinds of noise, so we can not allocate or let them share a public Embedding.How big is the decision or search space of the scheme in the figure, it is obvious that each step has 5 choices and 4 decision steps, so the size of the decision space is 5 to the 4th power, which means that there are so many allocation schemes, and ENAS passes a certainThe AUC evaluation index performance of each allocation scheme under the validation set data and the size of the embedding space consumed by the scheme are used to evaluate the pros and cons of each decision-making scheme.We definitely encourage solutions with good performance of validation set indicators and less space consumption, and Reward of reinforcement learning is designed with this idea.Through this mode, a reinforcement learning scheme can be designed to find the optimal Embedding scheme.
边栏推荐
- configure:error no SDL library found
- 深度学习知识点杂谈
- RuntimeError: CUDA error: no kernel image is available for execution on the device问题记录
- quick-3.5 ActionTimeline的setLastFrameCallFunc调用会崩溃问题
- VTK:Could not locate vtkTextRenderer object.
- MySQL 免安装版的下载与配置教程
- VS2017连接MYSQL
- podspec 校验依赖出错问题 pod lib lint ,需要指定源
- Gradle sync failed: Uninitialized object exists on backward branch 142
- The browser looks for events bound or listened to by js
猜你喜欢

Cholesterol-PEG-DBCO 胆固醇-聚乙二醇-二苯基环辛炔化学试剂

二进制转换成十六进制、位运算、结构体

qt:cannot open C:\Users\某某某\AppData\Local\Temp\main.obj.15576.16.jom for write

VTK环境配置

UiBot has an open Microsoft Edge browser and cannot perform the installation

softmax函数详解

Gradle sync failed: Uninitialized object exists on backward branch 142

unicloud 发布后小程序提示连接本地调试服务失败,请检查客户端是否和主机在同一局域网下

2021年软件测试面试题大全

Multi-Modal Face Anti-Spoofing Based on Central Difference Networks学习笔记
随机推荐
DSPE-PEG-COOH CAS:1403744-37-5 磷脂-聚乙二醇-羧基脂质PEG共轭物
Android software security and reverse analysis reading notes
Filter out egrep itself when using ps | egrep
【解决问题】RuntimeError: The size of tensor a (80) must match the size of tensor b (56) at non-singleton
浅谈对分布式模式下CAP的理解
Flow control statement in js
MW:3400 4-Arm PEG-DSPE 四臂-聚乙二醇-磷脂一种饱和的18碳磷脂
crontab的定时操作
Tencent Cloud GPU Desktop Server Driver Installation
mysql 事务原理详解
softmax函数详解
MYSQL transaction and lock problem handling
Cholesterol-PEG-Azide CLS-PEG-N3 胆固醇-聚乙二醇-叠氮 MW:3400
Tensorflow相关list
opencv之访问图像像素的三种方法
Tensorflow——demo
Cholesterol-PEG-Acid CLS-PEG-COOH Cholesterol-Polyethylene Glycol-Carboxyl Modified Peptides
cocos2d-x implements cross-platform directory traversal
Cholesterol-PEG-Acid CLS-PEG-COOH 胆固醇-聚乙二醇-羧基修饰肽类化合物
CAS:474922-22-0 Maleimide-PEG-DSPE 磷脂-聚乙二醇-马来酰亚胺简述