当前位置:网站首页>Understanding deep learning attention
Understanding deep learning attention
2022-06-10 21:11:00 【Tofu doesn't close】
So-called attention Mechanism , Whether in the field of computer vision or natural language processing , In fact, they are all the same , In fact, they are doing weighting , The so-called weighted , Its meaning is to pay more attention to certain aspects , Ignore sth .
For example :
such as self-attention in , It's just a formula ,
.
among
It means to find the weight ,Q,K,V By input X After linear transformation, we get ,
It is K Dimension root of , Used to prevent the value from being too large .
And the so-called long attention mechanism , It's more than one attention Put together , Forming a so-called multiple attention mechanism .
边栏推荐
- LeetCode 进阶之路 - 125.验证回文串
- Unity analyzes the rendering of built-in terrain and does some interesting things
- Construction of RT thread smart win10 64 bit compilation environment
- mysql基础篇之mysql在已有表中添加自动增加的主键(或任意一个字段)
- Heap sorting and hardening heap code for memory
- 連接mysql報錯 errorCode 1129, state HY000, Host ‘xxx‘ is blocked because of many connection errors
- Redis缓存雪崩
- How to use Diablo immortal database
- shell实现ssh登录并执行命令
- 保姆级教程:如何成为Apache Linkis文档贡献者
猜你喜欢

How to use Diablo immortal database

synergy: server refused client with our name

What is the difference between localhost and 127.0.0.1?

六级考试-商务英语-考前最后一背

Microsoft Word 教程,如何在 Word 中更改页面方向、为页面添加边框?

PDF. JS - - - - JS analyse le fichier PDF pour réaliser l'aperçu et obtenir le contenu du fichier PDF (sous forme de tableau)

面试必备——synchronized底层原理的基础知识

Can you still have a wonderful life if you are laid off at the age of 35?

Self attention and multi head attention

Redis缓存雪崩
随机推荐
Theoretical basis of distributed services
Software definition boundary (SDP)
Redis缓存雪崩
P5723 [deep base 4. example 13] prime number pocket
蛮力法/u到v是否存在简单路径
You have to learn math to play art?
Power consumption development experience sharing: design power consumption board
蛮力法/1~n个整数中取k个整数
2 pcs share a set of keyboard and mouse
六级考试-商务英语-考前最后一背
AttributeError: module ‘collections‘ has no attribute ‘MutableMapping‘
canvas 高级功能(中)
[technical fragment] implementation of renaming and filtering duplicate name files with suffixes
[generation confrontation network learning part I] classic Gan and its existing problems and related improvements
编程式导航路由跳转到当前路由(参数不变), 多次执行会抛出NavigationDuplicated的警告错误?
LeetCode 进阶之路 - 169.多数元素
Power set V4 recursion of brute force method /1~n
Arduino中Serial.print()与Serial.write()函数的区别,以及串口通信中十六进制与字符串的收发格式问题和转换过程详解
Serial Print() and serial The difference of write() function, and the problem of hexadecimal and string sending and receiving format in serial port communication and detailed explanation of the conver
js基础及常考面试题之 [] == ![]结果为true, []==[]结果为false 详解