当前位置:网站首页>Pytorch's LSTM parameters explained
Pytorch's LSTM parameters explained
2022-08-02 09:33:00 【A little cute c】
LSTM has a total of 7 parameters:
1: input_size - the size of the input data
2: hidden_size – the size of the hidden layer (ie the number of hidden layer nodes), the dimension of the output vector is equal to the number of hidden nodes
3: num_layers – the number of layers of LSTM stack, the default value is 1 layer, if set to 2, the second LSTM receives the calculation result of the first LSTM.That is, the first layer inputs [ X0 X1 X2 ... Xt ], calculates [ h0 h1 h2 ... ht ], and the second layer takes [ h0 h1 h2 ... ht ] as [ X0 X1 X2 ... Xt ]] The input is calculated again, outputting the final [ h0 h1 h2 ... ht ].
4: bias – whether the hidden layer state has bias, the default is true.bias is the bias value, or offset value
5: batch_first – whether the first dimension of input and output is batch_size, the default value is False
6: dropout – default value 0.Whether to add dropout layers after other RNN layers except the last RNN layer.The input value is a decimal between 0-1 and represents a probability.0 means 0 probability dropout, i.e. no dropout
7: bidirectional – whether it is a bidirectional RNN, the default is: false, if it is true, then: num_directions=2, otherwise it is 1.
边栏推荐
- Worship, Alibaba distributed system development and core principle analysis manual
- node制作一个视频帧长图生成器
- 大厂外包,值得拥有吗?
- LeetCode_2358_分组的最大数量
- [Concurrent programming] - Thread pool uses DiscardOldestPolicy strategy, DiscardPolicy strategy
- 中国发布丨滴滴因违反网络安全法等被罚80.26亿元!调查细节公布
- HCIA动态主机配置协议实验(dhcp)
- spark:页面单跳转换率统计(案例)
- Scala类型转换
- 理解JS的三座大山
猜你喜欢

Re22:读论文 HetSANN An Attention-based Graph Neural Network for Heterogeneous Structural Learning

每天花2小时恶补腾讯T8纯手打688页SSM框架和Redis,成功上岸美团

Tencent T8 architect, teach you to learn small and medium R&D team architecture practice PDF, senior architect shortcut

李航《统计学习方法》笔记之k近邻法

leetcode 62. Unique Paths(独特的路径)

openpyxl 单元格合并

Overview of Edge Computing Open Source Projects

In the whole development of chi V853 board tried to compile QT test

HCIA静态路由综合练习

shell脚本
随机推荐
【Flink 问题】Flink 如何提交轻量jar包 依赖该如何存放 会遇到哪些问题
恋爱十不要
Tencent T8 architect, teach you to learn small and medium R&D team architecture practice PDF, senior architect shortcut
Qt读取文件中内容(通过判断GBK UTF-8格式进行读取显示)
Spend 2 hours a day to make up for Tencent T8, play 688 pages of SSM framework and Redis, and successfully land on Meituan
Jenkins--基础--07--Blue Ocean
智能网络安全网卡|这是不是你要的安全感
动态规划每日一练(2)
十、 网络管理
WebGPU 导入[1] - 入门常见问题与个人分享
leetcode:639. 解码方法 II
Jenkins--基础--6.1--Pipeline--介绍
MySQL安装与卸载详细教程
1对1视频源码——快速实现短视频功能提升竞争力
Golang ORM框架 — GORM
EdrawMax Crack,多合一的图表应用程序
干货|如何在海量文件系统中选择合适自己的文件系统
三国演义小说
tf.where使用
每天花2小时恶补腾讯T8纯手打688页SSM框架和Redis,成功上岸美团