当前位置:网站首页>Pytorch's LSTM parameters explained
Pytorch's LSTM parameters explained
2022-08-02 09:33:00 【A little cute c】
LSTM has a total of 7 parameters:
1: input_size - the size of the input data
2: hidden_size – the size of the hidden layer (ie the number of hidden layer nodes), the dimension of the output vector is equal to the number of hidden nodes
3: num_layers – the number of layers of LSTM stack, the default value is 1 layer, if set to 2, the second LSTM receives the calculation result of the first LSTM.That is, the first layer inputs [ X0 X1 X2 ... Xt ], calculates [ h0 h1 h2 ... ht ], and the second layer takes [ h0 h1 h2 ... ht ] as [ X0 X1 X2 ... Xt ]] The input is calculated again, outputting the final [ h0 h1 h2 ... ht ].
4: bias – whether the hidden layer state has bias, the default is true.bias is the bias value, or offset value
5: batch_first – whether the first dimension of input and output is batch_size, the default value is False
6: dropout – default value 0.Whether to add dropout layers after other RNN layers except the last RNN layer.The input value is a decimal between 0-1 and represents a probability.0 means 0 probability dropout, i.e. no dropout
7: bidirectional – whether it is a bidirectional RNN, the default is: false, if it is true, then: num_directions=2, otherwise it is 1.
边栏推荐
- OneinStack多版本PHP共存
- 自定义卡包效果实现
- 【Flink 问题】Flink 如何提交轻量jar包 依赖该如何存放 会遇到哪些问题
- 高效时代,电商运营如何靠RPA快速提效?
- 百战RHCE(第四十七战:运维工程师必会技-Ansible学习2-Ansible安装配置练习环境)
- 腾讯T8架构师,教你学中小研发团队架构实践PDF,高级架构师捷径
- Jenkins--基础--6.1--Pipeline--介绍
- [Concurrent programming] - Thread pool uses DiscardOldestPolicy strategy, DiscardPolicy strategy
- 被报表需求逼疯的银行数据人,是时候放弃用Excel做报表了
- 干货|如何在海量文件系统中选择合适自己的文件系统
猜你喜欢

高效时代,电商运营如何靠RPA快速提效?

数据库mysql

js函数防抖和函数节流及其使用场景

spark:页面单跳转换率统计(案例)

The god-level Alibaba "high concurrency" tutorial "basic + actual combat + source code + interview + architecture"

【技术分享】OSPFv3基本原理

Rust from entry to master 03-helloworld

Re22:读论文 HetSANN An Attention-based Graph Neural Network for Heterogeneous Structural Learning

AutoJs学习-存款计算器

新起点丨MeterSphere开源持续测试平台v2.0发布
随机推荐
SVN下载上传文件
【Redis】Jedis
ORBSLAM代码阅读
[Concurrent programming] - Thread pool uses DiscardOldestPolicy strategy, DiscardPolicy strategy
UVM信息服务机制
spark:商品热门品类TOP10统计(案例)
Jenkins--基础--6.1--Pipeline--介绍
EdrawMax Crack,多合一的图表应用程序
HikariCP数据库连接池,太快了!
瑞吉外卖项目剩余功能补充
tf中tensor的大小输出
Fiddler(七) - Composer(组合器)克隆或者修改请求
不用Swagger,那我用啥?
主流监控系统工具选型及落地场景参考
谈谈对Volatile的理解
边缘计算开源项目概述
了解下C# 不安全代码
It's time for bank data people who are driven crazy by reporting requirements to give up using Excel for reporting
nacos项目搭建
Gorilla Mux 和 GORM 的使用方法