当前位置:网站首页>ML - Speech - advanced speech model
ML - Speech - advanced speech model
2022-07-25 15:23:00 【sword_ csdn】
Catalog
Reference resources
Huawei cloud College
RNN
Cyclic neural network (Recurrent Neural Networks,RNN) It is a periodic connection through hidden layer nodes , A neural network to capture dynamic information in serialized data , The serialized data can be classified .
No connection with other networks ,RNN You can save the state of a context , Can store in any long context window 、 Study 、 Express relevant information . And it is no longer limited to the boundary of traditional neural network in space , It can be extended in time series .
RNN It is widely used in scenes related to sequences , Such as a video composed of frames of images , A piece of audio , And a sentence of words .
RNN Network structure

RNN Structural expansion

x Is an input that is being read at the current time , And output a value h;s Is the state of one of the sequences , Which has been processed by the corresponding activation function .
standard RNN

BPTT
RNN Forward propagation of is : Calculate once in chronological order ,BPTT Is to pass the accumulated residual back from the last , This is similar to ordinary neural network training , The difference is that we add the gradients at each moment .
LSTM
Long and short term memory network (Long Short-Term Memmory,LSTM), It is a kind of time recurrent neural network , It is suitable for processing and predicting events with long intervals and delays in time series .
LSTM And RNN The difference is this , It adds a method to judge whether the information is useful “ processor ”, The structure of this processor becomes cell.cell Three doors are placed in , They are called input gates , Forgetting gate and output gate . A message enters the network , You can judge whether it is useful according to the rules . Only the information that conforms to the algorithm authentication will be left .
LSTM And speech recognition
DNN Each frame of voice and several frames before and after it are spliced together as the input of the network , So as to use the context information in the speech sequence .DNN The number of frames per input is fixed , Different window lengths will affect the results .
RNN To a certain extent, customer service DNN The shortcomings of , however RNN It's easy to see the gradient disappear , Unable to remember long-term information .
LSTM Through a specific gating unit, the error of the current time can be saved and selectively transmitted to a specific unit , So as to avoid the problem of gradient disappearance . Suitable for long relevant information and location intervals . It is applicable to the need to connect the previous long-term information to the current task 

LSTM: The initial state

LSTM: Oblivion gate

LSTM: Input gate

LSTM: to update


边栏推荐
- 记一次Yarn Required executor memeory is above the max threshold(8192MB) of this cluster!
- Spark partition operators partitionby, coalesce, repartition
- golang复习总结
- 什么是物联网
- SublimeText-win10光标跟随问题
- MySql的安装配置超详细教程与简单的建库建表方法
- Introduction to raspberry Pie: initial settings of raspberry pie
- 解决DBeaver SQL Client 连接phoenix查询超时
- The development summary of the function of fast playback of audio and video in any format on the web page.
- Spark SQL null value, Nan judgment and processing
猜你喜欢

从 join on 和 where 执行顺序认识T-sql查询执行顺序

Docker上运行redis以配置文件方式启动,连接客户端报错Error: Server closed the connection

Spark partition operators partitionby, coalesce, repartition

Spark AQE

Spark memory management mechanism new version

What is the Internet of things

npm的nexus私服 E401 E500错误处理记录

图论及概念

ML - 自然语言处理 - 自然语言处理简介

VMware Workstation fails to start VMware authorization service when opening virtual machine
随机推荐
记一次Yarn Required executor memeory is above the max threshold(8192MB) of this cluster!
RedisCluster搭建和扩容
dpdk 收发包问题案例:使用不匹配的收发包函数触发的不收包问题定位
Visual Studio 2022 查看类关系图
Spark memory management mechanism new version
Spark提交参数--files的使用
图片裁剪cropper 示例
基于OpenCV和YOLOv3的目标检测实例应用
How to understand the maximum allowable number of errors per client connection of MySQL parameters in Seata?
简易轮播图和打地鼠
本地缓存--Ehcache
MySql的安装配置超详细教程与简单的建库建表方法
MeanShift聚类-01原理分析
Spark SQL空值Null,NaN判断和处理
C语言函数复习(传值传址【二分查找】,递归【阶乘,汉诺塔等】)
UITextField的inputView和inputAccessoryView注意点
一个程序最多可以使用多少内存?
Record a redis timeout
Idea护眼色设置
Iframe nested other website page full screen settings