当前位置:网站首页>Speech synthesis: overview [generation task of unequal length sequence relation modeling]
Speech synthesis: overview [generation task of unequal length sequence relation modeling]
2022-06-29 08:20:00 【u013250861】
One 、 What is speech synthesis ?
Speech synthesis is a “ Generation task of unequal length sequence relation modeling ”
- Input :【tex len 】; Input :【frequency dim, spectrum length】
- Enter shape : Text token Sequence length ; Shape of the output :( Frequency dimension , Spectrum sequence length )

“ 739 ”5 individual “token” Corresponding 20 Multiple voices “ frame ”
Cannot model alone “ 7、 ... and ” And X The relationship between frames ,“ hundred ” And Y The relationship between frames ,...., And then put them together , This is against the nature of human pronunciation .
Two 、 Basic training framework of speech synthesis

1、 Introduction to training data
Sampling rate = 16000

2、Token Embedding Layer

To map characters to floating point numbers ,pytorch To take the “ Trainable query table ” The way , Set the... Contained in the dataset token Number &
边栏推荐
- Hook 简介
- Seven common sorts
- Program debugging - debug/release version
- [Kerberos] analysis of Kerberos authentication
- 语音处理工具:sox
- C compiler - implicit function declaration
- Use GPU training in the cloud on the laboratory (take yolov5 as an example)
- PHP clear empty values in multidimensional array
- 笔记本电脑快速连接手机热点的方法
- [eye of depth Wu Enda's fourth operation class] summary of multiple linear regression with multiple variables
猜你喜欢

Talking about Nacos configuration center from Nacos client

Flutter 文件读写-path_provider

AC automata

互联网公司的组织结构与产品经理岗位职责是什么?

【6G】算力网络技术白皮书整理

Hands on deep learning (I) -- linear neural network

AI deep dive of Huawei cloud

After crossing, she said that the multiverse really exists

友元,静态关键字,静态方法以及对象间的关系

Un voyage profond d'IA dans Huawei Cloud
随机推荐
C compiler - implicit function declaration
关于SqlSugar的多对多的级联插入的问题(无法获取集合属性的id,导致无法维护中间表)
C mqtt subscription message
Un voyage profond d'IA dans Huawei Cloud
手撕二叉搜索树(Binary Search Tree)
华为云的AI深潜之旅
Hook introduction
自注意力机制超级详解(Self-attention)
Soliciting articles and contributions - building a blog environment with a lightweight application server
Notice on organizing the second round of the Northwest Division (Shaanxi) of the 2021-2022 National Youth electronic information intelligent innovation competition
1284_ Implementation analysis of FreeRTOS task priority acquisition
Reflection - project management thinking of small and medium-sized companies - make the products first and the functions first
【LoRaWAN节点应用】安信可Ra-08/Ra-08H模组入网LoRaWAN网络的应用及功耗情况
Introduction to taro
aws iam内联策略示例
Excel中VLOOKUP函数简易使用——精确匹配或近似匹配数据
JS to implement a detailed scheme for lazy loading of pictures (it can be used after being imported)
常见的七大排序
Notes mosaïque
PostgreSQL installation: the database cluster initialization failed, stack hbuilder installation