当前位置:网站首页>Huawei machine learning service speech recognition function enables applications to paint "sound" and color
Huawei machine learning service speech recognition function enables applications to paint "sound" and color
2022-06-29 13:29:00 【HMS Core】
With people's pursuit of the ritual sense of life , Mobile devices 、 Wearable device 、 Smart home devices 、 Car infotainment systems are becoming more and more popular . In these applications , mouse 、 The keyboard is no longer convenient , And voice is the most natural way of communication between human beings , Speech recognition technology seems to have become a major application “ Standard configuration ”. Speech recognition scenarios are widely used , Such as voice input method 、 Voice search 、 Real time subtitles 、 Entertainment 、 Social chat 、 human-computer interaction 、 Driving mode, etc . therefore ,App The integrated speech recognition function can not only free your hands , You can also get a good human-computer interaction experience .
1. Business profile
HMS Core Machine learning services Real time speech recognition service Support short voice input in real time ( Duration not exceeding 60 second ) Convert to text , Real time recognition 60 Speech in seconds . The service uses industry-leading deep learning technology , With the continuous iteration of algorithm and data , At present, the recognition accuracy in the general ideal environment can reach 95% above . At present, Chinese Putonghua is supported ( Including mixed Chinese and English )、 English 、 French 、 German 、 Spanish 、 Italian 、 The Arabic language 、 Russian 、 Thai, 、 Malay Language 、 Recognition of Filipino .

DEMO demonstration

2 . Scenario introduction
HMS Core Machine learning services Real time speech recognition service It covers many fields in daily life and work , And deep optimization of shopping search 、 Movie search 、 Music search, navigation and other scene recognition capabilities , Further improve the recognition accuracy of these scenes . Using the shopping class App When searching for products , The product name or feature described by voice can be recognized as text to search for the target product . Again , In the use of music App when , You can identify the name or singer of a voice input song as text and search for songs . in addition , When it is not convenient for the driver to input text during driving , The input voice can be converted into text and then the destination can be searched , Make driving safer .
3 . Functional characteristics
• Support real-time word output
• Provides a pickup interface 、 There are two ways without pickup interface
• Support endpoint detection , Accurate positioning of start and end points
• Support mute detection , The voiceless part of the voice does not send voice packets
• Support intelligent conversion of digital format , For example, voice input “ In 2020 ” when , It can be intelligently identified as “2020 year ”.
How to access Huawei machine learning services ?
Huawei The official website of machine learning service Provide you with detailed Document guidance .
Learn more >>
visit Official website of Huawei developer Alliance
obtain Development guidance document
Huawei mobile service open source warehouse address :GitHub、Gitee
Pay attention to our , The first time to understand HMS Core Latest technical information ~
边栏推荐
- async原理实现
- Uber前安全主管面临欺诈指控 曾隐瞒数据泄露事件
- RT thread memory management
- SCHIEDERWERK電源維修SMPS12/50 PFC3800解析
- Schiederwerk Power Supply repair smps12 / 50 pfc3800 Analysis
- Cvpr2022 | panopticdepth: a unified framework for depth aware panoramic segmentation
- CVPR2022 | 通过目标感知Transformer进行知识蒸馏
- 基于51单片机控制的BUCK开关电源Proteus仿真
- 服务器上的RTC时间与世界时间不一致解决办法
- 360数科新能源专项产品规模突破60亿
猜你喜欢

Hystrix circuit breaker

CVPR2022 | 弱监督多标签分类中的损失问题

Mirror vulnerability scanner: trivy

Install the typescript environment and enable vscode to automatically monitor the compiled TS file as a JS file

C语言内存函数

Application Service Vulnerability scanning and exploitation of network security skills competition in secondary vocational schools (SSH private key disclosure)

基于51单片机控制的BUCK开关电源Proteus仿真

C language memory function

趣谈网络协议(二)传输层

CVPR2022 | A ConvNet for the 2020s & 如何设计神经网络总结
随机推荐
netdata邮件告警配置
Simple introduction to matlab
The scale of 360 digital new energy special products exceeded 6billion
C语言内存函数
C # implements definition, insertion and construction of binary sort tree
思科模拟器简单校园网设计,期末作业难度
GEE——美国LANDFIRE火灾数据集
Yolo series combs (IX) first taste of newly baked yolov6
Design of commodity search engine recommendation system
运动App如何实现端侧后台保活,让运动记录更完整?
@Table爆红
C # realize the definition, stack entry and stack exit of stack structure
Hystrix断路器
leetcode 522. 最长特殊序列 II
STK_GLTF模型
C # realize the hierarchical traversal of binary tree
Package folders as ISO
Application Service Vulnerability scanning and exploitation of network security skills competition in secondary vocational schools (SSH private key disclosure)
C#实现队列结构定义、入队、出队操作
Cvpr2022 | a convnet for the 2020s & how to design neural network Summary