当前位置:网站首页>Jinglianwen technology provides voice data acquisition and labeling services
Jinglianwen technology provides voice data acquisition and labeling services
2022-06-13 06:40:00 【Jinglianwen Technology】
What is voice tagging ?
Voice annotation is a common annotation type in the data annotation industry , The annotator marks and transcribes the voice information continuously , Let the manual system learn further , The marked data is mainly used for artificial intelligence machine learning , It is equivalent to installing the computer system “ Ears ”, Make it have “ Can hear ” The function of , So that the computer can have accurate speech recognition ability .
Voice annotation mainly includes ASR Voice transcribe 、 Speech cutting 、 Voice cleaning 、 Cleaning judgment 、 Voiceprint recognition 、 Phoneme labeling 、 Prosody tagging 、 Pronunciation proofreading these eight common ways of marking .
Voice tagging is closely related to artificial intelligence , At present , Speech recognition technology has been popularized in many aspects of daily life , Such as voice assistant 、 Intelligent speakers 、 Intelligent customer service, etc . With the development of artificial intelligence , The human-computer voice interaction scene will extend in more directions , In recognition accuracy 、 Scene optimization 、 It puts forward higher requirements for speech recognition technology .

AI The importance of data
In recent years, , Artificial intelligence continues to develop , The tool chain that enables AI is not perfect . Data is one of the core elements of AI iterative innovation , Optimize training data AI The model is an important way to further improve the accuracy . For advancement AI Apply high-quality landing , Basic data service providers of artificial intelligence need to collect data 、 cleaning 、 Information extraction 、 mark 、 Quality testing 、 Management and other links are more finely controlled , To provide higher quality data .
Jinglianwen technology provides data support for voice annotation
Jinglianwen technology is the largest enterprise in the Yangtze River Delta AI One of the basic data service providers , The existing database has a voice dataset 100T, Language readings covering tens of thousands of hours have been collected 、 Natural language conversation voice data , It can quickly provide data sets that meet the requirements . for example 《50800 Data set of recording and acquisition in depot 》、《60000 Segment Chinese voice data set 》、《100 individual id12000 A data set of Chinese reading English wake-up words 》、《21000 paragraph ASR Voice transcribe audio training set 》、《13000 Segment speech cutting audio training set 》 And other data sets that can be used to study the algorithms of speech recognition technology , It can effectively improve the test efficiency .
Jinglianwen technology has built a national 27 Provinces, cities and municipalities directly under the central government are all over the world 52 Data collection resource networks in countries , Rich in dialects , Collection channels for small languages 、 Scene building ability , Special scene data acquisition capability , Support speech recognition ASR collection 、 speech synthesis TTS collection 、 Wake up word collection 、 Multiplayer conversation collection 、 Vehicle voice acquisition 、 Mandarin collection 、 Dialect collection 、 English collection 、 Collection of small languages 、 Near and far field acquisition 、 voice VAD Collection, etc . It can be designed according to the scheme , For target areas 、 Collect the specific data of the scene .
Jinglianwen technology has successively established Hangzhou data headquarters , wuhan 、 jinhua 、 Data processing divisions in different provinces and cities such as Hengyang , Adopt amoeba internal competition management mode , Cultivate the 930 A full-time team of people , Research and develop jinglianwen technology data annotation platform , Support ASR Voice transcribe 、 Speech cutting 、 Voice cleaning 、 Emotional judgment 、 Voiceprint recognition 、 Phoneme labeling 、 Prosody tagging 、 Pronunciation proofreading , Meet the data annotation requirements of the diversity and richness of artificial intelligence .

边栏推荐
- Solution: vscode open file will always overwrite the last opened label
- Subtotal of constraintlayout
- Use of smalidea
- Kotlin base generics
- Failed to extract manifest from apk: processexception:%1 is not a valid Win32 Application.
- 景联文科技:数据采集标注行业现状及解决方案
- Jetpack - basic use of room
- BlockingQueue source code
- [kernel] two methods of driver compilation: compiling into modules and compiling into the kernel (using miscellaneous device driver templates)
- Detailed explanation of the player network data reading process of ijkplayer code walkthrough 2
猜你喜欢

【新手上路常见问答】关于技术管理

Failed to extract manifest from apk: processexception:%1 is not a valid Win32 Application.

The web server failed to start Port 7001 was already in use
![[FAQs for novices on the road] about technology management](/img/6f/cb2152d5ddb4714e1b249e50096947.jpg)
[FAQs for novices on the road] about technology management

JS case Xiaomi second kill countdown New Year Countdown

RN Metro packaging process and sentry code monitoring

Two uses of bottomsheetbehavior

Glide usage notes

十五、IO流(一)

电镀挂具RFID工序管理解决方案
随机推荐
Scrcpy source code walk 2 how to connect a client to a mobile server
ADB shell CMD overlay debugging command facilitates viewing system framework character resource values
【新手上路常见问答】关于技术管理
Construction and verification of Alibaba cloud server webrtc system
Select all select none JS code implementation
The web server failed to start Port 7001 was already in use
[FAQs for novices on the road] about technology management
Kotlin basic string operation, numeric type conversion and standard library functions
景联文科技提供一站式智能家居数据采集标注解决方案
MFS详解(六)——MFS Chunk Server服务器安装与配置
Array operations in JS
欧姆龙平替国产大货—JY-V640半导体晶元盒读写器
IIS batch bind domain name
Ijkplayer compilation process record
Subtotal of constraintlayout
Unable to find method 'org gradle. api. artifacts. result. ComponentSelectionReason. getDesc
JS case Xiaomi second kill countdown New Year Countdown
《MATLAB 神经网络43个案例分析》:第11章 连续Hopfield神经网络的优化——旅行商问题优化计算
时间格式化工具----moment.js(网页时间实时展示)
App performance test: (III) traffic monitoring