当前位置:网站首页>Typical case of data annotation: how does jinglianwen technology help enterprises build data solutions
Typical case of data annotation: how does jinglianwen technology help enterprises build data solutions
2022-07-02 23:04:00 【Jinglianwen Technology】
As one of the three elements of the development of artificial intelligence , The role of data is crucial .
Jinglianwen technology provides professional data scheme design for enterprises , Help enterprises quickly build data solutions that match their algorithm models , Explore new business areas .
Jinglianwen technology is the largest in the Yangtze River Delta AI Basic data service providers , Self developed data annotation platform , Establish mature annotations 、 to examine 、 Quality inspection mechanism , Support computer vision 、 Speech Engineering 、 Multi type data annotation projects such as natural language processing .
With high-quality training data and hundreds of large-scale technology enterprises around the world 、 Scientific research institutions maintain in-depth cooperation .
User stories
1. Well known security manufacturers contraband 2d Split dimension
One 、 demand :
50 10000 images 33 Categories of contraband segmentation labels
Two 、 Project difficulties :
1) Large amount of annotation data , Tight schedule .
2) There are many prohibited varieties and labels , It is difficult to distinguish similar objects .
3、 ... and 、 Solution :
1) For the annotation rule system , Configure exclusive business 、 project manager 、 Annotator 、 Quality inspector 、 Technical personnel , Work overtime to train the team , Learning professional knowledge , Adjust the marking specification and repair data .
2) The pre annotation ability and automatic quality inspection function of jinglianwen annotation platform effectively improve the annotation efficiency and quality
3) Full quantity quality inspection and two rounds of random inspection, three times of data quality inspection, and then submit data in batches , Timely confirm with customers about the problems of submitted data feedback
Four 、 Delivery results :
Complete the delivery in full within the construction period , First pass rate 98%.
2. Government smart city video annotation project
One 、 demand :
2 Ten thousand vehicles are tracked 、1000+ Event type annotation
Two 、 Project difficulties :
1) Total video duration 4000 minute , There are many annotation scenes ( Including urban roads , Expressway , Urban viaduct, etc ), There are many types of events , Tight schedule .
2) Video frame extraction 、 Data cleaning is time-consuming .
3) There are many types of events displayed in the video data , The effect of ordinary algorithm auxiliary annotation is not obvious .
3、 ... and 、 Solution :
1) For the annotation rule system , Arrange business 、 project 、 mark 、 Quality testing 、 The technical personnel are familiar with the background and purpose of the project 、 The rules 、 matters needing attention 、 difficulty 、 Platform operation 、 Project requirement ( Accuracy rate 、 Daily output ) Training .
2) The continuous frame pre annotation function of Jinglian text annotation platform effectively improves the annotation efficiency and quality
3) Full quantity quality inspection and two rounds of random inspection, three times of data quality inspection, and then submit data in batches , Monitor progress in real time , Check while repairing , Solve problems at any time .
Four 、 Delivery results :
Complete the delivery in full within the construction period , First pass rate 98.5%.
Besides , In the field of computer vision , Jinglianwen technology in rectangular pull frame 、 Polygon annotation 、 Semantic segmentation 、 Instance segmentation 、 Face key points 、 Lane line 、 Video continuous frame annotation 、 I have rich annotation experience in emotional analysis and other projects .
- National Laboratory Voiceprint recognition voice mark
One 、 demand :
Multi person conversation voice in uncertain scenes 50000 Bar labels
Two 、 Project difficulties :
1) The marking requirements in the early stage of the project are not clear , Modify the annotation rules repeatedly .
2) Part of the audio has multi regional accents , The effect of algorithm aided annotation is not obvious , It is also difficult to identify manually .
3、 ... and 、 Solution :
1) Equipped with 3 Project manager and tagging team with more than years of experience in voice tagging project management .
2) Annotation rules for changes , Instantly feed back the problems encountered in the labeling process and provide multiple solutions , Plan to reserve repair time , Establish an early warning mechanism , Strictly control key time nodes .
3) The pre annotation and automatic quality inspection functions of jinglianwen annotation platform effectively improve the annotation efficiency and quality
Four 、 Delivery results :
Complete the delivery in full within the construction period , First pass rate 98%.
Besides , In the field of voice Engineering , Jinglianwen technology is in ASR Voice transcribe 、 Voice cleaning 、 Speech cutting 、 Emotional judgment 、 Phoneme labeling 、 Prosody tagging 、 I have rich annotation experience in machine translation and other projects .
- National Laboratory Text label
One 、 demand :
Text label 20 Ten thousand , Contains intention matching 、 General abbreviation 、 Integrated induction 、 Different rules such as label classification
Two 、 Project difficulties :
1) It is difficult to mark a wide range of contents , High requirements for the quality and ability of marking personnel .
2) There are many text types 、 Lots of scenes 、 Length , Lots of labels , Cumulative labels and options up to 530 More than a
3) Accuracy requirements 99%, Tight schedule .
3、 ... and 、 Solution :
1) Equipped with 3 In the above NLP Project managers and marking teams with project management experience .
2) Arrange to mark the team's understanding of the project background 、 Purpose 、 The rules 、 matters needing attention 、 difficulty 、 Platform operation 、 Project requirement ( Accuracy rate 、 Daily output ) Conduct training and assessment , Assessment and elimination 40%, The remaining 60% Personnel enter the formal task .
3) Analyze the project structure according to the project requirements , be based on WBS The principle decomposes the project into a tree view layer by layer according to its internal structure and the order of implementation process , Form a relatively independent 、 Project responsibilities of each unit of the project that is easy to manage and check 、 Progress, etc. shall be specifically implemented to each participant of the project , Ensure the marking quality .
Four 、 Delivery results :
Complete the delivery in full within the construction period , First pass rate 99%.
Besides , In the field of natural language processing , Jinglianwen technology is in ocr Transcribe 、 Statement generalization 、 Sentiment analysis 、 Named entity 、 Part of speech tagging 、 Slot extraction 、 Text matching 、 Intention matching 、 Text cleaning 、 I have rich annotation experience in information extraction and other projects .
边栏推荐
- STM32串口DAM接收253字节就死机原因排查
- Jerry's prototype has no touch, and the reinstallation becomes normal after dismantling [chapter]
- [chestnut sugar GIS] ArcScene - how to make elevation map with height
- Share 10 JS closure interview questions (diagrams), come in and see how many you can answer correctly
- 【板栗糖GIS】arcmap—如何批量修改注记要素的字体,颜色,大小等
- Data analysis learning records -- complete a simple one-way ANOVA with Excel
- Redis 过期策略+conf 记录
- [羊城杯2020]easyphp
- STM32之ADC
- Hanging mirror security won four global infosec awards on rsac2022
猜你喜欢
数据标注典型案例,景联文科技如何助力企业搭建数据方案
Splunk audit setting
[羊城杯2020]easyphp
China Academy of information technology, Tsinghua University, Tencent security, cloud native security, industry university research and use strong alliance!
P7072 [CSP-J2020] 直播获奖
Looking at Ctrip's toughness and vision from the Q1 financial report in 2022
地平线2022年4月最新方案介绍
海思3559万能平台搭建:在截获的YUV图像上旋转操作
力扣刷题(2022-6-28)
AES高級加密協議的動機闡述
随机推荐
pytorch训练CPU占用持续增长(bug)
antd组件upload上传xlsx文件,并读取文件内容
Value sequence < detailed explanation of daily question >
移动端 1px 像素兼容性问题,实现1px 边框
Qt QSplitter拆分器
Kubernetes uses the host name to allocate the pod on the specified node
P1007 single log bridge
Webrtc audio and video capture and playback examples and mediastream media stream analysis
Splunk audit setting
全面解析分享购商业模式逻辑?分享购是如何赋能企业
从2022年Q1财报看携程的韧性和远景
Freshman learning sharing
Introduction and response to high concurrency
编辑卡顿
MySQL reset password, forget password, reset root password, reset MySQL password
Go four singleton modes
【板栗糖GIS】arcscene—如何做出有高度的高程图
JS syntax ES6, ES7, es8, es9, ES10, es11, ES12 new features (Abstract)
P7072 [CSP-J2020] 直播获奖
Analyse des données dossiers d'apprentissage - - analyse simple de la variance à facteur unique avec Excel