当前位置:网站首页>深度学习图像数据自动标注[通俗易懂]
深度学习图像数据自动标注[通俗易懂]
2022-07-02 13:39:00 【全栈程序员站长】
大家好,又见面了,我是你们的朋友全栈君。
Tensorflow和Caffe等深度学习中,监督学习的数据标注是一件非常繁琐和耗时的工作,目前大多数公司都采用外包给标注公司进行处理,或者购买现有的数据集,使得进行深度学习研究的成本异常高。本文介绍一种以人工智能解决数据标注的思路和方法。
一、思路
步骤:
1、以一个初步模型对小批量待标注数据进行检测,这里的初步模型可以是自己用少批量数据集训练出来的,也可以用网上公布的;
2、对检测出来的结果进行人为干预纠正;
3、把纠正后的数据训练新的模型;
4、用新模型对中等批量待测数据进行检测;
5、通过1~5步骤的循环迭代,可以逐步求精;
6、虽然也需要人工参与,但可以极大减少工作量。
实现方法:
1、Anno-Mage
Anno-Mage是一个半自动标注工具,通过一个通用模型对数据集进行检测。但这个工具能标注的物品类型有限,也没有模型迭代逐步求精的过程,可以自行对其源码进行修改优化。
github代码地址:https://github.com/virajmavani/semi-auto-image-annotation-tool
2、easyDL智能标注
2.1、智能标注
百度easyDL提供了智能标注的功能,跟以上思路差不多,都是先对小批量数据进行标注学习训练,然后以学习结果去标注剩下的数据集,然后人工纠正,迭代求精。
easyDL平台网址:https://ai.baidu.com/easydl/lite
智能检测技术文档:https://ai.baidu.com/ai-doc/EASYDL/lk38n327g
2.2、数据导出
但easyDL官方不提供数据导出功能和api,这阻碍了我们把数据拿到Tensorflow和Caffe进行训练。所以我们可以通过爬虫技术来爬取训练好的数据。
工具github地址:https://github.com/kooky126/easydl2labelImg
发布者:全栈程序员栈长,转载请注明出处:https://javaforall.cn/147820.html原文链接:https://javaforall.cn
边栏推荐
- Aujourd'hui dans l'histoire: Alipay lance le paiement par code à barres; La naissance du père du système de partage du temps; La première publicité télévisée au monde...
- 触发器:Mysql实现一张表添加或删除一条数据,另一张表同时添加
- Kubernetes three open interfaces first sight
- 大廠面試總結大全
- Machine learning perceptron model
- Classic quotations
- How to use stustr function in Oracle view
- 月报总结|Moonbeam6月份大事一览
- John blasting appears using default input encoding: UTF-8 loaded 1 password hash (bcrypt [blowfish 32/64 x3])
- What is the difference between self attention mechanism and fully connected graph convolution network (GCN)?
猜你喜欢

Does bone conduction earphone have external sound? Advantages of bone conduction earphones

Typescript array out of order output

HMS core machine learning service helps zaful users to shop conveniently

sql解决连续登录问题变形-节假日过滤
![[North Asia data recovery] data recovery case of raid crash caused by hard disk disconnection during data synchronization of hot spare disk of RAID5 disk array](/img/51/f9c1eed37794db8c8d0eefd60b9e3d.jpg)
[North Asia data recovery] data recovery case of raid crash caused by hard disk disconnection during data synchronization of hot spare disk of RAID5 disk array

TypeScript数组乱序输出

LeetCode 1. 两数之和

隐私计算技术创新及产业实践研讨会:学习

Analyzing more than 7million R & D needs, it is found that these eight programming languages are the most needed in the industry!

PCL point cloud image transformation
随机推荐
自注意力机制和全连接的图卷积网络(GCN)有什么区别联系?
LeetCode 5. 最长回文子串
PCL 最小中值平方法拟合平面
john爆破出現Using default input encoding: UTF-8 Loaded 1 password hash (bcrypt [Blowfish 32/64 X3])
串口控制舵机转动
Kubernetes three open interfaces first sight
Global and Chinese market of desktop hot melt equipment 2022-2028: Research Report on technology, participants, trends, market size and share
Yyds dry goods inventory # look up at the sky | talk about the way and principle of capturing packets on the mobile terminal and how to prevent mitm
Take you ten days to easily complete the go micro service series (I)
Analyzing more than 7million R & D needs, it is found that these eight programming languages are the most needed in the industry!
Classifier visual interpretation stylex: Google, MIT, etc. have found the key attributes that affect image classification
Unity uses ugui to set a simple multi-level horizontal drop-down menu (no code required)
Rock PI Development Notes (II): start with rock PI 4B plus (based on Ruixing micro rk3399) board and make system operation
忆当年高考|成为程序员的你,后悔了吗?
Hard core! One configuration center for 8 classes!
Which software is good for machine vision?
[fluent] dart data type boolean type (boolean type definition | logical operation)
Global and Chinese markets of stainless steel surgical suture 2022-2028: Research Report on technology, participants, trends, market size and share
How to solve the failure of printer driver installation of computer equipment
Some problems about MySQL installation