当前位置:网站首页>Popular science of data annotation: ten common image annotation methods
Popular science of data annotation: ten common image annotation methods
2022-06-24 11:44:00 【Lift up】
The rapid development of computer vision is inseparable from the support of a large number of image annotation data , With all kinds of image detection 、 Commercialization of recognition algorithm , The market is becoming more and more strict about the accuracy of image annotation , At the same time, for different application scenarios , Different image annotation methods are also derived .
1、 Semantic segmentation
Semantic segmentation is based on the attributes of objects , Area division of complex and irregular pictures , And mark the corresponding attributes , To help train the image recognition model , It is often used for automatic driving 、 human-computer interaction 、 Virtual reality and other fields .
2、 Rectangular box dimension
Rectangular box labeling is also called pull box labeling , It is the most widely used image annotation method at present , In a relatively simple way 、 Convenient way in image or video data , Quickly frame the specified target object .
3、 Polygon annotation
Polygonal annotation refers to in still pictures , Use polygon boxes , Mark irregular target objects , Dimension relative to rectangular box , Polygon annotation can frame the target more accurately , At the same time, for irregular objects , It is also more targeted .
4、 Key point marking
Key point annotation refers to the manual way , Mark the key points at the specified position , For example, facial feature points 、 Human bone connection points, etc , It is often used to train facial recognition models and statistical models .
5、 Point cloud annotation
Point cloud is an important expression of 3D data , Through sensors such as lidar , It can collect all kinds of obstacles and their position coordinates , The annotator needs to classify these dense point clouds , And mark different attributes , It is often used in the field of automatic driving .
6、3D Cube dimension
Different from point cloud annotation ,3D Cube annotation or annotation based on two-dimensional plane image , The announcer frames the edges of three-dimensional objects , And then get the vanishing point , Measure the relative distance between objects .
7、2D/3D Fusion annotation
2D/3D Fusion annotation refers to the simultaneous annotation of 2D and 3D Label the image data collected by the sensor , And build relationships . This method can mark the position and size of the object in plane and three-dimensional , Help the autopilot model enhance vision and radar perception .
8、 Target tracking
Target tracking refers to moving images , Frame extraction and annotation , Mark the target object in each frame , Then describe their trajectory , Such annotations are often used to train automatic driving models and video recognition models .
9、OCR Transcribe
OCR Transcribe is to mark and transcribe the text content in the image , Help train and improve the image and text recognition model . at present , Jing Lianwen supports simplified Chinese 、 Traditional Chinese 、 English 、 Japanese 、 Korean 、 French 、 German 、 Spanish 、 Transfer of printed or handwritten pictures in more than ten languages such as Arabic .
10、 Attribute discrimination
Attribute discrimination refers to the way of manual or machine cooperation , Identify the target object in the image , And mark it with the corresponding attribute .
边栏推荐
猜你喜欢

Qt: judge whether the string is in numeric format

PHP短信通知+语音播报自动双呼

万名校园开发者花式玩AI,亮点看这张图就够啦!

软件测试 对前一日函数的基本路径测试

Tools and methods - use code formatting tools in source insight

齐次坐标的理解

Programmers spend most of their time not writing code, but...

math_ Summation and derivation of proportional series & derivation of sum and difference of equal powers / difference between two nth power numbers/
![[graduation season · attacking technology Er] three turns around the tree, what branch can we rely on?](/img/0a/0ebfa1e5c1bea6033b538528242252.png)
[graduation season · attacking technology Er] three turns around the tree, what branch can we rely on?

TP-LINK 1208路由器教程(2)
随机推荐
Jenkins performance test
GLOG from getting started to getting started
Libuv的安装及运行使用
美团基于 Flink 的实时数仓平台建设新进展
Fizz gateway secondary development integration tutorial
RPM installation percona5.7.34
11+! 结肠癌中基于 m6A 调节因子的甲基化修饰模式以不同的肿瘤微环境免疫谱为特征
u盘安装kali并且持久化
Realization of alarm clock with AHK
Why does the virtual machine Ping the host but not the virtual machine
GLOG从入门到入门
Tools and methods - use code formatting tools in source insight
Linker --- linker
How to develop hospital information system (his) with SMS notification and voice function
深圳市人民医院程立新课题组提出多组学数据在肝细胞癌的诊断与预后分析的新方法meGPS
11+文章-机器学习打造ProTICS框架-深度揭示了不同分子亚型中肿瘤浸润免疫细胞对预后的影响
What code did the full stack programmer write this month?
Install wpr Exe command
Internship experience sharing in ByteDance 𞓜 ten thousand word job guide
集群控制管理