当前位置:网站首页>Progressive multi grasp detection using grasp path for rgbd images
Progressive multi grasp detection using grasp path for rgbd images
2022-07-03 05:16:00 【Qianyu QY】
The current plane grab detection is mainly based on Jiang stay 2014 Rectangular grab representation proposed in , Its biggest disadvantage is , It is impossible to mark the grab truth value of the object in detail ( Or the existing data sets are not fully annotated ), Lead to two problems :
(1) Cannot provide perfect truth value for neural network ;
(2) Unable to accurately verify , Because a practical predictive crawl may not meet the verification criteria , namely IOU>0.25 & angle<30, vice versa .
An easy solution is , Annotate the data set at pixel level , Make the grab of the annotation as close to the true value as possible , But this plan is difficult to achieve , Alternative ,Chen A simplified method is proposed , That is, the paper introduced in this paper :
@article{***,
author = {Lu Chen and Panfeng Huang and Zhongjie Meng},
title = {Convolutional multi-grasp detection using grasp path for {RGBD} images},
journal = {Robotics Auton. Syst.},
volume = {113},
pages = {94--103},
year = {2019}
}
The paper is divided into two parts : Introduce the crawl path , Crawl detection network . Here we mainly introduce the concept of crawl path .
One 、 Grab path
Crawl path is introduced as the truth value of network learning , It's some curves on the image , Every point on the curve can be used as a grab point , The normal direction at the grab point is taken as the grab angle , The width and height of the grab frame are given by the predicted grab ( Let's introduce ).
The crawl path is determined by Cornell The data set is generated by the original annotation rectangle , The specific way is : Connect the midpoint of any two overlapping rectangular boxes . This operation is applicable to most situations , Here's the picture ( Left ), But when all three rectangles have overlapping areas , Redundant crawl paths will be generated , When the shape of an object is complex , The generated crawl path may violate common sense , Here's the picture ( in ) and ( Right ):
In order to cope with the above two situations , The author puts forward solutions respectively , among , In response to the second situation , Even using neural networks .
An example of the finally generated crawl path is shown in the following figure :
Two 、 How to use crawl path
The author draws lessons from a few years ago ICRA A paper on the network structure for crawl detection , It's not detailed here , Here's the picture :
In order to calculate the loss of the network , You need to know the truth value capture corresponding to the predicted capture , This truth grab is obtained by predicting the grab and the grab path . First, make a vertical line from the midpoint of the predicted grasp to the nearest grasp path , The intersection is the grab point of truth value grab , The angle of the vertical line is the grasping angle of the truth value grasp , The height and width of the truth grab are equal to the width and height of the predicted grab , Then calculate the loss and back-propagation . Again , The truth fetching used for validation in the validation phase is the same as above . The generation process of truth grab is shown in the following figure ( The solid line box is predictive crawl , Dashed boxes are truth grabs ):
give the result as follows :
The author is in another article (Detecting Graspable Rectangles of Objects in Robotic Grasping) In the said , The low accuracy is due to the simple network structure , After changing to a better network structure , Accuracy of 94.6%.
3、 ... and 、 summary
I think the starting point of this article is good , Optimize the training process by generating a more detailed grab truth , But there are still the following problems :
(1) The width and height of the grab truth box should not be given by the predicted grab box , Because the predicted value may not meet the capture requirements , Such as too long or too short .
(2) The automatically generated crawl path is more detailed than the crawl box , But there are intermittent situations , It is even quite different from the edge of the object .
(3) Can another crawl path parallel to the crawl path and only one pixel away be taken as the true value , That is, capturing the truth value is not just a line , It should be an area .
Finally, shamelessly recommend your own B Station No , Occasionally upload robot Related videos
边栏推荐
- 大学校园IP网络广播-厂家基于校园局域网的大学校园IP广播方案设计指南
- Without 50W bride price, my girlfriend was forcibly dragged away. What should I do
- [develop wechat applet local storage with uni app]
- Notes | numpy-08 Advanced index
- (perfect solution) how to set the position of Matplotlib legend freely
- cookie session jwt
- Basic knowledge of reflection (detailed explanation)
- "250000 a year is just the price of cabbage" has become a thing of the past. The annual salary of AI posts has decreased by 8.9%, and the latest salary report has been released
- 音频焦点系列:手写一个demo理解音频焦点与AudioMananger
- 最大连续子段和(动态规划,递归,递推)
猜你喜欢
Compile and decompile GCC common instructions
Webrtc M96 release notes (SDP abolishes Plan B and supports opus red redundant coding)
appium1.22.x 版本後的 appium inspector 需單獨安裝
Use posture of sudo right raising vulnerability in actual combat (cve-2021-3156)
Audio Focus Series: write a demo to understand audio focus and audiomananger
5-36v input automatic voltage rise and fall PD fast charging scheme drawing 30W low-cost chip
College campus IP network broadcasting - manufacturer's design guide for college campus IP broadcasting scheme based on campus LAN
酒店公共广播背景音乐-基于互联网+的酒店IP网络广播系统设计
(完美解决)matplotlib图例(legend)如何自由设置其位置
Gbase8s composite index (I)
随机推荐
112 stucked keyboard (20 points)
Notes | numpy-08 Advanced index
Webrtc native M96 version opening trip -- a reading code download and compilation (Ninja GN depot_tools)
[practical project] autonomous web server
RT thread flow notes I startup, schedule, thread
Go practice -- factory mode of design patterns in golang (simple factory, factory method, abstract factory)
Redis breakdown penetration avalanche
Technical analysis of qianyuantong multi card aggregation router
Force GCC to compile 32-bit programs on 64 bit platform
1106 lowest price in supply chain (25 points)
1087 all roads lead to Rome (30 points)
1111 online map (30 points)
Pessimistic lock and optimistic lock of multithreading
Huawei personally ended up developing 5g RF chips, breaking the monopoly of Japan and the United States
Learn libcef together -- set cookies for your browser
Kept hot standby and haproxy
Gbase8s composite index (I)
Introduction to deep learning (II) -- univariate linear regression
【批处理DOS-CMD命令-汇总和小结】-CMD窗口的设置与操作命令-关闭cmd窗口、退出cmd环境(exit、exit /b、goto :eof)
[set theory] relationship properties (symmetry | symmetry examples | symmetry related theorems | antisymmetry | antisymmetry examples | antisymmetry theorems)