当前位置:网站首页>Progressive multi grasp detection using grasp path for rgbd images
Progressive multi grasp detection using grasp path for rgbd images
2022-07-03 05:16:00 【Qianyu QY】
The current plane grab detection is mainly based on Jiang stay 2014 Rectangular grab representation proposed in , Its biggest disadvantage is , It is impossible to mark the grab truth value of the object in detail ( Or the existing data sets are not fully annotated ), Lead to two problems :
(1) Cannot provide perfect truth value for neural network ;
(2) Unable to accurately verify , Because a practical predictive crawl may not meet the verification criteria , namely IOU>0.25 & angle<30, vice versa .
An easy solution is , Annotate the data set at pixel level , Make the grab of the annotation as close to the true value as possible , But this plan is difficult to achieve , Alternative ,Chen A simplified method is proposed , That is, the paper introduced in this paper :
@article{***,
author = {Lu Chen and Panfeng Huang and Zhongjie Meng},
title = {Convolutional multi-grasp detection using grasp path for {RGBD} images},
journal = {Robotics Auton. Syst.},
volume = {113},
pages = {94--103},
year = {2019}
}The paper is divided into two parts : Introduce the crawl path , Crawl detection network . Here we mainly introduce the concept of crawl path .
One 、 Grab path
Crawl path is introduced as the truth value of network learning , It's some curves on the image , Every point on the curve can be used as a grab point , The normal direction at the grab point is taken as the grab angle , The width and height of the grab frame are given by the predicted grab ( Let's introduce ).
The crawl path is determined by Cornell The data set is generated by the original annotation rectangle , The specific way is : Connect the midpoint of any two overlapping rectangular boxes . This operation is applicable to most situations , Here's the picture ( Left ), But when all three rectangles have overlapping areas , Redundant crawl paths will be generated , When the shape of an object is complex , The generated crawl path may violate common sense , Here's the picture ( in ) and ( Right ):

In order to cope with the above two situations , The author puts forward solutions respectively , among , In response to the second situation , Even using neural networks .
An example of the finally generated crawl path is shown in the following figure :

Two 、 How to use crawl path
The author draws lessons from a few years ago ICRA A paper on the network structure for crawl detection , It's not detailed here , Here's the picture :

In order to calculate the loss of the network , You need to know the truth value capture corresponding to the predicted capture , This truth grab is obtained by predicting the grab and the grab path . First, make a vertical line from the midpoint of the predicted grasp to the nearest grasp path , The intersection is the grab point of truth value grab , The angle of the vertical line is the grasping angle of the truth value grasp , The height and width of the truth grab are equal to the width and height of the predicted grab , Then calculate the loss and back-propagation . Again , The truth fetching used for validation in the validation phase is the same as above . The generation process of truth grab is shown in the following figure ( The solid line box is predictive crawl , Dashed boxes are truth grabs ):

give the result as follows :

The author is in another article (Detecting Graspable Rectangles of Objects in Robotic Grasping) In the said , The low accuracy is due to the simple network structure , After changing to a better network structure , Accuracy of 94.6%.
3、 ... and 、 summary
I think the starting point of this article is good , Optimize the training process by generating a more detailed grab truth , But there are still the following problems :
(1) The width and height of the grab truth box should not be given by the predicted grab box , Because the predicted value may not meet the capture requirements , Such as too long or too short .
(2) The automatically generated crawl path is more detailed than the crawl box , But there are intermittent situations , It is even quite different from the edge of the object .
(3) Can another crawl path parallel to the crawl path and only one pixel away be taken as the true value , That is, capturing the truth value is not just a line , It should be an area .
Finally, shamelessly recommend your own B Station No , Occasionally upload robot Related videos

边栏推荐
- 最大连续子段和(动态规划,递归,递推)
- 1103 integer factorization (30 points)
- "250000 a year is just the price of cabbage" has become a thing of the past. The annual salary of AI posts has decreased by 8.9%, and the latest salary report has been released
- Make your own dataset
- Pessimistic lock and optimistic lock of multithreading
- [backtrader source code analysis 4] use Python to rewrite the first function of backtrader: time2num, which improves the efficiency by 2.2 times
- Yolov5 input (II) | CSDN creative punch in
- Botu uses peek and poke for IO mapping
- C language program ideas and several commonly used filters
- My first Smartphone
猜你喜欢

How to connect the network: Chapter 2 (Part 1): a life cycle of TCP connection | CSDN creation punch in

酒店公共广播背景音乐-基于互联网+的酒店IP网络广播系统设计

Congratulations to musk and NADELLA on their election as academicians of the American Academy of engineering, and Zhang Hongjiang and Fang daining on their election as foreign academicians
![[basic grammar] C language uses for loop to print Pentagram](/img/9e/021c6c0e748e0981d4233f74c83e76.jpg)
[basic grammar] C language uses for loop to print Pentagram

Yolov5 network structure + code + application details | CSDN creation punch in

Go practice -- design patterns in golang's singleton

Oracle SQL table data loss

appium1.22. Appium inspector after X version needs to be installed separately

微服务常见面试题

Introduction to deep learning (II) -- univariate linear regression
随机推荐
The consumption of Internet of things users is only 76 cents, and the price has become the biggest obstacle to the promotion of 5g industrial interconnection
Overview of basic knowledge of C language
动态规划——相关概念,(数塔问题)
appium1.22. Appium inspector after X version needs to be installed separately
Automatic voltage rise and fall 5-40v multi string super capacitor charging chip and solution
College campus IP network broadcasting - manufacturer's design guide for college campus IP broadcasting scheme based on campus LAN
【实战项目】自主web服务器
【批处理DOS-CMD命令-汇总和小结】-CMD窗口的设置与操作命令-关闭cmd窗口、退出cmd环境(exit、exit /b、goto :eof)
[research materials] annual report of China's pension market in 2021 - Download attached
Go practice -- generate and read QR codes in golang (skip2 / go QRcode and boombuilder / barcode)
1103 integer factorization (30 points)
Notes | numpy-08 Advanced index
About debugging the assignment of pagenum and PageSize of the formal parameter pageweb < T > (i.e. page encapsulation generic) in the controller
Deep embedding and alignment of Google | protein sequences
Webrtc M96 release notes (SDP abolishes Plan B and supports opus red redundant coding)
Use posture of sudo right raising vulnerability in actual combat (cve-2021-3156)
Ueditor, FCKeditor, kindeditor editor vulnerability
Why is go language particularly popular in China
Yolov5 network structure + code + application details | CSDN creation punch in
Webrtc protocol introduction -- an article to understand ice, stun, NAT, turn