当前位置:网站首页>Classification and discussion of plane grab detection methods based on learning
Classification and discussion of plane grab detection methods based on learning
2022-07-03 05:16:00 【Qianyu QY】
The task of plane grab detection is , Input perceptual data , Output crawl configuration . So far , There are mainly two kinds of learning based plane grab detection methods :
(1) A one-stage end-to-end learning approach .
(2) Two stage learning method .
1、 One stage learning
In this kind of method , Directly learn the mapping function from input data to crawl configuration , Neural networks . The input is usually RGB Image or depth image , Output is Crawl the set of configurations , Then select the optimal crawl configuration according to the confidence . Currently based on Rectangle grabbing indicates The main method is , Such as
(1)Fully Convolutional Grasp Detection Network with Oriented Anchor Box

(2)Real-world Multi-object, Multi-grasp Detection

(3)Densely Supervised Grasp Detector (DSGD)

2、 Two stage learning
It includes two stages : Grab configuration sampling , Crawl configuration evaluation . In the first stage, multiple crawl configurations are sampled , The second stage evaluates the quality of each candidate crawl , Then choose the best grab . At present, the best is Dex-Net 4.0:
(1)Dex-Net 4.0: Learning ambidextrous robot grasping policies

3、 Discuss
One stage learning Of GroundTruth It is all the feasible grab configurations on the object , Annotation grab configuration can be manually annotated 、 Robot actual trial and error annotation or automatic generation of virtual environment , There are several problems :
(1) There is no unified standard for manual annotation , There is an error .
(2) Robot tagging is too time-consuming ,2016 year ICRA Of Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours It took a lot of time , The final result is unsatisfactory .
(3) No matter which annotation method , It is impossible to mark all the feasible grasping methods of the object in detail , It means , Neural networks do not have the best learning target.
(4) Multi object stacking scenes are difficult to label , As a result, the current plane grab data sets are almost all single objects (cornell、Jacquard etc. , Now there should be a synthetic multi-object scene data set , Not paying much attention ), Make the learning network in bin-picking Poor performance in the task .
Two stage learning Of GroundTruth Is the quality of the grab sample , namely 0 or 1. Datasets can automatically generate datasets in a virtual environment , Study of the target And the best , But synthetic image Dex-Net4.0 That kind of millions of samples is not what ordinary laboratories can do . For all that , It can be seen that the two-stage learning method is better than the one-stage learning method , More worthy of study . The next research direction is how to perform crawl detection for small data sets .
边栏推荐
- 酒店公共广播背景音乐-基于互联网+的酒店IP网络广播系统设计
- 聊聊如何利用p6spy进行sql监控
- Learn libcef together -- set cookies for your browser
- How to connect the network: Chapter 2 (Part 1): a life cycle of TCP connection | CSDN creation punch in
- appium1.22.x 版本後的 appium inspector 需單獨安裝
- [basic grammar] C language uses for loop to print Pentagram
- Go practice -- gorilla/rpc (gorilla/rpc/json) used by gorilla web Toolkit
- Source insight garbled code solution
- Common interview questions of microservice
- Interview question -- output the same characters in two character arrays
猜你喜欢

SSM framework integration

Go practice -- gorilla / websocket used by gorilla web Toolkit

(subplots usage) Matplotlib how to draw multiple subgraphs (axis field)

音频焦点系列:手写一个demo理解音频焦点与AudioMananger

Web APIs exclusivity

BTC-密码学原理

Go practice -- design patterns in golang's singleton

Differences among bio, NiO and AIO

Yolov5 model construction source code details | CSDN creation punch in

JS scope
随机推荐
Gbase8s composite index (I)
[backtrader source code analysis 4] use Python to rewrite the first function of backtrader: time2num, which improves the efficiency by 2.2 times
Pessimistic lock and optimistic lock of multithreading
2022-02-12 daily clock in: problem fine brush
1106 lowest price in supply chain (25 points)
1115 counting nodes in a BST (30 points)
appium1.22. Appium inspector after X version needs to be installed separately
Go practice -- use redis in golang (redis and go redis / redis)
[backtrader source code analysis 5] rewrite several time number conversion functions in utils with Python
appium1.22.x 版本後的 appium inspector 需單獨安裝
1107 social clusters (30 points)
Redis 击穿穿透雪崩
Webrtc protocol introduction -- an article to understand ice, stun, NAT, turn
"Pthread.h" not found problem encountered in compiling GCC
1087 all roads lead to Rome (30 points)
Differences among bio, NiO and AIO
[research materials] annual report of China's pension market in 2021 - Download attached
[basic grammar] C language uses for loop to print Pentagram
Congratulations to musk and NADELLA on their election as academicians of the American Academy of engineering, and Zhang Hongjiang and Fang daining on their election as foreign academicians
My first Smartphone