当前位置:网站首页>Analysis of [paper] pointlanenet papers
Analysis of [paper] pointlanenet papers
2022-07-27 03:08:00 【It's Anlan】
Catalog
Structured representation of lane lines
Preface
PointLaneNet yes 2019 IEEE Intelligent Vehicles Symposium (IV) A paper on , Determine the location of the lane line by detecting the points on the lane line , And classify the lane lines , Is subject to YOLO Series inspired ont-stage Network structure .
Title:《PointLaneNet: Efficient end-to-end CNNs for Accurate Real-Time Lane Detection》
version 1.0
abstract
An end-to-end lane detection algorithm is proposed , The coordinates of lane line points can be obtained directly from the driving scene .
method description
Inspired by YOLO Series algorithm , The lane detection algorithm can simultaneously predict the location and classify the lane lines in a single network , In order to facilitate network training , The concept of lane line distance is also clarified .
Structured representation of lane lines
The article points out that : Generally, straight lines or polynomial curves are used to structurally represent lane lines , In fact, in reality , This is not advisable , Even the curve of cubic polynomial cannot completely represent the complex lane line , But a series of precise points can represent the lane line , These points can be obtained by interpolation , As long as you know the starting point in the lane line , Termination point , In the middle , The four parameters of lane line width , It can show the shape of the lane line . The width of the lane line is known ( According to the regulations ), The end point of the lane line is at the bottom of the picture ( This article defaults to ).
There is a unique intersection between the horizontal and equally spaced straight lines and each lane line , Now you only need to find the starting point of the lane line and the point of the lane line .Fig2 Is the representation of structured lane lines .
y It's pre-set ( Horizontal line ), So if you want to determine the coordinates of each point, you only need to determine the offset of that point , therefore , The predicted value of each lane line is : Offset , The starting point , Confidence of lane line . The following figure is the definition of points on the lane line :
Distance
Define the distance between lane lines ( The paper says : The definition of the distance between lane lines is very important for training . In a general way ,point-point Distance is Euclidean distance ,box-box yes IOU Definition , The distance between lane lines is not clearly defined ).
Yes point-line Distance is defined :
Euclidean distance .
Yes line-line Distance definition formula :
The distance between two lines is defined as the average of the distance between overlapping points . For non overlapping parts , The distance defined is infinite ( infinity ).
detection pipeline
Use one-stage Methods to detect the lane line , The model detects the location of points and classifies lane lines .
Use GoogleNet As feature extraction network , Conduct 8 Next sampling , The pictures are divided into w x h Of grid
Location detection
Use 2 Time 1 x1 Convolution of , take feature map The output dimension of is converted to n+1,n yes n Offset , another 1 Dimension is the starting position of the lane line .
Classification
Use 2 Time 1 x1 Convolution of , Convert the output channel to 2, To indicate whether the lane line passes through this grid, By modifying the channel , This can be used to detect the color or type of lane line , Because of the dataset used in this article , Here is just to detect whether the lane line exists or not . The final output is :w * h * (n+1+2)
post-processing:
Because a lane passes through many grid, Each lane line has many prediction results , So it needs post-processing .
Specific steps :
Initialization distance , The threshold for \delta = 10 pixels,confidence The threshold of is cls = 0.99;
Delete all cls < 0.99 The lane line of the road ;
Yes confidence Sort , selection confidence The tallest one , If confidence The highest one distance < \delta, Delete this lane line , Keep looking for the second highest confidence;
If lane number More than the 5, take \delta = \delta+10, Re step 3; Or it's over ;
notes : This data set is set with no more than five lanes ;
Be careful : Formula is Latex,\delta.
Model Analysis
Model evaluation index :
Yes Mem、 Amount of computation FLOPS And the computational strength of the model I Quantitative analysis . among , The computational strength of the model shows Floating point operands required for memory exchange per byte during calculation , Unit is Flops/Byte. The relationship between the three variables :
I = Flops/Mem
model Parameter comparison
contrast SCNN,LaneNet.( The article here uses three lightweight feature networks as a comparison ,GoogLeNet, SqueezeNet, and MobileNet)

Training
data: The operation of data enhancement ;
Matching strategy : To balance positive and negative samples , In training set , Set the grid center point within a certain distance threshold from the true lane line as a positive sample ; Otherwise, set as negative sample .threshold = 20 pixels.
Loss: Three kinds of predicted values output by the model , Confidence of lane line confidence, Offset of lane line point \triangle x(x-coordinate), Starting point of lane line y(y-coordinate), Correspondingly ,loss Function is also composed of these three parts , Of confidence loss Using cross entropy , The loss function of lane line points uses Euclidean distance .loss function As shown in the figure below :

Choosing scales: Determine the sampling magnification through experiments , This paper chooses 16 Double down sampling , Although the information of lane line may be lost after down sampling , However, the location of the lane line can still be known through the lane structure or environmental information .
Hard negative mining: Sort the negative samples according to the confidence , Select the sample with high score , And adjust the ratio of positive samples to negative samples to 1:3, To strengthen the network's ability to identify false positives .( I don't understand here , Leave a hole )
experiments
Used Tusimple Data set and CuLane Data sets .
The evaluation index :accuracy,FP,FN


summary
PointLaneNet yes one-stage Network structure , A network outputs the location information of lane lines and lane line classification information at the same time , Create a horizontal line on the input image , There is a unique intersection between the straight line and the lane line , By finding the starting point of the lane line , Termination point , The center point and the width of the lane line determine the shape of the lane line and the classification of the lane , The final output of the network is : Offset , The starting point , Confidence of lane line .( y It's pre-set ( Horizontal line ), So if you want to determine the coordinates of each point, you only need to determine the offset of that point ); Besides , Defines the lane line and between the lane lines , Distance between point and lane line ; Set a threshold to determine whether the detected lane line is a positive sample or a negative sample .
边栏推荐
- CS224W fall 1.2 Applications of Graph ML
- Cuteone: a onedrive multi network disk mounting program / with member / synchronization and other functions
- 小玩一个并行多线程MCU—MC3172
- 2513: 小勇学分数(公约数问题)
- CuteOne:一款OneDrive多网盘挂载程序/带会员/同步等功能
- Go to export excel form
- Arduino UNO +74HC164流水灯示例
- 杀毒软件 clamav 的安装和使用
- Favicon网页收藏图标在线制作PHP网站源码/ICO图片在线生成/支持多种图片格式转换
- Blog competition dare to try BAC for beginners
猜你喜欢

Favicon网页收藏图标在线制作PHP网站源码/ICO图片在线生成/支持多种图片格式转换

OD-Paper【3】:Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
![[redis] quick start](/img/42/09f08b4f78bc2ddd4d15693d62cb4b.png)
[redis] quick start

测试人需要的数据库知识:MySQL常用语法

毕业2年转行软件测试获得12K+,不考研月薪过万的梦想实现了

Manually build ABP framework from 0 -abp official complete solution and manually build simplified solution practice

Non global function of lua function

Why do people like to rank things

Blog competition dare to try BAC for beginners

关于url编解码应该选用的函数
随机推荐
一体式水利视频监控站 遥测终端视频图像水位水质水量流速监测
Information collection port scanning tool nmap instructions
全网最全的软件测试基础知识整理(新手入门必学)
Comprehensive summary of shell analysis log file commands
商城小程序项目完整源码(微信小程序)
Which securities firm is safer to open an account and buy REITs funds?
数据资产管理的概念
【RYU】安装RYU常见问题及解决办法
Cs224w fall course - --- 1.1 why graphs?
iNFTnews | GGAC联合中国航天ASES 独家出品《中国2065典藏版》
我的爬虫笔记(七) 通过爬虫实现blog访问量+1
数据库读写分离和分库分表
Arduino UNO +74hc164 water lamp example
CS224W fall 1.2 Applications of Graph ML
Database read-write separation and database and table segmentation
What did kubedmin do?
5、 MFC view windows and documents
人们为什么热衷于给事物排序
用最原始的方法纯手工实现常见的 20 个数组方法
localStorage与sessionStorage