当前位置:网站首页>Understand the first prediction stage of yolov1
Understand the first prediction stage of yolov1
2022-07-03 05:49:00 【code bean】
First of all to see YOLOV1 Network structure , Feeling is ordinary convolutional neural network :
The final output is a 7*7*30 The black box of , Think of it as a book 7*7 The size has 30 Page of a book .

Every page of the book , from 49 A square (grid cell) form , V1 Each in the version grid cell forecast 2 Boxes (bounding box,bounding box The center of lies in this gridcell) So we can generate 98 individual bounding box.
Every bounding box, contain 5 Parameters :
1 x,y bounding box The location of the center point of
2 h,w bounding box The width and height of
3 c Whether to include the confidence of the target object ( This confidence , It should be cross comparison )
The perspective is switching back grid cell,grid cell It also includes the probability of each category , use grid cell The category probability of , multiply grid cell Self generated bounding box The degree of confidence , Get this bounding box Probability corresponding to each category .
In this case , Filter out those with low probability , The frame containing location information and category information can be drawn .
Look at the black box again :

that , We just looked at the left view of the black box , That just mentioned every grid cell Also includes A lot of information . Then the front view of this picture will be clearer ( And each or 49 individual grid cell Are all made of depth , This depth is 30).
here , Let's focus on , Actually, one. grid cell And the depth information it contains :

As mentioned before , Every grid cell Predictable 2 Boxes (bounding box), Purple and green are these two bounding box part , Every bounding box contain 5 Parameters , Yes, it's the one mentioned above 5 Parameters . Last 20 Dimensions store the grid cell Corresponding 20 Category probability . Seeing here, we probably know .V1 Version a picture can generate at most 98 Boxes , At most 20 Species category .
Summary
For better understanding , Here are a few points :
- grid cell and bounding box There is a corresponding relationship : Every grid cell Corresponding to the two bounding box
- bounding box Include the confidence level of whether the object is included ( Here we need to further confirm the confidence of what it is )
- grid cell Include the probability of each category .
- Two bounding box Each has a confidence , If the figure below shows :

The red box represents grid cell, Two black frames represent grid cell Two generated bounding boxes, The thicker the line, the higher the confidence . Two bounding box Share the grid cell Probability of category . Multiply probability and confidence , Will get this category to judge Total probability .
Look at this dynamic diagram , Should be able to better understand :

Continue analysis : If each grid Apply your own color to the maximum probability of the middle category , Then you may get the following effects :

Here is another conclusion ,V1 At most versions are predicted 20 Species , And at most, you can box out 49 An object .
Reference material
【 intensive reading AI The paper 】YOLO V1 object detection , Just look at me _ Bili, Bili _bilibili
边栏推荐
- redis 遇到 NOAUTH Authentication required
- Qt读写Excel--QXlsx插入图表5
- @Import annotation: four ways to import configuration classes & source code analysis
- 期末复习(day3)
- 2022.6.30DAY591
- "C and pointer" - Chapter 13 function pointer 1: callback function 2 (combined with template to simplify code)
- 理解 期望(均值/估计值)和方差
- How to create your own repository for software packages on Debian
- [teacher Zhao Yuqiang] Alibaba cloud big data ACP certified Alibaba big data product system
- Download the corresponding version of chromedriver
猜你喜欢

Introduction to redis using Lua script

How to install and configure altaro VM backup for VMware vSphere

How do I migrate my altaro VM backup configuration to another machine?

Simpleitk learning notes

伯努利分布,二项分布和泊松分布以及最大似然之间的关系(未完成)

Understand one-way hash function
![[teacher Zhao Yuqiang] MySQL flashback](/img/93/75998e28fd309880661ea723dc8de6.jpg)
[teacher Zhao Yuqiang] MySQL flashback

Latest version of source insight

redis 无法远程连接问题。

Mapbox tasting value cloud animation
随机推荐
Qt读写Excel--QXlsx插入图表5
Introduction to redis using Lua script
今天很多 CTO 都是被干掉的,因为他没有成就业务
Apple submitted the new MAC model to the regulatory database before the spring conference
Solve the 1251 client does not support authentication protocol error of Navicat for MySQL connection MySQL 8.0.11
CAD插件的安裝和自動加載dll、arx
The server data is all gone! Thinking caused by a RAID5 crash
Ext4 vs XFS -- which file system should you use
Download the corresponding version of chromedriver
[escape character] [full of dry goods] super detailed explanation + code illustration!
Final review (Day6)
2022.7.2 模拟赛
伯努利分布,二项分布和泊松分布以及最大似然之间的关系(未完成)
一起上水碩系列】Day 9
Configure DTD of XML file
Redis使用Lua脚本简介
MySQL 5.7.32-winx64 installation tutorial (support installing multiple MySQL services on one host)
QT read write excel -- qxlsx insert chart 5
[minesweeping of two-dimensional array application] | [simple version] [detailed steps + code]
Final review Day8