当前位置:网站首页>Understand the first prediction stage of yolov1
Understand the first prediction stage of yolov1
2022-07-03 05:49:00 【code bean】
First of all to see YOLOV1 Network structure , Feeling is ordinary convolutional neural network :
The final output is a 7*7*30 The black box of , Think of it as a book 7*7 The size has 30 Page of a book .

Every page of the book , from 49 A square (grid cell) form , V1 Each in the version grid cell forecast 2 Boxes (bounding box,bounding box The center of lies in this gridcell) So we can generate 98 individual bounding box.
Every bounding box, contain 5 Parameters :
1 x,y bounding box The location of the center point of
2 h,w bounding box The width and height of
3 c Whether to include the confidence of the target object ( This confidence , It should be cross comparison )
The perspective is switching back grid cell,grid cell It also includes the probability of each category , use grid cell The category probability of , multiply grid cell Self generated bounding box The degree of confidence , Get this bounding box Probability corresponding to each category .
In this case , Filter out those with low probability , The frame containing location information and category information can be drawn .
Look at the black box again :

that , We just looked at the left view of the black box , That just mentioned every grid cell Also includes A lot of information . Then the front view of this picture will be clearer ( And each or 49 individual grid cell Are all made of depth , This depth is 30).
here , Let's focus on , Actually, one. grid cell And the depth information it contains :

As mentioned before , Every grid cell Predictable 2 Boxes (bounding box), Purple and green are these two bounding box part , Every bounding box contain 5 Parameters , Yes, it's the one mentioned above 5 Parameters . Last 20 Dimensions store the grid cell Corresponding 20 Category probability . Seeing here, we probably know .V1 Version a picture can generate at most 98 Boxes , At most 20 Species category .
Summary
For better understanding , Here are a few points :
- grid cell and bounding box There is a corresponding relationship : Every grid cell Corresponding to the two bounding box
- bounding box Include the confidence level of whether the object is included ( Here we need to further confirm the confidence of what it is )
- grid cell Include the probability of each category .
- Two bounding box Each has a confidence , If the figure below shows :

The red box represents grid cell, Two black frames represent grid cell Two generated bounding boxes, The thicker the line, the higher the confidence . Two bounding box Share the grid cell Probability of category . Multiply probability and confidence , Will get this category to judge Total probability .
Look at this dynamic diagram , Should be able to better understand :

Continue analysis : If each grid Apply your own color to the maximum probability of the middle category , Then you may get the following effects :

Here is another conclusion ,V1 At most versions are predicted 20 Species , And at most, you can box out 49 An object .
Reference material
【 intensive reading AI The paper 】YOLO V1 object detection , Just look at me _ Bili, Bili _bilibili
边栏推荐
- Jetson AgX Orin platform porting ar0233 gw5200 max9295 camera driver
- There is no one of the necessary magic skills PXE for old drivers to install!!!
- [function explanation (Part 2)] | [function declaration and definition + function recursion] key analysis + code diagram
- "C and pointer" - Chapter 13 advanced pointer int * (* (* (*f) () [6]) ()
- mapbox尝鲜值之云图动画
- Installation of CAD plug-ins and automatic loading of DLL and ARX
- 1. 兩數之和
- [teacher Zhao Yuqiang] the most detailed introduction to PostgreSQL architecture in history
- PHP notes are super detailed!!!
- Final review Day8
猜你喜欢
![[function explanation (Part 1)] | | knowledge sorting + code analysis + graphic interpretation](/img/c2/991b8febd262cf9237017adc9d1221.jpg)
[function explanation (Part 1)] | | knowledge sorting + code analysis + graphic interpretation

Kubernetes resource object introduction and common commands (V) - (configmap)

今天很多 CTO 都是被干掉的,因为他没有成就业务

Beaucoup de CTO ont été tués aujourd'hui parce qu'il n'a pas fait d'affaires
![[video of Teacher Zhao Yuqiang's speech on wot] redis high performance cache and persistence](/img/a7/2140744ebad9f1dc0a609254cc618e.jpg)
[video of Teacher Zhao Yuqiang's speech on wot] redis high performance cache and persistence

Understand one-way hash function

Latest version of source insight

How to create and configure ZABBIX
![[teacher Zhao Yuqiang] MySQL high availability architecture: MHA](/img/a7/2140744ebad9f1dc0a609254cc618e.jpg)
[teacher Zhao Yuqiang] MySQL high availability architecture: MHA

Communication - how to be a good listener?
随机推荐
[teacher Zhao Yuqiang] Cassandra foundation of NoSQL database
"C and pointer" - Chapter 13 advanced pointer int * (* (* (*f) () [6]) ()
Personal outlook | looking forward to the future from Xiaobai's self analysis and future planning
The programmer shell with a monthly salary of more than 10000 becomes a grammar skill for secondary school. Do you often use it!!!
"C and pointer" - Chapter 13 function pointer 1: callback function 2 (combined with template to simplify code)
Apache+PHP+MySQL环境搭建超详细!!!
Strategy pattern: encapsulate changes and respond flexibly to changes in requirements
Jetson AgX Orin platform porting ar0233 gw5200 max9295 camera driver
[function explanation (Part 2)] | [function declaration and definition + function recursion] key analysis + code diagram
How does win7 solve the problem that telnet is not an internal or external command
70 shell script interview questions and answers
There is no one of the necessary magic skills PXE for old drivers to install!!!
[trivia of two-dimensional array application] | [simple version] [detailed steps + code]
ES 2022 正式发布!有哪些新特性?
Ext4 vs XFS -- which file system should you use
How to set up altaro offsite server for replication
Btrfs and ext4 - features, strengths and weaknesses
期末复习(DAY6)
Xaml gradient issue in uwp for some devices
Azure file synchronization of altaro: the end of traditional file servers?