当前位置:网站首页>Pedestrian re identification (Reid) - Overview
Pedestrian re identification (Reid) - Overview
2022-07-06 15:09:00 【gmHappy】
What is? Re-ID?
- Pedestrian recognition (Person re-identification, abbreviation Re-ID) Also known as pedestrian recognition , Is the use of computer vision technology to determine whether there is a specific pedestrian in the image or video sequence . It is widely regarded as a sub problem of image retrieval . Given a monitored pedestrian image , Retrieve the pedestrian image under the cross device . It aims to make up for the visual limitations of the current fixed camera , And can detect with pedestrians / Combination of pedestrian tracking technology , It can be widely used in intelligent video surveillance 、 Intelligent security and other fields .
- As shown in the figure below : There are multiple cameras shooting video sequences in an area ,ReID The requirements of a camera under the interest of pedestrians , Retrieve all pictures of the pedestrian under other cameras .
Why Re-ID?
In surveillance video , Due to the camera resolution and shooting angle , Very high quality face images are usually not available . When face recognition fails ,ReID It has become a very important alternative technology .
Research forms
- Data sets are usually pedestrian images obtained by manual annotation or detection algorithms , At present, it is independent of detection , Pay attention to identification
- Data sets are divided into training sets 、 Verification set 、Query、Gallery
- Train the model on the training set , After getting the model, right Query And Gallery Image feature extraction and similarity calculation in , For each Query stay Gallery Find out before N A similar picture
- Training 、 The identity of the person in the test is not repeated
Two directions
- feature extraction : Learn to be able to cope with the characteristics of people changing under different cameras
- Measure learning : Mapping the learned features to a new space makes the same people closer and different people farther
There are challenges
- Different cameras cause great changes in the appearance of pedestrians ;
- Target occlusion (Occlusion) Some features are lost ;
- Different View,Illumination Differences in characteristics that lead to the same goal ;
- Different target clothes have similar colors 、 Feature approximation leads to a decrease in discrimination ;
Common data set
CUHK03
Market1501
DukeMTMC-reID
MSMT17
Only commonly used data sets are listed here , A more complete data set can be referred to : Person Re-identification Datasets
Commonly used evaluation index
- rank-k: The sorting list returned by the algorithm , front k If the bit is an existing search target, it is called rank-k hit .eg:rank1: The first is the search target rank-1 hit .
- Cumulative Match Characteristic (CMC)
Take a very simple example , Suppose in face recognition , There are 100 personal , Now comes 1 A face to be recognized ( If label by m1), After comparing with the faces in the bottom database, the faces in the bottom database are sorted from high to low , We found that :
If the recognition result is m1、m2、m3、m4、m5……, Now rank-1 The accuracy of is 100%;rank-2 The correct rate of is 100%;rank-5 The correct rate of is 100%;
If the recognition result is m2、m1、m3、m4、m5……, Now rank-1 The accuracy of is 0%;rank-2 The accuracy of is 100%;rank-5 The correct rate of is 100%;
If the recognition result is m2、m3、m4、m5、m1……, Now rank-1 The accuracy of is 0%;rank-2 The accuracy of is 0%;rank-5 The accuracy of is 100%;
Empathy , When there are many faces to be recognized , Take the average . For example, the face to be recognized has 3 individual ( If label by m1,m2,m3), Similarly, there is a score from high to low for everyone's face ,
such as :
Face 1 The result is m1、m2、m3、m4、m5……,
Face 2 The result is m2、m1、m3、m4、m5……,
Face 3 result m3、m1、m2、m4、m5……,
Now rank-1 The accuracy of is (1+1+1)/3=100%;
rank-2 The correct rate of is (1+1+1)/3=100%;
rank-5 The correct rate of is (1+1+1)/3=100%;
such as :
Face 1 The result is m4、m2、m3、m5、m6……,
Face 2 The result is m1、m2、m3、m4、m5……,
Face 3 result m3、m1、m2、m4、m5……,
Now rank-1 The accuracy of is (0+0+1)/3=33.33%;
rank-2 The accuracy of is (0+1+1)/3=66.66%;
rank-5 The correct rate of is (0+1+1)/3=66.66%;
curve: Calculation rank-k Hit rate of , formation rank-acc The curve of , Here's the picture :
- mAP(mean average precision): Reflect the extent to which all the correct pictures in the database of the person who searched are in front of the sorted list , It can be measured more comprehensively ReID Performance of the algorithm . Here's the picture , Suppose the search pedestrian is gallery There is 10 A picture , In the list Middle position (rank) Respectively 1、2、3、4、5、6、7、8、9, be ap by (1/ 1 + 2 / 2 + 3 / 3 + 4 / 4 + 5 / 5 + 6 / 6 + 7 / 7 + 8 / 8 + 9 / 9) / 10 = 0.90;ap large , The search results of this pedestrian are relatively high , For all query Of ap Take the average value to mAP
Generally speaking ,Precision It's the retrieved items ( such as : file 、 Web page, etc ) How much is accurate ,Recall It's how many of the exact entries have been retrieved .
Accuracy rate = The number of positive samples detected / Total number detected
Recall rate = The number of positive samples detected / Number of all positive samples
Let's take a new example .
Suppose there is a search engine , According to search engines , The results are as follows :
Search for 1 The total number of relevant samples is 5 individual : just , just , just , just , just
Rank1 | just | negative | just | negative | negative | just | negative | negative | just | just |
Recall | 1/5=0.2 | 1/5=0.2 | 2/5=0.4 | 2/5=0.4 | 2/5=0.4 | 3/5=0.6 | 3/5=0.6 | 3/5=0.6 | 4/5=0.8 | 5/5=1.0 |
Precision | 1/1=1.0 | 1/2=0.5 | 2/3=0.66 | 2/4=0.5 | 2/5=0.4 | 3/6=0.5 | 3/7=0.42 | 3/8=0.38 | 4/9=0.44 | 7/10=0.5 |
Precision From left to right 1/1, 1/2, 2/3, 2/4… And so on
Search for 2 There are a total of 3 individual , The following are the results returned by the search engine
Rank1 | just | negative | negative | just | just | negative | negative |
Recall | 0.33 | 0.33 | 0.33 | 0.66 | 1 | 1 | 1 |
Precision | 1.0 | 0.5 | 0.33 | 0.5 | 0.6 | 0.5 | 0.43 |
We put each positive sample corresponding to Precision Averaging
Search for 1 Of mAP:mAP = (1/1 + 2/3 + 3/6 + 4/9+ 5/10) / 5 = 0.72
Search for 2 Of mAP: mAP = (1/1 + 2/4 + 3/5) / 3 = 0.63
Holistic mAP = (0.72 + 0.63) /2 = 0.675
边栏推荐
- Global and Chinese market for antiviral coatings 2022-2028: Research Report on technology, participants, trends, market size and share
- STC-B学习板蜂鸣器播放音乐
- Expanded polystyrene (EPS) global and Chinese markets 2022-2028: technology, participants, trends, market size and share Research Report
- 几款开源自动化测试框架优缺点对比你知道吗?
- Functions: Finding Roots of equations
- 基于485总线的评分系统双机实验报告
- [HCIA continuous update] working principle of static route and default route
- Video scrolling subtitle addition, easy to make with this technique
- UCORE lab7 synchronous mutual exclusion experiment report
- Why can swing implement a form program by inheriting the JFrame class?
猜你喜欢
Fundamentals of digital circuits (III) encoder and decoder
软件测试面试要问的性能测试术语你知道吗?
Express
STC-B学习板蜂鸣器播放音乐2.0
CSAPP家庭作業答案7 8 9章
Nest and merge new videos, and preset new video titles
ucore lab1 系统软件启动过程 实验报告
Build your own application based on Google's open source tensorflow object detection API video object recognition system (II)
几款开源自动化测试框架优缺点对比你知道吗?
自动化测试你必须要弄懂的问题,精品总结
随机推荐
[HCIA continuous update] advanced features of routing
The number of reversing twice in leetcode simple question
Function: calculates the number of uppercase letters in a string
“Hello IC World”
If the position is absolute, touchablehighlight cannot be clicked - touchablehighlight not clickable if position absolute
Vysor uses WiFi wireless connection for screen projection_ Operate the mobile phone on the computer_ Wireless debugging -- uniapp native development 008
Numpy Quick Start Guide
[pointer] solve the last person left
Face and eye recognition based on OpenCV's own model
Why can swing implement a form program by inheriting the JFrame class?
[issue 18] share a Netease go experience
ucore lab2 物理内存管理 实验报告
Keil5-MDK的格式化代码工具及添加快捷方式
Nest and merge new videos, and preset new video titles
Public key box
Zhejiang University Edition "C language programming experiment and exercise guide (3rd Edition)" topic set
CSAPP Shell Lab 实验报告
Statistics 8th Edition Jia Junping Chapter 1 after class exercises and answers summary
Dlib detects blink times based on video stream
ucore lab8 文件系统 实验报告