当前位置:网站首页>[target detection] Introduction to yolor theory + practical test visdrone data set
[target detection] Introduction to yolor theory + practical test visdrone data set
2022-07-29 00:58:00 【zstar-_】
Preface
YOLOR yes 2021 An algorithm proposed in , One of its works Chien-Yao Wang( Taiwan ) At the same time, it has just come out recently YOLOv7 The first author of .
Address of thesis :https://arxiv.org/abs/2105.04206
Address of the thesis project :https:// github.com/WongKinYiu/yolor
A brief introduction to the theory
YOLOR Your paper is not easy to chew , Because it writes a lot of mathematical derivation concepts , I don't do in-depth derivation , Simply retell his thoughts .
YOLOR The core idea of is also based on the physiological structure of human brain . Human beings can actively learn through normal senses and experience , The knowledge learned in this part is called Explicit knowledge , meanwhile , Human beings can also learn through subconsciousness , This part of the knowledge learned is called Tacit knowledge .
In the traditional neural network model , It often extracts neuron features and discards the learning and application of implicit knowledge , The author defines directly observable knowledge as explicit knowledge , Knowledge hidden in neural networks and unobservable is defined as implicit knowledge .
therefore , The author has made the following research contributions to this point :
- A unified network that can complete various tasks is proposed , It learns general representation by integrating implicit knowledge and explicit knowledge , People can accomplish various tasks through this general expression . The proposed network at a very small additional cost ( Less than 1 The parameter quantity and calculation quantity of ten thousand ) The performance of the model is effectively improved .
- Align the kernel space 、 Prediction refinement and multi task learning are introduced into the implicit knowledge learning process , And verify their effectiveness .
- Using vectors 、 The method of modeling tacit knowledge by means of neural network and matrix decomposition , And its effectiveness is verified .
- It is proved that the proposed implicit representation can accurately correspond to specific physical features , And we also present it in a visual way .
The following specific content is too complex and boring , For in-depth understanding, you can see the original paper or blog [1] Summary mind map :

The following is the experimental results of the author using different models shown in the paper :


Practice testing
Is it good , It depends on the actual use .
Seeing the source code is basically based on YOLOv5 Changed , So read my previous blog 【 object detection 】YOLOv5 Run through VisDrone Data sets Our readers should be familiar with Runtong YOLOR I'm familiar with it .
I will use VisDrone Data set and yolor_csp_x_star This model runs 100 individual epoch, Retest , Here are the test results , Same as before 【 object detection 】TPH-YOLOv5: be based on transformer Improvement yolov5 UAV target detection Make an experimental model for comparison , give the result as follows :
| Algorithm | [email protected] | [email protected]:.95s |
|---|---|---|
| yolov5-5.0 | 34.9% | 20.6% |
| yolov5-6.1 | 33.1% | 18.7% |
| tph-yolov5 | 37.4% | 21.7% |
| yolov6 | 32.5% | 17.4% |
| yolor | 37.7% | 22.8% |
notes : It's just 100 individual epoch What you get best.pt Test results , Not achieving optimal performance , Usually the test results shown in the paper are 300 individual epoch above .
Just look mAP Numerical words ,YOLOR The value of is iterating 100 Student: the wheel case , The highest value , explain YOLOR Indeed, it has improved the effect .
Next, I import the small target human detection video I tested before , You can see YOLOR The detection effect of is not as ideal as expected , Explain the situation of small targets ,YOLOR Not very applicable ( This is a conclusion reached without enough experiments , For reference only ).
YOLOR_VisDrone Data set detection effect
Code backup
run YOLOR It is obvious that the author's eyes are not limited to AP, occasionally ,AR In some specific tasks , Than AP It is more important . The author optimized this point , After training , Unlike the original only save best.pt and last.pt, Instead, save multiple models .
It may be due to some mistakes of the author , The original code has some small at run time bug. I made a simple correction , The code is backed up as follows ( contain yolor_csp_x_star.pt Weight file and some results of my running ):
https://pan.baidu.com/s/1lXiu82q0sQPTCU0C4UmcAQ?pwd=8888
References
[1]https://blog.csdn.net/weixin_42917352/article/details/120050525
边栏推荐
- Necessary interview skills for Android (including interview questions and learning materials)
- Andriod6.0 low power mode (turn off WiFi, Bluetooth, GPS, screen brightness, etc.)
- B站“崩溃”始末 2021.07.13 我们是这样崩的(转载)
- Flask sends verification code in combination with Ronglian cloud
- What are the methods to track the real-time market of London Silver?
- 数学建模及其基础知识详解(化学常考知识点)
- Daniel guild Games: summary and future outlook of this year
- Cloud function realizes website automatic check-in configuration details [web function /nodejs/cookie]
- 异步模式之工作线程
- Tips for API interface optimization
猜你喜欢

PLATO上线LAAS协议Elephant Swap,用户可借此获得溢价收益

Five interesting magic commands in jupyter notebook

自制 | 纯手工自制一个16位RISC架构CPU

Method of converting inline elements to block elements

Have you seen the management area decoupling architecture? Can help customers solve big problems

【Web开发】Flask框架基础知识

Shell programming specifications and variables

云函数实现网站自动化签到配置详解【Web函数/Nodejs/cookie】

What opportunities does the London gold real-time market bring?

Consumer unit 消费单元
随机推荐
时序预测 | MATLAB实现TCN时间卷积神经网络的时间序列预测
B+ tree~
Talk about the cross end technical scheme
从零开始实现lmax-Disruptor队列(六)Disruptor 解决伪共享、消费者优雅停止实现原理解析
直流无刷电机控制器(换电机霍尔收费多少)
分类预测 | MATLAB实现TCN时间卷积神经网络的时序分类预测
The method of tracking the real-time market of London Silver
数仓搭建——ADS层
Armeabi-v7a architecture (sv7a)
PLATO上线LAAS协议Elephant Swap,用户可借此获得溢价收益
Data warehouse construction - DWT floor
【commons-lang3专题】005- ObjectUtils 专题
【commons-lang3专题】003- RandomStringUtils 专题
SDRAM控制器设计(数字控制器的两种设计方法)
How to explain JS' bind simulation implementation to your girlfriend
追踪伦敦银实时行情的方法有哪些?
从零开始实现lmax-Disruptor队列(六)Disruptor 解决伪共享、消费者优雅停止实现原理解析
andriod6.0低功耗模式(关闭wifi、蓝牙、gps、屏幕亮度等)
【树莓派】widows电脑如何与树莓派连接
云函数实现网站自动化签到配置详解【Web函数/Nodejs/cookie】