当前位置：网站首页>Asian Games countdown! AI target detection helps host the Asian Games!

Asian Games countdown! AI target detection helps host the Asian Games!

2022-07-03 01:51:00 【Computer Vision Research Institute】

Pay attention to the parallel stars

Never get lost

Institute of computer vision

official account ID｜ComputerVisionGzq

Study Group ｜ Scan the code to get the join mode on the homepage

Computer Vision Institute column

author ：Edison_G

object detection It is now the most popular research topic , At present, the most popular is Yolo Series frame , Recently, our computer vision research institute also shared a lot of work and practice of target detection , All are Yolo-Base frame , Today we share a modified version of Yolov5, Effect of real-time detection ！

Open source code ：https://github.com/ultralytics/yolov5/releases

Preface

object detection It is now the most popular research topic , Now there are more and more frameworks , But the novelty of technology has reached the bottleneck , At present, it is becoming popular Transformer Mechanism , And it can also be greatly improved in the field of target detection , It is also a hot spot in current research .

The upcoming Asian Games , More advanced technology will be invested , such as 3D imaging 、 Attitude estimation 、 object detection 、 Tracking and identification ！ Let's go to the world and see a different China , Different technology , Different Asian Games . Today, let's talk about the embodiment of target detection in the Asian Games ！

At present, the most popular is Yolo Series frame , Recently we “ Institute of computer vision ” Also shared a good understanding of target detection work and practice , All are Yolo-Base frame .

The new framework improves

Today we share a simple optimized Yolov5, Temporarily named ：Pad-YoloV5, stay IPad It can detect in real time ！ be based on YoloV5 frame , Familiar students should not have to explain more .

YoloV4 stay YoloV3 Based on the research results of recent two years , as follows ：

Input terminal adopts mosaic Data to enhance
Backbone I used CSPDarknet53、Mish Activation function 、Dropblock Methods such as .（cspnet It can reduce the amount of calculation and ensure the accuracy ）
Mish Function is ：
Neck Have adopted the SPP、FPN+PAN Structure ,
The output end adopts CIOU_Loss、DIOU_nms operation

YoloV5 Major changes , as follows ：

Input end ：Mosaic Data to enhance 、 Adaptive anchor frame calculation
Backbone：Focus structure ,CSP structure
Neck：FPN+PAN structure
Prediction：GIOU_Loss

This time, the main optimization is , yes YoloV5 When data is enhanced , Use random scaling 、 Random cutting 、 Random layout of the way to splice , The detection effect for small targets is still very friendly . It is found through experiments that , This random splicing and regular splicing , The final result is a little different .

First, I modify the data enhancement strategy , Start to make statistics on the overall data set （ That is, data preprocessing and analysis ）, I'm roughly divided into three areas . Randomly splice the largest and smallest , The final result is really better than the overall random effect ！

secondly , Slightly modified the adaptive image scaling strategy ,Yolov5 In the code datasets.py Of letterbox Function has been modified , Add the least black edge to the original image adaptively . I am the picture after adaptive scaling , I'll fill in the lower right corner , In fact, most of the data have not changed , Just change it casually , Because online is all about Yolo Add new strategies in recent years , Indeed, there was a certain increase in the effect of the final examination .

The last modification , It's hard work Transformer The mechanism is added YoloV5 In the basic framework of , Training has really accelerated , But for the results of training with notebooks , Still not obvious enough . This is also the first time recently to share some careful thoughts about the practical process , The specific details we “ Institute of computer vision ” Later, I will share with you through a detailed work article ！

THE END

Please contact the official account for authorization.

The learning group of computer vision research institute is waiting for you to join ！

Institute of computer vision Mainly involves Deep learning field , It's mainly about Face detection 、 Face recognition , Multi target detection 、 Target tracking 、 Image segmentation, etc Research direction . research institute Next, we will continue to share the latest papers, algorithms and new frameworks , The difference of our reform this time is , We need to focus on ” Research “. After that, we will share the practice process for the corresponding fields , Let's really understand Get rid of the theory The real scene of , Develop the habit of hands-on programming and brain thinking ！

Sweep code Focus on

Institute of computer vision

official account ID｜ComputerVisionGzq

Study Group ｜ Scan the code to get the join mode on the homepage

Previous recommendation