当前位置:网站首页>Asian Games countdown! AI target detection helps host the Asian Games!
Asian Games countdown! AI target detection helps host the Asian Games!
2022-07-03 01:51:00 【Computer Vision Research Institute】
Pay attention to the parallel stars
Never get lost
Institute of computer vision


official account ID|ComputerVisionGzq
Study Group | Scan the code to get the join mode on the homepage
Computer Vision Institute column
author :Edison_G
object detection It is now the most popular research topic , At present, the most popular is Yolo Series frame , Recently, our computer vision research institute also shared a lot of work and practice of target detection , All are Yolo-Base frame , Today we share a modified version of Yolov5, Effect of real-time detection !
Open source code :https://github.com/ultralytics/yolov5/releases
01

Preface

object detection It is now the most popular research topic , Now there are more and more frameworks , But the novelty of technology has reached the bottleneck , At present, it is becoming popular Transformer Mechanism , And it can also be greatly improved in the field of target detection , It is also a hot spot in current research .

The upcoming Asian Games , More advanced technology will be invested , such as 3D imaging 、 Attitude estimation 、 object detection 、 Tracking and identification ! Let's go to the world and see a different China , Different technology , Different Asian Games . Today, let's talk about the embodiment of target detection in the Asian Games !


At present, the most popular is Yolo Series frame , Recently we “ Institute of computer vision ” Also shared a good understanding of target detection work and practice , All are Yolo-Base frame .
02

The new framework improves

Today we share a simple optimized Yolov5, Temporarily named :Pad-YoloV5, stay IPad It can detect in real time ! be based on YoloV5 frame , Familiar students should not have to explain more .
YoloV4 stay YoloV3 Based on the research results of recent two years , as follows :
Input terminal adopts mosaic Data to enhance
Backbone I used CSPDarknet53、Mish Activation function 、Dropblock Methods such as .(cspnet It can reduce the amount of calculation and ensure the accuracy )
Mish Function is :

Neck Have adopted the SPP、FPN+PAN Structure ,
The output end adopts CIOU_Loss、DIOU_nms operation
YoloV5 Major changes , as follows :
Input end :Mosaic Data to enhance 、 Adaptive anchor frame calculation
Backbone:Focus structure ,CSP structure
Neck:FPN+PAN structure
Prediction:GIOU_Loss
This time, the main optimization is , yes YoloV5 When data is enhanced , Use random scaling 、 Random cutting 、 Random layout of the way to splice , The detection effect for small targets is still very friendly . It is found through experiments that , This random splicing and regular splicing , The final result is a little different .

First, I modify the data enhancement strategy , Start to make statistics on the overall data set ( That is, data preprocessing and analysis ), I'm roughly divided into three areas . Randomly splice the largest and smallest , The final result is really better than the overall random effect !

secondly , Slightly modified the adaptive image scaling strategy ,Yolov5 In the code datasets.py Of letterbox Function has been modified , Add the least black edge to the original image adaptively . I am the picture after adaptive scaling , I'll fill in the lower right corner , In fact, most of the data have not changed , Just change it casually , Because online is all about Yolo Add new strategies in recent years , Indeed, there was a certain increase in the effect of the final examination .
The last modification , It's hard work Transformer The mechanism is added YoloV5 In the basic framework of , Training has really accelerated , But for the results of training with notebooks , Still not obvious enough . This is also the first time recently to share some careful thoughts about the practical process , The specific details we “ Institute of computer vision ” Later, I will share with you through a detailed work article !
THE END
Please contact the official account for authorization.

The learning group of computer vision research institute is waiting for you to join !
Institute of computer vision Mainly involves Deep learning field , It's mainly about Face detection 、 Face recognition , Multi target detection 、 Target tracking 、 Image segmentation, etc Research direction . research institute Next, we will continue to share the latest papers, algorithms and new frameworks , The difference of our reform this time is , We need to focus on ” Research “. After that, we will share the practice process for the corresponding fields , Let's really understand Get rid of the theory The real scene of , Develop the habit of hands-on programming and brain thinking !

Sweep code Focus on
Institute of computer vision
official account ID|ComputerVisionGzq
Study Group | Scan the code to get the join mode on the homepage
Previous recommendation
Label,Verify,Correct: A simple Few Shot Target detection method
SPARSE DETR: Efficient end-to-end target detection with learnable sparsity ( Source code download )
Adaptive feature fusion is used for Single-Shot object detection ( With source code download )
object detection :SmartDet、Miti-DETR and Few-Shot Object Detection
Yolo-Z: The improved YOLOv5 For small target detection ( Attached is the original paper download )
Multigrid redundant bounding box annotation for accurate target detection
边栏推荐
- Network security - man in the middle attack
- Network security - vulnerabilities and Trojans
- [data mining] task 6: DBSCAN clustering
- Some functions of applet development
- Network security - DNS spoofing and phishing websites
- Network security - phishing
- 云原生题目整理(待更新)
- openresty 缓存
- Learn the five skills you need to master in cloud computing application development
- Steps to obtain SSL certificate private key private key file
猜你喜欢

串口抓包/截断工具的安装及使用详解

Vant implements a simple login registration module and a personal user center

CF1617B Madoka and the Elegant Gift、CF1654C Alice and the Cake、 CF1696C Fishingprince Plays With Arr

深度(穿透)选择器 ::v-deep/deep/及 > > >

【Camera专题】Camera dtsi 完全解析

Qtablewidget lazy load remaining memory, no card!
![[Appendix 6 Application of reflection] Application of reflection: dynamic agent](/img/e7/0ee42902b178b13e9a41385267e7b6.jpg)
[Appendix 6 Application of reflection] Application of reflection: dynamic agent
![[technology development-23]: application of DSP in future converged networks](/img/2e/f39543a18a8f58b1d341ce72cc4427.png)
[technology development-23]: application of DSP in future converged networks

【数据挖掘】任务5:K-means/DBSCAN聚类:双层正方形

The technology boss is ready, and the topic of position C is up to you
随机推荐
Leetcode skimming questions_ Sum of two numbers II - enter an ordered array
Qtablewidget lazy load remaining memory, no card!
VIM 9.0 is officially released! The execution speed of the new script can be increased by up to 100 times
Smart management of Green Cities: Digital twin underground integrated pipe gallery platform
Processing of tree structure data
网络安全-DNS欺骗与钓鱼网站
Learn the five skills you need to master in cloud computing application development
Network security - Information Collection
Everything file search tool
[untitled]
Summary of interval knowledge
Caused by: com. fasterxml. jackson. databind. exc.MismatchedInputException: Cannot construct instance o
Comment le chef de file gère - t - il l'équipe en cas d'épidémie? Contributions communautaires
Why is it not recommended to use BeanUtils in production?
Reprint some Qt development experience written by great Xia 6.5
Smart management of Green Cities: Digital twin underground integrated pipe gallery platform
Function definition and call, this, strict mode, higher-order function, closure, recursion
小程序開發的部分功能
How is the mask effect achieved in the LPL ban/pick selection stage?
Main features of transport layer TCP and TCP connection