当前位置:网站首页>Asian Games countdown! AI target detection helps host the Asian Games!
Asian Games countdown! AI target detection helps host the Asian Games!
2022-07-03 01:51:00 【Computer Vision Research Institute】
Pay attention to the parallel stars
Never get lost
Institute of computer vision
official account ID|ComputerVisionGzq
Study Group | Scan the code to get the join mode on the homepage
Computer Vision Institute column
author :Edison_G
object detection It is now the most popular research topic , At present, the most popular is Yolo Series frame , Recently, our computer vision research institute also shared a lot of work and practice of target detection , All are Yolo-Base frame , Today we share a modified version of Yolov5, Effect of real-time detection !
Open source code :https://github.com/ultralytics/yolov5/releases
01
Preface
object detection It is now the most popular research topic , Now there are more and more frameworks , But the novelty of technology has reached the bottleneck , At present, it is becoming popular Transformer Mechanism , And it can also be greatly improved in the field of target detection , It is also a hot spot in current research .
The upcoming Asian Games , More advanced technology will be invested , such as 3D imaging 、 Attitude estimation 、 object detection 、 Tracking and identification ! Let's go to the world and see a different China , Different technology , Different Asian Games . Today, let's talk about the embodiment of target detection in the Asian Games !
At present, the most popular is Yolo Series frame , Recently we “ Institute of computer vision ” Also shared a good understanding of target detection work and practice , All are Yolo-Base frame .
02
The new framework improves
Today we share a simple optimized Yolov5, Temporarily named :Pad-YoloV5, stay IPad It can detect in real time ! be based on YoloV5 frame , Familiar students should not have to explain more .
YoloV4 stay YoloV3 Based on the research results of recent two years , as follows :
Input terminal adopts mosaic Data to enhance
Backbone I used CSPDarknet53、Mish Activation function 、Dropblock Methods such as .(cspnet It can reduce the amount of calculation and ensure the accuracy )
Mish Function is :
Neck Have adopted the SPP、FPN+PAN Structure ,
The output end adopts CIOU_Loss、DIOU_nms operation
YoloV5 Major changes , as follows :
Input end :Mosaic Data to enhance 、 Adaptive anchor frame calculation
Backbone:Focus structure ,CSP structure
Neck:FPN+PAN structure
Prediction:GIOU_Loss
This time, the main optimization is , yes YoloV5 When data is enhanced , Use random scaling 、 Random cutting 、 Random layout of the way to splice , The detection effect for small targets is still very friendly . It is found through experiments that , This random splicing and regular splicing , The final result is a little different .
First, I modify the data enhancement strategy , Start to make statistics on the overall data set ( That is, data preprocessing and analysis ), I'm roughly divided into three areas . Randomly splice the largest and smallest , The final result is really better than the overall random effect !
secondly , Slightly modified the adaptive image scaling strategy ,Yolov5 In the code datasets.py Of letterbox Function has been modified , Add the least black edge to the original image adaptively . I am the picture after adaptive scaling , I'll fill in the lower right corner , In fact, most of the data have not changed , Just change it casually , Because online is all about Yolo Add new strategies in recent years , Indeed, there was a certain increase in the effect of the final examination .
The last modification , It's hard work Transformer The mechanism is added YoloV5 In the basic framework of , Training has really accelerated , But for the results of training with notebooks , Still not obvious enough . This is also the first time recently to share some careful thoughts about the practical process , The specific details we “ Institute of computer vision ” Later, I will share with you through a detailed work article !
THE END
Please contact the official account for authorization.
The learning group of computer vision research institute is waiting for you to join !
Institute of computer vision Mainly involves Deep learning field , It's mainly about Face detection 、 Face recognition , Multi target detection 、 Target tracking 、 Image segmentation, etc Research direction . research institute Next, we will continue to share the latest papers, algorithms and new frameworks , The difference of our reform this time is , We need to focus on ” Research “. After that, we will share the practice process for the corresponding fields , Let's really understand Get rid of the theory The real scene of , Develop the habit of hands-on programming and brain thinking !
Sweep code Focus on
Institute of computer vision
official account ID|ComputerVisionGzq
Study Group | Scan the code to get the join mode on the homepage
Previous recommendation
Label,Verify,Correct: A simple Few Shot Target detection method
SPARSE DETR: Efficient end-to-end target detection with learnable sparsity ( Source code download )
Adaptive feature fusion is used for Single-Shot object detection ( With source code download )
object detection :SmartDet、Miti-DETR and Few-Shot Object Detection
Yolo-Z: The improved YOLOv5 For small target detection ( Attached is the original paper download )
Multigrid redundant bounding box annotation for accurate target detection
边栏推荐
- 小程序開發的部分功能
- 网络安全-防火墙
- 网络安全-病毒
- [AUTOSAR cantp] -2.11-uds diagnostic response frame data segment data padding data filling and data optimization data optimization (Theory + configuration)
- Huakaiyun (Zhiyin) | virtual host: what is a virtual host
- Sweet talk generator, regular greeting email machine... Open source programmers pay too much for this Valentine's day
- View of MySQL
- Processing of tree structure data
- Network security - scan
- 【Camera专题】Camera dtsi 完全解析
猜你喜欢
Three core issues of concurrent programming - "deep understanding of high concurrent programming"
Introduction to flask tutorial
[camera topic] how to save OTP data in user-defined nodes
传输层 TCP主要特点和TCP连接
mysql
Learn BeanShell before you dare to say you know JMeter
[data mining] task 6: DBSCAN clustering
[shutter] animation animation (animatedbuilder animation use process | create animation controller | create animation | create components for animation | associate animation with components | animatio
Take you ten days to easily complete the go micro service series (II)
CF1617B Madoka and the Elegant Gift、CF1654C Alice and the Cake、 CF1696C Fishingprince Plays With Arr
随机推荐
小程序開發的部分功能
树形结构数据的处理
网络安全-破解系统密码
Smart management of Green Cities: Digital twin underground integrated pipe gallery platform
Rockchip3399 start auto load driver
Some functions of applet development
【Camera专题】Camera dtsi 完全解析
Network security - cracking system passwords
Network security - firewall
【数据挖掘】任务5:K-means/DBSCAN聚类:双层正方形
Take you ten days to easily complete the go micro service series (I)
英语常用词汇
Custom components, using NPM packages, global data sharing, subcontracting
Why is it not recommended to use BeanUtils in production?
The technology boss is ready, and the topic of position C is up to you
Niuniu's ball guessing game (dynamic planning + prefix influence)
Sweet talk generator, regular greeting email machine... Open source programmers pay too much for this Valentine's day
网络安全-浅谈安全威胁
Leetcode skimming questions_ Sum of two numbers II - enter an ordered array
CF1617B Madoka and the Elegant Gift、CF1654C Alice and the Cake、 CF1696C Fishingprince Plays With Arr