当前位置:网站首页>Pytoch deep learning and target detection practice notes
Pytoch deep learning and target detection practice notes
2022-07-03 14:52:00 【Sol-itude】
Original video link :PyTorch Deep learning target detection introduction practical series 【 Mound x Boolean number 】
download VOC Data sets
Download address :http://host.robots.ox.ac.uk/pascal/VOC/
What is in the dataset

- Annotations Label folder ( contain xml file , Various information of pictures )
- ImageSets Picture collection ( Main concern Main The contents of the folder )
- JPEGImages The data set contains pictures
- SegmentationClass Semantic segmentation picture
- SegmentationObject Instance split picture
What we need to pay attention to ImageSets
open ImageSets-Main-aeroplane_train.txt
You can see 32 and 33 by 1, stay JPEGImages in ,32 Picture of No 
open Annotations
open Annotations-000032.xml
<object>
<name>aeroplane</name>< Picture category name >
<pose>Frontal</pose>< The angle of the object , It can be seen that it was taken in front >
<truncated>0</truncated>< Whether it is truncated ,0 Yes no >
<difficult>0</difficult>< Is it difficult to identify ,0 It's not difficult >
<bndbox>
<xmin>104</xmin>< The minimum value of abscissa of the object on the picture >
<ymin>78</ymin>< The minimum value of the vertical coordinate of the object on the picture >
<xmax>375</xmax>< The maximum abscissa of the object on the picture >
<ymax>183</ymax>< The maximum value of the ordinate of the object on the picture >
</bndbox>
</object>
download COCO Data sets
Official website :https://cocodataset.org/#home
Reading Annotations

Scan from the first line , yes 46 Lattice pixel , White pixels do not appear until the third line , Line by line scanning
“counts”:[147,3,1…]
To be continued
边栏推荐
- Qt development - scrolling digital selector commonly used in embedded system
- C language dup2 function
- C language STR function
- Qt—绘制其他东西
- QT program font becomes larger on computers with different resolutions, overflowing controls
- Four data flows and cases of grpc
- Special research report on the market of lithium battery electrolyte industry in China (2022 Edition)
- Pyqt interface production (login + jump page)
- Zzuli:1042 sum of sequence 3
- tonybot 人形机器人 定距移动 代码编写玩法
猜你喜欢

Adobe Premiere Pro 15.4 has been released. It natively supports Apple M1 and adds the function of speech to text

Rasterization: a practical implementation (2)

4-33--4-35

C language fcntl function
![[ue4] material and shader permutation](/img/8f/7743ac378490fcd7b9ecc5b4c2ef2a.jpg)
[ue4] material and shader permutation
![[qingniaochangping campus of Peking University] in the Internet industry, which positions are more popular as they get older?](/img/f6/fe61c84f289f0e74a45946dac687a6.jpg)
[qingniaochangping campus of Peking University] in the Internet industry, which positions are more popular as they get older?

Niuke: crossing the river
![[ue4] geometry drawing pipeline](/img/30/9fcf83a665043fe57389d44c2e16a8.jpg)
[ue4] geometry drawing pipeline

Vs+qt multithreading implementation -- run and movetothread

QT - draw something else
随机推荐
NOI OPENJUDGE 1.3(06)
Chapter 14 class part 1
Dllexport and dllimport
QT program font becomes larger on computers with different resolutions, overflowing controls
Devaxpress: range selection control rangecontrol uses
【微信小程序】WXSS 模板样式
Optical cat super account password and broadband account password acquisition
Frequently asked questions: PHP LDAP_ add(): Add: Undefined attribute type in
C language DUP function
表单文本框的使用(一) 选择文本
Zzuli:1046 product of odd numbers
5-1 blocking / non blocking, synchronous / asynchronous
CentOS7部署哨兵Redis(带架构图,清晰易懂)
[opengl] face pinching system
Adobe Premiere Pro 15.4 has been released. It natively supports Apple M1 and adds the function of speech to text
C language fcntl function
[opengl] advanced chapter of texture - principle of flowmap
C language to realize mine sweeping
My QT learning path -- how qdatetimeedit is empty
Some concepts about agile