当前位置:网站首页>Introduction and download of common data sets for in-depth learning (with network disk link)
Introduction and download of common data sets for in-depth learning (with network disk link)
2022-06-13 02:38:00 【Infinite thoughts】
Abstract
This blog post summarizes the data sets commonly used by bloggers for in-depth learning , Contains commonly used classifications 、 Target detection and face recognition tasks , A brief introduction to each data set is given 、 Keywords obtained from the official website download website and official account . Because some data sets are large , The download speed of the official website may be slow , For the convenience of management , Here I put it into the personal official account platform , Available through search “AI Technology research and sharing ” Official account , You can get the network disk link by replying to the keywords of each data set in the background . Data set files are downloaded from the official website , Only for learning communication , Bloggers will continue to update , Welcome to your attention .

Catalog
Data sets :Person identification in TV series
Data sets :The CNBC Face Database
Data sets :Caltech-10K-WebFaces
1. Classification data set
| Data sets :MNISTHandwritten digital datasets , Contains a set of 60,000 A training set of examples and a set of 10,000 A test set of samples . |
| Official website address | |
| keyword ( It is recommended to copy ) | Mnist |
| Data sets :CIFAR-10Contains 10 Category 60,000 Images ( Each class is represented as a line in the figure above ). All in all 50,000 A training image and 10,000 A test image . |
| Official website address | |
| keyword ( It is recommended to copy ) | CIFAR-10 |
| Data sets :Fashion-MNISTcontain 60,000 A training image and 10,000 A test image . similar MNIST Fashion products database of . |
| Official website address | |
| keyword ( It is recommended to copy ) | fashion-mnist |
| Data sets :ImagenetImagenet The dataset has 1400 More than ten thousand pictures , cover 2 More than 10000 categories , About image classification 、 location 、 Detection and other research work are mostly based on this data set . |
| Official website address | |
| keyword ( It is recommended to copy ) | ImageNet |
2. Target detection data set
| Data sets :MS-COCOCOCO It is a large-scale and rich object detection , Segmentation and subtitle data sets .330K Images ,80 Object categories , Each image 5 Subtitles ,250,000 A person with key points . |
| Official website address | |
| keyword ( It is recommended to copy ) | MS-COCO |
| Data sets :PASCAL-VOCPASCAL VOC Challenge is a benchmark of visual object classification, recognition and detection , It provides standard image annotation data set and standard evaluation system for detection algorithm and learning performance , Include 20 A catalog . |
| Official website address | |
| keyword ( It is recommended to copy ) | PASCAL-VOC |
| Data sets :BDD100KAutonomous driving is commonly used in large and diverse data sets , Dimension exceeds 100,000 Zhang image , The category contains buses , Pedestrians , Bicycle , truck , The car , Trains and riders , For target detection 、 Full frame segmentation, etc . |
| Official website address | |
| keyword ( It is recommended to copy ) | BDD-100K |
| Data sets :Open-ImagesOpen Images Is a containing near 900 Ten thousand images URL Data set of . These images have been annotated with image level label borders for thousands of categories . |
| Official website address | |
| keyword ( It is recommended to copy ) | Open-Images |
3. Face recognition dataset
| Data sets :CASIA-WebFaceContains 10575 Individual 494414 Zhang image .CASIA-webface database , It contains 10000 personal , altogether 50 Ten thousand face pictures , All from the network . |
| Official website address | |
| keyword ( It is recommended to copy ) | CASIA-WebFace |
| Data sets :PubFigColumbia University's public face data set , contains 200 Individual 58k+ Face image , It is mainly used for face recognition in unrestricted scenes . |
| Official website address | |
| keyword ( It is recommended to copy ) | PubFig |
| Data sets :CelebAThe large face recognition data set published by Professor Tang Xiaoou's Laboratory of the Chinese University of Hong Kong . The dataset contains 200K Face pictures , Face attributes include 40 Varied , It is mainly used for face attribute recognition . |
| Official website address | |
| keyword ( It is recommended to copy ) | CelebA |
| Data sets :ColorFeretIt includes a general face database and a general test standard , It already contains 1000 Many people 10000 Multiple photos , Everyone includes different expressions , light , Photos of posture and age . |
| Official website address | https://www.nist.gov/itl/iad/image-group/color-feret-database |
| keyword ( It is recommended to copy ) | ColorFeret |
| Data sets :MTFLThe dataset contains nearly 13000 Face pictures , All from the network . |
| Official website address | |
| keyword ( It is recommended to copy ) | MTFL |
| Data sets :FaceDBContains 1521 The amplitude resolution is 384x286 Pixel gray image . Each image comes from 23 A different tester's frontal angle of the face . |
| Official website address | |
| keyword ( It is recommended to copy ) | FaceDB |
| Data sets :LFWIn order to study the problem of face recognition in unrestricted environment . This dataset contains more than 13,000 Face images , All collected in Internet. In order to study the problem of face recognition in unrestricted environment . This dataset contains more than 13,000 Face images , All collected in Internet. |
| Official website address | |
| keyword ( It is recommended to copy ) | LFW |
| Data sets :Person identification in TV seriesThe face photos selected in this dataset are from two well-known TV dramas ,《 Vampire Hunter Buffy 》 and 《 The big bang theory 》. |
| Official website address | |
| keyword ( It is recommended to copy ) | PITVS |
| Data sets :CMUVASC-PIECMU PIE The face database is built on 2000 year 11 month , It includes information from 68 Individual 40000 A picture , It includes everyone's 13 Kind of posture conditions ,43 Light conditions and 4 A picture with an expression . |
| Official website address | |
| keyword ( It is recommended to copy ) | CMUVASC-PIE |
| Data sets :CASIA-FaceV5The dataset contains data from 500 Individual 2500 Zhang Yaya's face . |
| Official website address | |
| keyword ( It is recommended to copy ) | CASIA-FaceV5 |
| Data sets :The CNBC Face DatabaseThis data set collects 200 Individuals in different states ( Different looks , Dress up , Hair style, etc ) Face photos of . |
| Official website address | |
| keyword ( It is recommended to copy ) | Face-Place |
| Data sets :IMDB-WIKIIMDB-WIKI The face database totals 523,051 Zhang face database , Each picture is marked with the person's age and gender , It is of great significance to the study of age recognition and gender recognition . |
| Official website address | |
| keyword ( It is recommended to copy ) | IMDB-WIKI |
| Data sets :FDDBFDDB yes UMass Data set of , Used for human face detection (Face Detection). This data set is relatively large , It's challenging . |
| Official website address | |
| keyword ( It is recommended to copy ) | FDDB |
| Data sets :Caltech-10K-WebFaces10k+ Face , Provide coordinate positions of eyes and mouth |
| Official website address | http://www.vision.caltech.edu/Image_Datasets/Caltech_10K_WebFaces/#Description |
| keyword ( It is recommended to copy ) | Caltech-10K |
| Data sets :JAFFEThe database is created by 10 Japanese women made various expressions according to instructions in the experimental environment . The whole database has 213 Zhang image ,10 personal , It's all women , Everyone makes 7 Kind of expression . |
| Official website address | |
| keyword ( It is recommended to copy ) | JAFFE |
| Data sets :AFLWfront 2000 individual AFLW Fitting of samples 3D Noodles , Can be used for 3D Face the assessment . |
| Official website address | http://www.cbsr.ia.ac.cn/users/xiangyuzhu/projects/3DDFA/main.htm |
| keyword ( It is recommended to copy ) | AFLW |
By searching “AI Technology research and sharing ” Official account , And in The background replies to each dataset keyword You can get the network disk link . Data set files are downloaded from the official website , Only for learning communication , Bloggers will continue to update , Welcome to your attention .

Share more , Coming soon !
边栏推荐
- PCR validation of basic biological experiments in [life sciences]
- [keras] data of 3D u-net source code analysis py
- Find the number of permutations
- Port mapping between two computers on different LANs (anydesk)
- GMM Gaussian mixture model
- [reading point paper] yolo9000:better, faster, stronger, (yolov2), integrating various methods to improve the idea of map and wordtree data fusion
- Introduction to armv8/armv9 - learning this article is enough
- What are the differences in cache/tlb?
- SANs证书生成
- Leetcode 450. 删除二叉搜索树中的节点 [二叉搜索树]
猜你喜欢

For loop instead of while loop - for loop instead of while loop

Huffman tree and its application

Gadgets: color based video and image cutting

Queuing theory, game theory, analytic hierarchy process

Mbedtls migration experience
![Leetcode 450. 删除二叉搜索树中的节点 [二叉搜索树]](/img/39/d5c4d424a160635791c4645d6f2e10.png)
Leetcode 450. 删除二叉搜索树中的节点 [二叉搜索树]

Chapter7-13_ Dialogue State Tracking (as Question Answering)

A real-time target detection model Yolo

Hstack, vstack and dstack in numpy

Superficial understanding of conditional random fields
随机推荐
Bai ruikai Electronic sprint Scientific Innovation Board: proposed to raise 360 million Funds, Mr. And Mrs. Wang binhua as the main Shareholder
Superficial understanding of conditional random fields
[dest0g3 520 orientation] dest0g3, which has been a long time since we got WP_ heap
Leetcode 926. Flip string to monotonically increasing [prefix and]
SANs证书生成
Think about the possibility of attacking secure memory through mmu/tlb/cache
微信云开发粗糙理解
[common tools] pyautogui tutorial
柏瑞凱電子沖刺科創板:擬募資3.6億 汪斌華夫婦為大股東
Paipai loan parent company Xinye quarterly report diagram: revenue of RMB 2.4 billion, net profit of RMB 530million, a year-on-year decrease of 10%
Port mapping between two computers on different LANs (anydesk)
[pytorch] kaggle large image dataset data analysis + visualization
Chapter7-11_ Deep Learning for Question Answering (2/2)
Opencvshare4 and vs2019 configuration
A real-time target detection model Yolo
冲刺强基计划数学物理专题一
Welcome to blog navigation
How did you spend your winter vacation perfectly?
重定向设置参数-RedirectAttributes
[pytorch]fixmatch code explanation - data loading























