当前位置:网站首页>【CVPR 2021】DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort
【CVPR 2021】DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort
2022-06-26 09:24:00 【_ Summer tree】
List of articles

Speed reading summary
DatasetGAN:
A process of automatically generating a large number of high-quality semantic segmentation image data sets , Need the least manpower . Only a few labeled samples are needed to train decoder Generate the rest of the potential space , Thus, an infinite annotation data generator . The resulting dataset can then be used to train any computer vision architecture .
Annotation cost is the bottleneck of data scale .
Our goal is to synthesize large and high-quality label data sets , Only a few examples of labels are needed .
In our work , We show the latest and most advanced image generation models to learn very powerful potential representations , It can be used for complex pixel level tasks .
We introduced DatasetGAN, It can generate a large number of high-quality semantic segmentation image data sets , Need the least manpower .
The key to our approach is to observe , Trained to synthesize images GANs Must acquire rich semantic knowledge , To present diverse and realistic examples of objects .
Our key point is , Training a successful decoder requires only a small number of labeled images , Thus, an infinite annotation data set generator .
Because we only need to mark a few examples , therefore We annotate the image in great detail , And generate data sets with rich objects and partial segmentation .
We are 7 Image segmentation tasks generate data sets , These include 34 Personal face pieces and 32 Pixel level labels for car parts . Our approach is significantly superior to all semi supervised baselines , And it is equivalent to the method of full supervision , Although in some cases you need two orders of magnitude less annotation data .
In our work , We show the animation of the object 3D The reconstruction , There we use our method to generate detailed part tags .

DATASETGAN Composite image annotation pairs , Large high-quality datasets with detailed pixel level labels can be generated . Figure shows this 4 A step .(1,2). utilize StyleGAN, Only a few composite images are annotated . Train an efficient branch to generate labels .(3). Automatically generate a huge synthetic annotation image data set .(4). Train your favorite methods with synthetic datasets , And test it on real images .
chart 2:DATASETGAN The overall architecture of . We from StyleGAN Upsampling features are mapped to the highest resolution , Construct pixel level feature vectors for all pixels on the composite image . Then train MLP The set of classifiers , The semantic knowledge in the pixel feature vector is interpreted into its component label .
chart 3“: Small human annotated face and car datasets . Most datasets used for semantic segmentation (MS-COCO [33], ADE [56], cityscape[11]) It's too big , The user cannot check every training image . In this picture , We showed all the marked faces (a-c) And cars (d-f) Split training example .a) Shows an example of a segmentation mask and associated tags ,b) Shows the complete set of training images (GAN sample ),c) Shows a partial list of dimensions and the number of instances in the dataset . An interesting fact is , Please note that , There are more tags in a single image than in a dataset .

chart 4: come from DATASETGAN Examples of synthetic images and labels of faces and cars .StyleGAN For backbone 1024 Zhang 1024 Resolution CelebA-HQ (faces) Images and 512 Zhang 384 Resolution LSUN CAR (cars) Image training .DATASETGAN use 16 An annotated example for training . // This is annotated What label is it ?

chart 5: come from DATASETGAN The birds of China 、 cat 、 Examples of composite images and labels for bedrooms .StyleGAN stay NABirds(10241024 A picture )、LSUN CAT(256256 A picture ) and LSUN Bedroom(256256 A picture ) To be trained on .DATASETGAN stay 30 Only annotated bird samples 、30 A cat and 40 Training in a bedroom .

chart 6: The number of training examples is the same as mIOU We compare... On the benchmark ADE-Car- 12 Test set . The red dotted line indicates the full supervision method , It makes use of information from ADE20k Of 2.6k Training examples . // mIOU What is it? ?

Method
The key insight of DATASETGAN is that generativemodels such as GANs that are trained to synthesize highlyrealistic images must acquire semantic knowledge in theirhigh dimensional latent space.
DATASET-GAN aims to utilize these powerful properties of imageGANs. Intuitively, if a human provides a labeling corre-sponding to one latent code, we expect to be able to effec-tively propagate this labeling across the GAN’s latent space.
Specifically, we synthesize a small num-ber of images by utilizing a GAN architecture, StyleGANin our paper, and record their corresponding latent featuremaps.
By sampling latent codeszand passing eachthrough the entire architecture, we have an infinite datasetgenerator!
This video explanation is not bad : https://www.bilibili.com/video/av502581865/
边栏推荐
- Merrill Lynch data helps State Grid Hubei "golden eye" accurately identify abnormal power consumption
- Dedecms applet plug-in is officially launched, and one click installation does not require any PHP or SQL Foundation
- Edge computing is the sinking and extension of cloud computing capabilities to the edge and user sides
- PHP does not allow images to be uploaded together with data (no longer uploading images before uploading data)
- Phpcms applet plug-in tutorial website officially launched
- Self learning neural network series - 7 feedforward neural network pre knowledge
- 教程1:Hello Behaviac
- 挖财打新债安全吗
- "One week's solution to analog electricity" - power circuit
- Modify coco evaluation index maxdets=[10,15,20]
猜你喜欢

Cancellation and unbinding of qiniu cloud account

"One week's work on Analog Electronics" - diodes
![Pycharm [debug] process stuck](/img/8c/c32cbdfcb106b34fccbbc071a13822.jpg)
Pycharm [debug] process stuck

Router bridging settings

Phpcms V9 mall module (fix the Alipay interface Bug)

行为树的基本概念及进阶

Edge computing is the sinking and extension of cloud computing capabilities to the edge and user sides

【CVPR 2021】Joint Generative and Contrastive Learning for Unsupervised Person Re-identification

"One week's work on digital power" -- encoder and decoder

Merrill Lynch data technology expert team | building a cloud native product system based on containers
随机推荐
《一周搞定模电》—功率放大器
常用电路设计
How to compile builds
Behavior tree XML file hot load
External sorting and heap size knowledge
Self learning neural network sequence -- 2 perceptron
《一周搞定数电》-逻辑门
[open source] use phenocv weedcam for more intelligent and accurate weed management
"One week's work on Analog Electronics" - Basic amplification circuit
集合对象复制
简析ROS计算图级
编辑类型信息
《一周搞定数电》——组合逻辑电路
Practice of production control | dilemma on assembly rack
PHP extracts TXT text to store the domain name in JSON data
【pulsar学习】pulsar架构原理
The first techo day Tencent technology open day, 628
計算領域高質量科技期刊分級目錄
首期Techo Day腾讯技术开放日,628等你
【Open5GS】Open5GS安装配置