A Light CNN for Deep Face Representation with Noisy Labels

Last update: Nov 05, 2022

Related tags

Overview

A Light CNN for Deep Face Representation with Noisy Labels

Citation

If you use our models, please cite the following paper:

@article{wulight,
  title={A Light CNN for Deep Face Representation with Noisy Labels},
  author={Wu, Xiang and He, Ran and Sun, Zhenan and Tan, Tieniu}
  journal={arXiv preprint arXiv:1511.02683},
  year={2015}
}
@article{wu2015lightened,
  title={A Lightened CNN for Deep Face Representation},
  author={Wu, Xiang and He, Ran and Sun, Zhenan},
  journal={arXiv preprint arXiv:1511.02683},
  year={2015}
}
@article{wu2015learning,
  title={Learning Robust Deep Face Representation},
  author={Wu, Xiang},
  journal={arXiv preprint arXiv:1507.04844},
  year={2015}
}

Updates

Dec 16, 2016
- The MS-Celeb-1M clean list is uploaded: Baidu Yun, Google Drive.
Nov 08, 2016
- The prototxt and model C based on caffe-rc3 is updated. The accuracy on LFW achieves 98.80% and the [email protected]=0 obtains 94.97%.
- The performance of set 1 on MegaFace achieves 65.532% for rank-1 accuracy and 75.854% for [email protected]=10^-6.
Nov 26, 2015
- The prototxt and model B is updated and the accuracy on LFW achieves 98.13% for a single net without training on LFW.
Aug 13, 2015
- Evaluation of LFW for identification protocols is published.
Jun 11, 2015
- The prototxt and model A is released. The accuracy on LFW achieves 97.77%.

Overview

The Deep Face Representation Experiment is based on Convolution Neural Network to learn a robust feature for face verification task. The popular deep learning framework caffe is used for training on face datasets such as CASIA-WebFace, VGG-Face and MS-Celeb-1M. And the feature extraction is realized by python code caffe_ftr.py.

Structure

Code
- data pre-processing and evaluation code
Model
- caffemodel.
  - The model A and B is trained on CASIA-WebFace by caffe-rc.
  - The model C is trained on MS-Celeb-1M by caffe-rc3.
Proto
- Lightened CNN implementations by caffe
Results
- LFW features

Description

Data Pre-processing

Download face dataset such as CASIA-WebFace, VGG-Face and MS-Celeb-1M.
All face images are converted to gray-scale images and normalized to 144x144 according to landmarks.
According to the 5 facial points, we not only rotate two eye points horizontally but also set the distance between the midpoint of eyes and the midpoint of mouth(ec_mc_y), and the y axis of midpoint of eyes(ec_y) .

Dataset	size	ec_mc_y	ec_y
Training set	144x144	48	48
Testing set	128x128	48	40

Training

The model is trained by open source deep learning framework caffe.
The network configuration is showed in "proto" file and the trained model is showed in "model" file.

Evaluation

The model is evaluated on LFW which is a popular data set for face verification task.
The extracted features and lfw testing pairs are located in "results" file.
To evaluate the model, the matlab code or other ROC evaluation code can be used.
The model is also evaluated on MegaFace. The dataset and evaluation code can be downloaded from http://megaface.cs.washington.edu/

Results

The single convolution net testing is evaluated on unsupervised setting only computing cosine similarity for lfw pairs.

Model	100% - EER	[email protected]=1%	[email protected]=0.1%	[email protected]=0	Rank-1	[email protected]=1%
A	97.77%	94.80%	84.37%	43.17%	84.79%	63.09%
B	98.13%	96.73%	87.13%	64.33%	89.21%	69.46%
C	98.80%	98.60%	96.77%	94.97%	93.80%	84.40%

The details are published as a technical report on arXiv.

The released models are only allowed for non-commercial use.

A Light CNN for Deep Face Representation with Noisy Labels

Related tags

Overview

A Light CNN for Deep Face Representation with Noisy Labels

Citation

Updates

Overview

Structure

Description

Owner

Alfred Xiang Wu

This repository allows the user to automatically scale a 3D model/mesh/point cloud on Agisoft Metashape

Pseudo-Visual Speech Denoising

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

2021 Artificial Intelligence Diabetes Datathon

HODEmu, is both an executable and a python library that is based on Ragagnin 2021 in prep.

《LXMERT: Learning Cross-Modality Encoder Representations from Transformers》(EMNLP 2020)

Improving Machine Translation Systems via Isotopic Replacement

A GPT, made only of MLPs, in Jax

Codes for NeurIPS 2021 paper "On the Equivalence between Neural Network and Support Vector Machine".

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Self-training for Few-shot Transfer Across Extreme Task Differences

This repository contains the source codes for the paper AtlasNet V2 - Learning Elementary Structures.

AutoVideo: An Automated Video Action Recognition System

Cleaned up code for DSTC 10: SIMMC 2.0 track: subtask 2: multimodal coreference resolution

Source code of the paper PatchGraph: In-hand tactile tracking with learned surface normals.

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Code for models used in Bashiri et al., "A Flow-based latent state generative model of neural population responses to natural images".

Event-forecasting - Event Forecasting Algorithms With Python

This is a work in progress reimplementation of Instant Neural Graphics Primitives

YOLOv7 - Framework Beyond Detection