当前位置:网站首页>Face detection: mtcnn
Face detection: mtcnn
2022-06-12 23:01:00 【u013250861】
The paper :Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks
download :https://arxiv.org/abs/1604.02878
Code :https://github.com/kpzhang93/MT
Face detection is a problem in computer vision , That is, locate one or more faces in the photo .
Locating a face in a photo means finding the coordinates of the face in the image , The range of the face is divided by the bounding box around the face .
The face is dynamic , Its appearance is highly variable , Face detection becomes a difficult problem in computer vision . Such as face detection , Subject to its orientation or angle 、 Light level 、 clothing 、 Accessories 、 Hair color 、 Facial hair 、 Make up 、 Age and so on .
Face detection is an essential first step in face recognition system , Its purpose is to locate and extract face regions in the background .
There are two main methods of face recognition :
- Feature-based approach , Use manual filters to search and detect faces . have access to OpenCV Library cascade classifiers (cascade classifiers) Conduct .
- Image based approach , Learn how to extract faces from the whole image as a whole . Can pass MTCNN Libraries use multitasking cascading CNN To achieve .
MTCNN Is a deep cascading multitasking framework . The framework is used to solve various postures 、 Lighting and shielding , Face detection and alignment in an unconstrained environment .
The framework takes advantage of the inherent correlation between detection and alignment to improve their performance . The framework uses cascaded architecture and three stages of a well-designed deep convolution network to predict human faces and landmark Location . Besides , A new online hard sample mining Strategy , Further improve the performance in practice .
notes : Face alignment face alignment: It is a necessary preprocessing in many face applications , The purpose is to reduce the rotation of the input image (Rotation)、 translation (Translation)、 The zoom (Scale) Loss of accuracy due to change .
Reference material :
Face detection MTCNN brief introduction
Face algorithm series :MTCNN Detailed explanation of face detection
MTCNN working principle
边栏推荐
- China barcode decoder market trend report, technical innovation and market forecast
- 【LeetCode】5. Longest Palindromic Substring
- The development trend of digital collections!
- JS fighting on...
- VIM use the lower right 4 keys
- LeetCode 146. LRU cache
- A 42 year old senior executive of a large factory reminds people aged 30-39 that these six habits that make you stronger should be developed as soon as possible
- 在同花顺开户证券安全吗,买股票怎么网上开户
- Analysis report on production and marketing demand and investment forecast of China's Melamine Industry from 2022 to 2028
- C language: how to give an alias to a global variable?
猜你喜欢
基于51单片机的酒精检测仪
Hostvars in ansible
深度学习-神经网络:卷积的实现方法【直接法(精度没损失)、GEMM(矩阵乘法,精度没损失)、FFT(傅里叶变换,精度有损失)、Winograd(精度有损失)】
[Part VI] source code analysis and application details of countdownlatch [key]
The annual salary of 500000 is one line, and the annual salary of 1million is another line
Web3 principle and decentralization
Anti aliasing / anti aliasing Technology
[Part 8] semaphore source code analysis and application details [key points]
Mysql case when then函数使用
Su embedded training day13 - file IO
随机推荐
web3 原则和去中心化
[web technology] 1348- talk about several ways to implement watermarking
Gb28181 protocol -- alarm
Insight into China's smart medical industry in 2022
反走样/抗锯齿技术
JVM Basics - > how to troubleshoot JVM problems in your project
Module 8 operation
LeetCode 146. LRU cache
Introduction to Quaternion
Design of traceid in the project
Use of map() function in JS
Research Report on water sports shoes industry - market status analysis and development prospect forecast
lua 日期时间
JVM foundation - > what is STW?
Research and Analysis on the development of China's Melamine Industry from 2022 to 2028 and market prospect forecast report
度量学习(Metric Learning)【AMSoftmax、Arcface】
ImageView grayed, reflected, rounded, watermarked
年薪50万是一条线,年薪100万又是一条线…...
[Part 8] semaphore source code analysis and application details [key points]
Go时间格式化 赋值