当前位置:网站首页>Face detection: mtcnn
Face detection: mtcnn
2022-06-12 23:01:00 【u013250861】
The paper :Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks
download :https://arxiv.org/abs/1604.02878
Code :https://github.com/kpzhang93/MT
Face detection is a problem in computer vision , That is, locate one or more faces in the photo .
Locating a face in a photo means finding the coordinates of the face in the image , The range of the face is divided by the bounding box around the face .
The face is dynamic , Its appearance is highly variable , Face detection becomes a difficult problem in computer vision . Such as face detection , Subject to its orientation or angle 、 Light level 、 clothing 、 Accessories 、 Hair color 、 Facial hair 、 Make up 、 Age and so on .
Face detection is an essential first step in face recognition system , Its purpose is to locate and extract face regions in the background .
There are two main methods of face recognition :
- Feature-based approach , Use manual filters to search and detect faces . have access to OpenCV Library cascade classifiers (cascade classifiers) Conduct .
- Image based approach , Learn how to extract faces from the whole image as a whole . Can pass MTCNN Libraries use multitasking cascading CNN To achieve .
MTCNN Is a deep cascading multitasking framework . The framework is used to solve various postures 、 Lighting and shielding , Face detection and alignment in an unconstrained environment .
The framework takes advantage of the inherent correlation between detection and alignment to improve their performance . The framework uses cascaded architecture and three stages of a well-designed deep convolution network to predict human faces and landmark Location . Besides , A new online hard sample mining Strategy , Further improve the performance in practice .
notes : Face alignment face alignment: It is a necessary preprocessing in many face applications , The purpose is to reduce the rotation of the input image (Rotation)、 translation (Translation)、 The zoom (Scale) Loss of accuracy due to change .
Reference material :
Face detection MTCNN brief introduction
Face algorithm series :MTCNN Detailed explanation of face detection
MTCNN working principle
边栏推荐
- Wechat applet withdrawal function
- Common rendering pipeline grooming
- Anti aliasing / anti aliasing Technology
- 在同花顺开户安全么 ,证券开户怎么开户流程
- China Aquatic Fitness equipment market trend report, technical innovation and market forecast
- 常见渲染管线整理
- C # reading table data in word
- Research Report on market supply and demand and strategy of tizanidine industry in China
- Lua date time
- Embedded pipeline out of the box
猜你喜欢

度量学习(Metric Learning)【AMSoftmax、Arcface】

Inventory of CV neural network models from 2021 to 2022

管线中的坐标变换

【Web技术】1348- 聊聊水印实现的几种方式
![LeetCode 890 查找和替换模式[map] HERODING的LeetCode之路](/img/a2/186439a6d50339ca7f299a46633345.png)
LeetCode 890 查找和替换模式[map] HERODING的LeetCode之路

Use js to listen for Keydown event

Wechat applet withdrawal function

设计消息队列存储消息数据的 MySQL 表格

ShardingSphere-proxy-5.0.0部署之分表实现(一)

应用最广泛的动态路由协议:OSPF
随机推荐
Introduction to Quaternion
Module 8 operation
【建议收藏】通俗易懂图解网络知识-第一篇
Is it safe to open an account in flush? How to open an account online to buy stocks
The "fourteenth five year plan" development plan and panoramic strategic analysis report of China's information and innovation industry 2022 ~ 2028
Lua date time
反走样/抗锯齿技术
The carrying capacity of L2 level ADAS increased by more than 60% year-on-year in January, and domestic suppliers "emerged"
Design of traceid in the project
Analysis report on the 14th five year development plan and operation mode of China's hazardous waste treatment industry from 2022 to 2028
What you must know about cloud computing
80 lines of code to realize simple rxjs
四元数简介
证券开户有风险吗?怎么开户安全呢?
【LeetCode】33. Search rotation sort array
度量学习(Metric Learning)【AMSoftmax、Arcface】
The most widely used dynamic routing protocol: OSPF
【LeetCode】33. 搜索旋转排序数组
Database system composition
C language: how to give an alias to a global variable?