当前位置:网站首页>人脸检测:MTCNN
人脸检测:MTCNN
2022-06-12 22:47:00 【u013250861】
论文:Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks
下载:https://arxiv.org/abs/1604.02878
代码:https://github.com/kpzhang93/MT
人脸检测是计算机视觉中的一个问题,即在照片中定位一张或多张人脸。
在照片中定位人脸是指在图像中找到人脸的坐标,并通过人脸周围的边界框来划分人脸的范围。
人脸是动态的,其外观具有高度的可变性,使得人脸检测成为计算机视觉中的一个难题。如检测人脸,受其其朝向或角度、光线水平、服装、配饰、头发颜色、面部毛发、化妆、年龄等等影响。
人脸检测是人脸识别系统中必不可少的第一步,其目的是在背景中定位和提取人脸区域。
有两种主要的人脸识别方法:
- 基于特征的方法,使用手工过滤器来搜索和检测人脸。可以使用OpenCV库级联分类器(cascade classifiers)进行。
- 基于图像的方法,从整体上学习如何从整个图像中提取人脸。可以通过MTCNN库使用多任务级联CNN来实现。
MTCNN是一个深度级联多任务框架。该框架用来解决由于各种姿势、照明和遮挡,在不受约束的环境中进行人脸检测和对齐的问题。
该框架利用检测和对齐之间的内在相关性来提高它们的性能。框架利用级联架构和精心设计的深度卷积网络的三个阶段以粗到细的方式预测人脸和landmark位置。此外,提出了一种新的online hard sample mining策略,进一步提高了实践中的性能。
注:人脸对齐 face alignment:是许多人脸应用中必要的前处理,目的是减少输入影像的旋转(Rotation)、平移(Translation)、缩放(Scale)变化而造成的精度损失。
边栏推荐
- The fate of Internet people is that it is difficult to live to 30?
- (downloadable) Research Report on the development and utilization of government data (2021), a glimpse of the development of Government Office
- [Part 8] semaphore source code analysis and application details [key points]
- 四元数简介
- Common rendering pipeline grooming
- 【LeetCode】103. 二叉树的锯齿形层序遍历
- Design a MySQL table for message queue to store message data
- Research and Analysis on the development of China's Melamine Industry from 2022 to 2028 and market prospect forecast report
- 【LeetCode】300. Longest ascending subsequence
- USB机械键盘改蓝牙键盘
猜你喜欢

JVM Basics - > how to troubleshoot JVM problems in your project

【建议收藏】通俗易懂图解网络知识-第一篇

C # reading table data in word

Shardingsphere-proxy-5.0.0 deployment table implementation (I)

Modstartcms modular station building system v3.3.0 component function upgrade, event triggering enhancement

Qt Quick 3D学习:鼠标拾取物体

反走样/抗锯齿技术

Qrcodejs2 QR code generation JS

JVM Basics - > how GC determines that an object can be recycled

SQL query list all views in SQL Server 2005 database - SQL query to list all views in an SQL Server 2005 database
随机推荐
Analysis report on production and marketing demand and investment forecast of China's Melamine Industry from 2022 to 2028
Su embedded training day13 - file IO
be careful! Your Navicat may have been poisoned
JVM foundation > G1 garbage collector
【LeetCode】300. Longest ascending subsequence
Colab教程(超级详细版)及Colab Pro/Colab Pro+使用评测
iShot
QT quick 3D learning: mouse picking up objects
[Part 7] source code analysis and application details of cyclicbarrier [key]
Coordinate transformation in pipelines
Audio and video technology development weekly 𞓜 234
【LeetCode】剑指 Offer II 020. 回文子字符串的个数
[machine learning] learning notes 01- introduction
Mysql case when then函数使用
Qrcodejs2 QR code generation JS
What you must know about cloud computing
細數攻防演練中十大關鍵防守點
The programmer dedicated to promoting VIM has left. Father of vim: I will dedicate version 9.0 to him
lua 日期时间
Photoshop:ps how to enlarge a picture without blurring