当前位置:网站首页>人脸检测:MTCNN
人脸检测:MTCNN
2022-06-12 22:47:00 【u013250861】
论文:Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks
下载:https://arxiv.org/abs/1604.02878
代码:https://github.com/kpzhang93/MT
人脸检测是计算机视觉中的一个问题,即在照片中定位一张或多张人脸。
在照片中定位人脸是指在图像中找到人脸的坐标,并通过人脸周围的边界框来划分人脸的范围。
人脸是动态的,其外观具有高度的可变性,使得人脸检测成为计算机视觉中的一个难题。如检测人脸,受其其朝向或角度、光线水平、服装、配饰、头发颜色、面部毛发、化妆、年龄等等影响。
人脸检测是人脸识别系统中必不可少的第一步,其目的是在背景中定位和提取人脸区域。
有两种主要的人脸识别方法:
- 基于特征的方法,使用手工过滤器来搜索和检测人脸。可以使用OpenCV库级联分类器(cascade classifiers)进行。
- 基于图像的方法,从整体上学习如何从整个图像中提取人脸。可以通过MTCNN库使用多任务级联CNN来实现。
MTCNN是一个深度级联多任务框架。该框架用来解决由于各种姿势、照明和遮挡,在不受约束的环境中进行人脸检测和对齐的问题。
该框架利用检测和对齐之间的内在相关性来提高它们的性能。框架利用级联架构和精心设计的深度卷积网络的三个阶段以粗到细的方式预测人脸和landmark位置。此外,提出了一种新的online hard sample mining策略,进一步提高了实践中的性能。
注:人脸对齐 face alignment:是许多人脸应用中必要的前处理,目的是减少输入影像的旋转(Rotation)、平移(Translation)、缩放(Scale)变化而造成的精度损失。
边栏推荐
- Global and Chinese Melamine Industry Development Research and prospect trend report 2022-2028
- USB mechanical keyboard changed to Bluetooth Keyboard
- Go时间格式化 赋值
- Is it safe to open an account with new bonds? How should novices operate?
- Analysis report on investment and development trend of gap base of Chinese traditional medicine 2022 ~ 2028
- C#读取word中表格数据
- Introduction to Quaternion
- MySQL case when then function use
- List of open source alternative projects of world famous Cloud Service SaaS companies
- Zhengzhou University of light industry -- development and sharing of harmonyos pet health system
猜你喜欢

Flutter series part: detailed explanation of GridView layout commonly used in flutter

【建议收藏】通俗易懂图解网络知识-第一篇

Qt Quick 3D学习:鼠标拾取物体
![[web technology] 1348- talk about several ways to implement watermarking](/img/5f/c4f6ba6799202c79d1e9cb7a083952.png)
[web technology] 1348- talk about several ways to implement watermarking

设计消息队列存储消息数据的 MySQL 表格

2022 heavyweight: growth law - skillfully use digital marketing to break through enterprise difficulties

Audio and video technology development weekly 𞓜 234

Hostvars in ansible

QT quick 3D learning: mouse picking up objects

The programmer dedicated to promoting VIM has left. Father of vim: I will dedicate version 9.0 to him
随机推荐
QT quick 3D learning: mouse picking up objects
Theory + practice will help you master the dynamic programming method
Create a virtual thread using loom - David
Report on the "fourteenth five year plan" and strategic strategy recommendations for China's intellectual property protection industry 2022 ~ 2028
【LeetCode】数组中第K大的元素
PHP删除二维数组中相同项的数据
【LeetCode】5. 最长回文子串
C language: how to give an alias to a global variable?
China's alternative sports equipment market trend report, technology dynamic innovation and market forecast
JVM foundation - > three ⾊ mark
Mysql concat_ws、concat函数使用
China embolic coil market trend report, technical innovation and market forecast
生成小程序菊花码(生成菊花码、更换中间logo、更改图片尺寸,加文字)
Design of traceid in the project
JVM Basics - > how GC determines that an object can be recycled
Coordinate transformation in pipelines
项目里面的traceID的设计
Research Report on water sports shoes industry - market status analysis and development prospect forecast
Qrcodejs2 QR code generation JS
【LeetCode】209. 长度最小的子数组