当前位置:网站首页>人脸检测:MTCNN
人脸检测:MTCNN
2022-06-12 22:47:00 【u013250861】
论文:Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks
下载:https://arxiv.org/abs/1604.02878
代码:https://github.com/kpzhang93/MT
人脸检测是计算机视觉中的一个问题,即在照片中定位一张或多张人脸。
在照片中定位人脸是指在图像中找到人脸的坐标,并通过人脸周围的边界框来划分人脸的范围。
人脸是动态的,其外观具有高度的可变性,使得人脸检测成为计算机视觉中的一个难题。如检测人脸,受其其朝向或角度、光线水平、服装、配饰、头发颜色、面部毛发、化妆、年龄等等影响。
人脸检测是人脸识别系统中必不可少的第一步,其目的是在背景中定位和提取人脸区域。
有两种主要的人脸识别方法:
- 基于特征的方法,使用手工过滤器来搜索和检测人脸。可以使用OpenCV库级联分类器(cascade classifiers)进行。
- 基于图像的方法,从整体上学习如何从整个图像中提取人脸。可以通过MTCNN库使用多任务级联CNN来实现。
MTCNN是一个深度级联多任务框架。该框架用来解决由于各种姿势、照明和遮挡,在不受约束的环境中进行人脸检测和对齐的问题。
该框架利用检测和对齐之间的内在相关性来提高它们的性能。框架利用级联架构和精心设计的深度卷积网络的三个阶段以粗到细的方式预测人脸和landmark位置。此外,提出了一种新的online hard sample mining策略,进一步提高了实践中的性能。
注:人脸对齐 face alignment:是许多人脸应用中必要的前处理,目的是减少输入影像的旋转(Rotation)、平移(Translation)、缩放(Scale)变化而造成的精度损失。
边栏推荐
- The development trend of digital collections!
- [leetcode] sword finger offer II 020 Number of palindrome substrings
- China's new generation information technology industry "14th five year plan" special planning and innovation strategic direction report 2022 ~ 2028
- Zhengzhou University of light industry -- development and sharing of harmonyos pet health system
- Is there any risk in opening a securities account? How to open an account safely?
- 細數攻防演練中十大關鍵防守點
- The shutter library recommends sizer to help you easily create a responsive UI
- C language: how to give an alias to a global variable?
- The interface testing tool apipos3.0 is applicable to process testing and reference parameter variables
- [890. find and replace mode]
猜你喜欢

数据库每日一题---第10天:组合两个表

JVM Basics - > how GC determines that an object can be recycled

Su embedded training day13 - file IO

C#读取word中表格数据

Mysql concat_ WS, concat function use

数字藏品的发展趋势!

Modstartcms modular station building system v3.3.0 component function upgrade, event triggering enhancement

flutter系列之:flutter中常用的GridView layout详解

【Web技术】1348- 聊聊水印实现的几种方式

MySQL case when then function use
随机推荐
Use of map() function in JS
Afraid to write documents? AI plug-in for automatically generating code documents
China's elastic belt market trend report, technical dynamic innovation and market forecast
C # reading table data in word
反走样/抗锯齿技术
Research Report on market supply and demand and strategy of China's digital camera lens industry
基于51单片机的酒精检测仪
What you must know about cloud computing
四元数简介
Flutter series part: detailed explanation of GridView layout commonly used in flutter
flutter系列之:flutter中常用的GridView layout详解
Research Report on truffle fungus industry - market status analysis and development prospect forecast
lua 日期时间
China Aquatic Fitness equipment market trend report, technical innovation and market forecast
ImageView grayed, reflected, rounded, watermarked
Alcohol detector based on 51 single chip microcomputer
[leetcode] sword finger offer II 020 Number of palindrome substrings
(downloadable) Research Report on the development and utilization of government data (2021), a glimpse of the development of Government Office
LNMP platform docking redis service
Shardingsphere-proxy-5.0.0 deployment table implementation (I)