当前位置:网站首页>2D human posture estimation deeppose
2D human posture estimation deeppose
2022-06-29 03:23:00 【light169】
【https://github.com/Naman-ntc/Pytorch-Human-Pose-Estimation】
Some early top-down deep learning methods used neural networks to directly predict the location of key points of human body 2D coordinate .

DeepPose [1] It is a classic representative of this kind of method .DeepPose Cascade neural network is used to predict the relative coordinates of each key point of human body . Each stage takes the output coordinates of the previous stage as the input , And further predict more accurate coordinate positions . Final , Convert the predicted normalized relative coordinates into absolute coordinates .
Alexander Toshev and Christian Szegedy Proposed DeepPose The earliest will be CNN( Convolutional neural networks ) Applied to human joint point detection .DeepPose Human posture estimation is transformed into joint point regression problem , And put forward that CNN Method applied to human joint point regression : Use the whole image input to 7 layer CNN To do joint point regression , Further more , Use cascaded CNN Detector to increase the accuracy of joint point positioning .
DeepPose stay LSP On dataset [email protected] Average accuracy reaches 61%, It was at the time state-of-art Method
The method of direct regression of coordinates , You can get the key position directly , There is often a faster prediction speed . However , Due to the great degree of freedom of human posture , The modeling method of directly predicting coordinates is not friendly to the prediction of neural network , The prediction accuracy is restricted to a certain extent .
.1、 First , In order to train better and more uniformly , The author hopes to present the picture with people at the core , Thus, a standardization method of joint coordinates is introduced .


So the normalized joint point coordinates ( Relative coordinates ) Shown by the following :

The absolute position of the joint point coordinates predicted by the network relative to the picture is

But I found that the rear attitude recognition Baseline It seems that this kind of standardization has not been adopted in the paper . One of my guesses is , at that time Batch Normalization(2015) Your article hasn't been published yet , therefore DeepPose(2014) According to the characteristics of attitude recognition, this standardized method is adopted . But actually , It and Batch Normalization The function may be similar , Therefore, in the subsequent network construction , If you do BN, Then there is no need to do such posture Standardization .
2、 Basic network part
This one is not complicated , Reprinted as a whole AlexNet, Only the output is changed . From the original one-dimensional vector ( Long for classified “ Number of categories “) It becomes a one-dimensional vector ( Long for “ Double the number of joints “). The reason why the output length is changed to twice as much Number of joints , Because each joint has Horizontal and vertical Two coordinates to predict .
It is worth noting that , Because the prediction coordinate is actually a highly nonlinear regression task , For convolutional networks, there is no good use of the spatial information of the image . So since 14 Year of Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation After putting forward the heat map method , Mainstream networks rarely use the way of predicting coordinates for attitude recognition . The specific advantages and disadvantages of prediction coordinates and thermal diagram will also be introduced in subsequent articles .

The network structure in this paper ,Backbone Namely AlexNet
3、 Advanced network part
In order to achieve better Local observation effect , The author also establishes more for each joint in a small scale AlexNet To learn . Therefore, the implementation of this block is actually equivalent to extracting smaller picture frames for different joints , Then standardize these proposed local picture frames ( modular 1) and DNN( modular 2) Re modeling and calculation of , The reason for this is that it is conducive to the improvement of accuracy in the process of enlarging the target position .
But compared with its implementation process , In fact, what is more important is the idea of this module , That's it As long as the joint position is extracted repeatedly in different stages , It is conducive to the improvement of accuracy . Such a combination local and global In fact, the idea of has been adopted in many subsequent papers (Stacked Hourglass/CPM/...), And proved to be an effective idea .
See :
Human posture estimation -DeepPose ( Detailed instructions )
DeepPose Comprehensive analysis ( principle + Code +Colab) - You know
边栏推荐
- Jerry's watch obtains alarm mode settings [chapter]
- Concise words tell about technical people who must master basic IT knowledge and skills. Part 1
- How to keep source code secret in embedded development
- Zigzag sequence traversal of binary tree [one of layered traversal methods - > preorder traversal +level]
- Etcd tutorial - Chapter 6 etcd core API V3
- Web GIS 航拍实现的智慧园区数字孪生应用
- Certification training | streamnational certification training phase 2
- Jerry's watch pause [chapter]
- 初探元宇宙存储,数据存储市场下一个爆点?
- Grafana入门教程
猜你喜欢
![[linear algebra] 1.2 total permutation and commutation](/img/04/18fc358c6c426e10c8598bcee9cd43.png)
[linear algebra] 1.2 total permutation and commutation

LeetCode 每日一题——324. 摆动排序 II
![[test theory] quality analysis ability](/img/4b/d011e16c7b2be52fe12c123214779e.jpg)
[test theory] quality analysis ability

Linear and nonlinear structures

【云原生】这么火,你不来了解下?
![Synchronous real-time data of Jerry's watch [chapter]](/img/6f/719aa14fb376aba45472783886dbff.jpg)
Synchronous real-time data of Jerry's watch [chapter]
![二叉树的锯齿形层序遍历[分层遍历方式之一 -> 前序遍历+level]](/img/f6/0df9f2a454cea0a95a5347546a90fb.png)
二叉树的锯齿形层序遍历[分层遍历方式之一 -> 前序遍历+level]
![[yunyuanyuan] it's so hot. Why don't you come and understand it?](/img/a8/99037ec5b796e39b9e76eac95deb86.png)
[yunyuanyuan] it's so hot. Why don't you come and understand it?

Stm32l4 Series MCU ADC accurately calculates input voltage through internal reference voltage

设备监理师证书含金量怎样?值得考吗?
随机推荐
Double click events and click events
相同的树[从部分到整体]
allegro 设计中显示网络飞线或关闭网络飞线的方法
Tu ne peux pas comprendre le feu?
【雲原生】這麼火,你不來了解下?
2D人体姿态估计 - DeepPose
How does kubernetes store business data persistently? (10)
set time format
Delphi time to timestamp
二叉树的锯齿形层序遍历[分层遍历方式之一 -> 前序遍历+level]
归并排序
2022-2028 global UAV detection radar industry research and trend analysis report
Certification training | streamnational certification training phase 2
Which is the product with the highest interest rate of increased life insurance on the market at present?
Etcd tutorial - Chapter 6 etcd core API V3
Is it safe to open a stock account by mobile phone? Is it difficult to open an account?
目前市面上增额终身寿险利率最高的产品是哪个?
【云原生】这么火,你不来了解下?
MySQL advanced SQL statement (Part 2)
Basic MySQL database operations