当前位置:网站首页>Video human behavior detection
Video human behavior detection
2022-07-27 22:44:00 【Nonzero factor】
theory
Video detection usually deals with ideas
| Ideas | Application scenarios | principle | shortcoming |
|---|---|---|---|
| Video behavior detection | Behavior recognition | adopt 3D Convolution kernel implementation , Combining spatial semantic information and motion information to recognize human behavior | ? |
| Body posture recognition | Used to detect whether a person falls or gets sick , Or for fitness 、 Automatic teaching of sports and dance | Based on key point detection , Combine time sequence information to judge human behavior | Distortion occurs when the human body is covered , The robustness is not high in complex scenes |
| Abnormal behavior detection | Abnormal behavior detection technology in video scene is the core technology of intelligent monitoring system | Predict the next frame based on the previous normal behavior , The predicted next frame is compared with the original video , Judge whether it is abnormal behavior |
Overview of video behavior detection
- Behavior recognition : Recognize actions in short videos ——what
- Sequential action detection : Locate the time period when a specific action appears in a long video ——what+when
- Spatiotemporal motion detection : Identify and locate people and actions in the video ——what+when+where
Pain points of video detection :
- How to represent the actions in the video
- How to efficiently process a large amount of video data
- How to reduce the labeling cost of video data

slowfast–2019ICCV
The author's creative ideas come from
SlowFast yes Facebook stay 2019 year ICCV A video recognition paper , Inspired by the types of retinal nerve cells in primates ( about 80% The cells (P-cells) Operate at low frequency , Can identify details ; And about 20% The cells (M-cells) It operates at high frequency , Sensitive to changes in time ). author A fast and slow network is proposed SlowFast framework , To realize the processing and analysis of time and space dimensions by two branches .
structure
SlowFast The whole algorithm consists of two convolution branches :
- Slow Branch : Fewer frames and larger channels learn spatial semantic information .
- Fast Branch : A larger number of frames and a smaller number of channels learn motion information
Data set features
.avi file You need to collect an image every few frames .jpg file
The length of the video is inconsistent
Model effect

mmaction2 Instructions
ubantu18.04
python == 3.8
torch == 1.8.0
cuda == 11.11
1、 download Data sets and Open source framework mmaction2
from github download mmaction2-init-master github:https://github.com/open-mmlab/mmaction2
download UCF101( Behavior recognition ) In dataset Data Set、Action Recognition https://www.crcv.ucf.edu/research/data-sets/ucf101/
UCF101 It's from YouTube Collected with 101 Action recognition of realistic action video of operation categories
2、 Install the open source framework
cd mkdir mmaction2
cd mmaction2
unzip mmaction2-master.zip
cd mmaction2-master
pip install -r requirements/build.txt
python setup.py develop
pip install decord -i https://pypi.douban.com/simple
https://github.com/jfzhang95/pytorch-video-recognition
Body posture recognition
Based on graph convolution

Video abnormal behavior detection
- Unsupervised learning : Predict the next frame based on the previous normal behavior , The predicted next frame is compared with the original video , Judge whether it is abnormal behavior
- Supervised learning : Expand exception samples
Application scenarios
Monitor security violence detection
边栏推荐
- An2021软件安装及基本操作(新建文件/导出)
- Relationship between DBM and VPP and Vpeak
- 视频直播源码,uni-app实现广告滚动条
- Live video source code, uni app to achieve advertising scroll bar
- 已有6名员工确诊!三星第三度关闭龟尾手机工厂!
- 2021年福建省职业院校技能大赛(中职组)网络安全竞赛任务书
- 阿里资深软件测试工程师推荐测试人员必学——安全测试入门介绍
- Vocational school Panyun network security competition ----- exploration of hidden information
- 舌簧继电器
- 技术生涯10年,那些让我心动的技术书
猜你喜欢

The purpose of DDD to divide domains, sub domains, core domains, and support domains

时间继电器

Drawing three coordinate (axis) diagram with MATLAB

Are Transformers Effective for Time Series Forecasting?| Pit filling

SQL injection less29 (parameter pollution bypasses WAF)

8000 word explanation of OBSA principle and application practice
![[illustration] shake hands three times and wave hands four times - it's enough to read this article carefully](/img/b1/af520cec44e849e8828a86fc7a2614.png)
[illustration] shake hands three times and wave hands four times - it's enough to read this article carefully

Matlab draws the statistical rose chart of wind speed and direction

Direct insertion sort of seven sorts

Iptables learning
随机推荐
Vs2019 release mode debugging: this expression has side effects and will not be evaluated.
EC code introduction
SQL injection less29 (parameter pollution bypasses WAF)
An2021软件安装及基本操作(新建文件/导出)
投资22亿美金!格科微12英寸CIS制造项目落户上海临港
Deepfake's face is hard to distinguish between true and false, and musk Fenke has disguised successfully
SSM integration process
High frequency relay
联发科携手三星推出全球首款支持Wi-Fi 6的8K电视
摩托罗拉诉海能达案一审结果出炉:海能达被判赔53亿元
直播软件app开发,uniapp scroll-view隐藏滚动条
【二叉树】统计二叉树中好节点的数目
刚培训完的中级测试工程师如何快速度过试用期
软件测试的就业前景到底怎么样?
蓝牙框架总结
技术生涯10年,那些让我心动的技术书
饿了么input输入框设置type=‘number‘时,去掉后面的上下按钮
[SQL] SQL optimization
Mimx8md6cvahzab i.MX 8mdual cortex-a53 - Microprocessor
Analysis on data collection and analysis of network security competition in national vocational college skill competition