当前位置:网站首页>Principle of line of sight tracking and explanation of the paper
Principle of line of sight tracking and explanation of the paper
2022-07-28 08:57:00 【@BangBang】
L2CS-Net: Fine-Grained Gaze Estimation in Unconstrained Environments

at present , Line of sight tracking technology has the following application platforms :
- The computer : Mainly used for human-computer interaction —— Computer communication and text input ( More efficient than a mouse , And it is more suitable for the disabled )
- TV : Select and navigate menus and switch channels
- Head gear : Apply to user attention 、 Cognitive research 、 psychoanalysis ; Or is it VR Partial rendering of , If - - It can estimate the direction of people's line of sight through the built-in camera in the helmet , You can render the scene locally , That is, only the scene within the scope of human line of sight is finely rendered , Thus, the hardware cost is greatly reduced .
- Car equipment :
Check whether the driver is tired and focused. - Handheld devices : brightness 、 Volume adjustment and other human-computer interaction functions .

Reference blog :
Eye tracking is used in various applications, such as human-computer interaction and virtual reality . lately , Convolutional neural networks (CNN) The method has made significant progress in predicting the direction of line of sight . However , Outdoor line of sight tracking is still a challenging problem , Due to the unique eye appearance , Light conditions , And the diversity of head posture and gaze direction .
In this project , We propose a method based on cnn To predict the direction of gaze
We suggest returning to each gaze angle separately , To improve the prediction accuracy of each angle , This will improve the overall gazing ability . Besides , We use two identical losses , One for each angle , To improve and increase the generalization of e-learning . We evaluated our model using two popular data sets , These data sets are collected with unconstrained settings . Our proposed model realizes advanced 3.92◦ Accuracy and 10.41◦ Yes MPIIGaze and Gaze360 Data sets .
Originally, it was a multi task way , Insufficient accuracy , Various losses are combined , It is difficult to make all parties' training satisfactory ., Improved use of multiple loss estimates 3D Line of sight tracking , Use two parallel full connection layers to predict yaw horn and pitch horn , And independent losses are used for both angles . Each loss includes bin Classification and regression , Use softmax And cross entropy estimation gaze angle (L2+ Cross entropy ).
There are mainly two ways to realize line of sight tracking :1. The conventional and CNN based Method :
- routine : Use regression , Build a specific mapping relationship with line of sight estimation , such as adaptive linear regression and gaussian process regression
For the sight effect with little change , But the range of sight change is relatively large , The effect is relatively poor - CNN: CNN Build a nonlinear mapping relationship between line of sight and image
Loss function


Most of them use L2 Loss of estimated line of sight direction yaw and pitch horn , We're dealing with two gaze Two independent loss functions are proposed from the angle of , Each loss function includes Cross entropy loss and Mean square error loss , According to the estimate softmax classification bin Probability , To calculate gaze bin The expectation of , Using this method, fine-grained optimization . Then use with the real ground truth The mean square error of improves the prediction accuracy of the output .
Network architecture

According to the proposed classification and regression loss , We built a simple network (L2CS-Net), Will recognize the face image feed To resnet50 backbone in , Preliminary extraction of network features . Compared with the previous return in a network gaze Of yaw and pich angle , We propose that each corner uses a fully connected network independently . These two full connection layers share one backbone Extracted features . At the same time, we define the loss function for each branch of the full connection layer .

Data sets

Conclusion

Source code :https://github.com/ahmednull/l2cs-net
边栏推荐
- Leetcode brushes questions. I recommend this video of the sister Xueba at station B
- NPM and yarn use (official website, installation, command line, uploading your own package, detailed explanation of package version number, updating and uninstalling package, viewing all versions, equ
- 阿里巴巴内部面试资料
- Analysis of model predictive control (MPC) (IX): numerical solution of quadratic programming (II)
- Dry goods semantic web, Web3.0, Web3, metauniverse, these concepts are still confused? (top)
- Basic syntax of jquey
- Will sqlserver CDC 2.2 generate table locks when extracting large tables from the source
- 微服务架构 Sentinel 的服务限流及熔断
- The cooperation between starfish OS and metabell is just the beginning
- C轮融资已完成!思迈特软件领跑国内BI生态赋能,产品、服务竿头一步
猜你喜欢
![[opencv] generate transparent PNG image](/img/0a/4afc9bda411634562f4b0f3915a7ba.png)
[opencv] generate transparent PNG image

Gbase appears in Unicom cloud Tour (Sichuan Station) to professionally empower cloud ecology

Business digitalization is running rapidly, and management digitalization urgently needs to start

谷歌 Material Design 的文本框为什么没人用?

NPM and yarn use (official website, installation, command line, uploading your own package, detailed explanation of package version number, updating and uninstalling package, viewing all versions, equ

【软考软件评测师】2013综合知识历年真题

Bluetooth technology | it is reported that apple, meta and other manufacturers will promote new wearable devices, and Bluetooth will help the development of intelligent wearable devices

After reading these 12 interview questions, the new media operation post is yours
![Detailed explanation of DHCP distribution address of routing / layer 3 switch [Huawei ENSP]](/img/9c/b4ebe608cf639b8348adc1f1cc71c8.png)
Detailed explanation of DHCP distribution address of routing / layer 3 switch [Huawei ENSP]

思迈特软件Smartbi完成C轮融资,推动国产BI加速进入智能化时代
随机推荐
Opengauss synchronization status query
Explain cache consistency and memory barrier
Line generation (matrix)
Solution: indexerror: index 13 is out of bounds for dimension 0 with size 13
CAT1 4g+ Ethernet development board 232 data is sent to the server through 4G module TCP
[opencv] generate transparent PNG image
Redis 基本知识,快来回顾一下
谷歌 Material Design 的文本框为什么没人用?
Flink window & time principle
After summarizing more than 800 kubectl aliases, I'm no longer afraid that I can't remember commands!
创建线程的3种方式
Blog Building 9: add search function to Hugo
说透缓存一致性与内存屏障
Customer first | domestic Bi leader, smart software completes round C financing
PostgreSQL:无法更改视图或规则使用的列的类型
Baidu AI Cloud Jiuzhou district and county brain, depicting a new blueprint for urban and rural areas!
Top all major platforms, 22 versions of interview core knowledge analysis notes, strong on the list
Machine learning how to achieve epidemic visualization -- epidemic data analysis and prediction practice
Bluetooth technology | it is reported that apple, meta and other manufacturers will promote new wearable devices, and Bluetooth will help the development of intelligent wearable devices
GB/T 41479-2022信息安全技术 网络数据处理安全要求 导图概览