Framework for the Complete Gaze Tracking Pipeline

The figure below shows a general representation of the camera-to-screen gaze tracking pipeline [1]. The webcam image is preprocessed to create a normalized image of the eyes and face, from left to right. These images are fed into a model, which predicts the 3D gaze vector. The predicted gaze vector can be projected onto the screen once the user’s head pose is known.
This framework allows for the implementation of a real-time approach to predict the viewing position on the screen based only on the input image.

pip install -r requirements.txt
If necessary, calibrate the camera using the provided interactive script python calibrate_camera.py, see Camera Calibration by OpenCV.
For higher accuracy, it is also advisable to calibrate the position of the screen as described by Takahashiet al., which provide an OpenCV and matlab implementation.
To make reliable predictions, the proposed model needs to be specially calibration for each user. A software is provided to collect this calibration data.
Train a model or download a pretrained model.
If all previous steps are fulfilled, python main.py --calibration_matrix_path=./calibration_matrix.yaml --model_path=./p00.ckpt can be executed and a "red laser pointer" should be visible on the screen. main.py also provides multiple visualization options like:
1. --visualize_preprocessing to visualize the preprocessed images
2. --visualize_laser_pointer to show the gaze point the person is looking at on the screen like a red laserpointer dot, see the right monitor on the image below
3. --visualize_3d to visualize the head, the screen, and the gaze vector in a 3D scene, see left monitor on the image below

[1] Amogh Gudi, Xin Li, and Jan van Gemert, “Efficiency in real-time webcam gaze tracking”, in Computer Vision - ECCV 2020 Workshops - Glasgow, UK, August 23-28, 2020, Proceedings, Part I, Adrien Bartoli and Andrea Fusiello, Eds., ser. Lecture Notes in Computer Science, vol. 12535, Springer, 2020, pp. 529–543. DOI : 10.1007/978-3-030-66415-2_34. [Online]. Available: https://doi.org/10.1007/978-3-030-66415-2_34.

Framework for the Complete Gaze Tracking Pipeline

Related tags

Overview

Framework for the Complete Gaze Tracking Pipeline

Owner

Pascal

Histogram specification using openCV in python .

Virtual Zoom Gesture using OpenCV

Using Opencv ,based on Augmental Reality(AR) and will show the feature matching of image and then by finding its matching

OpenMMLab Text Detection, Recognition and Understanding Toolbox

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

SceneCollisionNet This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more info

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

Contextual speed detection for python

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Python Computer Vision Aim Bot for Roblox's Phantom Forces

Awesome anomaly detection in medical images

A tensorflow implementation of EAST text detector

Automatically download multiple papers by keywords in CVPR

Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Augmenting Anchors by the Detector Itself