Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Last update: Jan 07, 2023

Overview

Ego4D

EGO4D is the world's largest egocentric (first person) video ML dataset and benchmark suite, with 3,600 hrs (and counting) of densely narrated video and a wide range of annotations across five new benchmark tasks. It covers hundreds of scenarios (household, outdoor, workplace, leisure, etc.) of daily life activity captured in-the-wild by 926 unique camera wearers from 74 worldwide locations and 9 different countries. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. The approach to data collection was designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant.

Public Documentation/Start Here: Ego4D Docs

For the CLI readme (to download/access): CLI README

For a demo notebook: Annotation Notebook

For the visualization engine: Viz README

For feature extraction: Feature README

License

Ego4D is released under the MIT License.

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Related tags

Overview

Ego4D

License

Owner

Meta Research

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

Pyramid Pooling Transformer for Scene Understanding

Codes for paper "KNAS: Green Neural Architecture Search"

ObjectDetNet is an easy, flexible, open-source object detection framework

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

Deep Reinforcement Learning based autonomous navigation for quadcopters using PPO algorithm.

This is a library for training and applying sparse fine-tunings with torch and transformers.

[RSS 2021] An End-to-End Differentiable Framework for Contact-Aware Robot Design

"3D Human Texture Estimation from a Single Image with Transformers", ICCV 2021

Colab notebook and additional materials for Python-driven analysis of redlining data in Philadelphia

Extreme Lightwegith Portrait Segmentation

Project repo for Learning Category-Specific Mesh Reconstruction from Image Collections

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

Orange Chicken: Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation

3D-printable hand-strapped keyboard

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Label-Free Model Evaluation with Semi-Structured Dataset Representations

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Learning Representations that Support Robust Transfer of Predictors