Ego4D: Around the World in 3,000 Hours of Egocentric Video

  • 2021-10-13 22:19:32
  • Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Rui
  • 48

Abstract

We introduce Ego4D, a massive-scale egocentric video dataset and benchmarksuite. It offers 3,025 hours of daily-life activity video spanning hundreds ofscenarios (household, outdoor, workplace, leisure, etc.) captured by 855 uniquecamera wearers from 74 worldwide locations and 9 different countries. Theapproach to collection is designed to uphold rigorous privacy and ethicsstandards with consenting participants and robust de-identification procedureswhere relevant. Ego4D dramatically expands the volume of diverse egocentricvideo footage publicly available to the research community. Portions of thevideo are accompanied by audio, 3D meshes of the environment, eye gaze, stereo,and/or synchronized videos from multiple egocentric cameras at the same event.Furthermore, we present a host of new benchmark challenges centered aroundunderstanding the first-person visual experience in the past (querying anepisodic memory), present (analyzing hand-object manipulation, audio-visualconversation, and social interactions), and future (forecasting activities). Bypublicly sharing this massive annotated dataset and benchmark suite, we aim topush the frontier of first-person perception. Project page:https://ego4d-data.org/

 

Quick Read (beta)

loading the full paper ...