Computer Vision Meetup: EgoSchema: A Dataset for Truly Long-Form Video Understanding

preview_player
Показать описание
Introducing EgoSchema, a very long-form video question-answering dataset, and benchmark to evaluate long video understanding capabilities of modern vision and language systems. Derived from Ego4D, EgoSchema consists of over 5000 human curated multiple choice question answer pairs, spanning over 250 hours of real video data, covering a very broad range of natural human activity and behavior.

Speaker: Karttikeya Mangalam is a PhD student in Computer Science at the Department of Electrical Engineering & Computer Sciences (EECS) at University of California,

Scroll down on this page and join the Computer Vision Meetup friendliest to your timezone:

Recorded on Sept 7, 2023 at the virtual Computer Vision Meetup.

#computervision #machinelearning #datascience #ai #artificialintelligence
Рекомендации по теме