Open lecture within the Deep Learning course unit, by Tengda Han (Google DeepMind).
Abstract: Long Video Understanding refers to the task of recognizing and understanding complex activities, events, or interactions that unfold over longer durations of time. This includes tasks like temporal action detection, dense video captioning, video grounding, future video prediction, and video summarization, among others.
