Aula aberta no âmbito da unidade curricular de Aprendizagem Profunda, por Tengda Han (Google DeepMind).
Abstract: Long Video Understanding refers to the task of recognizing and understanding complex activities, events, or interactions that unfold over longer durations of time. This includes tasks like temporal action detection, dense video captioning, video grounding, future video prediction, and video summarization, among others.
