Video Scene Graph Generation (VidSGG) is a research field focused on converting visual input from video streams into structured knowledge representations.

These scene graphs contain nodes (objects) and edges (relationships) that help machines understand context and interactions within each frame over time.

Recent Publications