Topic 15: Visual Reasoning

LLMs watching film

Learning Goals

By the end of this lesson, students should be able to:

Tasks

Work on your final project

Lesson Plan and Slides

YouTube video for Demystifing Video Reasoning

Visual reasoning lesson plan

Visual reasoning slides

Mirage lesson plan

Mirage slides

Papers

Ruisi Wang, Zhongang Cai, Fanyi Pu, Junxiang Xu, Wanqi Yin, Maijunxian Wang, Ran Ji, Chenyang Gu, Bo Li, Ziqi Huang, Hokin Deng, Dahua Lin, Ziwei Liu, Lei Yang (2026) Demystifing Video Reasoning. https://arxiv.org/abs/2603.16870

Sara Ghazanfari, Francesco Croce, Nicolas Flammarion, Prashanth Krishnamurthy, Farshad Khorrami, Siddharth Garg (2025) Chain-of-Frames: Advancing Video Understanding in Multimodal LLMs via Frame-Aware Reasoning. https://arxiv.org/abs/2506.00318

MIRAGE: The Illusion of Visual Understanding. Mohammad Asadi, Jack W. O'Sullivan, Fang Cao, Tahoura Nedaee, Kamyar Rajabalifardi, Fei-Fei Li, Ehsan Adeli, Euan Ashley (2026). https://arxiv.org/abs/2603.21687