Visual Intelligence Beyond Closed Benchmarks
Video, actions, objects, scenes, and interaction under changing categories, contexts, and goals.
Collaborate
I am broadly open to collaborations, student projects, and thesis supervision across computer vision and AI. I am particularly interested in ambitious questions around visual intelligence, multimodal reasoning, embodied perception, and evaluation beyond closed benchmarks, while also welcoming ideas outside my current publication areas.
Directions
Video, actions, objects, scenes, and interaction under changing categories, contexts, and goals.
Links between vision, language, spatial reasoning, agents, robotics, physical environments, and human activity.
Robustness, uncertainty, imperfect supervision, generalization, datasets, and evaluation protocols that reveal what models really understand.
Fit
Research
A paper idea, methodological question, benchmark gap, dataset opportunity, or application domain where visual perception and AI could matter.
Supervision
Bachelor, master, or visiting student topics are welcome across computer vision and AI; the strongest topics usually connect a clear research question with a feasible prototype or evaluation.
Message