Emad Bahrami

I’m Emad, a PhD student in the Computer Vision Group at the University of Bonn, supervised by Prof. Jürgen Gall. My primary research area is video understanding, currently focused on enhancing temporal modeling in Multimodal LLMs. Previously, I worked on tasks such as temporal action segmentation and long-term dense anticipation. I’m currently a research intern at Microsoft, working on Multimodal Large Language Models (LLMs) for video perception and reasoning.
Before starting my PhD, I was a researcher at Deep MI, specializing in semantic segmentation of human brain MRI scans. Additionally, I spent time as a visiting researcher in the Computer Vision Group, focusing on action recognition and future frame prediction, also under the supervision of Prof. Gall. I completed my bachelor’s degree at the University of Tehran.
news
May 2025 | Glad to be recognized as an Outstanding Reviewer at CVPR 2025 🎉 |
---|---|
May 2025 | I’ve joined Microsoft as a Research Intern. |
selected publications [view all]
-
-
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense AnticipationIEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2025