Dezhao Luo

I am currently a final-year PhD student within the Computer Vision Group in the School of Electronic Engineering and Computer Science at Queen Mary University of London, supervised by Prof. Shaogang (Sean) Gong.

Before that, I received my M.S degree from IIE, Chinese Academy of Sciences in Beijing, China, under the supervision of Prof. Yu Zhou.

My research interest lies in computer vision and machine learning, focusing on learning dynamic visual environments through natural language for understanding, generating and reasoning.

Selected Publications

ViMo: A Generative Visual GUI World Model for App Agent
Dezhao Luo*, Bohan Tang*, Kang Li, Georgios Papoudakis, Jifei Song, Shaogang Gong, Jianye Hao, Jun Wang, Kun Shao
Under review, 2025.

Generative Video Diffusion for Unseen Cross-Domain Video Moment Retrieval
Dezhao Luo, Shaogang Gong, Jiabo Huang, Hailin Jin, Yang Liu
Proceedings of the AAAI Conference on Artificial Intelligence, 2025 (AAAI'25).

Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models
Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024: 5464–5473 (WACV'24).

Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023: 23045–23055 (CVPR'23).

Exploring Relations in Untrimmed Videos for Self-Supervised Learning
Dezhao Luo, Bo Fang, Yu Zhou, Yucan Zhou, Dayan Wu, Weiping Wang
ACM Transactions on Multimedia Computing, Communications, and Applications, 2022, 18(1s): 1–21 (TOMM'22).

Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
Dezhao Luo, Chang Liu, Yu Zhou, Dongbao Yang, Can Ma, Qixiang Ye, Weiping Wang
Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(07): 11701–11708 (AAAI'20, Oral presentation).

Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning
Yuan Yao, Chang Liu, Dezhao Luo, Yu Zhou, Qixiang Ye
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 6548–6557 (CVPR'20).