kuku — my post-training advisor; specializes in comfort priors, household physics, and alignment with human preferences.

Yuanzhi Liang

Mail: liangyzh18 [at] outlook [dot] com

Profiles: ORCID · DBLP · Google Scholar

About Me

I am a research scientist specializing in generative AI at the Institute of Artificial Intelligence (TeleAI), China Telecom. I received my Ph.D. from the University of Technology Sydney in 2024, advised by Dr. Linchao Zhu and Prof. Yi Yang.

I received a Master's degree from Xi'an Jiaotong University in 2020 and was a member of the SMILES LAB, advised by Prof. Xueming Qian and Prof. Li Zhu.

My research focuses on generative AI with an emphasis on post-training, alignment, and structured world understanding built on large-scale data-driven foundations. I am interested in how models develop coherent internal representations, respect real-world structure, and refine their behavior through principled signals—including physical priors, aesthetic objectives, and human-aligned guidance. The aim is to move toward generative systems that remain grounded in data while becoming more reliable, structurally consistent, and capable of progressive self-improvement.

More broadly, I am interested in learning frameworks where generative models improve through simulation, interaction, and self-directed refinement rather than static supervision alone. This includes the use of world models, controlled training environments, and structured feedback to provide richer learning signals—enabling models to test hypotheses, adapt, and grow through experience. The long-term goal is to support a trajectory toward generative intelligence that is grounded, aligned, and continuously advancing through both data and experience.

I am always looking for highly motivated research interns and long-term collaborators. We currently have multiple positions available, focusing on, but not limited to, multimodal large models, video generation/editing, and 3D generation. If you are interested in exploring these areas or discussing potential research collaborations, please feel free to contact me via email. (Applicants for internships are encouraged to include your CV.)

Work Experience

Jul 2021 - Dec 2021, Alibaba DAMO Academy

Research intern working on virtual human synthesis.

Jul 2020 - Jul 2021, Baidu Research

Research intern working on visual knowledge embedding, object recognition, and multi-modal representation.

Mar 2020 - Jun 2020, JD AI Research

Research intern working on product recognition.

Aug 2018 - Jun 2019, JD AI Research

Research intern working on visual-language representation learning.

Selected Honors

First place in AliProducts Challenge @ CVPR 2020 the RetailVision workshop.
First place in iMat Product Competition @ CVPR 2019 FGVC6 workshop.
First place in in Fieldguide Challenge: Moths & Butterflies @ CVPR 2019 FGVC6 workshop.
Second place in iFood Competition @ CVPR 2019 FGVC6 workshop.
Second place in iMet2020 Fine-grained Attributes Classification Competition @ CVPR 2020 FGVC7 workshop.
Kaggle Silver Medal in Deepfake Detection Challenge 2020.

Recent Preprints and Surveys (all publications)

From World Action Models to Embodied Brains: A Roadmap for Open-World Physical Intelligence Overview Link
Yuanzhi Liang, Xufeng Zhan*, Haibin Huang, Chi Zhang, Xuelong Li.
arXiv, 2026
TeleBoost: A Systematic Alignment Framework for High-Fidelity, Controllable, and Robust Video Generation Overview Homepage Link
Yuanzhi Liang, Xuan'er Wu, Yirui Liu, Yijie Fang*, Yizhen Fan, Ke Hao*, Rui Li*, Ruiying Liu*, Ziqi Ni*, Peng Yu, Yanbo Wang, Haibin Huang, Qizhen Weng, Chi Zhang, Xuelong Li.
arXiv, 2026
TeleWorld: Towards Dynamic Multimodal Synthesis with a 4D World Model Overview Link
Yabo Chen, Yuanzhi Liang, Jiepeng Wang, Tingxi Chen, Junfei Cheng, Zixiao Gu, Yuyang Huang, Zicheng Jiang, Wei Li, Tian Li, Weichen Li, Zuoxin Li, Guangce Liu, Jialun Liu, Junqi Liu, Haoyuan Wang, Qizhen Weng, Xuan'er Wu, Xunzhi Xiang, Xiaoyan Yang, Xin Zhang, Shiwen Zhang, Junyu Zhou, Chengcheng Zhou, Haibin Huang, Chi Zhang, Xuelong Li.
arXiv, 2025
Integrating Reinforcement Learning with Visual Generative Models: Foundations and Advances Overview Homepage Link
Yuanzhi Liang, Yijie Fang*, Rui Li*, Ziqi Ni*, Ruijie Su*, Chi Zhang.
Vicinagearth, 2026
VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation Overview Link
Chi Zhang, Yuanzhi Liang, Xi Qiu, Fangqiu Yi, Xuelong Li.
arXiv, 2024

Selected Publications

Learning to Credit the Right Steps: Objective-aware Process Optimization for Visual Generation Overview Link
Rui Li*^†, Ke Hao*^†, Yuanzhi Liang, Haibin Huang, Chi Zhang, Yun Gu, Xuelong Li.
Accepted by ACM MM 2026
Reward-Aware Trajectory Shaping for Few-step Visual Generation Overview Link
Rui Li*^†, Bingyu Li^†, Yuanzhi Liang, Haibin Huang, Chi Zhang, Xuelong Li.
Accepted by ACM MM 2026
Seeing What Matters: Visual Preference Policy Optimization for Visual Generation Overview Link
Ziqi Ni*, Yuanzhi Liang^†, Rui Li, Yi Zhou, Haibin Huang, Chi Zhang, Xuelong Li.
Accepted by CVPR 2026
Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generation Overview Link
Ruiying Liu*, Yuanzhi Liang^†, Haibin Huang, Tianshu Yu, Chi Zhang.
Accepted by CVPR 2026
Rethinking Reward Signals in Video GRPO: When Scores Become Targets Overview Link
Rui Li*, Yuanzhi Liang^†, Ziqi Ni, Haibin Huang, Chi Zhang, Xuelong Li.
Accepted by ECCV 2026
LaxMotion: Rethinking Supervision Granularity for 3D Human Motion Generation Overview Link
Sheng Liu*, Yuanzhi Liang, Sidan Du.
Accepted by ECCV 2026
Uni-Inter: Unifying 3D Human Motion Synthesis Across Diverse Interaction Contexts Overview Link
Sheng Liu*, Yuanzhi Liang^†, Jiepeng Wang, Sidan Du, Chi Zhang, Xuelong Li.
SIGGRAPH Asia 2025
InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild Overview Homepage Link
Yiyi Ma*, Yuanzhi Liang, Xiu Li, Chi Zhang, Xuelong Li.
ICCV 2025
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Overview Homepage Link
Yu Lu, Yuanzhi Liang, Linchao Zhu, Yi Yang.
NeurIPS 2024
MAAL: Multimodality-Aware Autoencoder-Based Affordance Learning for 3D Articulated Objects. Overview Link
Yuanzhi Liang, Xiaohan Wang, Linchao Zhu, Yi Yang.
ICCV 2023
IcoCap: Improving Video Captioning by Compounding Images. Overview Link
Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang.
IEEE TMM 26, 2024 (online 2023)
A Simple Episodic Linear Probe Improves Visual Recognition in the Wild. Overview Link
Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang.
CVPR 2022
SEEG: Semantic Energized Co-Speech Gesture Generation. Overview Link
Yuanzhi Liang, Qianyu Feng, Linchao Zhu, Li Hu, Pan Pan, Yi Yang.
CVPR 2022
Penalizing the Hard Example But Not Too Much: A Strong Baseline for Fine-Grained Visual Classification. Overview Link
Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang.
IEEE TNNLS 35(5), 2024 (online 2022)
Removing Raindrops and Rain Streaks in One Go Overview Link
Ruijie Quan, Xin Yu, Yuanzhi Liang, Yi Yang.
CVPR 2021
Food and Ingredient Joint Learning for Fine-Grained Recognition Overview Link
Chengxu Liu, Yuanzhi Liang, Yao Xue, Xueming Qian, Jianlong Fu.
IEEE TCSVT 31(6), 2021 (online 2020)
VrR-VG: Refocusing Visually-Relevant Relationships. Overview Link
Yuanzhi Liang, Yalong Bai, Wei Zhang, Xueming Qian, Li Zhu, Tao Mei.
ICCV 2019

Note *: interns that I mentored. Note †: equal contribution.

Journal Reviewer

Reviewer for TPAMI and TIP.

Conference Reviewer / Program Committee Member

Reviewer for ICCV, CVPR, ICLR, NeurIPS, ECCV, MM, AAAI, IJCAI, ICME, and CAAI.

Others

Member of MNBVC (Massive Never-ending BT Vast Chinese corpus).