I am currently a Ph.D. student at the University of Technology Sydney, advised by Dr. Linchao Zhu and Prof. Yi Yang.
I received a Master's degree from Xi'an Jiaotong Univerisity in 2020. I was a member of the SMILES LAB, advised by Prof. Xueming Qian and Prof. Li Zhu.
My academic and professional journey has been driven by two kinds of curiosities: the development of machines capable of perceiving real-world scenarios and understanding semantics.
My primary research interests include:
1. Visual Perception in Real-world Scenarios: Tackling the challenges of visual perception in real-world scenarios and exploring their relevant applications.
2. Multi-modal Representation Learning: Investigating representation learning for multi-modal data, with a special focus on visual-language tasks.
Lately, my research trajectory has evolved:
1. Perception in Interactive Environments: I'm delving deeper into addressing perception challenges in interactive settings. A significant portion of this line is dedicated to improving the capabilities of robotic systems, enabling them to interpret and interact seamlessly within real-world environments.
2. Development of Humanoid AI Agents Using Large Models: I'm captivated by the possibilities large models offer in the creation of humanoid AI agents. The overarching objective is to mimic human-like intelligence and behavioral patterns more closely.