Welcome to my academic page!
I am on my one-year PhD visiting journey at MReal Lab of Nanyang Technological University (NTU), and expect to graduate from Xidian University in July 2024. Currently, I work with Prof. Zhang Hanwang, and major in multi-modal representation learning, large vision-language model, diffusion model, and other related tasks. I also work closely with Prof.Mingyuan Zhou from The University of Texas at Austin. My PhD advisor is Prof.Bo Chen.My long-term research goal is to build explainable machines that can learn from human prior knowledge and apply such common sense in their final prediction under the Bayesian framework.
Multi-modal representation learning: Vision-Langauge Model, ChatGPT, Diffusion model.
Optimal transport theory: Conditional transport, Embedding methods, Concept learning.
Bayesian statistics: Deep generative model, Variational inference, Knowledge representation.
Topics: deep generative model / optimal transport (*: indicates equal contribution)