Research Summary
I am currently working on:
- large language model;
- video generative / video editing models.
Previously, I have researched on:
- multi-modal learning, in particular, vision-and-language tasks;
- weakly-supervised learning.
I maintain my publication record at Google Scholar.
Source code for all first-author projects are available at GitHub.
LinkedIn profile. OpenReview profile.
I am actively seeking talented students for research projects, if interested please contact me via the email address above.