刘 哲源

Zheyuan (David) Liu

profile picture

I obtained my PhD from the Australian National University in 2024, supervised by Prof. Stephen Gould and Dr. Damien Teney (Idiap Research Institute | AIML). My PhD surrounds multi-modal learning, in particular, vision-and-language, where I focused on a task termed composed image retrieval.

Prior to that, I obtained my bachelor degree in Electronics Engineering (R&D) with first-class honours from the Australian National University in 2018.


Research Summary

My research has focused on:

  • multi-modal learning, in particular, vision-and-language tasks
  • weakly-supervised learning
  • generative model

I maintain my publication record at Google Scholar. Source code for all first-author projects are available at GitHub.

A record of news and recent publications are kept below.




News

May 2024 I have joined AIML as a postdoctoral research fellow.

Apr 2024 I have received my award of PhD from ANU. A big thank you to my supervisors, my parents and all my friends for their kind support through this journey.


Recent Publications

Changsheng Lu, Zheyuan Liu, Piotr Koniusz. “OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection.” To appear in European Conference on Computer Vision (ECCV), 2024.

bib | paper coming soon | code

  coming soon

Zheyuan Liu. “Retrieving Images through Bi-modal Visual and Language Queries.” Thesis (PhD), Australian National University, 2024.

bib | ANU archive | pdf

  @phdthesis{liu2024retrieving,
  title={Retrieving Images through Bi-modal Visual and Language Queries},
  author={Liu, Zheyuan},
  school={The Australian National University},
  year={2024}
}

Zheyuan Liu, Weixuan Sun, Damien Teney, and Stephen Gould. “Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder.” Transactions on Machine Learning Research (TMLR), 2024.

bib | pdf | arXiv | code

  @article{liu2024candidate,
    title={Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder},
    author={Zheyuan Liu and Weixuan Sun and Damien Teney and Stephen Gould},
    journal={Transactions on Machine Learning Research},
    issn={2835-8856},
    year={2024},
    url={https://openreview.net/forum?id=fJAwemcvpL},
    note={}
  }

Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, and Stephen Gould. “Bi-directional training for composed image retrieval via text prompt learning.” IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024.

bib | pdf | arXiv | code

  @InProceedings{Liu_2024_WACV,
    author={Liu, Zheyuan and Sun, Weixuan and Hong, Yicong and Teney, Damien and Gould, Stephen},
    title={Bi-Directional Training for Composed Image Retrieval via Text Prompt Learning},
    booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
    month={January},
    year={2024},
    pages={5753-5762}
  }

Weixuan Sun, Yanhao Zhang, Zhen Qin, Zheyuan Liu, Lin Cheng, Fanyi Wang, Yiran Zhong, and Nick Barnes. “All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation.” IEEE/CVF International Conference on Computer Vision (ICCV) Workshops. 2023.

bib | pdf | arXiv | code

  @InProceedings{Sun_2023_ICCV,
    author={Sun, Weixuan and Zhang, Yanhao and Qin, Zhen and Liu, Zheyuan and Cheng, Lin and Wang, Fanyi and Zhong, Yiran and Barnes, Nick},
    title={All-pairs Consistency Learning forWeakly Supervised Semantic Segmentation},
    booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
    month={October},
    year={2023},
    pages={826-837}
  }

Weixuan Sun, Jiayi Zhang, Jianyuan Wang, Zheyuan Liu, Yiran Zhong, Tianpeng Feng, Yandong Guo, Yanhao Zhang, and Nick Barnes. “Learning audio-visual source localization via false negative aware contrastive learning.” IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2023.

bib | pdf | arXiv | code

  @InProceedings{Sun_2023_CVPR,
    author={Sun, Weixuan and Zhang, Jiayi and Wang, Jianyuan and Liu, Zheyuan and Zhong, Yiran and Feng, Tianpeng and Guo, Yandong and Zhang, Yanhao and Barnes, Nick},
    title={Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning},
    booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month={June},
    year={2023},
    pages={6420-6429}
  }

Zheyuan Liu, Cristian Rodriguez-Opazo, Damien Teney, and Stephen Gould. “Image retrieval on real-life images with pre-trained vision-and-language models.” IEEE/CVF International Conference on Computer Vision (ICCV). 2021.

bib | pdf | arXiv | dataset | code

  @InProceedings{Liu_2021_ICCV,
    author={Liu, Zheyuan and Rodriguez-Opazo, Cristian and Teney, Damien and Gould, Stephen},
    title ={Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models},
    booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month={October},
    year={2021},
    pages={2125-2134}
  }