I obtained my PhD from the Australian National University in 2024, advised by Prof. Stephen Gould and Dr. Damien Teney (Idiap Research Institute | AIML). My PhD surrounds multi-modal learning, in particular, vision-and-language, where I focused on a task termed composed image retrieval.