I am a recent graduate from the University of Toronto’s Master of Science in Applied Computing programme. My research interests lies in LLM/MLLM Post Training, Reinforcement Learning, and Agentic Systems.
Currently I am working as a part-time research intern at Texas A&M University under the supervision of Prof. Yu Zhang, with the topic of LLM Agents. During my master’s study, I served as a full-time co-op applied research intern at ModiFace (L’Oréal’s AI Lab) focusing on controllable video generation. Prior to that, I worked as a part-time research intern in TIGER Lab under the supervision of Prof. Wenhu Chen focused on Agentic-RL based Tool-calling LLMs. I was fortunate to work under the supervision of the wonderful Jiawang Cao and Wenbo Zhu during my research internship at Opus AI Research, where I worked on Multimodal Reasoning and MLLM Benchmarking, with two papers published in top conferences.
ACL
TMLR
ACL
CVPR
AAAI
Last Updated: June 2026
Powered by Jekyll and Minimal Light theme.