I am a PhD candidate in Computer Science at UCLA working with Prof. Wei Wang. I’m currently a research intern at Genentech Prescient Design.
I earned my bachelor’s degree in Computing from The Hong Kong Polytechnic University with First Class Honours in 2018, advised by Prof. Qin Lu and Prof. Jiannong Cao. I studied as an exchange student at the University of Maryland in 2016. I’ve also spent time at Amazon Alexa AI (working with Dr. Jiun-Yu Kao and Dr. Tagyoung Chung), USC Information Sciences Institute (working with Prof. Nanyun (Violet) Peng and Prof. Muhao Chen), The Chinese University of Hong Kong (working with Prof. Helen Meng), UC Santa Cruz (working with Prof. Marilyn Walker) and MIT (working with Dr. Abel Sanchez and Prof. John R. Williams).
I’m interested in Natural Language Processing, Machine Learning and AI4Science. My research focuses on generative language models, especially in the clinical, medical, and science domains:
In InstructionalFingerprint, we present a pilot study on LLM fingerprinting as a form of very lightweight instruction tuning. Model publisher specifies a confidential private key and implants it as an instruction backdoor that causes the LLM to generate specific text when the key is present. Results on 11 popularly-used LLMs showed that this approach is lightweight and does not affect the normal behavior of the model.
In BMBI, we propose to mitigate bias exhibited in QA models by observing the query instance’s influence on another instance, enabling bias mitigation with extremely low resources. With our method, bias levels in multiple bias categories can be reduced without using category-specific instance-level annotation.
Oral presentation of the conf paper: Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning. In the collaboration work with Amazon Alexa AI, we introduce a dialogue state tracking model tuning less than 1% of LM parameters and achieves better low-resource performance with prompt tuning techniques.
Check out course materials for the 2-week summer program “Data Science for ALL” that were just delivered by our joint team from UCI and UCLA!