News

Obtained PhD and joined Prescient Design at Genentech/Roche

I’m excited to share that I defended my PhD in March and joined the Foundation Model team of Prescient Design at Genentech/Roche as a Senior Machine Learning Scientist!

Mar 21, 2025

New preprint on new output paradigm for LLMs

Our new preprint explores a new paradigm for expressing LLMs’ decisions without decoding concrete words. We formally define the paradigm and introduce a comprehensive evaluation set covering a few to thousands of candidates. We found that this new output paradigm can outperform full decoding while being 40x faster.

Jan 29, 2025

Presenting at AAAI 2025 🇺🇸

At AAAI 2025, I will present my new paper on equipping LLMs with clinical decision-making capabilities with crafted learning objective design and clinical knowledge memorization. See you in Philadelphia!

Jan 28, 2025

Call for papers for the AAAI Symposium on LLM Agents for Scientific Discovery

We are calling for papers and participation for the AAAI 2025 Spring Symposium on LLM Agents for Scientific Discovery.

Jan 20, 2025

Presenting at NeurIPS 2024 🇨🇦

Presenting papers on LLMs for health, diagnosis, and multimodal reasoning and graph reasoning at NeurIPS 2024 in Vancouver. Click for schedule and location details.

Dec 9, 2024

New preprint using KG structure as inspiration for reasoning

Our new preprint simulates the thinking process of the knowledge graph constructors and utilizes the KG structure as inspiration for reasoning. This is especially helpful when expert-curated KG is sparse, enabling significantly better performance on biomedical KG-based QA.

Nov 20, 2024

Awarded the J.P. Morgan Chase AI PhD Fellowship 🏆

I’m excited to be awarded the J.P. Morgan Chase AI PhD Fellowship!

Sep 29, 2024

Modeling unobservable susceptibility at EMNLP 2024

In a paper to be presented at EMNLP 2024, we present a computational approach to efficiently model users’ latent susceptibility levels guided by the supervision of people’s sharing behavior. The estimated susceptibility is significantly aligned with human judgments. This model enables large-scale susceptibility analysis for the first time.

Sep 19, 2024

Awarded the Amazon Fellowship 🏆

I’m excited to be awarded the Amazon Fellowship!

Jul 10, 2024

Presenting at NAACL 2024 🇲🇽

Presenting three papers on bias, fairness and safety of Large Language Models at NAACL 2024 in Mexico City, including detecting and mitigating bias in QA models with ground-truth bias labels, fingerprinting LLMs, and a pilot study on injecting backdoors by instruction tuning data poisoning. Click for schedule and location details.

Jun 15, 2024

Presenting at AAAI 2024 🇨🇦

Presenting a demo and a poster at AAAI 2024 in Vancouver, including a demo on information diffusion via community-level information pathways and a poster on improving low-resource information extraction by structure-to-text data generation with Large Language Models.

Feb 22, 2024

New preprint on LLM ownership protection

In InstructionalFingerprint, we present a pilot study on LLM fingerprinting as a form of very lightweight instruction tuning. Model publisher specifies a confidential private key and implants it as an instruction backdoor that causes the LLM to generate specific text when the key is present. Results on 11 popularly-used LLMs showed that this approach is lightweight and does not affect the normal behavior of the model.

Jan 21, 2024

New preprint on bias mitigation

In BMBI, we propose to mitigate bias exhibited in QA models by observing the query instance’s influence on another instance, enabling bias mitigation with extremely low resources. With our method, bias levels in multiple bias categories can be reduced without using category-specific instance-level annotation.

Oct 1, 2023

Presenting at INTERSPEECH 2023 🇮🇪

Oral presentation of the conf paper: Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning. In the collaboration work with Amazon Alexa AI, we introduce a dialogue state tracking model tuning less than 1% of LM parameters and achieves better low-resource performance with prompt tuning techniques.

Aug 1, 2023

Data Science for ALL

Check out course materials for the 2-week summer program “Data Science for ALL” that were just delivered by our joint team from UCI and UCLA!

Jul 10, 2023

Presenting at ACL 2023 🇨🇦

Presenting three papers at ACL 2023 in Toronto, including DICE: Data-Efficient Clinical Event Extraction with Generative Models, Multi-hop Evidence Retrieval for Cross-document Relation Extraction, and Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction? Click for schedule and location details.

Jul 1, 2023

New preprints on data generation with LLMs and LLM's backdoor attack

In STAR, we propose to synthesize training data by structure-to-text generation using Large Language Models and we show that the generated data is even more effective than human-curated data instances to boost the low-resource event extraction performance. In the new study about LLM’s backdoor attack, we demonstrate that an attacker can inject backdoors by issuing very few malicious instructions and control model behavior through data poisoning.

May 15, 2023

New INTERSPEECH paper 🇮🇪

New INTERSPEECH paper! In the collaboration work with Amazon Alexa AI, we introduce a dialogue state tracking model tuning less than 1% of LM parameters and achieves better low-resource performance with prompt tuning techniques.

May 1, 2023

New preprints on biomedical relation extraction and cross-document relation extraction

We propose to formulate biomedical relation extraction as the NLI task. We design a new evidence retrieval technique for cross-document relation extraction.

Dec 15, 2022

Attending NeurIPS

I’ll be at the NeurIPS ENLSP workshop.

Dec 1, 2022

Presenting at AKBC 2022 🇬🇧

In HyperVC, we use hyperbolic models with variable curvature to learn representations of temporal knowledge graphs for future fact prediction.

Sep 1, 2022

New preprint on clinical event extraction

In DICE, we introduce a data-efficient generative model for clinical event extraction and propose the first benchmark in the domain.

Aug 1, 2022

New preprint on indirect supervision for relation extraction

In Summarization as Indirect Supervision for Relation Extraction, we convert the relation extraction task into a summarization formulation.

May 1, 2022

Presenting at EMNLP 2021 🇩🇴

New EMNLP Findings paper. In HyperExpan, we use a more expressive hyperbolic space for taxonomy expansion.

Aug 1, 2021

Presenting at NAACL 2021 🌎

We present our demo system EventPlus at NAACL 2021, the live system and code is released. Check it out!

Jun 1, 2021

Handbook at ACL 2020 🌎

The handbooks for ACL2020 that I helped assemble are ready and they’re available in 24 timezones for the first time, check them out!

Jul 1, 2020