Mingyu Derek Ma

Mingyu Derek Ma

PhD Student

    UCLA Computer Science
derek.ma at ucla.edu        
he/his/him

Hi!

I am a PhD student in Computer Science at UCLA working with Prof. Wei Wang. I am interested in extracting structured knowledge from unstructed text using generative language models. Specifically, my research unbinds structure prediction models from constraints of data resources, label ontology, document structure, and domain.

I earned my bachelor’s degree in Computing from The Hong Kong Polytechnic University with First Class Honours in 2018, advised by Prof. Qin Lu and Prof. Jiannong Cao. I studied as an exchange student at the University of Maryland in 2016. I’ve also spent time at Amazon Alexa AI (working with Dr. Jiun-Yu Kao and Dr. Tagyoung Chung), USC Information Sciences Institute (working with Prof. Nanyun (Violet) Peng and Prof. Muhao Chen), The Chinese University of Hong Kong (working with Prof. Helen Meng), UC Santa Cruz (working with Prof. Marilyn Walker) and MIT (working with Dr. Abel Sanchez and Prof. John R. Williams).

Recent News

New preprint on bias mitigation
In BMBI, we propose to mitigate bias exhibited in QA models by observing the query instance’s influence on another instance, enabling bias mitigation with extremely low resources. With our method, bias levels in multiple bias categories can be reduced without using category-specific instance-level annotation.
Data Science for ALL
Check out course materials for the 2-week summer program “Data Science for ALL” that were just delivered by our joint team from UCI and UCLA!
New preprints on data generation with LLMs and LLM's backdoor attack
In STAR, we propose to synthesize training data by structure-to-text generation using Large Language Models and we show that the generated data is even more effective than human-curated data instances to boost the low-resource event extraction performance.
In the new study about LLM’s backdoor attack, we demonstrate that an attacker can inject backdoors by issuing very few malicious instructions and control model behavior through data poisoning.

Publications

MIDDAG: Where Does Our News Go? Investigating Information Diffusion via Community-Level Information Pathways
From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media
Mitigating Bias for Question Answering Models by Tracking Bias Influence
STAR: Improving Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models
Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models
DICE: Data-Efficient Clinical Event Extraction with Generative Models
Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction?
Multi-hop Evidence Retrieval for Cross-document Relation Extraction
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
Summarization as Indirect Supervision for Relation Extraction
Bending the Future: Autoregressive Modeling of Temporal Knowledge Graphs in Curvature-Variable Hyperbolic Spaces
HyperExpan: Taxonomy Expansion with Hyperbolic Representation Learning
EventPlus: A Temporal Event Understanding Pipeline
Dual Memory Network Model for Sentiment Analysis of Review Text
Implicit Discourse Relation Identification for Open-domain Dialogues
Dual Memory Network Model for Biased Product Review Classification
BlocHIE: a BLOCkchain-based platform for Healthcare Information Exchange

Curriculum Vitae

Experience

Education

Awards

  • Outstanding Project Award - Best Capstone Project Award Competition, PolyU Dept. of Computing (1/100) , 2018

    PolyU Computing News

  • HKSAR Government Scholarship Fund Talent Development Scholarship , 2018
  • Silver Award - Hong Kong ICT (Information and Communication Technologies) Awards (website | wiki) Student Innovation Award (Tertiary or Above), Hong Kong Government , 2018

    PolyU News | PolyU Computing News | PolyU Computing News about InnoCarnival Exhibition | PolyU Tweet

  • Champion and Most Innovative Award (HKSAR) - Imagine Cup (website | wiki), Microsoft , 2017
  • Commercial Radio 50th Anniversary Scholarship, Hong Kong Commercial Broadcasting Company Limited & PolyU (1/400) , 2017
  • Winner - Hong Kong Techathon, PolyU and City University of Hong Kong , 2018

    PolyU Computing News | PolyU Tweet

  • CMA (The Chinese Manufacturers’ Association of Hong Kong) & Donors Scholarship (3/100) , 2018
  • Champion - PolyU Smart Computing Competition (website) , 2017

    PolyU Computing News

  • Best Creative Service Project - Youth Volunteer Service Conference (website) , 2017

    News by Office of Service-Learning, PolyU | News by HKSAR Gov Agency for Volunteer Service

  • PolyU Undergraduate Summer Research Abroad Sponsorship , 2017
  • PolyU Chinese Mainland and Overseas Activities Fund , 2016
  • Wong Tit-Shing Student Exchange Scholarship , 2016
  • PolyU Exchange Scholarship , 2016
  • Teaching

  • Teaching Associate, CS32 Introduction to Computer Science II (Summer 2023) with Prof. Edwin Ambrosio, UCLA
  • Teaching Assistant, Data Science for ALL, an ICS NSF-Funded summer program on data science and machine learning (Summer 2023), UC Irvine & UCLA
  • Teaching Assistant, CS35L Software Construction (Spring 2023) with Prof. Paul Eggert, UCLA
  • Teaching Assistant, CS188 Natural Language Processing (Winter 2022, later became CS162) with Prof. Nanyun (Violet) Peng, UCLA
  • Teaching Assistant, CS31 Introduction to Computer Science I (Fall 2021) with Prof. David Smallberg, UCLA