Yingjia Wan

Yingjia Wan

Student Researcher in Natural Language Processing

Language Technology Lab, University of Cambridge

About

Hi! I am Yingjia. I studied NLP at the Language Technology lab, University of Cambridge, as an MPhil student supervised by Prof Ivan Vulić. I am passionable about a wide range of NLP topics that I have been working on, including Large Language Model (LLM) Reasoning (in multiple domains), LLM Hallucination & Fact-Checking, Debiasing, Multimodality, and Cognition-Inspired NLP.

I grow from a interdisplinary background of humanties and computer science: I studied linguistics and neuroscience at University of Macau where I graduated top of the cohort as an undergraduate valedictorian.

I am currently looking for intern and PhD opportunities in NLP (and always open for collaboration). Feel free to drop me an email (yingjiawan.alisa[AT]gmail.com) if you are interested in working with me or just a chat!

Interests
  • Model Reasoning
  • Model Hallucination & Fact-Checking
  • Automated Theorem Proving
  • Multimodality
  • Cognition-Inspired NLP
Education
  • M.Phil. in Theoretical & Applied Linguistics (NLP track), 2022-2023

    University of Cambridge

  • Funded Exchange, 2019-2020

    University of Michigan

  • B.A. in English Studies, 2017-2021

    University of Macau

Recent Publications

(2024). Mr.Ben: A Comprehensive Meta-Reasoning Benchmark for Analyzing Large Language Models. Preprint available on arXiv:2406.13975..

Code Dataset Blog arXiv

(2024). Process-Driven Autoformalization in Lean 4. Preprint available at arXiv:2406.01940..

Code Dataset arxiv

(2024). AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation. Preprint available at arXiv:2405.16802..

Code arxiv

(2024). AUTOALIGN: Automated Alignment Evaluation in Autoformalization. Preprint provided upon request.

(2023). Multimodal Prompt Tuning for Cognition-Enhanced NLP. [MPhil Dissertation].

PDF Code

Research Experience

 
 
 
 
 
Hong Kong University of Science and Technology - Guangzhou
Full-Time Research Assistant
March 2024 – Present
Topic: LLMs for Formal Mathematics, Automated Thoerem Proving, LLM Reasoning.
 
 
 
 
 
Language Technology Lab (LTL), Uniersity of Cambridge
Full-Time Research Student
February 2023 – July 2023 Cambridge, UK
Topic: Intergrate Language Models with Human Cognition (Eye-tracking & EEG data) via Multi-Modal Instruction Tuninng.
 
 
 
 
 
Centre for Cognitive and Brain Sciences, Uniersity of Macau
Data Analysis Intern
January 2021 – February 2022 Macao S.A.R, China
Topic: Bilinguals’ Word-Level Attention During Listening Assessments.
 
 
 
 
 
Depts of Computer Science & of Psychology, University of Macau
Research Intern
July 2019 – December 2020 Macao S.A.R, China
Topic: Applying NLP in Developmental Psychology.
 
 
 
 
 
Department of Education & of Psychology, University of Michigan
Lab Member (Hybrid)
April 2020 – July 2021 Ann Arbor, MI, USA
Topic: A Cross-Cultural Perpective Survey at Universities in USA and China during the Pandemic.

Accomplish­ments

Macau Foundation
Fundação Macau Academic Prize
Represented the graduating class of 2021 at the congregation speech.
The begining of my NLP journey, igniting my academic enthusiasm.
Full waiver of tuition & college accomodation fees during four academic years, overseas exchange covered.

Contact