Yingjia Wan
Yingjia Wan
About
Publications
Experiences
Accomplishments
CV
Blogs
Contact
Light
Dark
Automatic
1
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs
MR-BEN is a comprehensive process-based benchmark to evaluate advanced `meta-reasoning’ skills, where models are asked to locate and analyse errors in the provided CoT solutions. It comprises 5,975 multi-domain samples with annotated groundtruths.
Zhongshen Zeng
,
Yinhong Liu
,
Yingjia Wan
,
Jingyao Li
,
Pengguang Chen
,
Jianbo Dai
,
Yuxuan Yao
,
Rongwu Xu
,
Zehan Qi
,
Wanru Zhao
,
Linling Shen
,
Jianqiao Lu
,
Haochen Tan
,
Yukang Chen
,
Hao Zhang
,
Zhan Shi
,
Bailin Wang
,
Zhijiang Guo
,
Jiaya Jia
Code
Dataset
Blog
arXiv
AutoPSV: Automated Process-Supervised Verifier
AutoPSV proposes a simple, effective, and efficient method to automatically annotate reasoning steps (even without requiring grountruth answers).
Jianqiao Lu
,
Zhiyang Dou
,
Hongru Wang
,
Zeyu Cao
,
Jianbo Dai
,
Yingjia Wan
,
Yinya Huang
,
Zhijiang Guo
Code
arxiv
Reading-While-Listening vs. Reading-Only in A Second Language at Different Language Proficiencies: an Eye-Tracking Study
Reading-while-listening (R/L) has a facilitation effect on second language (L2) reading comprehension after longitudinal R/L training …
Yingjia Wan
,
Matthew Wallace
Pedagogy in a Pandemic: College Instructor Perspectives on Online Instruction during COVID-19 at Universities in USA and China
Higher education institutions globally saw a collective mandate to move classes online, where afforded, at the onset of the COVID- 19 …
Sarah Stilwell
,
Anjli Narwani
,
Jessica Pelton
,
Xi Zhang
,
Qi Zeng
,
Qi Zhao
,
Yingjia Wan
,
Kevin Miller
Cite
×