Zhongshen Zeng, Yinhong Liu, Yingjia Wan, Jingyao Li, Pengguang Chen, Jianbo Dai, Yuxuan Yao, Rongwu Xu, Zehan Qi, Wanru Zhao, Linling Shen, Jianqiao Lu, Haochen Tan, Yukang Chen, Hao Zhang, Zhan Shi, Bailin Wang, Zhijiang Guo, Jiaya Jia
(2024).
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs.
The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024).