Benyou Wang
Mail: wabyking@gmail.com
Currently, I am an assistant professor in the Chinese University of Hong Kong, Shenzhen (CUHKSZ). I got my phd degree from the University of Padova, Italy (very fortune to be supervised by Massimo Melucci and Emanuele Di Buiccio). See our lab on CUHKSZ LLM group.
I submit my thesis in Sep. 30th 2021 (Here is a draft version)and have it defended in March 2022.
I joined CUHKSZ as an assistant professor from June 1st 2022. Please send me emails if you are interested to work with me as a Ph.D., research associate, or post-doc. See JD here
News
- We won the Gold Medal in AIMO 2 (AI Mathematical Olympiad). 该项工作是和华为诺亚方舟实验室合作,Thank Lifeng and Xu Yan.
- TwinMarket was accepted as the only Best Paper in ICLR Advances in Financial AI workshop(1 out of 53 accepted papers) as well as NeurIPS 2025, this is done with Nanjing University.
- 2024年入选腾讯犀牛鸟项目, CFF-滴滴盖亚学者项目, 2024年入选华为AI百校计划
- 2023年获得华为火花奖
- I serve as a Website Chair in EMNLP 2023.
- I serve as a Publicity Chair in NLPCC 2023.
- A joking paper is released: Can we create a new creature?
- I was visiting University of Montreal from Dec. 2019 to Feb. 2020, hosted by Prof. Jian-Yun Nie. Montreal is a nice city for life and research.
- I visited University of Amsterdam for one week, hosted by Prof. Maarten de Rijke, to give a talk in the Meetup Search Engines Amsterdam.
- We won the Best explainable NLP paper in NAACL 2019 with 1000 dollars, present our paper together with BERT authors (Best Long paper winner). here
- I got Marie Curry Fellowship and came back to academia in 2018. Thanks for the generous funding from EU.
- Our book <推荐系统与深度学习> has been published, buy it in JD
- We (IRGAN) won the Best Paper honorable mention award in SIGIR 2017, one of the most-cited papers in SIGIR. See here
Awards
- The Best Paper for Advances in Financial AI Workshop in ICLR 2025
- The Best Paper in NLPCC 2022
- NAACL 2019 Best Explainable NLP Paper
- SIGIR 2017 Best Oaper Award Honorable Mention
- Selected as a Marie Curie Researcher of Quantum Information Access and Retrieval Theory (QUARTZ), a fellowship for Early-stage Researcher, funded by the European Union's Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No. 721321
Education
- 2018.10- 2022.3: Ph.D student: information engineering, University of Padua, Italy.
- 2014.9-2017.2: Master: pattern recognition and intelligent system, Tianjin University, China.
- 2010.9-2014.6: Bachelor: software engineer, Hubei University of Automotive Technology, China.
Work and Intern
- 2022.6-~: Assistant Professor, School of Data Science, the Chinese University of Hong Kong, Shenzhen.
- 2018.6-2021.6: Marie Curie Researcher, Department of Information Engineering, University of Padua, Italy.
- 2020.12: Visiting Scholar. Institute of Theoretical Physics, Chinese Academy of Science (CAS) in Dec. 2020 hosted by Pan Zhang
- 2019.11-2020-2: Visiting Student. University of Montreal from Dec. 2019 to Feb. 2020, hosted by Prof. Jian-Yun Nie.
- 2019.10: Visiting Student (one week). University of Amsterdam, hosted by Prof. Maarten de Rijke.
- 2019.9-2019.12: Visiting Student. DIKU IR Lab in University of Copenhagen, hosted by Prof. Christina Lioma.
- 2017.7-2018.6: Full-time Associate Researcher, Data Application Center, Tencent, China.
Professional Activities
After being a faculty. Phd students and Research Assistants under my supervision are underlined
- Yuzhe YANG, Yifei Zhang, Minghao Wu, Kaidi Zhang, Yunmiao Zhang, Honghai Yu, Yan Hu, Benyou Wang. TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets. NeurIPS 2025.
- Wanlong Liu, Junxiao Xu, Fei Yu, Yukang Lin, Ke Ji, Wenyu Chen, Lifeng Shang, Yasheng Wang, Yan Xu, Benyou Wang. Question-Free Fine-Tuning: Towards Efficient and Adaptive Reasoning in Large Language Models. NeurIPS 2025 (Spotlight).
- Kaituo Feng, Kaixiong Gong, Bohao Li, Zonghao Guo, Yibing Wang, Tianshuo Peng, Junfei Wu, Xiaoying Zhang, Benyou Wang, Xiangyu Yue. Video-R1: Reinforcing Video Reasoning in MLLMs. NeurIPS 2025.
- Ke Ji, Jiahao Xu, Tian Liang, Qiuzhi Liu, Zhiwei He, Xiaoyuan Liu, Xingyu Chen, Junying Chen, Benyou Wang, Zhaopeng Tu, Haitao Mi, Dong Yu. The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models. NeurIPS 2025.
- Chengpeng Li, Zhengyang Tang, Ziniu Li, Mingfeng Xue, Keqin Bao, Tian Ding, Ruoyu Sun, Benyou Wang, Xiang Wang, Junyang Lin, Dayiheng Liu. CoRT: Code-integrated Reasoning within Thinking. NeurIPS 2025.
- Wenya Xie, Qingying Xiao, Yu Zheng, Xidong Wang, Junying Chen, Ke Ji, Anningzhe Gao, Prayag Tiwari, Xiang Wan, Feng Jiang, Benyou Wang. Enabling Doctor-Centric Medical AI with LLMs through Workflow-Aligned Tasks and Benchmarks. npj Health Systems.
- Wanlong Liu, Junying Chen, Ke Ji, Li Zhou, Wenyu Chen, Benyou Wang. RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions. EMNLP 2025 (Oral).
- Xunlian Dai, Li Zhou, Benyou Wang, Haizhou Li. From Word to World: Evaluate and Mitigate Culture Bias in LLMs via Word Association Test. EMNLP 2025.
- Xu Wang, Zihao Li, Benyou Wang, Yan Hu, Difan Zou. Model Unlearning via Sparse Autoencoder Subspace Guided Projections. EMNLP 2025.
- Xidong Wang, Dingjie Song, Shunian Chen, Junying Chen, Zhenyang Cai, Chen Zhang, Lichao Sun, Benyou Wang. LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via a Hybrid Architecture. Findings of EMNLP 2025.
- Xu Wang, Yan Hu, Wenyu Du, Reynold Cheng, Benyou Wang, Difan Zou. Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis. ICML 2025.
- Yumou Liu, An Li, Chaojie Li, Fei Yu, Benyou Wang*. Periodical Moving Average Accelerates Gradient Accumulation for Post-Training. UAI 2025.
- Yiran Qin, Ao Sun, Hong Yuze, Benyou Wang, Ruimao Zhang. NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants. ICRA 2025.
- Jianqing Zhu, Huang Huang, Zhihang Lin, Juhao Liang, Zhengyang Tang, Khalid Almubarak, Mosen Alharthi, Bang An, Juncai He, Xiangbo Wu, Fei Yu, Junying Chen, MA Zhuoheng, Yuhao Du, He Zhang, Saied Alshahrani, Emad A. Alghamdi, Lian Zhang, Ruoyu Sun, Haizhou Li, Benyou Wang*, Jinchao Xu. Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion. ACL 2025.
- Zhenyang Cai, Junying Chen, Rongsheng Wang, Weihong Wang, Yonglin Deng, Dingjie Song, Yize Chen, Zixu Zhang, Benyou Wang. Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging. ACL 2025.
- Yuhao Zhang, Zhiheng Liu, Fan Bu, Ruiyu Zhang, Benyou Wang, Haizhou Li. Soundwave: Less is More for Speech-Text Alignment in LLMs. ACL 2025.
- Junying Chen, Chi Gui, Anningzhe Gao, Ke Ji, Xidong Wang, Xiang Wan, Benyou Wang. CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis. Findings of ACL 2025.
- Junying Chen, Zhenyang Cai, Ke Ji, Xidong Wang, Wanlong Liu, Rongsheng Wang, Benyou Wang. Towards Medical Complex Reasoning with LLMs through Medical Verifiable Problems. Findings of ACL 2025.
- Ke Ji, Junying Chen, Anningzhe Gao, Wenya Xie, Xiang Wan, Benyou Wang. Unlocking LLMs’ Self-Improvement Capacity with Autonomous Learning for Domain Adaptation. Findings of ACL 2025.
- Guorui Zheng, Xidong Wang, Juhao Liang, Nuo Chen, Yuping Zheng, Benyou Wang. Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts. ICLR 2025 .
- Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Chenghao Ma, Shanghaoran Quan, Liang Chen, Qingxiu Dong, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Ge Zhang, Lei Li, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang. Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models. ICLR 2025.
- Chenyu Huang, Zhengyang Tang, Shixi Hu, Ruoqing Jiang, Xin Zheng, Dongdong Ge, Benyou Wang, Zizhuo Wang. ORLM: A Customizable Framework in Training Large Models for Automated Optimization Modeling. Operations Research
- Junzhi Chen, Juhao Liang, Benyou Wang. Smurfs: Multi-Agent System using Context-Efficient DFSDT for Tool Planning. NAACL 2025.
- Chenghao Zhu, Nuo Chen, Yufei Gao, Yunyi Zhang, Prayag Tiwari, Benyou Wang. Is Your LLM Outdated? Evaluating LLMs at Temporal Generalization. NAACL 2025
- Wentao Ge, Shunian Chen, Guiming Hardy Chen, Junying Chen, Zhihong Chen, Nuo Chen, Wenya Xie, Shuo Yan, Chenghao Zhu, Ziyue Lin, Song Dingjie, Xidong Wang, Anningzhe Gao, Zhang Zhiyi, Jianquan Li, Xiang Wan, Benyou Wang. MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria. NAACL 2025.
- Dingjie Song, Wenjun Wang, Shunian Chen, Xidong Wang, Michael X. Guan, Benyou Wang. Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs. COLING 2025.
- Juhao Liang , Zhenyang Cai , Jianqing Zhu, Huang Huang, Kewei Zong, Bang An, Mosen Alharthi, Juncai He, Lian Zhang, Haizhou Li, Benyou Wang#, Jinchao Xu. Alignment at Pre-training! Towards Native Alignment for Arabic LLMs. NeurIPS 2024
- Pengcheng Chen, Jin Ye, Guoan Wang, Yanjun Li, Zhongying Deng, Wei Li, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang, Bin Fu, Jianfei Cai, Bohan Zhuang, Eric J Seibel, Junjun He, Yu Qiao. GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. NeurIPS 2024 Track Datasets and Benchmarks
- Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, GUOJUN XIONG, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Huang Jiajia, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, Jimin Huang.
FinBen: An Holistic Financial Benchmark for Large Language Models. NeurIPS 2024 Track Datasets and Benchmarks
- Junying Chen , Chi Gui , Ruyi Ouyang , Anningzhe Gao, Shunian Chen , Guiming Hardy Chen , Xidong Wang , Ruifei Zhang , Zhenyang Cai , Ke Ji, Guangjun Yu, Xiang Wan, Benyou Wang. HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale. EMNLP 2024.
- Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang. Humans or LLMs as the Judge? A Study on Judgement Biases. EMNLP 2024
- Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu. VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment. EMNLP 2024
- Junying Chen, Xidong Wang, Ke Ji, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang. HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs. COLM 2024
- Song Dingjie, Shunian Chen, Guiming Hardy Chen, Fei Yu, Xiang Wan, Benyou Wang. MileBench: Benchmarking MLLMs in Long Context. COLM 2024
- Chen Zhang, Benyou Wang, Dawei Song. On Elastic Language Models. ACM TOIS
- Chuyi Kong, Yaxin Fan, Xiang Wan, Feng Jiang, and Benyou Wang. Large Language Model as a User Simulator. ACL 2024
- Zhengyang Tang, Xingxing Zhang, Benyou Wang, Furu Wei. MathScale: Scaling Instruction Tuning for Mathematical Reasoning ICML 2024
- Xianghong Fang, Jian Li, Qiang Sun, Benyou Wang. Rethinking the Uniformity Metric in Self-Supervised Learning. ICLR 2024
- Xidong Wang , Guiming Hardy Chen, Dingjie Song, Zhiyi Zhang, Zhihong Chen, Qingying Xiao, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li. CMB: A Comprehensive Medical Benchmark in Chinese. 2023. NAACL 2024   Online leaderboard
- Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Juncai He, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu. AceGPT, Localizing Large Language Models in Arabic. NAACL 2024. ( HuggingFace downloading: 10K per month.)
- Fei Yu, Anningzhe Gao, Benyou Wang. Outcome-supervised verifiers for planning in mathematical reasoning. Findings of NAACL 2024. (The work brings 7B LLMs to the era with an accuracy of 0.8 and even 0.9 in GSM8K, see the leaderboard )
- Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Jianquan Li, Guiming Chen, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang Haizhou Li. HuatuoGPT, towards Taming Language Model to Be a Doctor. 2023. Findings of EMNLP 2023 (GitHub stars: 1K; Online access : 400K+ ) )
- Zhihong Chen, Feng Jiang, Junying Chen, Tiannan Wang, Fei Yu, Guiming Chen, Hongbo Zhang, Juhao Liang, Chen Zhang, Zhiyi Zhang, Jianquan Li, Xiang Wan, Benyou Wang , Haizhou Li. "Phoenix: Democratizing chatgpt across languages". Arxiv code (Github stars: 3K) HuggingFace downloading: 4K per month
- Fei Yu, Hongbo Zhang, Prayag Tiwari, and Benyou Wang. Natural language reasoning, a survey. 2023. ACM Computing Survey.
- Zhongwei Wan, Che Liu, Mi Zhang, Jie Fu, Benyou Wang, Sibo Cheng, Lei Ma, César Quilodrán-Casas, Rossella Arcucci. Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias. NeurIPS 2023
- Yazhou Zhang, Yang Yu, Qing Guo, Benyou Wang , Dongming Zhao, Sagar Uprety, Dawei Song, Jing Qin, Qiuchi Li. All In One: A Chinese Multi-Modal Dataset for Multi-Affection Detection in Conversations. NeurIPS 2023 Track Datasets and Benchmarks
- Zhihong Chen, Shizhe Diao, Benyou Wang, Guanbin Li, and Xiang Wan. "Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts". ICCV 2023
- Zhongwei Wan, Xin Liu, Benyou Wang, Jiezhong Qiu, Boyu Li, Ting Guo, Guangyong Chen, Yang Wang. Spatio-Temporal Contrastive Learning Enhanced GNNs for Session-based Recommendation. Transactions on Information Systems (TOIS)
- Jianquan Li, Xiangbo Wu , Xiaokang Liu , Prayag Tiwari, Qianqian Xie, and Benyou Wang. "Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk". ACL 2023
- Yajiao LIU , Xin Jiang, Yichun Yin, Yasheng Wang, Fei Mi, Qun Liu, Xiang Wan, and Benyou Wang. One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue Systems. ACL 2023
- Chen Zhang , Yang Yang, Jiahao Liu, Jingang Wang, Yunsen Xian, Benyou Wang and Dawei Song. Lifting the Curse of Capacity Gap in Distilling Language Models. ACL 2023
- Zhihong Chen , Guiming Hardy Chen , Shizhe Diao, Xiang Wan, and Benyou Wang. On the Difference of BERT-style and CLIP-style Text Encoders. Findings of ACL 2023
- Xiaokang Liu , Jianquan Li , Jingjing Mu, Min Yang, Ruifeng Xu, and Benyou Wang. "Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision Boundary". AAAI 2023
- Le Sun, Mingyang Zhang, Benyou Wang and Prayag Tiwari. Few-Shot Class-Incremental Learning for Medical Time Series Classification. IEEE Journal of Biomedical and Health Informatics . 2023
- Yaochen Liu, Qiuchi Li, Benyou Wang Yazhou Zhang, Dawei Song. "A survey of quantum-cognitively inspired sentiment analysis models". ACM Computing Surveys 2023
- Benyou Wang, Qianqian Xie, Jiahuan Pei, Prayag Tiwari, Zhao Li, and Fu Jie. "Pre-trained Language Models in Biomedical Domain: A Survey from Multiscale Perspective". ACM Computing Surveys, .
- Yi Yang, Chen Zhang, Benyou Wang and Dawei Song. "Doge Tickets: Uncovering Domain-general Language Models by Playing Lottery Tickets". NLPCC 2022 Best Paper 2022 .
- Sunzhu Li , Peng Zhang, Guobing Gan, Xiuqing Lv, Benyou Wang Junqiu Wei, Xin Jiang. "Hypoformer: Hybrid Decomposition Transformer for Edge-friendly Neural Machine Translation". EMNLP 2022 .
- Guobing Gan, Peng Zhang, Sunzhu Li, Xiuqing Lu, Benyou Wang. "MorphTE: Injecting morphology in tensorized embeddings". NeurIPS 2022 .