Benyou Wang

Mail: wabyking@gmail.com

Currently, I am an assistant professor in the Chinese University of Hong Kong, Shenzhen (CUHKSZ). I got my phd degree from the University of Padova, Italy (very fortune to be supervised by Massimo Melucci and Emanuele Di Buiccio). See our lab on CUHKSZ LLM group.

I submit my thesis in Sep. 30th 2021 （Here is a draft version）and have it defended in March 2022.

I joined CUHKSZ as an assistant professor from June 1st 2022. Please send me emails if you are interested to work with me as a Ph.D., research associate, or post-doc. See JD here

News

我们和南京大学合作的论文被ICLR 的Advances in Finnacial AI workshop的最佳论文（1 out of 53 accepted papers）
2024年入选腾讯犀牛鸟项目, CFF-滴滴盖亚学者项目, 2024年入选华为AI百校计划
2023年获得华为火花奖
I serve as a Website Chair in EMNLP 2023.
I serve as a Publicity Chair in NLPCC 2023.
Our paper ( Doge Ticket ) got the Best Paper Award in NLPCC 2022, see here .
In September 2022, we got one paper accepted in NeurIPS (named MorphTE ) and another paper in EMNLP (Hypoformer ). Both papers are about compressing transformer models (either in embedding or fully-connected layers)
In August 2022, one paper got accepted in COLING, which extends deep prompt tuning (DPT) to dense retrieval. By using two additional strategies, DPT got comparable performance with a fine-tuning.
A joking paper is released: Can we create a new creature?
A new paper is accepted in ICLR 2022 which could compress 12-layer BERT encoders into 1.5 M while with slight performance drop, see "Exploring extreme parameter compression for pre-trained language models".
Our paper titled "Word2Fun: Modelling Words as Functions for Diachronic Word Representation" got accepted in NeurIPS 2021 with my supervisors Massimo and Emanuele. This introduces a new paradigm for time-specific word embeddings (e.g., imagining that word "president" in 2018 and 2021 generally refer to different people), with both theoretical advantages and empirical success. This is a kind of work that are smaller, better, and more interpretable.
Our paper titled "On position embeddings in BERT" got accepted in ICLR 2021. Try searching "position embeddings" in Google .
Our paper titled "Encoding word order in complex embeddings" got accepted with a spotlight presentation in ICLR 2020 (acceptance rate 6%). Codes were already open-sourced.. This is the first work for rotation-based position embeddings while the previous is translation-based.
We won the Best explainable NLP paper in NAACL 2019 with 1000 dollars, present our paper together with BERT authors (Best Long paper winner). here
I got Marie Curry Fellowship to rejoin academia in 2018. Thanks for the generous funding.
Our book <推荐系统与深度学习> has been published, buy it in JD
We (IRGAN) won the Best Paper honorable mention award in SIGIR 2017, one of the most-cited papers in SIGIR. See here

Awards

NLPCC 2022 Best Paper
NAACL 2019 Best Explainable NLP Paper
SIGIR 2017 best paper award Honorable Mention
Selected as a Marie Curie Researcher of Quantum Information Access and Retrieval Theory (QUARTZ), a fellowship for Early-stage Researcher, funded by the European Union's Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No. 721321

Education

2018.10- 2022.3: Ph.D student: information engineering, University of Padua, Italy.
2014.9-2017.2: Master: pattern recognition and intelligent system, Tianjin University, China.
2010.9-2014.6: Bachelor: software engineer, Hubei University of Automotive Technology, China.

Work and Intern

2022.6-~: Assistant Professor, School of Data Science, the Chinese University of Hong Kong, Shenzhen.
2018.6-2021.6: Marie Curie Researcher, Department of Information Engineering, University of Padua, Italy.
2020.12: Visiting Scholar. Institute of Theoretical Physics, Chinese Academy of Science (CAS) in Dec. 2020 hosted by Pan Zhang
2019.11-2020-2: Visiting Student. University of Montreal from Dec. 2019 to Feb. 2020, hosted by Prof. Jian-Yun Nie.
2019.10: Visiting Student (one week). University of Amsterdam, hosted by Prof. Maarten de Rijke.
2019.9-2019.12: Visiting Student. DIKU IR Lab in University of Copenhagen, hosted by Prof. Christina Lioma.
2017.7-2018.6: Full-time Associate Researcher, Data Application Center, Tencent, China.

Professional Activities

Website Chair of EMNLP 2023
Publicity Chair of NLPCC 2023
PC member of the workshop of Diachronic Lexical Semantics evaluation task in the 7th evaluation campaign of Natural Language Processing and Speech tools for Italian
PC member of the First Workshop Evaluation and Comparison of NLP Systems in EMNLP 2020
We co-organized the Kingston-Montreal NLP/IR workshop between researchers in RALI lab, University of Montreal and Queen's University.
Founding member of Quantum Penguin Club in Tencent, the predecessor of Tencent Quantum Lab.
Serveing as an Executive Committee of CCF (China computer Federation) Tianjin University Branch in 2015-2016. I am responsible for inviting scholars for academic communication.
Reviewer : ICLR 2021/2020, NeurIPS 2021/2020, ICML 2022

Publications @ Google Scholar

After being a faculty. Phd students and Research Assistants under my supervision are underlined

Xu Wang, Yan Hu, Wenyu Du, Reynold Cheng, Benyou Wang, Difan Zou. Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis. ICML 2025.
Yumou Liu, An Li, Chaojie Li, Fei Yu, Benyou Wang*. Periodical Moving Average Accelerates Gradient Accumulation for Post-Training. UAI 2025.
Yiran Qin, Ao Sun, Hong Yuze, Benyou Wang, Ruimao Zhang. NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants. ICRA 2025.
Jianqing Zhu, Huang Huang, Zhihang Lin, Juhao Liang, Zhengyang Tang, Khalid Almubarak, Mosen Alharthi, Bang An, Juncai He, Xiangbo Wu, Fei Yu, Junying Chen, MA Zhuoheng, Yuhao Du, He Zhang, Saied Alshahrani, Emad A. Alghamdi, Lian Zhang, Ruoyu Sun, Haizhou Li, Benyou Wang*, Jinchao Xu. Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion. ACL 2025.
Zhenyang Cai, Junying Chen, Rongsheng Wang, Weihong Wang, Yonglin Deng, Dingjie Song, Yize Chen, Zixu Zhang, Benyou Wang. Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging. ACL 2025.
Yuhao Zhang, Zhiheng Liu, Fan Bu, Ruiyu Zhang, Benyou Wang, Haizhou Li. Soundwave: Less is More for Speech-Text Alignment in LLMs. ACL 2025.
Junying Chen, Chi Gui, Anningzhe Gao, Ke Ji, Xidong Wang, Xiang Wan, Benyou Wang. CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis. Findings of ACL 2025.
Junying Chen, Zhenyang Cai, Ke Ji, Xidong Wang, Wanlong Liu, Rongsheng Wang, Benyou Wang. Towards Medical Complex Reasoning with LLMs through Medical Verifiable Problems. Findings of ACL 2025.
Ke Ji, Junying Chen, Anningzhe Gao, Wenya Xie, Xiang Wan, Benyou Wang. Unlocking LLMs’ Self-Improvement Capacity with Autonomous Learning for Domain Adaptation. Findings of ACL 2025.
Guorui Zheng, Xidong Wang, Juhao Liang, Nuo Chen, Yuping Zheng, Benyou Wang. Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts. ICLR 2025 .
Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Chenghao Ma, Shanghaoran Quan, Liang Chen, Qingxiu Dong, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Ge Zhang, Lei Li, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang. Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models. ICLR 2025.
Chenyu Huang, Zhengyang Tang, Shixi Hu, Ruoqing Jiang, Xin Zheng, Dongdong Ge, Benyou Wang, Zizhuo Wang. ORLM: A Customizable Framework in Training Large Models for Automated Optimization Modeling. Operations Research
Junzhi Chen, Juhao Liang, Benyou Wang. Smurfs: Multi-Agent System using Context-Efficient DFSDT for Tool Planning. NAACL 2025.
Chenghao Zhu, Nuo Chen, Yufei Gao, Yunyi Zhang, Prayag Tiwari, Benyou Wang. Is Your LLM Outdated? Evaluating LLMs at Temporal Generalization. NAACL 2025
Wentao Ge, Shunian Chen, Guiming Hardy Chen, Junying Chen, Zhihong Chen, Nuo Chen, Wenya Xie, Shuo Yan, Chenghao Zhu, Ziyue Lin, Song Dingjie, Xidong Wang, Anningzhe Gao, Zhang Zhiyi, Jianquan Li, Xiang Wan, Benyou Wang. MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria. NAACL 2025.
Dingjie Song, Wenjun Wang, Shunian Chen, Xidong Wang, Michael X. Guan, Benyou Wang. Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs. COLING 2025.
Juhao Liang , Zhenyang Cai , Jianqing Zhu, Huang Huang, Kewei Zong, Bang An, Mosen Alharthi, Juncai He, Lian Zhang, Haizhou Li, Benyou Wang#, Jinchao Xu. Alignment at Pre-training! Towards Native Alignment for Arabic LLMs. NeurIPS 2024
Pengcheng Chen, Jin Ye, Guoan Wang, Yanjun Li, Zhongying Deng, Wei Li, Tianbin Li, Haodong Duan, Ziyan Huang, Yanzhou Su, Benyou Wang, Shaoting Zhang, Bin Fu, Jianfei Cai, Bohan Zhuang, Eric J Seibel, Junjun He, Yu Qiao. GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI. NeurIPS 2024 Track Datasets and Benchmarks
Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, GUOJUN XIONG, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Huang Jiajia, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, Jimin Huang. FinBen: An Holistic Financial Benchmark for Large Language Models. NeurIPS 2024 Track Datasets and Benchmarks
Junying Chen , Chi Gui , Ruyi Ouyang , Anningzhe Gao, Shunian Chen , Guiming Hardy Chen , Xidong Wang , Ruifei Zhang , Zhenyang Cai , Ke Ji, Guangjun Yu, Xiang Wan, Benyou Wang. HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale. EMNLP 2024.
Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang. Humans or LLMs as the Judge? A Study on Judgement Biases. EMNLP 2024
Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu. VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment. EMNLP 2024
Junying Chen, Xidong Wang, Ke Ji, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang. HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs. COLM 2024
Song Dingjie, Shunian Chen, Guiming Hardy Chen, Fei Yu, Xiang Wan, Benyou Wang. MileBench: Benchmarking MLLMs in Long Context. COLM 2024
Chen Zhang, Benyou Wang, Dawei Song. On Elastic Language Models. ACM TOIS
Chuyi Kong, Yaxin Fan, Xiang Wan, Feng Jiang, and Benyou Wang. Large Language Model as a User Simulator. ACL 2024
Zhengyang Tang, Xingxing Zhang, Benyou Wang, Furu Wei. MathScale: Scaling Instruction Tuning for Mathematical Reasoning ICML 2024
Xianghong Fang, Jian Li, Qiang Sun, Benyou Wang. Rethinking the Uniformity Metric in Self-Supervised Learning. ICLR 2024
Xidong Wang , Guiming Hardy Chen, Dingjie Song, Zhiyi Zhang, Zhihong Chen, Qingying Xiao, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li. CMB: A Comprehensive Medical Benchmark in Chinese. 2023. NAACL 2024 Online leaderboard
Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Juncai He, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu. AceGPT, Localizing Large Language Models in Arabic. NAACL 2024. ( HuggingFace downloading: 10K per month.)
Fei Yu, Anningzhe Gao, Benyou Wang. Outcome-supervised verifiers for planning in mathematical reasoning. Findings of NAACL 2024. (The work brings 7B LLMs to the era with an accuracy of 0.8 and even 0.9 in GSM8K, see the leaderboard )
Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Jianquan Li, Guiming Chen, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang Haizhou Li. HuatuoGPT, towards Taming Language Model to Be a Doctor. 2023. Findings of EMNLP 2023 (GitHub stars: 1K; Online access : 400K+ ) )
Zhihong Chen, Feng Jiang, Junying Chen, Tiannan Wang, Fei Yu, Guiming Chen, Hongbo Zhang, Juhao Liang, Chen Zhang, Zhiyi Zhang, Jianquan Li, Xiang Wan, Benyou Wang , Haizhou Li. "Phoenix: Democratizing chatgpt across languages". Arxiv code (Github stars: 3K) HuggingFace downloading: 4K per month
Fei Yu, Hongbo Zhang, Prayag Tiwari, and Benyou Wang. Natural language reasoning, a survey. 2023. ACM Computing Survey.
Zhongwei Wan, Che Liu, Mi Zhang, Jie Fu, Benyou Wang, Sibo Cheng, Lei Ma, César Quilodrán-Casas, Rossella Arcucci. Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias. NeurIPS 2023
Yazhou Zhang, Yang Yu, Qing Guo, Benyou Wang , Dongming Zhao, Sagar Uprety, Dawei Song, Jing Qin, Qiuchi Li. All In One: A Chinese Multi-Modal Dataset for Multi-Affection Detection in Conversations. NeurIPS 2023 Track Datasets and Benchmarks
Zhihong Chen, Shizhe Diao, Benyou Wang, Guanbin Li, and Xiang Wan. "Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts". ICCV 2023
Zhongwei Wan, Xin Liu, Benyou Wang, Jiezhong Qiu, Boyu Li, Ting Guo, Guangyong Chen, Yang Wang. Spatio-Temporal Contrastive Learning Enhanced GNNs for Session-based Recommendation. Transactions on Information Systems (TOIS)
Jianquan Li, Xiangbo Wu , Xiaokang Liu , Prayag Tiwari, Qianqian Xie, and Benyou Wang. "Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk". ACL 2023
Yajiao LIU , Xin Jiang, Yichun Yin, Yasheng Wang, Fei Mi, Qun Liu, Xiang Wan, and Benyou Wang. One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue Systems. ACL 2023
Chen Zhang , Yang Yang, Jiahao Liu, Jingang Wang, Yunsen Xian, Benyou Wang and Dawei Song. Lifting the Curse of Capacity Gap in Distilling Language Models. ACL 2023
Zhihong Chen , Guiming Hardy Chen , Shizhe Diao, Xiang Wan, and Benyou Wang. On the Difference of BERT-style and CLIP-style Text Encoders. Findings of ACL 2023
Xiaokang Liu , Jianquan Li , Jingjing Mu, Min Yang, Ruifeng Xu, and Benyou Wang. "Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision Boundary". AAAI 2023
Le Sun, Mingyang Zhang, Benyou Wang and Prayag Tiwari. Few-Shot Class-Incremental Learning for Medical Time Series Classification. IEEE Journal of Biomedical and Health Informatics . 2023
Yaochen Liu, Qiuchi Li, Benyou Wang Yazhou Zhang, Dawei Song. "A survey of quantum-cognitively inspired sentiment analysis models". ACM Computing Surveys 2023
Benyou Wang, Qianqian Xie, Jiahuan Pei, Prayag Tiwari, Zhao Li, and Fu Jie. "Pre-trained Language Models in Biomedical Domain: A Survey from Multiscale Perspective". ACM Computing Surveys, .
Yi Yang, Chen Zhang, Benyou Wang and Dawei Song. "Doge Tickets: Uncovering Domain-general Language Models by Playing Lottery Tickets". NLPCC 2022 Best Paper 2022 .
Sunzhu Li , Peng Zhang, Guobing Gan, Xiuqing Lv, Benyou Wang Junqiu Wei, Xin Jiang. "Hypoformer: Hybrid Decomposition Transformer for Edge-friendly Neural Machine Translation". EMNLP 2022 .
Guobing Gan, Peng Zhang, Sunzhu Li, Xiuqing Lu, Benyou Wang. "MorphTE: Injecting morphology in tensorized embeddings". NeurIPS 2022 .

Before being a faculty

Benyou Wang, Yuxin Ren, Lifeng Shang, Xin Jiang, Qun Liu. "Exploring extreme parameter compression for pre-trained language models". ICLR 2022, .
Peng Zhang, Wenjie Hui, Benyou Wang (corresponding), Donghao Zhao, Dawei Song, Christina Lioma, Jakob Grue Simonsen. "Complex-valued Neural Network-based Quantum Language Models". ACM Transactions on Information Systems .
Benyou Wang, Emanuele Di Buiccio, Massimo Melucci. Word2fun, modeling words as functions for dynamic word embeddings. NeurIPS 2021, .
Benyou Wang, Lifeng Shang, Christina Lioma, Xin Jiang, Qun Liu, Jakob Grue Simonsen. On position embeddings in BERT. ICLR 2021,
Benyou Wang*, Donghao Zhao*, Christina Lioma, Qiuchi Li, Peng Zhang, Jakob Grue Simonsen. Encoding word order in complex embeddings. ICLR 2020, Spotlight paper (acceptance rate: 6%)
Qiuchi Li*, Benyou Wang*, Massimo Melucci. A Complex-valued Network for Matching. NAACL 2019, Best Explainable NLP Paper
Benyou Wang. Dynamic content monitoring and exploration using vector spaces. SIGIR 2019 doctoral consortium.
Benyou Wang*, Qiuchi Li*, Massimo Melucci, Dawei Song. Semantic Hilbert Space for Text Representation Learning. WWW 2019
Wei Zhao*, Benyou Wang*, Min Yang, Jianbo Ye, Zhou Zhao, Xiaojun Chen, Ying Shen.. Leveraging Long and Short-term Information in Content-aware Movie Recommendation via Adversarial Training. IEEE Transactions on Cybernetics (TOC), 2019 (IF: 8.803)
Peng Zhang, Zhan Su, Lipeng Zhang, Benyou Wang , Dawei Song. 2018. A Quantum Many-body Wave Function Inspired Language Modeling Approach. CIKM 2018
Wei Zhao, Wang Benyou , Jianbo Ye, Yongqiang Gao, Min Yang, Xiaojun Chen, PLASTIC: Prioritize Long and Short-term Information in Top-n Recommendation using Adversarial Training, IJCAI 2018
Wei Zhao, Wang Benyou , Jianbo Ye, Min Yang, Zhou Zhao, Ruotian Luo, Yu Qiao A Multi-task Learning Approach for Image Captioning, IJCAI 2018
Zhang Peng, Niu Jiabing, Su Zhan, Wang Benyou et al. End-to-End Quantum-like Language Models with Application to Question Answering AAAI 2018
Wang Jun, Yu Lantao, Zhang Weinan, Gong Yu, Xu Yinghui, Wang Benyou , Zhang Peng, Zhang Dell. IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models. SIGIR 2017. Best Paper Award Honourable Mentions . Zhihu link (in Chinses)
Wang Benyou, Niu Jiabing, Ma Liqun, Zhang Yuhua, Zhang Lipeng, Li Jinfei, Zhang Peng Song, D. . A Chinese Question Answering Approach Integrating Count-Based and Embedding-Based Features. ICCPOL-NLPCC . December, 2016
Wang Benyou, Zhang Peng, Li Jinfei, Song Dawei, Hou Yuexian, Shang Zhenguo. Exploration of quantum interference in document relevance judgement discrepancy. Entropy , 18(4), 144. 2016. (IF : 1.821)
Chen Yongqiang, Zhang Peng, Song Dawei, Wang Benyou. A Real-Time Eye Tracking Based Query Expansion Approach via Latent Topic Modeling. CIKM 2015 (pp. 1719-1722). ACM. October, 2015

Book and book chapter

Huang Xin, Wei zhao, Wang Benyou, Rui Zhao. Recommendation System and Deep Learning, Tsinghua University Press, in Chinese. Focusing on the chapters related "Learn to rank" and "Generative Adversarial Nets(GAN) for Recommendation". Online purchase link: JD and Dangdang.
Wang,B. , Emanuele Di, B., & Melucci, M.. Representing words in vector space and beyond . In A. Diederik, K. Andrei, M. Massimo, & T. Bourama (Eds.),Quantum-like models forinformation retrieval and decision-making. Springer.

Talks

"Discussion on the border of LLMs", the Practice from HuatuoGPT, 大模型机理分析论坛, July 20th-30th 2024, Taian, Shandong, China Conference on Data Mining (CCDM 2024), invited by Prof. Chenliang Li
"Large Language Models for Healthcare", the Practice from HuatuoGPT, YSSNLP, Kunming, Yunnan, June 14th 2024, invited by Prof. Chenliang Li
The practice and thinking of medical LLMs, 大模型赋能智能医疗的 workshop, May 6th 2024, Chongqing, invited by Prof. Hao Chen
"The progress of HuatuoGPT", Guest lecture, Soutech, May 2024. invited by Prof. Bingyi Jin
"The present and the future of large language models", The campus open day. Apirl 2024
"The progress of the medical LLM HuatuoGPT", April 2024, Zhejiang University, invited by Prof. Guangyong Chen
"The progress of the medical LLM HuatuoGPT", April 2024, Shanghai AI Lab, invited by Dr. Huaxi Huang
"The progress of the medical LLM HuatuoGPT", April 2024, Peking University, Prof. Xiaohua Zhou
"The practice and thinking of large langauge models", March 2024, Nankai Univesity, invited by Prof. Jie Liu
"The progress of HuatuoGPT", HKUST (Guangzhou), April 2024, invited by Prof. Li Liu
"Large language models, the practice and the future", Huawei 诺亚大讲堂, hosted by Dr. Lifeng Shang, 2024
"What can quantum physics bring to NLP", invited talk by Prof. Masahito Hayasi in the Japan-China Joint Workshop on Quantum Information, September 2023.
"AceGPT, the SOTA Arabic LLM", KAU talk invited by Prof. Eman, September 2023.
"Medical LLM, the practice from HuatuoGPT", online talk invited by Prof. Libo Qin in MLNLP, September 2023.
"Medical LLM, the practice from HuatuoGPT", CIPS tutorial in Jinan, September 2023.
"HuatuoGPT: taming language models to be doctors", IJCAI workshop hosted by Prof. Chen Chen, August 2023.
"Medical LLMs", CCF (中国计算机协会术语委员会), hosted by Prof. Peng Zhang, August 2023.
"What should we do in the LLM era", SRIBD to undergraduate students, hosted by the SRIBD Director Ping Lee, August 2023.
"LLM practice", KAUST, hosted by Prof. Jinchao Xu, August 2023.
"Vertical LLM", CIPS (中文信息学会讲习班), hosted by Prof. Min Yang, June 2023.
"Medical ChatGPT, our practice", HK Hospital Authority (香港医管局), May 2023.
"Medical ChatGPT, our practice", CCF中国计算机协会天津分会, hosted by Prof. Peng Zhang, May 2023.
"Large language model, our practice", KAUST, hosted by Prof. Jinchao Xu, May 2023.
"The interplay between Large language model and IR", Huawei 大讲堂, hosted by Prof. Rui Zhang, May 2023.
"Introduction to ChatGPT", undergraduate students in SDS, Hosted by Prof. Hongyuan Zha, April 2023.
"Introduction to ChatGPT", CIPS中文信息学会青工委, Hosted by Prof. Chenliang Ma, March 2023.
"Introduction to ChatGPT", Shanghai Jiaotong University, Hosted by Prof. Weinan Zhang, March 2023.
"Vertical LLM", Shenzhen Robot and AI lab, hosted by Prof. Hongyuan Zha, November 2022.
"Some proposals on biomedical NLP", online seminar, hosted by Prof. Xiang Wan, SRIBD, September 2022.
"Position embeddings", AI Times, hosted by Miss He Yun, July 2022.
"How and whether AI replaces Humans", Weibo live streaming with more than 1M watching, hosted by Mr Qingyi Gao, June 2022.
"Position embeddings", Renmin University of China, hosted by Prof. Yong Liu, April 2022.
"Quantum and AI: Opportunities, Challenges, and Future Trends", Huawei, Shenzhen, hosted by Prof. Qun Liu, March 2022.
"How quantum physics contributes to NLP", HIT(SZ), Shenzhen, hosted by Prof. Zenglin Xu, December 2021.
"Word2Fun, modeling words as functions for diachronic word embeddings", CSIRO, Australia, online, December 2021.
"Large-scale Pre-trained Language Models (PLMs): Potentials, Efficiency, and Future Trends", SUSTech, Shenzhen, hosted by Qi-Man Shao, December 2021.
"Bridging Quantum physics and NLP", Microsoft Research Asia (MSRA), Beijing, online, hosted by Nan Duan, November 2021.
"How physics and NLP help each other?", Institute of Theoretical Physics, Chinese Academy of Science (CAS) Beijing, December 2020.
"On Position embeddings", Alibaba, Beijing, China, December 2020.
"How quantum theory contributes to NLP", First workshop of quantum computing and AI, virtually, previously in Tianjin University, Tianjin, China, December 2020.
"Encoding Word Order in Complex Embeddings", Speech and Language Computing Group, Huawei Noah's Ark Lab, Shenzhen, China, December 2020.
"Investigating complex-valued representation in NLP", Montreal Institute for Learning Algorithms (MILA), Montreal, Canada, hosted by MILA NLP reading group (hosted by Alessandro Sordoni and Siva Reddy), January 21st, 2020.
"Beyond particles: modeling words as waves", RALI Département d’informatique et recherche opérationnelle, University of Montreal, Montreal, Canada, hosted by Jian-Yun Nie, December 7th, 2019.
Sequencial Modeling in Vector Spaces, the Italian Information Retreival Workshop, in Sep 2021
On Position embeddings , the China Student Symposium on NLP (CSSNLP), Beijing, in December, 2020
Invited lecture: Quantum theory and NLP , for bachelor students, Beijing Institute of Technolohy, Beijing, in December, 2020
Invited lecture: pretrained language model and its position embeddings for bachelor students, Shandong University, Qingdao, in Dec. 2020
How physics and NLP help each other? , Institute of theoretical Physics, Chinese Academy of Science (CAS) Beijing, in December, 2020
On Position embeddings , Alibaba, Beijing in December, 2020
, Formulizing semantic shift detection as a distance between sets EVALITA 2020. Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian Diachronic Lexical Semantics online, on December 17th 2020
How quantum theory contributes to NLP, First workshop of quantum computing and AI, virtually, previously in Tianjin University, Tianjin, China, 22. Nov. 2020
Encoding word order in complex embeddings, Speech and Language Computing Group, Huawei Noah's Ark Lab, Shenzhen, China, 23. April 2020
Dynamic Content Monitoring and Exploration using Vector Spaces, University of Bedfordshire, Luton, Lonton, UK, 12 Feb. 2020
Quantum Mechanics meet Information Search and Retrieval – The QUARTZ Project, Great London Text Analytics meetup, London, UK, 12. Feb. 2020
Investigating complex-valued representation in NLP, Mila, Mila, Montreal, Canada, 20. Jan 2020
Beyond particles: modeling words as waves , RALI Département d’informatique et recherche opérationnelle, University of Montreal, Montreal, Canada, Dec. 7th 2019
Beyond particles: modeling words as waves , DIKU machine learning section, University of Copenhagen, Copenhagen, Denmark, Nov. 25th 2019
Quantum formulations for language: understand words as particles , meetup Search Engines Amsterdam , University of Amsterdam, Amsterdam. Netherlands, Oct. 25th 2019
Dynamic Content Monitoring and Exploration using Vector Spaces , SIGIR Doctoral Consortium, Paris, France. July, 2019
2019 Joint Statistics Summer School by Univeristy of Bolzano, Padova and Salzburg, Brixen, Italy. July 11, 2019
Quantum-inspired NLP/IR . Bytedance AI Lab, Hang Li's group, Beijing, China. June 28, 2019
Quantum-inspired NLP/IR. Tencent Cloud NLP team (Zhiwen Lab), Shenzhen, China, June 21, 2019
Representing and interpreting words in vector space inspired by Quantum theory. Quartz Workshop, University of Copenhagen
Tensor analysis for DL. Functional Analysis, University of Padova
Word embedding and the beyond.> IR group, University of Padova
Deep Learning in language : offline workshop in Padova

Teaching

CSC 6201/CIE 6021 Large Language Models, Fall 2023/2024. (This might be the first course of large language models in the world.)

CSC6052/DDA6307/MDS6002 Natural Language Processing, Spring 2024/2025

Open-Source code

Phoenix Star
Medical_NLP Star
HuatuoGPT Star
TextClassificationBenchmark Star
InstructionZoo Star
HuatuoGPT-II Star
ALLaVA Star
Huatuo-26M Star
crosstalk-generation Star
ReasoningNLP Star
Apollo Star
Evaluation-of-ChatGPT-on-Information-Extraction Star
CMB Star
OVM Star
AceGPT Star
HuatuoGPT-Vision Star
LongLLaVA Star

Co-supervised PhD Students

Fei Yu, Phd (interned in Tencent AI Lab), starting from 2022 Fall
Zhengyang Tang, Phd (interned in MSRA, Alibab Qwen), starting from 2023 Spring
Bohao Li , Phd (interned in MSRA), starting from 2023 Fall
Junying Chen, Phd (previous RA), starting from 2023 Fall
Xidong Wang, Phd (previous RA), starting from 2023 Fall
Juhao Liang, Phd (previous RA, interned in Huawei), starting from 2023 Fall
Ke Ji , Phd (previous RA, interned in Tencen AI Lab), starting from 2024 Fall
Shunian Chen (co-supervised by Prof. Yongtao Guan) , Phd (previous RA), starting from 2024 Fall

Alumni

Zhihong Chen , graduated Phd, (joined Stanford as a post-doc, now running a starup in medical imaging using LLMs in silcon vally.
Nuo Chen , RA, (now Phd in NUS)
Zhiyuan Fan , RA, (now Phd in HKUST)
Zhihan Zhang , RA, (now Phd in NUS)
Guiming Chen, RA, (now Phd in the University of Texas at Dallas)
Wenya Xie , RA, (now Phd in University of Minnesota)
Dingjie Song , RA, (now Phd in Lehigh University)
Shuo Yan, RA, (now Phd in the University of Texas at Dallas)
Zhuoheng Ma, interned Master student in CUHKSZ, (now in Weixin)
Guorui Zheng, interned Master student in CUHKSZ, (now interned in Didi)
Yiran Xie, interned Master student in CUHKSZ, (now interned in Weixin)
Ruoli Gan, interned Master student in CUHKSZ, (now interned in Huawei)
Yuheng Liu, interned undergraduate student in CUHKSZ, (now Master student in New York University)