Weiwei Sun

I am a PhD student at LTI, Carnegie Mellon University, advised by Yiming Yang. Before that, I received my M.E. and B.E. from Shandong University, advised by Zhaochun Ren. My recent research interests focus on large language models, augmented language models, generative retrieval.

Email  /  Twitter  /  LinkedIn  /  Google Scholar  /  Github

Publications
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval
Weiwei Sun, Zhengliang Shi, Jiulong Wu, Lingyong Yan, Xinyu Ma, Yiding Liu, Min Cao, Dawei Yin, Zhaochun Ren
EMNLP 2024
paper  /  data  /  code
TourRank: Utilizing Large Language Models for Documents Ranking with a Tournament-Inspired Strategy
Yiqun Chen, Qi Liu, Yi Zhang, Weiwei Sun, Daiting Shi, Jiaxin Mao, Dawei Yin
Preprint
paper  /  code
MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter
Jitai Hao, Weiwei Sun, Xin Xin, Qi Meng, Zhumin Chen, Pengjie Ren, Zhaochun Ren
ACL 2024
paper  /  code
Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering
Zhengliang Shi, Shuo Zhang, Weiwei Sun, Shen Gao, Pengjie Ren, Zhumin Chen, Zhaochun Ren
ACL 2024
Enhanced Generative Recommendation via Content and Collaboration Integration
Yidan Wang, Zhaochun Ren, Weiwei Sun, Jiyuan Yang, Zhixiang Liang, Xin Chen, Ruobing Xie, Su Yan, Xu Zhang, Pengjie Ren, Zhumin Chen, Xin Xin
CIKM 2024
paper
Improving the Robustness of Large Language Models via Consistency Alignment
Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Shuaiqiang Wang, Chong Meng, Zhicong Cheng, Zhaochun Ren, Dawei Yin
LREC-Coling 2024
paper
How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study
Tianjie Ju, Weiwei Sun, Wei Du, Xinwei Yuan, Zhaochun Ren, Gongshen Liu
LREC-Coling 2024
paper  /  code
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Chong Meng, Shuaiqiang Wang, Zhicong Cheng, Zhaochun Ren, Dawei Yin
NAACL 2024
paper  /  code
Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers
Weiwei Sun, Zheng Chen, Xinyu Ma, Lingyong Yan, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
arXiv 2023, GenRec workshop at CIKM 2023
paper  /  code
Learning to Tokenize for Generative Retrieval
Weiwei Sun, Lingyong Yan, Zheng Chen, Shuaiqiang Wang, Haichao Zhu, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren.
NeurIPS 2023
paper  /  code
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren.
EMNLP 2023   (Outstanding Paper Award)
paper  /  code
DiQAD: A Benchmark Dataset for End-to-end Open-domain Dialogue Assessment
Yukun Zhao, Lingyong Yan, Weiwei Sun, Chong Meng, Shuaiqiang Wang, Zhicong Cheng, Zhaochun Ren, Dawei Yin.
Findings of EMNLP 2023
paper  /  code
Answering Ambiguous Questions via Iterative Prompting
Weiwei Sun, Hengyi Cai, Hongshen Chen, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren.
ACL 2023
paper  /  code
RADE: Reference-Assisted Dialogue Evaluation for Open-Domain Dialogue
Zhengliang Shi, Weiwei Sun, Shuo Zhang, Zhen Zhang, Pengjie Ren, Zhaochun Ren.
ACL 2023
paper  /  code
Towards Explainable Conversational Recommender Systems
Shuyu Guo, Shuo Zhang, Weiwei Sun, Pengjie Ren, Zhumin Chen, Zhaochun Ren.
SIGIR 2023
paper  /  code
Generative Knowledge Selection for Knowledge-Grounded Dialogues
Weiwei Sun, Pengjie Ren, Zhaochun Ren.
Findings of EACL 2023
paper  /  code
Contrastive Learning Reduces Hallucination in Conversations
Weiwei Sun, Zhengliang Shi, Shen Gao, Pengjie Ren, Maarten de Rijke, Zhaochun Ren.
AAAI 2023
paper  /  code
Metaphorical User Simulators for Evaluating Task-oriented Dialogue Systems
Weiwei Sun, Shuyu Guo, Shuo Zhang, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren.
ACM TOIS
paper  /  code (SimTester)  /  code (MetaSim)
Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems
Weiwei Sun, Shuo Zhang, Krisztian Balog, Zhaochun Ren, Pengjie Ren, Zhumin Chen, Maarten de Rijke.
SIGIR 2021
paper  /  code
Conversations Powered by Cross-Lingual Knowledge
Weiwei Sun, Chuan Meng, Qi Meng, Zhaochun Ren, Pengjie Ren, Zhumin Chen, Maarten de Rijke.
SIGIR 2021
paper  /  code
DukeNet: A Dual Knowledge Interaction Network for Knowledge-Grounded Conversation
Chuan Meng, Pengjie Ren, Zhumin Chen, Weiwei Sun, Zhaochun Ren, Zhaopeng Tu, Maarten de Rijke.
SIGIR 2020
paper  /  code
Education

PhD, Language Technologies Institute, Carnegie Mellon University, 2024.8 - present

M.E., Computer Science, Shandong University, 2021.9 - 2023.12

B.E., Computer Science, Shandong University, 2017.9 - 2021.6

Internship

Vector Institute, with Colin Raffel, 2024.7 - 2024.9

University of Amsterdam, IR Lab, with Maarten de Rijke, remote, 2024

Baidu, Search Science Team, with Lingyong Yan and Xinyu Ma, 2022.9 - 2023.12

JD.com, Data Science Lab, with Hongshen Chen and Hengyi Cai, 2021.3 - 2021.9

Shandong University, IR Lab, with Zhaochun Ren, 2019.9 - 2021.3

Award

2023 Baidu Scholarship, 2024.1

Outstanding Paper Award, EMNLP 2023, 2023.12

Presidential Scholarship, Shandong University, 2023.11

National Scholarship, Shandong University, 2023.11

Reviewer

EMNLP 2022, ACL 2023, SIGIR 2023, SIGIR-AP 2023, ECML/PKDD 2023, IPM 2023, WSDM 2023, EMNLP 2023, TALLIP 2023, SIGIR 2024, EMNLP 2024 (SAC), NeurIPS 2024, CIKM 2024, ICLR 2025


The design of this website is borrowed from here