Publications

See also: Google Scholar, DBLP

Open-source Codes

TruthRL  WebAgent-R1  AdaDecode  InstructRAG  CasRel  BERT-NER 

Preprints / In Submission

  1. [Preprint] TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning [code]
    Zhepei Wei, Xiao Yang, Kai Sun, Jiaqi Wang, Rulin Shao, Sean Chen, Mohammad Kachuee, Teja Gollapudi, Tony Liao, Nicolas Scheffer, Rakesh Wanga, Anuj Kumar, Yu Meng, Wen-tau Yih, Xin Luna Dong

  2. [Preprint] Do LLM Evaluators Prefer Themselves for a Reason? [code]
    Wei-Lin Chen, Zhepei Wei, Xinyu Zhu, Shi Feng, Yu Meng

  3. [Preprint] Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents [code]
    Yiding Wang, Zhepei Wei, Xinyu Zhu, Yu Meng

Conference Papers

  1. [ACL 2026 Findings] PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time
    Weizhi Zhang, Xinyang Zhang, Chenwei Zhang, Liangwei Yang, Jingbo Shang, Zhepei Wei, Henry Peng Zou, Zijie Huang, Zhengyang Wang, Yifan Gao, Xiaoman Pan, Lian Xiong, Jingguo Liu, Philip S. Yu, Xian Li

  2. [ACL 2026] Aligning Large Language Models via Fully Self-Synthetic Data [code]
    Shangjian Yin, Zhepei Wei, Xinyu Zhu, Wei-Lin Chen, Yu Meng

  3. [NeurIPS 2025] The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning [code]
    Xinyu Zhu, Mengzhou Xia, Zhepei Wei, Wei-Lin Chen, Danqi Chen, Yu Meng

  4. [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning [code]
    Zhepei Wei, Wenlin Yao, Yao Liu, Weizhi Zhang, Qin Lu, Liang Qiu, Changlong Yu, Puyang Xu, Chao Zhang, Bing Yin, Hyokun Yun, Lihong Li

  5. [ICML 2025] AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism [code]
    Zhepei Wei, Wei-Lin Chen, Xinyu Zhu, Yu Meng. Previously presented at NeurIPS 2024 AFM Workshop (Oral: 8/157)

  6. [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales [code] [webpage]
    Zhepei Wei, Wei-Lin Chen, Yu Meng

  7. [ICLR 2024] Incentivized Truthful Communication for Federated Bandits
    Zhepei Wei, Chuanhao Li, Tianze Ren, Haifeng Xu, Hongning Wang

  8. [NeurIPS 2023] Incentivized Communication for Federated Bandits
    Zhepei Wei, Chuanhao Li, Haifeng Xu, Hongning Wang

  9. [EMNLP 2022] Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables
    Erxin Yu, Lan Du, Yuan Jin, Zhepei Wei, Yi Chang

  10. [AACL 2022] Towards Unified Representations of Knowledge Graph and Expert Rules for Machine Learning and Reasoning
    Zhepei Wei, Yue Wang, Jinnan Li, Zhining Liu, Erxin Yu, Yuan Tian, Xin Wang, Yi Chang

  11. [IJCAI 2022] AttExplainer: Explain Transformer via Attention by Reinforcement Learning [code]
    Runliang Niu, Zhepei Wei, Yan Wang, Qi Wang

  12. [ACL 2020] A Novel Cascade Binary Tagging Framework for Relational Triple Extraction [code]
    Zhepei Wei, Jianlin Su, Yue Wang, Yuan Tian, Yi Chang