default search action

combined dblp search
author search
venue search
publication search

ask others

Zhiheng Xi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/DouZLGSXZWXFPZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/DouZLGSXZWXFPZZ24
Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Wei Shen, Limao Xiong, Yuhao Zhou, Xiao Wang, Zhiheng Xi, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin. ACL (1) 2024: 1932-1945
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Dou0JZXSHWFXZJZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Dou0JZXSHWFXZJZ24
Shihan Dou, Yan Liu, Haoxiang Jia, Enyu Zhou, Limao Xiong, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang:
StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback. ACL (1) 2024: 4571-4585
[c14]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/coling/ZhouCZXG0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ZhouCZXG0024
Yuhao Zhou, Wenxiang Chen, Rui Zheng, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang:
ORTicket: Let One Robust BERT Ticket Transfer across Different Tasks. LREC/COLING 2024: 12527-12538
[c13]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/coling/ZhangWXXGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ZhangWXXGZ024
Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang, Xuanjing Huang:
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions. LREC/COLING 2024: 14186-14203
[c12]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/coling/ZhengZXGZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/ZhengZXGZH24
Rui Zheng, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang:
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals. LREC/COLING 2024: 15410-15421
[c11]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/WangZCXSZYGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangZCXSZYGZ024
Binghai Wang, Rui Zheng, Lu Chen, Zhiheng Xi, Wei Shen, Yuhao Zhou, Dong Yan, Tao Gui, Qi Zhang, Xuanjing Huang:
Reward Modeling Requires Automatic Adjustment Based on Data Quality. EMNLP (Findings) 2024: 4041-4064
[c10]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/XiaGGX0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/XiaGGX0024
Han Xia, Songyang Gao, Qiming Ge, Zhiheng Xi, Qi Zhang, Xuanjing Huang:
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data. EMNLP (Findings) 2024: 8178-8188
[c9]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/ChenZWJHYZZXGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChenZWJHYZZXGZ024
Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang:
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning. EMNLP 2024: 15270-15283
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ZhengSHLDZXWHG024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhengSHLDZXWHG024
Rui Zheng, Wei Shen, Yuan Hua, Wenbin Lai, Shihan Dou, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang:
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning. ICLR 2024
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/XiCHJZHDLGWGSFZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XiCHJZHDLGWGSFZ24
Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang:
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning. ICML 2024
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/HeLZDLXGZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/HeLZDLXGZH24
Wei He, Shichun Liu, Jun Zhao, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang:
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models. NAACL-HLT (Findings) 2024: 3829-3845
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06080
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06080
Binghai Wang, Rui Zheng, Lu Chen, Yan Liu, Shihan Dou, Caishuang Huang, Wei Shen, Senjie Jin, Enyu Zhou, Chenyu Shi, Songyang Gao, Nuo Xu, Yuhao Zhou, Xiaoran Fan, Zhiheng Xi, Jun Zhao, Xiao Wang, Tao Ji, Hang Yan, Lixing Shen, Zhan Chen, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
Secrets of RLHF in Large Language Models Part II: Reward Modeling. CoRR abs/2401.06080 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-17221
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-17221
Xiaoran Fan, Tao Ji, Changhao Jiang, Shuo Li, Senjie Jin, Sirui Song, Junke Wang, Boyang Hong, Lu Chen, Guodong Zheng, Ming Zhang, Caishuang Huang, Rui Zheng, Zhiheng Xi, Yuhao Zhou, Shihan Dou, Junjie Ye, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
MouSi: Poly-Visual-Expert Vision-Language Models. CoRR abs/2401.17221 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-01391
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-01391
Shihan Dou, Yan Liu, Haoxiang Jia, Limao Xiong, Enyu Zhou, Wei Shen, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Xuanjing Huang, Tao Gui:
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback. CoRR abs/2402.01391 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05808
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05808
Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang:
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning. CoRR abs/2402.05808 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16431
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16431
Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang, Xuanjing Huang:
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions. CoRR abs/2402.16431 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12171
Weikang Zhou, Xiao Wang, Limao Xiong, Han Xia, Yingshuang Gu, Mingxu Chai, Fukang Zhu, Caishuang Huang, Shihan Dou, Zhiheng Xi, Rui Zheng, Songyang Gao, Yicheng Zou, Hang Yan, Yifan Le, Ruohui Wang, Lijun Li, Jing Shao, Tao Gui, Qi Zhang, Xuanjing Huang:
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models. CoRR abs/2403.12171 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16176
Rui Zheng, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang:
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals. CoRR abs/2403.16176 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-00884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-00884
Wei He, Shichun Liu, Jun Zhao, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang:
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models. CoRR abs/2404.00884 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04151
Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-Gang Jiang:
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments. CoRR abs/2406.04151 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10977
Rui Zheng, Hongyi Guo, Zhihan Liu, Xiaoying Zhang, Yuanshun Yao, Xiaojun Xu, Zhaoran Wang, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang, Hang Li, Yang Liu:
Toward Optimal LLM Alignments Using Two-Player Games. CoRR abs/2406.10977 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-14874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-14874
Han Xia, Songyang Gao, Qiming Ge, Zhiheng Xi, Qi Zhang, Xuanjing Huang:
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data. CoRR abs/2408.14874 (2024)
2023
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhengXLLGZHMSG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhengXLLGZHMSG23
Rui Zheng, Zhiheng Xi, Qin Liu, Wenbin Lai, Tao Gui, Qi Zhang, Xuanjing Huang, Jin Ma, Ying Shan, Weifeng Ge:
Characterizing the Impacts of Instances on Robustness. ACL (Findings) 2023: 2314-2332
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/XiZZHWPSZG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiZZHWPSZG23
Zhiheng Xi, Rui Zheng, Yuansen Zhang, Xuanjing Huang, Zhongyu Wei, Minlong Peng, Mingming Sun, Qi Zhang, Tao Gui:
Connectivity Patterns are Task Embeddings. ACL (Findings) 2023: 11993-12013
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhouZXGFFYGZH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhouZXGFFYGZH23
Enyu Zhou, Rui Zheng, Zhiheng Xi, Songyang Gao, Xiaoran Fan, Zichu Fei, Jingting Ye, Tao Gui, Qi Zhang, Xuanjing Huang:
RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms. EMNLP (Findings) 2023: 10262-10274
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/XiJZZGLGZH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/XiJZZGLGZH23
Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Jia Liu, Tao Gui, Qi Zhang, Xuanjing Huang:
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement. EMNLP (Findings) 2023: 11383-11406
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-14497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-14497
Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang:
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement. CoRR abs/2305.14497 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04964
Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang:
Secrets of RLHF in Large Language Models Part I: PPO. CoRR abs/2307.04964 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-01191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-01191
Shihan Dou, Junjie Shan, Haoxiang Jia, Wenhao Deng, Zhiheng Xi, Wei He, Yueming Wu, Tao Gui, Yang Liu, Xuanjing Huang:
Towards Understanding the Capability of Large Language Models on Code Clone Detection: A Survey. CoRR abs/2308.01191 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07864
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07864
Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui:
The Rise and Potential of Large Language Model Based Agents: A Survey. CoRR abs/2309.07864 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-06762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-06762
Xiao Wang, Yuansen Zhang, Tianze Chen, Songyang Gao, Senjie Jin, Xianjun Yang, Zhiheng Xi, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xuanjing Huang:
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models. CoRR abs/2310.06762 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11227
Enyu Zhou, Rui Zheng, Zhiheng Xi, Songyang Gao, Xiaoran Fan, Zichu Fei, Jingting Ye, Tao Gui, Qi Zhang, Xuanjing Huang:
RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms. CoRR abs/2310.11227 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11971
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11971
Rui Zheng, Wei Shen, Yuan Hua, Wenbin Lai, Shihan Dou, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Haoran Huang, Tao Gui, Qi Zhang, Xuanjing Huang:
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning. CoRR abs/2310.11971 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09979
Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Jun Zhao, Wei Shen, Yuhao Zhou, Zhiheng Xi, Xiao Wang, Xiaoran Fan, Shiliang Pu, Jiang Zhu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment. CoRR abs/2312.09979 (2023)
2022
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/XiZGZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/XiZGZH22
Zhiheng Xi, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
Efficient Adversarial Training with Robust Early-Bird Tickets. EMNLP 2022: 8318-8331
[i1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-07263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-07263
Zhiheng Xi, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang:
Efficient Adversarial Training with Robust Early-Bird Tickets. CoRR abs/2211.07263 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.