default search action
Kianté Brantley
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Owen Oertell, Jonathan D. Chang, Yiyi Zhang, Kianté Brantley, Wen Sun:
RL for Consistency Models: Reward Guided Text-to-Image Generation with Fast Inference. RLJ 4: 1656-1673 (2024) - [c15]Jonathan D. Chang, Dhruv Sreenivas, Yingbing Huang, Kianté Brantley, Wen Sun:
Adversarial Imitation Learning via Boosting. ICLR 2024 - [c14]My Phan, Kianté Brantley, Stephanie Milani, Soroush Mehri, Gokul Swamy, Geoffrey J. Gordon:
When is Transfer Learning Possible? ICML 2024 - [c13]Aaron David Tucker, Kianté Brantley, Adam Cahall, Thorsten Joachims:
Coactive Learning for Large Language Models using Implicit User Feedback. ICML 2024 - [c12]Kianté Brantley, Zhichong Fang, Sarah Dean, Thorsten Joachims:
Ranking with Long-Term Constraints. WSDM 2024: 47-56 - [i21]Zhaolin Gao, Kianté Brantley, Thorsten Joachims:
Reviewer2: Optimizing Review Generation Through Prompt Generation. CoRR abs/2402.10886 (2024) - [i20]Anne Wu, Kianté Brantley, Yoav Artzi:
A Surprising Failure? Multimodal LLMs and the NLVR Challenge. CoRR abs/2402.17793 (2024) - [i19]Owen Oertell, Jonathan D. Chang, Yiyi Zhang, Kianté Brantley, Wen Sun:
RL for Consistency Models: Faster Reward Guided Text-to-Image Generation. CoRR abs/2404.03673 (2024) - [i18]Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun:
Dataset Reset Policy Optimization for RLHF. CoRR abs/2404.08495 (2024) - [i17]Jonathan D. Chang, Dhruv Sreenivas, Yingbing Huang, Kianté Brantley, Wen Sun:
Adversarial Imitation Learning via Boosting. CoRR abs/2404.08513 (2024) - [i16]Zhaolin Gao, Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Gokul Swamy, Kianté Brantley, Thorsten Joachims, J. Andrew Bagnell, Jason D. Lee, Wen Sun:
REBEL: Reinforcement Learning via Regressing Relative Rewards. CoRR abs/2404.16767 (2024) - [i15]Zhaolin Gao, Wenhao Zhan, Jonathan D. Chang, Gokul Swamy, Kianté Brantley, Jason D. Lee, Wen Sun:
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF. CoRR abs/2410.04612 (2024) - [i14]Giovanni Monea, Antoine Bosselut, Kianté Brantley, Yoav Artzi:
LLMs Are In-Context Reinforcement Learners. CoRR abs/2410.05362 (2024) - 2023
- [c11]Anne Wu, Kianté Brantley, Noriyuki Kojima, Yoav Artzi:
lilGym: Natural Language Visual Reasoning with Reinforcement Learning. ACL (1) 2023: 9214-9234 - [c10]Felix Faltings, Michel Galley, Kianté Brantley, Baolin Peng, Weixin Cai, Yizhe Zhang, Jianfeng Gao, Bill Dolan:
Interactive Text Generation. EMNLP 2023: 4450-4468 - [c9]Rajkumar Ramamurthy, Prithviraj Ammanabrolu, Kianté Brantley, Jack Hessel, Rafet Sifa, Christian Bauckhage, Hannaneh Hajishirzi, Yejin Choi:
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization. ICLR 2023 - [i13]Felix Faltings, Michel Galley, Baolin Peng, Kianté Brantley, Weixin Cai, Yizhe Zhang, Jianfeng Gao, Bill Dolan:
Interactive Text Generation. CoRR abs/2303.00908 (2023) - [i12]Jonathan D. Chang, Kianté Brantley, Rajkumar Ramamurthy, Dipendra Misra, Wen Sun:
Learning to Generate Better Than Your LLM. CoRR abs/2306.11816 (2023) - [i11]Kianté Brantley, Zhichong Fang, Sarah Dean, Thorsten Joachims:
Ranking with Long-Term Constraints. CoRR abs/2307.04923 (2023) - [i10]Ge Gao, Jonathan D. Chang, Claire Cardie, Kianté Brantley, Thorsten Joachims:
Policy-Gradient Training of Language Models for Ranking. CoRR abs/2310.04407 (2023) - 2022
- [i9]Rajkumar Ramamurthy, Prithviraj Ammanabrolu, Kianté Brantley, Jack Hessel, Rafet Sifa, Christian Bauckhage, Hannaneh Hajishirzi, Yejin Choi:
Is Reinforcement Learning (Not) for Natural Language Processing?: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization. CoRR abs/2210.01241 (2022) - [i8]Anne Wu, Kianté Brantley, Noriyuki Kojima, Yoav Artzi:
lilGym: Natural Language Visual Reasoning with Reinforcement Learning. CoRR abs/2211.01994 (2022) - 2021
- [b1]Kianté Brantley:
Expert-in-the-Loop for Sequential Decisions and Predictions. University of Maryland, College Park, MD, USA, 2021 - [c8]Kianté Brantley, Soroush Mehri, Geoffrey J. Gordon:
Successor Feature Sets: Generalizing Successor Representations Across Policies. AAAI 2021: 11774-11781 - [i7]Kianté Brantley, Soroush Mehri, Geoffrey J. Gordon:
Successor Feature Sets: Generalizing Successor Representations Across Policies. CoRR abs/2103.02650 (2021) - 2020
- [c7]Kianté Brantley, Hal Daumé III, Amr Sharaf:
Active Imitation Learning with Noisy Guidance. ACL 2020: 2093-2105 - [c6]Kianté Brantley, Wen Sun, Mikael Henaff:
Disagreement-Regularized Imitation Learning. ICLR 2020 - [c5]Kianté Brantley, Miroslav Dudík, Thodoris Lykouris, Sobhan Miryoosefi, Max Simchowitz, Aleksandrs Slivkins, Wen Sun:
Constrained episodic reinforcement learning in concave-convex and knapsack settings. NeurIPS 2020 - [i6]Kianté Brantley, Amr Sharaf, Hal Daumé III:
Active Imitation Learning with Noisy Guidance. CoRR abs/2005.12801 (2020) - [i5]Kianté Brantley, Miroslav Dudík, Thodoris Lykouris, Sobhan Miryoosefi, Max Simchowitz, Aleksandrs Slivkins, Wen Sun:
Constrained episodic reinforcement learning in concave-convex and knapsack settings. CoRR abs/2006.05051 (2020)
2010 – 2019
- 2019
- [c4]Kianté Brantley, Kyunghyun Cho, Hal Daumé III, Sean Welleck:
Non-Monotonic Sequential Text Generation. WNLP@ACL 2019: 57-59 - [c3]Sean Welleck, Kianté Brantley, Hal Daumé III, Kyunghyun Cho:
Non-Monotonic Sequential Text Generation. ICML 2019: 6716-6726 - [c2]Sobhan Miryoosefi, Kianté Brantley, Hal Daumé III, Miroslav Dudík, Robert E. Schapire:
Reinforcement Learning with Convex Constraints. NeurIPS 2019: 14070-14079 - [i4]Sean Welleck, Kianté Brantley, Hal Daumé III, Kyunghyun Cho:
Non-Monotonic Sequential Text Generation. CoRR abs/1902.02192 (2019) - [i3]Sobhan Miryoosefi, Kianté Brantley, Hal Daumé III, Miroslav Dudík, Robert E. Schapire:
Reinforcement Learning with Convex Constraints. CoRR abs/1906.09323 (2019) - 2017
- [c1]Amr Sharaf, Shi Feng, Khanh Nguyen, Kianté Brantley, Hal Daumé III:
The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task. WMT 2017: 667-673 - [i2]Amr Sharaf, Shi Feng, Khanh Nguyen, Kianté Brantley, Hal Daumé III:
The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task. CoRR abs/1708.01318 (2017) - 2015
- [i1]Ashwinkumar Ganesan, Kianté Brantley, Shimei Pan, Jian Chen:
LDAExplore: Visualizing Topic Models Generated Using Latent Dirichlet Allocation. CoRR abs/1507.06593 (2015)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-22 19:42 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint