default search action
Yonathan Efroni
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c27]Anurag Koul, Shivakanth Sujit, Shaoru Chen, Ben Evans, Lili Wu, Byron Xu, Rajan Chari, Riashat Islam, Raihan Seraj, Yonathan Efroni, Lekan P. Molu, Miroslav Dudík, John Langford, Alex Lamb:
PcLast: Discovering Plannable Continuous Latent States. ICML 2024 - [c26]Jeongyeol Kwon, Yonathan Efroni, Shie Mannor, Constantine Caramanis:
Prospective Side Information for Latent MDPs. ICML 2024 - [i35]Caner Hazirbas, Alicia Sun, Yonathan Efroni, Mark Ibrahim:
The Bias of Harmful Label Associations in Vision-Language Models. CoRR abs/2402.07329 (2024) - [i34]Lili Wu, Ben Evans, Riashat Islam, Raihan Seraj, Yonathan Efroni, Alex Lamb:
Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs. CoRR abs/2404.14552 (2024) - [i33]Jeongyeol Kwon, Shie Mannor, Constantine Caramanis, Yonathan Efroni:
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation. CoRR abs/2406.01389 (2024) - [i32]Wenhao Zhan, Scott Fujimoto, Zheqing Zhu, Jason D. Lee, Daniel R. Jiang, Yonathan Efroni:
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank. CoRR abs/2410.01101 (2024) - 2023
- [j1]Alex Lamb, Riashat Islam, Yonathan Efroni, Aniket Rajiv Didolkar, Dipendra Misra, Dylan J. Foster, Lekan P. Molu, Rajan Chari, Akshay Krishnamurthy, John Langford:
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models. Trans. Mach. Learn. Res. 2023 (2023) - [c25]Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Rajiv Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford:
Principled Offline RL in the Presence of Rich Exogenous Information. ICML 2023: 14390-14421 - [c24]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Reward-Mixing MDPs with Few Latent Contexts are Learnable. ICML 2023: 18057-18082 - [i31]Jeongyeol Kwon, Yonathan Efroni, Shie Mannor, Constantine Caramanis:
Prospective Side Information for Latent MDPs. CoRR abs/2310.07596 (2023) - [i30]Anurag Koul, Shivakanth Sujit, Shaoru Chen, Ben Evans, Lili Wu, Byron Xu, Rajan Chari, Riashat Islam, Raihan Seraj, Yonathan Efroni, Lekan P. Molu, Miro Dudík, John Langford, Alex Lamb:
PcLast: Discovering Plannable Continuous Latent States. CoRR abs/2311.03534 (2023) - [i29]Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Ürün Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu:
Pearl: A Production-ready Reinforcement Learning Agent. CoRR abs/2312.03814 (2023) - 2022
- [c23]Yonathan Efroni, Dylan J. Foster, Dipendra Misra, Akshay Krishnamurthy, John Langford:
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information. COLT 2022: 5062-5127 - [c22]Yonathan Efroni, Dipendra Misra, Akshay Krishnamurthy, Alekh Agarwal, John Langford:
Provably Filtering Exogenous Distractors using Multistep Inverse Dynamics. ICLR 2022 - [c21]Manan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh:
Mirror Descent Policy Optimization. ICLR 2022 - [c20]Yonathan Efroni, Chi Jin, Akshay Krishnamurthy, Sobhan Miryoosefi:
Provable Reinforcement Learning with a Short-Term Memory. ICML 2022: 5832-5850 - [c19]Yonathan Efroni, Sham M. Kakade, Akshay Krishnamurthy, Cyril Zhang:
Sparsity in Partially Controllable Linear Systems. ICML 2022: 5851-5860 - [c18]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms. ICML 2022: 11772-11789 - [c17]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Tractable Optimality in Episodic Latent MABs. NeurIPS 2022 - [i28]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms. CoRR abs/2201.12700 (2022) - [i27]Yonathan Efroni, Chi Jin, Akshay Krishnamurthy, Sobhan Miryoosefi:
Provable Reinforcement Learning with a Short-Term Memory. CoRR abs/2202.03983 (2022) - [i26]Yonathan Efroni, Dylan J. Foster, Dipendra Misra, Akshay Krishnamurthy, John Langford:
Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information. CoRR abs/2206.04282 (2022) - [i25]Alex Lamb, Riashat Islam, Yonathan Efroni, Aniket Didolkar, Dipendra Misra, Dylan J. Foster, Lekan P. Molu, Rajan Chari, Akshay Krishnamurthy, John Langford:
Guaranteed Discovery of Controllable Latent States with Multi-Step Inverse Models. CoRR abs/2207.08229 (2022) - [i24]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Reward-Mixing MDPs with a Few Latent Contexts are Learnable. CoRR abs/2210.02594 (2022) - [i23]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Tractable Optimality in Episodic Latent MABs. CoRR abs/2210.03528 (2022) - [i22]Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford:
Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information. CoRR abs/2211.00164 (2022) - 2021
- [c16]Yonathan Efroni, Nadav Merlis, Shie Mannor:
Reinforcement Learning with Trajectory Feedback. AAAI 2021: 7288-7295 - [c15]Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. ICML 2021: 2937-2947 - [c14]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Reinforcement Learning in Reward-Mixing MDPs. NeurIPS 2021: 2253-2264 - [c13]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
RL for Latent MDPs: Regret Guarantees and a Lower Bound. NeurIPS 2021: 24523-24534 - [c12]Alon Cohen, Yonathan Efroni, Yishay Mansour, Aviv Rosenberg:
Minimax Regret for Stochastic Shortest Path. NeurIPS 2021: 28350-28361 - [c11]Guy Tennenholtz, Uri Shalit, Shie Mannor, Yonathan Efroni:
Bandits with partially observable confounded data. UAI 2021: 430-439 - [i21]Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor:
Confidence-Budget Matching for Sequential Budgeted Learning. CoRR abs/2102.03400 (2021) - [i20]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
RL for Latent MDPs: Regret Guarantees and a Lower Bound. CoRR abs/2102.04939 (2021) - [i19]Alon Cohen, Yonathan Efroni, Yishay Mansour, Aviv Rosenberg:
Minimax Regret for Stochastic Shortest Path. CoRR abs/2103.13056 (2021) - [i18]Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor:
Reinforcement Learning in Reward-Mixing MDPs. CoRR abs/2110.03743 (2021) - [i17]Nadav Merlis, Yonathan Efroni, Shie Mannor:
Dare not to Ask: Problem-Dependent Guarantees for Budgeted Bandits. CoRR abs/2110.05724 (2021) - [i16]Yonathan Efroni, Sham M. Kakade, Akshay Krishnamurthy, Cyril Zhang:
Sparsity in Partially Controllable Linear Systems. CoRR abs/2110.06150 (2021) - [i15]Yonathan Efroni, Dipendra Misra, Akshay Krishnamurthy, Alekh Agarwal, John Langford:
Provable RL with Exogenous Distractors via Multistep Inverse Dynamics. CoRR abs/2110.08847 (2021) - 2020
- [c10]Lior Shani, Yonathan Efroni, Shie Mannor:
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs. AAAI 2020: 5668-5675 - [c9]Lior Shani, Yonathan Efroni, Aviv Rosenberg, Shie Mannor:
Optimistic Policy Optimization with Bandit Feedback. ICML 2020: 8604-8613 - [c8]Manan Tomar, Yonathan Efroni, Mohammad Ghavamzadeh:
Multi-step Greedy Reinforcement Learning Algorithms. ICML 2020: 9504-9513 - [c7]Yonathan Efroni, Mohammad Ghavamzadeh, Shie Mannor:
Online Planning with Lookahead Policies. NeurIPS 2020 - [i14]Yonathan Efroni, Lior Shani, Aviv Rosenberg, Shie Mannor:
Optimistic Policy Optimization with Bandit Feedback. CoRR abs/2002.08243 (2020) - [i13]Yonathan Efroni, Shie Mannor, Matteo Pirotta:
Exploration-Exploitation in Constrained MDPs. CoRR abs/2003.02189 (2020) - [i12]Manan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh:
Mirror Descent Policy Optimization. CoRR abs/2005.09814 (2020) - [i11]Guy Tennenholtz, Uri Shalit, Shie Mannor, Yonathan Efroni:
Bandits with Partially Observable Offline Data. CoRR abs/2006.06731 (2020) - [i10]Yonathan Efroni, Nadav Merlis, Shie Mannor:
Reinforcement Learning with Trajectory Feedback. CoRR abs/2008.06036 (2020)
2010 – 2019
- 2019
- [c6]Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor:
How to Combine Tree-Search Methods in Reinforcement Learning. AAAI 2019: 3494-3501 - [c5]Lior Shani, Yonathan Efroni, Shie Mannor:
Exploration Conscious Reinforcement Learning Revisited. ICML 2019: 5680-5689 - [c4]Chen Tessler, Yonathan Efroni, Shie Mannor:
Action Robust Reinforcement Learning and Applications in Continuous Control. ICML 2019: 6215-6224 - [c3]Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor:
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies. NeurIPS 2019: 12203-12213 - [i9]Chen Tessler, Yonathan Efroni, Shie Mannor:
Action Robust Reinforcement Learning and Applications in Continuous Control. CoRR abs/1901.09184 (2019) - [i8]Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor:
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies. CoRR abs/1905.11527 (2019) - [i7]Lior Shani, Yonathan Efroni, Shie Mannor:
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs. CoRR abs/1909.02769 (2019) - [i6]Yonathan Efroni, Mohammad Ghavamzadeh, Shie Mannor:
Multi-Step Greedy and Approximate Real Time Dynamic Programming. CoRR abs/1909.04236 (2019) - [i5]Manan Tomar, Yonathan Efroni, Mohammad Ghavamzadeh:
Multi-step Greedy Policies in Model-Free Deep Reinforcement Learning. CoRR abs/1910.02919 (2019) - 2018
- [c2]Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor:
Beyond the One-Step Greedy Approach in Reinforcement Learning. ICML 2018: 1386-1395 - [c1]Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor:
Multiple-Step Greedy Policies in Approximate and Online Reinforcement Learning. NeurIPS 2018: 5244-5253 - [i4]Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor:
Beyond the One Step Greedy Approach in Reinforcement Learning. CoRR abs/1802.03654 (2018) - [i3]Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor:
Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning. CoRR abs/1805.07956 (2018) - [i2]Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor:
How to Combine Tree-Search Methods in Reinforcement Learning. CoRR abs/1809.01843 (2018) - [i1]Lior Shani, Yonathan Efroni, Shie Mannor:
Revisiting Exploration-Conscious Reinforcement Learning. CoRR abs/1812.05551 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-06 21:31 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint