default search action

combined dblp search
author search
venue search
publication search

ask others

Yannis Flet-Berliac

Yannis Paul Raymond Flet-Berliac

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c10]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/Flet-BerliacGSC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/Flet-BerliacGSC24
Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist:
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion. EMNLP 2024: 21353-21370
[c9]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/rlc/BoigeFQFRP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/BoigeFQFRP24
Raphaël Boige, Yannis Flet-Berliac, Lars C. P. M. Quaedvlieg, Arthur Flajolet, Guillaume Richard, Thomas Pierrot:
PASTA: Pretrained Action-State Transformer Agents. RLC 2024: 1511-1532
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17708
Allen Nie, Yash Chandak, Christina J. Yuan, Anirudhan Badrinath, Yannis Flet-Berliac, Emma Brunskill:
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators. CoRR abs/2405.17708 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19185
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19185
Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist:
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion. CoRR abs/2406.19185 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19188
Nathan Grinsztajn, Yannis Flet-Berliac, Mohammad Gheshlaghi Azar, Florian Strub, Bill Wu, Eugene Choi, Chris Cremer, Arash Ahmadian, Yash Chandak, Olivier Pietquin, Matthieu Geist:
Averaging log-likelihoods in direct alignment. CoRR abs/2406.19188 (2024)
2023
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/DongFNB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/DongFNB23
Kefan Dong, Yannis Flet-Berliac, Allen Nie, Emma Brunskill:
Model-Based Offline Reinforcement Learning with Local Misspecification. AAAI 2023: 7423-7431
[c7]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/BadrinathFNB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BadrinathFNB23
Anirudhan Badrinath, Yannis Flet-Berliac, Allen Nie, Emma Brunskill:
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets. NeurIPS 2023
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11426
Kefan Dong, Yannis Flet-Berliac, Allen Nie, Emma Brunskill:
Model-based Offline Reinforcement Learning with Local Misspecification. CoRR abs/2301.11426 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14069
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14069
Anirudhan Badrinath, Yannis Flet-Berliac, Allen Nie, Emma Brunskill:
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets. CoRR abs/2306.14069 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-10936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-10936
Raphaël Boige, Yannis Flet-Berliac, Arthur Flajolet, Guillaume Richard, Thomas Pierrot:
PASTA: Pretrained Action-State Transformer Agents. CoRR abs/2307.10936 (2023)
2022
[c6]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/NieFJSB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NieFJSB22
Allen Nie, Yannis Flet-Berliac, Deon R. Jordan, William Steenbergen, Emma Brunskill:
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data. NeurIPS 2022
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/0009FB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/0009FB22
Yao Liu, Yannis Flet-Berliac, Emma Brunskill:
Offline policy optimization with eligible actions. UAI 2022: 1253-1263
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09424
Yannis Flet-Berliac, Debabrota Basu:
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics. CoRR abs/2204.09424 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00632
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00632
Yao Liu, Yannis Flet-Berliac, Emma Brunskill:
Offline Policy Optimization with Eligible Actions. CoRR abs/2207.00632 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-08642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-08642
Allen Nie, Yannis Flet-Berliac, Deon R. Jordan, William Steenbergen, Emma Brunskill:
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data. CoRR abs/2210.08642 (2022)
2021
[b1]
- view
- export record
  dblp key:
  - phd/hal/FletBerliac21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/hal/FletBerliac21
Yannis Flet-Berliac:
Sample-Efficient Deep Reinforcement Learning for Control, Exploration and Safety. (Apprentissage par renforcement profond éfficace pour le contrôle, l'exploration et la sûreté). University of Lille, France, 2021
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Flet-BerliacFPP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Flet-BerliacFPP21
Yannis Flet-Berliac, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist:
Adversarially Guided Actor-Critic. ICLR 2021
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Flet-BerliacOMP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Flet-BerliacOMP21
Yannis Flet-Berliac, Reda Ouhamma, Odalric-Ambrym Maillard, Philippe Preux:
Learning Value Functions in Deep Policy Gradients using Residual Variance. ICLR 2021
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-04376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-04376
Yannis Flet-Berliac, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist:
Adversarially Guided Actor-Critic. CoRR abs/2102.04376 (2021)
2020
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Flet-BerliacP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Flet-BerliacP20
Yannis Flet-Berliac, Philippe Preux:
Only Relevant Information Matters: Filtering Out Noisy Samples To Boost RL. IJCAI 2020: 2711-2717
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04440
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04440
Yannis Flet-Berliac, Reda Ouhamma, Odalric-Ambrym Maillard, Philippe Preux:
Is Standard Deviation the New Standard? Revisiting the Critic in Deep Policy Gradients. CoRR abs/2010.04440 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04025
Yannis Flet-Berliac, Philippe Preux:
Samples are not all useful: Denoising policy gradient updates using variance. CoRR abs/1904.04025 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11939
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11939
Yannis Flet-Berliac, Philippe Preux:
High-Dimensional Control Using Generalized Auxiliary Tasks. CoRR abs/1909.11939 (2019)
2017
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/hci/JohansenFKSPPL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hci/JohansenFKSPPL17
Benjamin Johansen, Yannis Paul Raymond Flet-Berliac, Maciej Jan Korzepa, Per Sandholm, Niels Henrik Pontoppidan, Michael Kai Petersen, Jakob Eg Larsen:
Hearables in Hearing Care: Discovering Usage Patterns Through IoT Devices. HCI (9) 2017: 39-49

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.