Human irrationality: both bad and good for reward inference.

AllImages Videos Books Maps News Shopping

Human irrationality: both bad and good for reward inference - arXiv

Nov 12, 2021 · Title:Human irrationality: both bad and good for reward inference. Authors:Lawrence Chan, Andrew Critch, Anca Dragan. View a PDF of the paper ...

[PDF] Human irrationality: both bad and good for reward inference

www.semanticscholar.org › paper

Human irrationality: both bad and good for reward inference · 20 Citations · 39 References.

The impacts of known and unknown demonstrator irrationality on reward ...

openreview.net › forum

Oct 16, 2020 · We find that incorrectly assuming noisy-rationality for an irrational demonstrator can lead to remarkably poor reward inference accuracy.

Irrationality can help reward inference - OpenReview

The Trickle-down Impact of Reward Inconsistency on RLHF - OpenReview

More results from openreview.net

Question(s) about irrationality : r/GAMETHEORY - Reddit

www.reddit.com › comments › questions...

Feb 12, 2024 · Occam's razor is insufficient to infer the preferences of irrational agents · Human irrationality: both bad and good for reward inference. I ...

‪Andrew Critch‬ - ‪Google Scholar‬

scholar.google.com › citations

Co-authors ; Human irrationality: both bad and good for reward inference. L Chan, A Critch, A Dragan. arXiv preprint arXiv:2111.06956, 2021. 24, 2021.

‪Lawrence Chan‬ - ‪Google Scholar‬

scholar.google.com › citations

Human irrationality: both bad and good for reward inference. L Chan, A Critch, A Dragan. arXiv preprint arXiv:2111.06956, 2021. 24, 2021. Optimal cost design ...

[PDF] IRRATIONALITY CAN HELP REWARD INFERENCE

openreview.net › pdf

Each irrationality uses the parameter value that is most informative. As discussed in section 3.2, different irrational- ity types have different slopes and ...

[PDF] The Effect of Modeling Human Rationality Level on Learning ...

ojs.aaai.org › AAAI › article › view

While demonstrations do pro- vide the most information when the human is highly ratio- nal, comparisons gain an advantage when querying a more irrational human.

[PDF] KTO: Model Alignment as Prospect Theoretic Optimization - arXiv

arxiv.org › pdf

Sep 3, 2024 · Chan, L., Critch, A., and Dragan, A. Human irrationality: both bad and good for reward inference. arXiv preprint. arXiv:2111.06956, 2021. Chen, ...

CSCI 699 Robot Learning - USC Lira Lab

liralab.usc.edu › csci699

Chan et al., Human Irrationality: Both Bad and Good for Reward Inference (2021). Julian et al., Never Stop Learning: The Effectiveness of Fine-Tuning in ...