default search action

combined dblp search
author search
venue search
publication search

ask others

Steffen Udluft

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-10017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-10017
Simon Eisenmann, Daniel Hein, Steffen Udluft, Thomas A. Runkler:
Model-based Offline Quantum Reinforcement Learning. CoRR abs/2404.10017 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-11751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-11751
Philipp Wissmann, Daniel Hein, Steffen Udluft, Volker Tresp:
Why long model-based rollouts are no reason for bad Q-value estimates. CoRR abs/2407.11751 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-12319
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-12319
Steffen Limmer, Steffen Udluft, Clemens Otte:
Neural-ANOVA: Model Decomposition for Interpretable Machine Learning. CoRR abs/2408.12319 (2024)
2023
[j9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/WiedemannHUM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/WiedemannHUM23
Simon Wiedemann, Daniel Hein, Steffen Udluft, Christian B. Mendl:
Quantum Policy Iteration via Amplitude Estimation and Grover Search - Towards Quantum Advantage for Reinforcement Learning. Trans. Mach. Learn. Res. 2023 (2023)
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/esann/SwazinnaUR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/SwazinnaUR23
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
Automatic Trade-off Adaptation in Offline RL. ESANN 2023
[c33]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/SwazinnaUR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SwazinnaUR23
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
User-Interactive Offline Reinforcement Learning. ICLR 2023
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/qce/TrespUHHLMSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/qce/TrespUHHLMSM23
Volker Tresp, Steffen Udluft, Daniel Hein, Werner Hauptmann, Martin Leib, Christopher Mutschler, Daniel D. Scherer, Wolfgang Mauerer:
Workshop Summary: Quantum Machine Learning. QCE 2023: 1-3
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/ssci/WeberSHUS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssci/WeberSHUS23
Marc Weber, Phillip Swazinna, Daniel Hein, Steffen Udluft, Volkmar Sterzing:
Learning Control Policies for Variable Objectives from Offline Data. SSCI 2023: 1674-1681
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09744
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09744
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
Automatic Trade-off Adaptation in Offline RL. CoRR abs/2306.09744 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06127
Marc Weber, Phillip Swazinna, Daniel Hein, Steffen Udluft, Volkmar Sterzing:
Learning Control Policies for Variable Objectives from Offline Data. CoRR abs/2308.06127 (2023)
2022
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icaart/SchollDOU22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaart/SchollDOU22a
Philipp Scholl, Felix Dietrich, Clemens Otte, Steffen Udluft:
Safe Policy Improvement Approaches and Their Limitations. ICAART (Revised Selected Paper 2022: 74-98
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icaart/SchollDOU22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaart/SchollDOU22
Philipp Scholl, Felix Dietrich, Clemens Otte, Steffen Udluft:
Safe Policy Improvement Approaches on Discrete Markov Decision Processes. ICAART (2) 2022: 142-151
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05433
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05433
Phillip Swazinna, Steffen Udluft, Daniel Hein, Thomas A. Runkler:
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning. CoRR abs/2201.05433 (2022)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12175
Philipp Scholl, Felix Dietrich, Clemens Otte, Steffen Udluft:
Safe Policy Improvement Approaches on Discrete Markov Decision Processes. CoRR abs/2201.12175 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10629
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
User-Interactive Offline Reinforcement Learning. CoRR abs/2205.10629 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04741
Simon Wiedemann, Daniel Hein, Steffen Udluft, Christian B. Mendl:
Quantum Policy Iteration via Amplitude Estimation and Grover Search - Towards Quantum Advantage for Reinforcement Learning. CoRR abs/2206.04741 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-00724
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-00724
Philipp Scholl, Felix Dietrich, Clemens Otte, Steffen Udluft:
Safe Policy Improvement Approaches and their Limitations. CoRR abs/2208.00724 (2022)
2021
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/eaai/SwazinnaUR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eaai/SwazinnaUR21
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
Overcoming model bias for robust offline deep reinforcement learning. Eng. Appl. Artif. Intell. 104: 104366 (2021)
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/esann/SwazinnaU0R21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/SwazinnaU0R21
Phillip Swazinna, Steffen Udluft, Daniel Hein, Thomas A. Runkler:
Behavior Constraining in Weight Space for Offline Reinforcement Learning. ESANN 2021
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/ssci/SwazinnaUR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssci/SwazinnaUR21
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
Measuring Data Quality for Dataset Selection in Offline Reinforcement Learning. SSCI 2021: 1-8
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05479
Phillip Swazinna, Steffen Udluft, Daniel Hein, Thomas A. Runkler:
Behavior Constraining in Weight Space for Offline Reinforcement Learning. CoRR abs/2107.05479 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-13461
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-13461
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
Measuring Data Quality for Dataset Selection in Offline Reinforcement Learning. CoRR abs/2111.13461 (2021)
2020
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-05533
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-05533
Phillip Swazinna, Steffen Udluft, Thomas A. Runkler:
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning. CoRR abs/2008.05533 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/gecco/0001UR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gecco/0001UR19
Daniel Hein, Steffen Udluft, Thomas A. Runkler:
Generating interpretable reinforcement learning policies using genetic programming. GECCO (Companion) 2019: 23-24
2018
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/eaai/HeinUR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eaai/HeinUR18
Daniel Hein, Steffen Udluft, Thomas A. Runkler:
Interpretable policies for reinforcement learning by genetic programming. Eng. Appl. Artif. Intell. 76: 158-169 (2018)
[c25]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/DepewegHUR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/DepewegHUR18
Stefan Depeweg, José Miguel Hernández-Lobato, Steffen Udluft, Thomas A. Runkler:
Sensitivity analysis for predictive uncertainty. ESANN 2018
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/gecco/HeinUR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gecco/HeinUR18
Daniel Hein, Steffen Udluft, Thomas A. Runkler:
Generating interpretable fuzzy controllers using particle swarm optimization and genetic programming. GECCO (Companion) 2018: 1268-1275
[c23]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DepewegHDU18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DepewegHDU18
Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft:
Decomposition of Uncertainty in Bayesian Deep Learning for Efficient and Risk-sensitive Learning. ICML 2018: 1192-1201
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-10960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-10960
Daniel Hein, Steffen Udluft, Thomas A. Runkler:
Generating Interpretable Fuzzy Controllers using Particle Swarm Optimization and Genetic Programming. CoRR abs/1804.10960 (2018)
2017
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/eaai/HeinHRU17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eaai/HeinHRU17
Daniel Hein, Alexander Hentschel, Thomas A. Runkler, Steffen Udluft:
Particle swarm optimization for generating interpretable fuzzy reinforcement learning policies. Eng. Appl. Artif. Intell. 65: 87-98 (2017)
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/DepewegHDU17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DepewegHDU17
Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft:
Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks. ICLR (Poster) 2017
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/HeinUTHRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/HeinUTHRS17
Daniel Hein, Steffen Udluft, Michel Tokic, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing:
Batch reinforcement learning on the industrial benchmark: First experiences. IJCNN 2017: 4214-4221
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/ssci/HeinDTUHRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssci/HeinDTUHRS17
Daniel Hein, Stefan Depeweg, Michel Tokic, Steffen Udluft, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing:
A benchmark environment motivated by industrial control problems. SSCI 2017: 1-8
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HeinUTHRS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HeinUTHRS17
Daniel Hein, Steffen Udluft, Michel Tokic, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing:
Batch Reinforcement Learning on the Industrial Benchmark: First Experiences. CoRR abs/1705.07262 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1709-09480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-09480
Daniel Hein, Stefan Depeweg, Michel Tokic, Steffen Udluft, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing:
A Benchmark Environment Motivated by Industrial Control Problems. CoRR abs/1709.09480 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1710-07283
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-07283
Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft:
Decomposition of Uncertainty for Active Learning and Reliable Reinforcement Learning in Stochastic Systems. CoRR abs/1710.07283 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-04170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-04170
Daniel Hein, Steffen Udluft, Thomas A. Runkler:
Interpretable Policies for Reinforcement Learning by Genetic Programming. CoRR abs/1712.04170 (2017)
2016
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/ijsir/HeinHRU16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijsir/HeinHRU16
Daniel Hein, Alexander Hentschel, Thomas A. Runkler, Steffen Udluft:
Reinforcement Learning with Particle Swarm Optimization Policy (PSO-P) in Continuous State and Action Spaces. Int. J. Swarm Intell. Res. 7(3): 23-42 (2016)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/DepewegHDU16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DepewegHDU16
Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft:
Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks. CoRR abs/1605.07127 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HeinHSTU16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HeinHSTU16
Daniel Hein, Alexander Hentschel, Volkmar Sterzing, Michel Tokic, Steffen Udluft:
Introduction to the "Industrial Benchmark". CoRR abs/1610.03793 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HeinHRU16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HeinHRU16
Daniel Hein, Alexander Hentschel, Thomas A. Runkler, Steffen Udluft:
Particle Swarm Optimization for Generating Fuzzy Reinforcement Learning Policies. CoRR abs/1610.05984 (2016)
2015
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/SpieckermannDUH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/SpieckermannDUH15
Sigurd Spieckermann, Siegmund Düll, Steffen Udluft, Alexander Hentschel, Thomas A. Runkler:
Exploiting similarity in system identification tasks with recurrent neural networks. Neurocomputing 169: 343-349 (2015)
2014
[c19]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/SpieckermannDUHR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/SpieckermannDUHR14
Sigurd Spieckermann, Siegmund Düll, Steffen Udluft, Alexander Hentschel, Thomas A. Runkler:
Exploiting similarity in system identification tasks with recurrent neural networks. ESANN 2014
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icann/SpieckermannDUR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icann/SpieckermannDUR14
Sigurd Spieckermann, Siegmund Düll, Steffen Udluft, Thomas A. Runkler:
Regularized Recurrent Neural Networks for Data Efficient Dual-Task Learning. ICANN 2014: 17-24
2013
[c17]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/DuellU13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/DuellU13
Siegmund Duell, Steffen Udluft:
Ensembles for Continuous Actions in Reinforcement Learning. ESANN 2013
2012
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/at/RunklerUD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/at/RunklerUD12
Thomas A. Runkler, Steffen Udluft, Siegmund Düll:
Datenbasierte Optimalsteuerung mit neuronalen Netzen und dateneffizientem Reinforcement Learning. Autom. 60(10): 641-647 (2012)
[c16]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/DuellWHU12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/DuellWHU12
Siegmund Duell, Lina Weichbrodt, Alexander Hans, Steffen Udluft:
Recurrent Neural State Estimation in Domains with Long-Term Dependencies. ESANN 2012
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/lncs/DuellUS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/lncs/DuellUS12
Siegmund Duell, Steffen Udluft, Volkmar Sterzing:
Solving Partially Observable Reinforcement Learning Problems with Recurrent Neural Networks. Neural Networks: Tricks of the Trade (2nd ed.) 2012: 709-733
2011
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/adprl/HansDU11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/adprl/HansDU11
Alexander Hans, Siegmund Duell, Steffen Udluft:
Agent self-assessment: Determining policy quality without execution. ADPRL 2011: 84-90
[c14]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/HansU11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/HansU11
Alexander Hans, Steffen Udluft:
Ensemble Usage for More Reliable Policy Identification in Reinforcement Learning. ESANN 2011
2010
[c13]
- view
  - electronic edition @ iospress.nl
  - no references & citations available
- export record
  dblp key:
  - conf/ecai/HansU10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/HansU10
Alexander Hans, Steffen Udluft:
Uncertainty Propagation for Efficient Exploration in Reinforcement Learning. ECAI 2010: 361-366
[c12]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/DuellHU10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/DuellHU10
Siegmund Duell, Alexander Hans, Steffen Udluft:
The Markov Decision Process Extraction Network. ESANN 2010
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icmla/HansU10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmla/HansU10
Alexander Hans, Steffen Udluft:
Ensembles of Neural Networks for Robust Reinforcement Learning. ICMLA 2010: 401-406

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j2]
- view
  - electronic edition @ kuenstliche-intelligenz.de (archived)
  - no references & citations available
- export record
  dblp key:
  - journals/ki/SterzingU09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ki/SterzingU09
Volkmar Sterzing, Steffen Udluft:
Dateneffizientes Reinforcement-Learning. Künstliche Intell. 23(3): 19-22 (2009)
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icann/HansU09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icann/HansU09
Alexander Hans, Steffen Udluft:
Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data. ICANN (1) 2009: 70-79
2008
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/SchaferUZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/SchaferUZ08
Anton Maximilian Schäfer, Steffen Udluft, Hans-Georg Zimmermann:
Learning long-term dependencies with recurrent neural networks. Neurocomputing 71(13-15): 2481-2488 (2008)
[c9]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/HansSSU08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/HansSSU08
Alexander Hans, Daniel Schneegaß, Anton Maximilian Schäfer, Steffen Udluft:
Safe exploration for reinforcement learning. ESANN 2008: 143-148
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/SchneegassUM08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/SchneegassUM08
Daniel Schneegaß, Steffen Udluft, Thomas Martinetz:
Uncertainty propagation for quality assurance in Reinforcement Learning. IJCNN 2008: 2588-2595
2007
[c7]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/SchneegassUM07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/SchneegassUM07
Daniel Schneegaß, Steffen Udluft, Thomas Martinetz:
Neural Rewards Regression for near-optimal policy identification in Markovian and partial observable environments. ESANN 2007: 301-306
[c6]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/SchaferUZ07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/SchaferUZ07
Anton Maximilian Schäfer, Steffen Udluft, Hans-Georg Zimmermann:
The Recurrent Control Neural Network. ESANN 2007: 319-324
[c5]
- view
  - electronic edition @ esann.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/esann/SchneegassUM07a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/SchneegassUM07a
Daniel Schneegaß, Steffen Udluft, Thomas Martinetz:
Explicit Kernel Rewards Regression for data-efficient near-optimal policy identification. ESANN 2007: 337-342
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icann/SchneegassUM07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icann/SchneegassUM07
Daniel Schneegaß, Steffen Udluft, Thomas Martinetz:
Improving Optimality of Neural Rewards Regression for Data-Efficient Batch Near-Optimal Policy Identification. ICANN (1) 2007: 109-118
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/SchaeferSSU07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/SchaeferSSU07
Anton Maximilian Schäfer, Daniel Schneegaß, Volkmar Sterzing, Steffen Udluft:
A Neural Reinforcement Learning Approach to Gas Turbine Control. IJCNN 2007: 1691-1696
2006
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/aia/SchneegassUM06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aia/SchneegassUM06
Daniel Schneegaß, Steffen Udluft, Thomas Martinetz:
Kernel Rewards Regression: An Information Efficient Batch Policy Iteration Approach. Artificial Intelligence and Applications 2006: 428-433
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icann/SchaferUZ06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icann/SchaferUZ06
Anton Maximilian Schäfer, Steffen Udluft, Hans-Georg Zimmermann:
Learning Long Term Dependencies with Recurrent Neural Networks. ICANN (1) 2006: 71-80

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.