Guss et al., 2021 - Google Patents
The minerl 2020 competition on sample efficient reinforcement learning using human priorsGuss et al., 2021
View PDF- Document ID
- 15720920398770271707
- Author
- Guss W
- Castro M
- Devlin S
- Houghton B
- Kuno N
- Loomis C
- Milani S
- Mohanty S
- Nakata K
- Salakhutdinov R
- Schulman J
- Shiroshita S
- Topin N
- Ummadisingu A
- Vinyals O
- Publication year
- Publication venue
- arXiv preprint arXiv:2101.11071
External Links
Snippet
Although deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples, affording only a shrinking segment of the AI community access to their development. Resolution of these …
- 230000002787 reinforcement 0 title abstract description 55
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Systems or methods specially adapted for a specific business sector, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/20—Education
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/10—Control of the course of the game, e.g. start, progess, end
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F2300/00—Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
- A63F2300/60—Methods for processing data by generating or executing the game program
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Guss et al. | The minerl 2020 competition on sample efficient reinforcement learning using human priors | |
Guss et al. | The MineRL 2019 competition on sample efficient reinforcement learning using human priors | |
Risi et al. | From chess and atari to starcraft and beyond: How game ai is driving the world of ai | |
Noothigattu et al. | Teaching AI agents ethical values using reinforcement learning and policy orchestration | |
Li | Reinforcement learning applications | |
Nalepka et al. | Human social motor solutions for human–machine interaction in dynamical task contexts | |
Diamandis et al. | Bold: How to go big, create wealth and impact the world | |
Yannakakis et al. | A panorama of artificial and computational intelligence in games | |
McGonigal | Why I love bees: A case study in collective intelligence gaming | |
Gallego-Durán et al. | A guide for game-design-based gamification | |
Roohi et al. | Review of intrinsic motivation in simulation-based game testing | |
Milani et al. | Retrospective analysis of the 2019 MineRL competition on sample efficient reinforcement learning | |
Palanisamy | Hands-On Intelligent Agents with OpenAI Gym: Your guide to developing AI agents using deep reinforcement learning | |
Shah et al. | The MineRL BASALT competition on learning from human feedback | |
Stahlke et al. | Artificial players in the design process: Developing an automated testing tool for game level and world design | |
Jacob et al. | “it’s unwieldy and it takes a lot of time”—challenges and opportunities for creating agents in commercial games | |
Stefanidis et al. | Learning prosocial skills through multiadaptive games: A case study | |
Romero-Mendez et al. | The use of deep learning to improve player engagement in a video game through a dynamic difficulty adjustment based on skills classification | |
Bessant et al. | Entrepreneurship | |
Kruse et al. | Evaluation of a Multi-agent “Human-in-the-loop” Game Design System | |
Milani et al. | The minerl competition on sample-efficient reinforcement learning using human priors: A retrospective | |
Eisenmann | The fail-safe startup: your roadmap for entrepreneurial success | |
Walther-Franks et al. | Robots, pancakes, and computer games: designing serious games for robot imitation learning | |
Sapio et al. | Developing and testing a new reinforcement learning toolkit with unreal engine | |
Gaudl | Building robust real-time game ai: simplifying & automating integral process steps in multi-platform design |