Fan et al., 2022 - Google Patents

Dras: Deep reinforcement learning for cluster scheduling in high performance computing

Fan et al., 2022

Document ID: 6224890545849754358
Author: Fan Y; Li B; Favorite D; Singh N; Childers T; Rich P; Allcock W; Papka M; Lan Z
Publication year: 2022
Publication venue: IEEE Transactions on Parallel and Distributed Systems

External Links

Cited by

Snippet

Cluster schedulers are crucial in high-performance computing (HPC). They determine when and which user jobs should be allocated to available system resources. Existing cluster scheduling heuristics are developed by human experts based on their experience with …

Continue reading at ieeexplore.ieee.org (PDF) (other versions)

230000002787 reinforcement 0 title abstract description 32

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
- G06F9/4887—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues involving deadlines, e.g. rate based, periodic
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
- G06Q10/063—Operations research or analysis
- G06Q10/0631—Resource planning, allocation or scheduling for a business operation
- G06Q10/06311—Scheduling, planning or task assignment for a person or group
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation, e.g. linear programming, "travelling salesman problem" or "cutting stock problem"
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models

Similar Documents

Publication	Publication Date	Title
Tuli et al.	2021	COSCO: Container orchestration using co-simulation and gradient based optimization for fog computing environments
Fan et al.	2021	Deep reinforcement agent for scheduling in HPC
Zhang et al.	2020	RLScheduler: an automated HPC batch job scheduler using reinforcement learning
Fan et al.	2022	Dras: Deep reinforcement learning for cluster scheduling in high performance computing
Fazel Zarandi et al.	2020	A state of the art review of intelligent scheduling
Shahidinejad et al.	2020	An elastic controller using Colored Petri Nets in cloud computing environment
Yan et al.	2021	HANSEL: Adaptive horizontal scaling of microservices using Bi-LSTM
Mahmoud et al.	2022	Multiobjective task scheduling in cloud environment using decision tree algorithm
Li et al.	2022	Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud
Bridi et al.	2016	A constraint programming scheduler for heterogeneous high-performance computing machines
CN113641445B (en)	2024-03-26	Cloud resource self-adaptive configuration method and system based on depth deterministic strategy
Li et al.	2021	OKCM: improving parallel task scheduling in high-performance computing systems using online learning
Ye et al.	2021	SHWS: Stochastic hybrid workflows dynamic scheduling in cloud container services
Mohammadzadeh et al.	2023	Energy-aware workflow scheduling in fog computing using a hybrid chaotic algorithm
Jalali Khalil Abadi et al.	2024	A comprehensive survey on scheduling algorithms using fuzzy systems in distributed environments
Prado et al.	2011	Genetic fuzzy rule-based scheduling system for grid computing in virtual organizations
Sun et al.	2024	Multi-tree genetic programming hyper-heuristic for dynamic flexible workflow scheduling in multi-clouds
Jalali Khalil Abadi et al.	2024	Deep reinforcement learning-based scheduling in distributed systems: a critical review
Cui et al.	2018	Cloud workflow scheduling algorithm based on reinforcement learning
Saemi et al.	2023	Solving task scheduling problem in mobile cloud computing using the hybrid multi-objective Harris Hawks optimization algorithm
Baheri	2020	Mars: Multi-scalable actor-critic reinforcement learning scheduler
Perez et al.	2009	Responsive elastic computing
Fomperosa et al.	2022	Task scheduler for heterogeneous data centres based on deep reinforcement learning
Perez et al.	2010	Multi-objective reinforcement learning for responsive grids
Fan	2021	Intelligent Job Scheduling on High Performance Computing Systems