ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learning

Jinkyoo Park; Junyoung Park; Sanjar Bakhtiyar

arxiv: 2106.03051 · v1 · pith:CPULOE2Lnew · submitted 2021-06-06 · 💻 cs.LG · cs.AI· cs.MA· cs.SY· eess.SY

ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learning

Junyoung Park , Sanjar Bakhtiyar , Jinkyoo Park This is my paper

classification 💻 cs.LG cs.AIcs.MAcs.SYeess.SY

keywords schedulenetschedulingtasksmulti-agentproblemproblemsagentsembeddings

0 comments

read the original abstract

We propose ScheduleNet, a RL-based real-time scheduler, that can solve various types of multi-agent scheduling problems. We formulate these problems as a semi-MDP with episodic reward (makespan) and learn ScheduleNet, a decentralized decision-making policy that can effectively coordinate multiple agents to complete tasks. The decision making procedure of ScheduleNet includes: (1) representing the state of a scheduling problem with the agent-task graph, (2) extracting node embeddings for agent and tasks nodes, the important relational information among agents and tasks, by employing the type-aware graph attention (TGA), and (3) computing the assignment probability with the computed node embeddings. We validate the effectiveness of ScheduleNet as a general learning-based scheduler for solving various types of multi-agent scheduling tasks, including multiple salesman traveling problem (mTSP) and job shop scheduling problem (JSP).

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

GOAL: Graph-based Objective-Aligned Diffusion Solvers for Dynamic Multi-Objective Optimization
cs.NE 2026-05 unverdicted novelty 6.0

GOAL uses conditioned diffusion on relational graphs with typed edges to produce feasible multi-objective solutions for scheduling problems, reporting 100% feasibility and sub-0.2% MAPE on FSP, JSP, and FJSP up to 20 jobs.
AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling
cs.DC 2026-03 unverdicted novelty 5.0

AGMARL-DKS uses per-node multi-agent RL with GNN state representations and stress-aware lexicographical ordering to outperform the default Kubernetes scheduler on fault tolerance, utilization, and cost for batch and m...
Low-Cost Labels, Reliable Choices: Rollout-Calibrated Hyper-Heuristics for Job Shop Scheduling
cs.AI 2026-05 unverdicted novelty 4.0

Gated rollout-calibrated hyper-heuristic for JSSP achieves lowest mean RPD among learned selectors on synthetic instances while staying close to the best fixed rule.