pith. sign in

arxiv: 2106.03051 · v1 · pith:CPULOE2Lnew · submitted 2021-06-06 · 💻 cs.LG · cs.AI· cs.MA· cs.SY· eess.SY

ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learning

classification 💻 cs.LG cs.AIcs.MAcs.SYeess.SY
keywords schedulenetschedulingtasksmulti-agentproblemproblemsagentsembeddings
0
0 comments X
read the original abstract

We propose ScheduleNet, a RL-based real-time scheduler, that can solve various types of multi-agent scheduling problems. We formulate these problems as a semi-MDP with episodic reward (makespan) and learn ScheduleNet, a decentralized decision-making policy that can effectively coordinate multiple agents to complete tasks. The decision making procedure of ScheduleNet includes: (1) representing the state of a scheduling problem with the agent-task graph, (2) extracting node embeddings for agent and tasks nodes, the important relational information among agents and tasks, by employing the type-aware graph attention (TGA), and (3) computing the assignment probability with the computed node embeddings. We validate the effectiveness of ScheduleNet as a general learning-based scheduler for solving various types of multi-agent scheduling tasks, including multiple salesman traveling problem (mTSP) and job shop scheduling problem (JSP).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. GOAL: Graph-based Objective-Aligned Diffusion Solvers for Dynamic Multi-Objective Optimization

    cs.NE 2026-05 unverdicted novelty 6.0

    GOAL uses conditioned diffusion on relational graphs with typed edges to produce feasible multi-objective solutions for scheduling problems, reporting 100% feasibility and sub-0.2% MAPE on FSP, JSP, and FJSP up to 20 jobs.

  2. AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling

    cs.DC 2026-03 unverdicted novelty 5.0

    AGMARL-DKS uses per-node multi-agent RL with GNN state representations and stress-aware lexicographical ordering to outperform the default Kubernetes scheduler on fault tolerance, utilization, and cost for batch and m...

  3. Low-Cost Labels, Reliable Choices: Rollout-Calibrated Hyper-Heuristics for Job Shop Scheduling

    cs.AI 2026-05 unverdicted novelty 4.0

    Gated rollout-calibrated hyper-heuristic for JSSP achieves lowest mean RPD among learned selectors on synthetic instances while staying close to the best fixed rule.