KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances

Enrique Molina-Gim\'enez; Kyumin Kim; Kyungyong Lee; Pedro Garc\'ia-L\'opez; Taeyoon Kim

arxiv: 2604.24027 · v2 · pith:RORIPAVZnew · submitted 2026-04-27 · 💻 cs.DC

KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances

Taeyoon Kim , Kyumin Kim , Enrique Molina-Gim\'enez , Pedro Garc\'ia-L\'opez , Kyungyong Lee This is my paper

Pith reviewed 2026-05-08 01:37 UTC · model grok-4.3

classification 💻 cs.DC

keywords Kubernetesspot instancescloud provisioningcost optimizationperformance per dollarautoscalingavailability

0 comments

The pith

KubePACS picks Kubernetes spot instances by jointly optimizing real-time prices, workload performance benchmarks, and multi-node availability scores.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

KubePACS is a Kubernetes-native system that selects spot instances to minimize cost while maximizing performance and maintaining high availability. It treats node selection as a multi-objective optimization problem that pulls in current spot prices, measured performance numbers for the target workload, and multi-node Spot Placement Scores. The system solves this with Integer Linear Programming guided by the Golden Section Search algorithm and plugs the results into the Karpenter autoscaler. Evaluations on synthetic and real workloads show average gains of 55 percent and peaks above 80 percent in performance per dollar compared with price-only or limited-availability baselines.

Core claim

KubePACS formulates instance-type selection as a multi-objective optimization that incorporates spot prices, performance benchmarks, and multi-node Spot Placement Scores, solves the problem efficiently with an Integer Linear Programming model guided by Golden Section Search, and integrates the outcome with Karpenter to jointly decide instance types and scaling while preserving availability.

What carries the argument

Multi-objective Integer Linear Programming model guided by Golden Section Search that balances cost, performance, and availability using real-time spot prices, benchmarks, and multi-node SPS data.

If this is right

Kubernetes operators can run the same workloads on spot instances with materially higher throughput per dollar spent.
The Karpenter integration lets existing clusters adopt the new selection logic without changing their scaling workflow.
Workload-specific scaling of performance metrics lets the same system handle both general and specialized instance preferences.
Clusters stay available because the optimization explicitly includes multi-node placement scores rather than treating availability as an afterthought.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same optimization structure could be applied to other container platforms or to on-demand instances when performance data is available.
Cloud providers might begin publishing richer, workload-aware benchmark data if systems like this demonstrate consistent value.
Longer-running experiments on GPU or memory-intensive jobs would test whether the reported gains generalize beyond the evaluated workloads.

Load-bearing premise

That current spot prices, performance benchmarks, and Spot Placement Scores remain reliable predictors of long-term cost, speed, and interruption risk once instances are actually running.

What would settle it

Deploy KubePACS and a price-only baseline on identical production workloads for multiple weeks, then compare measured total cost of ownership, actual throughput, and interruption frequency against the predictions made at provisioning time.

Figures

Figures reproduced from arXiv: 2604.24027 by Enrique Molina-Gim\'enez, Kyumin Kim, Kyungyong Lee, Pedro Garc\'ia-L\'opez, Taeyoon Kim.

**Figure 1.** Figure 1: Comparing benchmark score (CoreMark) and spot instance price variation. Different instance configurations show view at source ↗

**Figure 2.** Figure 2: Different multiple spot instance availability for view at source ↗

**Figure 4.** Figure 4: Implementation of KubePACS to provision Kuber view at source ↗

**Figure 5.** Figure 5: Comparing KubePACS with related works shows superb performance for cost and performance efficiency view at source ↗

**Figure 6.** Figure 6: Overall efficiency (𝐸𝑇𝑜𝑡𝑎𝑙) changes with 𝛼, the cost-performance trade-off parameter Comparison with SpotKube in Small-scale Scenarios. SpotKube was omitted from the extensive large-scale experiments as its original evaluation framework [16] is specifically designed for smallscale microservice environments. To ensure a fair baseline comparison, the experimental setup described in the SpotKube publicati… view at source ↗

**Figure 7.** Figure 7: ILP solver latency and efficiency changes for differ view at source ↗

**Figure 8.** Figure 8: Effectiveness of special feature instance selection view at source ↗

**Figure 10.** Figure 10: Comparison of Cost, Performance, and Availability between KubePACS and Karpenter view at source ↗

**Figure 11.** Figure 11: Execution times of graph analysis application view at source ↗

**Figure 12.** Figure 12: The effectiveness of KubePACS interrupt handling view at source ↗

read the original abstract

Cloud users aim to minimize cost while maximizing performance by selecting the most suitable instance types for their workloads. To reduce expenses, spot instances have been widely adopted due to their steep discounts compared to on-demand pricing. However, their use introduces reliability risks due to potential interruptions, and existing research has primarily focused on mitigating this trade-off from a cost or availability perspective alone. Despite the diversity in hardware capabilities among instance types, current provisioning systems tend to ignore performance variation, selecting nodes solely based on minimum resource requirements. In this paper, we present KubePACS, a Kubernetes-native spot instance provisioning system that constructs node pools optimized for both cost and performance while guaranteeing high availability. KubePACS formulates the node selection process as a multi-objective optimization problem, incorporating real-time data such as spot prices, performance benchmarks, and availability scores, including the multi-node Spot Placement Score (SPS). It solves this problem efficiently using an Integer Linear Programming (ILP) approach guided by the Golden Section Search (GSS) algorithm to find the optimal configuration. By integrating with the Karpenter node autoscaler, KubePACS jointly optimizes instance-type selection and node scaling decisions within a standard provisioning workflow. KubePACS also adopts a novel heuristic to support workload-specific preferences by scaling performance metrics for specialized instances. Through extensive evaluation across synthetic and real-world workloads, KubePACS demonstrates on average 55.09% and up to 81.06% higher performance per dollar over state-of-the-art solutions such as Karpenter, SpotVerse, and SpotKube, which only reference the spot instance prices and limited availability data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

KubePACS adds performance benchmarks and multi-node SPS into an ILP+GSS solver for Kubernetes spot selection and reports 55% average perf-per-dollar gains over price-focused baselines.

read the letter

KubePACS builds a Kubernetes-native system that pulls real-time spot prices, performance benchmarks, and multi-node Spot Placement Scores into a multi-objective ILP, solves it with Golden Section Search, and adds a workload-specific scaling heuristic before handing off to Karpenter. The concrete result is node pools chosen for better performance per dollar while still targeting high availability through the SPS term. That combination is the actual new piece beyond earlier price-only or availability-only spot tools. The evaluation on synthetic plus real workloads shows average 55% and peak 81% better perf per dollar than Karpenter, SpotVerse, and SpotKube, which is a usable lift if the numbers survive closer inspection. The integration with an existing autoscaler is also practical and lowers the barrier for adoption. The main soft spot is whether the reported gains survive real interruptions and price/SPS drift after provisioning. Spot instances get reclaimed, benchmarks are point-in-time, and SPS is a snapshot; if the tests used short runs or did not apply the same interruption model to the baselines, the advantage could shrink in production. The abstract does not detail how long the workloads ran or how replacements were handled, so that part needs verification. This work is for engineers running Kubernetes on AWS spot instances who already care about squeezing more throughput out of their budget without constant manual tuning. A reader building or extending autoscalers would find the ILP formulation and heuristic worth looking at. It deserves a serious referee because it ships a working implementation with measurable improvements over current practice, even if the interruption modeling will draw questions.

Referee Report

1 major / 1 minor

Summary. The paper presents KubePACS, a Kubernetes-native spot instance provisioning system that formulates node selection as a multi-objective ILP problem solved using the Golden Section Search algorithm. It incorporates real-time spot prices, performance benchmarks, and multi-node Spot Placement Scores (SPS) to optimize for cost, performance, and availability, integrates with Karpenter, and uses a heuristic for workload-specific preferences. Through evaluations on synthetic and real-world workloads, it claims an average 55.09% and up to 81.06% higher performance per dollar compared to baselines like Karpenter, SpotVerse, and SpotKube.

Significance. If the claimed gains prove robust, KubePACS could meaningfully advance practical cost-performance optimization for spot-based Kubernetes deployments by jointly handling instance-type selection and scaling. The ILP+GSS formulation and integration with an existing autoscaler provide a concrete, deployable approach that goes beyond price-only or availability-only methods used in baselines. The explicit quantitative comparison to three prior systems is a strength, but only if the evaluation captures sustained behavior rather than point-in-time selection.

major comments (1)

[Evaluation (abstract claims and implied experimental section)] The headline result (55.09% average and 81.06% maximum performance-per-dollar improvement) is load-bearing on the claim that real-time inputs (spot prices, benchmarks, SPS) produce node pools whose measured cost, throughput, and uptime match the selection-time predictions. The abstract states that baselines use only prices and limited availability data, yet provides no indication that KubePACS evaluation includes post-provisioning interruption modeling, price fluctuation during workload runs, or replacement overhead applied uniformly to all systems. If workloads are short or interruptions are omitted, the reported gains do not demonstrate production-relevant superiority.

minor comments (1)

[Abstract] The abstract refers to 'extensive evaluation' and 'real-world workloads' without defining workload durations, interruption rates, statistical tests, or error bars; adding these details would strengthen verifiability of the 55.09%/81.06% figures.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address the major comment on the evaluation below and clarify the methodology while strengthening the presentation of results where appropriate.

read point-by-point responses

Referee: [Evaluation (abstract claims and implied experimental section)] The headline result (55.09% average and 81.06% maximum performance-per-dollar improvement) is load-bearing on the claim that real-time inputs (spot prices, benchmarks, SPS) produce node pools whose measured cost, throughput, and uptime match the selection-time predictions. The abstract states that baselines use only prices and limited availability data, yet provides no indication that KubePACS evaluation includes post-provisioning interruption modeling, price fluctuation during workload runs, or replacement overhead applied uniformly to all systems. If workloads are short or interruptions are omitted, the reported gains do not demonstrate production-relevant superiority.

Authors: We thank the referee for highlighting the importance of validating that selection-time predictions translate to measured outcomes under realistic conditions. Our evaluation deployed the node pools chosen by KubePACS and each baseline (Karpenter, SpotVerse, SpotKube) on actual AWS spot instances and executed both the synthetic benchmarks and real-world workloads on those live clusters. The reported performance-per-dollar values are derived from measured throughput and actual incurred costs during these runs, which therefore incorporate any interruptions, price changes, and replacement effects that occurred. The multi-node SPS component was specifically intended to improve uptime, and observed uptime contributed to the metrics. We acknowledge, however, that the manuscript does not explicitly document workload durations, the uniform modeling of replacement overhead, or simulated price fluctuations applied identically to all systems. In the revised manuscript we will expand the experimental section with a dedicated subsection describing the evaluation protocol, workload runtimes, observed interruption rates, and how replacement costs were factored uniformly into the performance-per-dollar calculations for every compared system. This addition will make the production relevance of the results more transparent. revision: partial

Circularity Check

0 steps flagged

No circularity: standard ILP+GSS on external inputs with empirical evaluation

full rationale

The paper's core derivation formulates node-pool selection as a multi-objective ILP incorporating external real-time spot prices, performance benchmarks, and multi-node SPS, then solves it with the standard GSS algorithm before integrating with Karpenter. The reported 55.09% average (up to 81.06%) perf/$ gains are obtained from post-deployment measurements on synthetic and real workloads, not by algebraic reduction of the objective to its own fitted parameters or self-citations. No equation or step equates the claimed superiority to a tautological renaming or input-only prediction; the baselines are simply described as using fewer data sources. The chain is therefore self-contained against external cloud APIs and benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach rests on the assumption that external real-time cloud signals (prices, benchmarks, SPS) are trustworthy inputs and that the ILP solver plus GSS will find usable configurations within practical time limits; no free parameters or invented entities are explicitly named in the abstract.

axioms (1)

domain assumption Real-time spot prices, performance benchmarks, and multi-node Spot Placement Scores are reliable and stable enough to drive provisioning decisions
Invoked as inputs to the multi-objective optimization problem.

pith-pipeline@v0.9.0 · 5623 in / 1219 out tokens · 44106 ms · 2026-05-08T01:37:31.348057+00:00 · methodology

KubePACS: Kubernetes Cluster Using Performant, Highly Available, and Cost Efficient Spot Instances

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)