An Agglomerative Clustering of Simulation Output Distributions Using Regularized Wasserstein Distance

David J. Eckman; Mohammadmahdi Ghasemloo

arxiv: 2407.12100 · v2 · pith:RJKATTVInew · submitted 2024-07-16 · 📊 stat.ME · stat.AP· stat.ML

An Agglomerative Clustering of Simulation Output Distributions Using Regularized Wasserstein Distance

Mohammadmahdi Ghasemloo , David J. Eckman This is my paper

classification 📊 stat.ME stat.APstat.ML

keywords clusteringdistributionsoutputsperformancesimulationagglomerativedistanceempirical

0 comments

read the original abstract

Using statistical learning methods to analyze stochastic simulation outputs can significantly enhance decision-making by uncovering relationships between different simulated systems and between a system's inputs and outputs. We focus on clustering multivariate empirical distributions of simulation outputs to identify patterns and trade-offs among performance measures. We present a novel agglomerative clustering algorithm that utilizes the regularized Wasserstein distance to cluster these multivariate empirical distributions. This framework has several important use cases, including anomaly detection, pre-optimization, and online monitoring. In numerical experiments involving a call-center model, we demonstrate how this methodology can identify staffing plans that yield similar performance outcomes and inform policies for intervening when queue lengths signal potentially worsening system performance.

This paper has not been read by Pith yet.

An Agglomerative Clustering of Simulation Output Distributions Using Regularized Wasserstein Distance

discussion (0)