VS-Bench is a new benchmark of ten visual multi-agent environments that measures VLMs on element recognition, next-action prediction, and normalized episode return, showing strong perception but large gaps in reasoning and decision-making with the best model at 46.6% prediction accuracy and 31.4% of
Stochastic games
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
A model-free RL methodology is developed to maximize the probability of LTL satisfaction in unknown stochastic games when the derived DRA has a single Rabin pair, with a generalization providing lower bounds for multiple pairs.
citing papers explorer
-
VS-Bench: Evaluating VLMs for Strategic Abilities in Multi-Agent Environments
VS-Bench is a new benchmark of ten visual multi-agent environments that measures VLMs on element recognition, next-action prediction, and normalized episode return, showing strong perception but large gaps in reasoning and decision-making with the best model at 46.6% prediction accuracy and 31.4% of
-
Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives
A model-free RL methodology is developed to maximize the probability of LTL satisfaction in unknown stochastic games when the derived DRA has a single Rabin pair, with a generalization providing lower bounds for multiple pairs.