Cohn, Nigel Shadbolt, and Michael Wooldridge

Malfa, E · 2023 · arXiv 2309.16573

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

cs.CL · 2023-08-02 · conditional · novelty 8.0

XSTest is a benchmark for detecting exaggerated safety refusals in large language models on clearly safe prompts.

StarCoder 2 and The Stack v2: The Next Generation

cs.SE · 2024-02-29 · accept · novelty 6.0

StarCoder2-15B matches or beats CodeLlama-34B on code tasks despite being smaller, and StarCoder2-3B outperforms prior 15B models, with open weights and exact training data identifiers released.

End-to-end PDDL Planning with Hardcoded and Dynamic Agents

cs.AI · 2025-12-10 · unverdicted · novelty 5.0

An end-to-end LLM framework refines natural language into valid PDDL domains and problems via hardcoded and dynamic agents, generates plans with standard engines, and returns readable output.

citing papers explorer

Showing 3 of 3 citing papers.

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models cs.CL · 2023-08-02 · conditional · none · ref 3
XSTest is a benchmark for detecting exaggerated safety refusals in large language models on clearly safe prompts.
StarCoder 2 and The Stack v2: The Next Generation cs.SE · 2024-02-29 · accept · none · ref 229
StarCoder2-15B matches or beats CodeLlama-34B on code tasks despite being smaller, and StarCoder2-3B outperforms prior 15B models, with open weights and exact training data identifiers released.
End-to-end PDDL Planning with Hardcoded and Dynamic Agents cs.AI · 2025-12-10 · unverdicted · none · ref 24
An end-to-end LLM framework refines natural language into valid PDDL domains and problems via hardcoded and dynamic agents, generates plans with standard engines, and returns readable output.

Cohn, Nigel Shadbolt, and Michael Wooldridge

fields

years

verdicts

representative citing papers

citing papers explorer