HalluWorld is a controlled benchmark using explicit reference world models to automatically label and disentangle hallucinations in LLMs across synthetic environments with varying complexity and observability.
Perry, Matthew R
6 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 2polarities
background 2representative citing papers
AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.
Single-diffractive cross sections in pp and pbar p collisions are described by a three-parameter fit in a dephasing Lindblad framework yielding a consistent decoherence factor φ ≈ 0.89 that favors CPT-invariant dephasing over CP-invariant.
Authors propose a four-stage framework to analyze opportunities and risks of generative AI across the health information journey from public sources to clinical care.
Theoretical calculations explain experimental magnetoelastic data in antiferromagnetic MnPt, linking magnetic structure to anisotropy energy and isotropic/anisotropic magnetostriction coefficients.
citing papers explorer
-
HalluWorld: A Controlled Benchmark for Hallucination via Reference World Models
HalluWorld is a controlled benchmark using explicit reference world models to automatically label and disentangle hallucinations in LLMs across synthetic environments with varying complexity and observability.
-
AlphaEvolve: A coding agent for scientific and algorithmic discovery
AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.
-
Decoherence, Perturbations and Symmetry in Lindblad Dynamics -- Implications for Diffractive Dissociation
Single-diffractive cross sections in pp and pbar p collisions are described by a three-parameter fit in a dephasing Lindblad framework yielding a consistent decoherence factor φ ≈ 0.89 that favors CPT-invariant dephasing over CP-invariant.
-
Opportunities and Risks of Generative AI through the Health Information Journey
Authors propose a four-stage framework to analyze opportunities and risks of generative AI across the health information journey from public sources to clinical care.
-
Magnetoelasticity - magnetic structure interrelation - tetragonal MnPt system study
Theoretical calculations explain experimental magnetoelastic data in antiferromagnetic MnPt, linking magnetic structure to anisotropy energy and isotropic/anisotropic magnetostriction coefficients.
- Just Type It in Isabelle! AI Agents Drafting, Mechanizing, and Generalizing from Human Hints