Vehicle enables compositional verification of neural controllers in discrete and continuous cyber-physical systems across Rocq, Isabelle/HOL, Agda, and Imandra, including the first infinite time-horizon safety proof for a continuous medical device in a general-purpose ITP.
hub Canonical reference
Davis, Guillaume Baudart, and Louis Mandel
Canonical reference. 89% of citing Pith papers cite this work as background.
hub tools
citation-role summary
citation-polarity summary
fields
cs.SE 6 cs.CV 4 cs.HC 3 cs.AI 2 cs.CR 2 cs.CY 2 cs.DC 2 cs.LG 2 cond-mat.mes-hall 1 cond-mat.stat-mech 1roles
background 9representative citing papers
Malicious LLM API routers actively perform payload injection and secret exfiltration, with 9 of 428 tested routers showing malicious behavior and further poisoning risks from leaked credentials.
A Yoneda lemma for categorical supermaps gives a concrete representation via channel-state duality whenever the theory has it, yielding stable definitions for boxworld and real quantum theory.
LLM2Ltac mines symbolic tactics from 11,725 Coq theorems using LLMs and integrates them into CoqHammer, improving proof rates by 23.87% on 6,199 theorems from four large verification projects.
Manifold k-NN generalizes DP-NNS to k-NN queries on manifold point clouds via a recursive successor-list property, delivering 1-10x speedups and full dynamic support.
A complete, axiom-free Lean 4 formalization of combinatorial curve neighborhoods for A1(1) has been achieved by encoding them as maximal vertices in degree-bounded reachable sets of the infinite dihedral moment graph.
Embedding CUDA Graphs in UCX for multi-path intra-node GPU communication yields up to 2.95x bandwidth improvement over single-path UCX on a four-GPU node for large messages.
GraphQLify automates REST-to-GraphQL migration via static source code analysis, delivering 100% type-safe conversions on 834 APIs and 2-4x faster performance than REST for multi-call workflows.
SoulNote enables multi-session GenAI songwriting for DHH users, producing measurable gains in self-insight, emotion regulation, and self-care attitudes.
Users entangle their lived experiences with AI predictions in menstrual tracking apps, leading to self-fulfilling prophecies, limited critical awareness from UI, and isolation for non-normative users.
A machine-checked Lean 4 formalization of Stokes' theorem on smooth singular cubes with true Fréchet pullback, chain-level extensions, and comparison to prior HOL Light work.
Fusing chart visualizations with raw time series improves or maintains classification accuracy on UCR datasets when the visuals add non-redundant information.
Reproduction confirms PAG boosts generative retrieval effectiveness, but its look-ahead planning signal collapses under intent-preserving typos and query mismatches, reverting performance to unguided decoding.
Checksum Count Vectors enable robust similarity search to identify duplicate and variant legacy media recordings with high accuracy despite substantial data damage.
A minimal agentic system achieves competitive performance in automated theorem proving with a simpler design and lower cost than state-of-the-art methods.
Zero-shot inversion-free flow method de-identifies skin images in under 20 seconds while preserving pathological features with IoU stability exceeding 0.67 using segment-by-synthesis and CIELAB decoupling.
13 participants became convinced AI understands human values after chatbot interactions evaluated with the VAPT toolkit.
A data-driven framework reduces particle-based transfer operators via concentration projection, geometric manifold, and finite-state discretization to reproduce clustering transitions and metastable states from simulation data.
The authors predict that measuring increased frequency variance in a megahertz resonator due to fluctuating nuclear spin polarization enables single nuclear spin detection with existing devices.
The authors define Attribute-Authenticated Continuous Group Key Agreement (AA-CGKA), prove security properties of requirement integrity, unforgeability and unlinkability, and report an implementation with performance comparable to certificate-based MLS.
A new bias-aware benchmark for duplicate bug report detection shows simpler techniques outperform recent sophisticated methods on most projects and match industry tools.
Proves that most bounded-modification problems for making transition systems implementable by flip-flop nets are NP-complete.
Microstate tokenizer from clustered EEG signals provides universal representations that outperform traditional time- and frequency-domain features across sleep staging, emotion recognition, and motor imagery tasks.
U-Schema acts as a common pivot representation that reduces the number of direct mappings needed for schema and data migration across heterogeneous database paradigms while preserving structure and query behavior.
citing papers explorer
-
Compositional Neural-Cyber-Physical System Verification in the Interactive Theorem Prover of Your Choice
Vehicle enables compositional verification of neural controllers in discrete and continuous cyber-physical systems across Rocq, Isabelle/HOL, Agda, and Imandra, including the first infinite time-horizon safety proof for a continuous medical device in a general-purpose ITP.
-
Your Agent Is Mine: Measuring Malicious Intermediary Attacks on the LLM Supply Chain
Malicious LLM API routers actively perform payload injection and secret exfiltration, with 9 of 428 tested routers showing malicious behavior and further poisoning risks from leaked credentials.
-
Supermaps on generalised theories
A Yoneda lemma for categorical supermaps gives a concrete representation via channel-state duality whenever the theory has it, yielding stable definitions for boxworld and real quantum theory.
-
A Learning Method for Symbolic Systems Using Large Language Models
LLM2Ltac mines symbolic tactics from 11,725 Coq theorems using LLMs and integrates them into CoqHammer, improving proof rates by 23.87% on 6,199 theorems from four large verification projects.
-
Manifold k-NN: Accelerated k-NN Queries for Manifold Point Clouds
Manifold k-NN generalizes DP-NNS to k-NN queries on manifold point clouds via a recursive successor-list property, delivering 1-10x speedups and full dynamic support.
-
Formalizing $A_1^{(1)}$ Curve Neighborhoods in Lean 4
A complete, axiom-free Lean 4 formalization of combinatorial curve neighborhoods for A1(1) has been achieved by encoding them as maximal vertices in degree-bounded reachable sets of the infinite dihedral moment graph.
-
Accelerating Intra-Node GPU-to-GPU Communication Through Multi-Path Transfers with CUDA Graphs
Embedding CUDA Graphs in UCX for multi-path intra-node GPU communication yields up to 2.95x bandwidth improvement over single-path UCX on a four-GPU node for large messages.
-
GraphQLify: Automated and Type Safety-Preserving GraphQL API Adoption
GraphQLify automates REST-to-GraphQL migration via static source code analysis, delivering 100% type-safe conversions on 834 APIs and 2-4x faster performance than REST for multi-call workflows.
-
From Daily Song to Daily Self: Supporting Reflective Songwriting of Deaf and Hard-of-Hearing Individuals through Generative Music AI
SoulNote enables multi-session GenAI songwriting for DHH users, producing measurable gains in self-insight, emotion regulation, and self-care attitudes.
-
"It became a self-fulfilling prophecy": How Lived Experiences are Entangled with AI Predictions in Menstrual Cycle Tracking Apps
Users entangle their lived experiences with AI predictions in menstrual tracking apps, leading to self-fulfilling prophecies, limited critical awareness from UI, and isolation for non-normative users.
-
Stokes' Theorem for Smooth Singular Cubes in Lean 4: True Pullback, Bridges to mathlib4, and Chain-Level d^2=0
A machine-checked Lean 4 formalization of Stokes' theorem on smooth singular cubes with true Fréchet pullback, chain-level extensions, and comparison to prior HOL Light work.
-
VTBench: A Multimodal Framework for Time-Series Classification with Chart-Based Representations
Fusing chart visualizations with raw time series improves or maintains classification accuracy on UCR datasets when the visuals add non-redundant information.
-
Lost in Decoding? Reproducing and Stress-Testing the Look-Ahead Prior in Generative Retrieval
Reproduction confirms PAG boosts generative retrieval effectiveness, but its look-ahead planning signal collapses under intent-preserving typos and query mismatches, reverting performance to unguided decoding.
-
Prints in the Magnetic Dust: Robust Similarity Search in Legacy Media Images Using Checksum Count Vectors
Checksum Count Vectors enable robust similarity search to identify duplicate and variant legacy media recordings with high accuracy despite substantial data damage.
-
A Minimal Agent for Automated Theorem Proving
A minimal agentic system achieves competitive performance in automated theorem proving with a simpler design and lower cost than state-of-the-art methods.
-
Zero-Shot Generative De-identification: Inversion-Free Flow for Privacy-Preserving Skin Image Analysis
Zero-shot inversion-free flow method de-identifies skin images in under 20 seconds while preserving pathological features with IoU stability exceeding 0.67 using segment-by-synthesis and CIELAB decoupling.
-
AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations
13 participants became convinced AI understands human values after chatbot interactions evaluated with the VAPT toolkit.
-
Data-driven Reduction of Transfer Operators for Particle Clustering Dynamics
A data-driven framework reduces particle-based transfer operators via concentration projection, geometric manifold, and finite-state discretization to reproduce clustering transitions and metastable states from simulation data.
-
Near-resonant nuclear spin detection with megahertz mechanical resonators
The authors predict that measuring increased frequency variance in a megahertz resonator due to fluctuating nuclear spin polarization enables single nuclear spin detection with existing devices.
-
Attribute-Based Authentication in Secure Group Messaging for Distributed Environments and Safer Online Spaces
The authors define Attribute-Authenticated Continuous Group Key Agreement (AA-CGKA), prove security properties of requirement integrity, unforgeability and unlinkability, and report an implementation with performance comparable to certificate-based MLS.
-
Duplicate Bug Report Detection: How Far Are We?
A new bias-aware benchmark for duplicate bug report detection shows simpler techniques outperform recent sophisticated methods on most projects and match industry tools.
-
On the Complexity of Techniques That Make Transition Systems Implementable by Boolean Nets
Proves that most bounded-modification problems for making transition systems implementable by flip-flop nets are NP-complete.
-
Atoms of Thought: Universal EEG Representation Learning with Microstates
Microstate tokenizer from clustered EEG signals provides universal representations that outperform traditional time- and frequency-domain features across sleep staging, emotion recognition, and motor imagery tasks.
-
A Model-Driven Approach to Database Migration with a Unified Data Model
U-Schema acts as a common pivot representation that reduces the number of direct mappings needed for schema and data migration across heterogeneous database paradigms while preserving structure and query behavior.
-
Explaining Graph Neural Networks for Node Similarity on Graphs
Empirical comparison shows gradient-based explanations for GNN node similarities are actionable, consistent, and retain effects when sparsified, unlike mutual information explanations.
-
Correctness Notions for Petri Nets with Identifiers
Extends Petri nets with identifiers for object- and resource-aware systems and defines generalized correctness criteria with decidability analysis.
-
GenAI in Software Engineering: The Role of Technology Acceptance Models
UTAUT is suitable for studying individual barriers to GenAI use in software engineering when combined with Bayesian analysis, with three priorities for future research on construct refinement, operationalization, and statistical methods.
-
New Kids: An Architecture and Performance Investigation of Second-Generation Serverless Platforms
Second-generation serverless platforms using lightweight isolates and edge deployment achieve roughly 10 ms warm latency and negligible cold starts, according to architecture analysis of seven platforms and microbenchmarks totaling over 38 million function calls.
-
Reflections on the design, applications and implementations of the normative specification language eFLINT
eFLINT is a normative specification language combining declarative and procedural elements to support automated compliance reasoning before, during, and after software execution.
-
Community-Informed AI Models for Police Accountability
Proposes and illustrates a community-informed, multi-perspective approach to developing AI for analyzing LAPD body-worn camera footage of traffic stops.
-
Building Network Digital Twins Part II: Real-Time Adaptive PID for Enhanced State Synchronization
An adaptive PID controller is integrated into a Network Digital Twin to enhance real-time traffic state synchronization, with results demonstrated via an interactive user interface.
-
Towards the Development of Detection of Learned Helplessness in Mathematics: Design and Data Collection Challenges from a Developing Country Perspective
A descriptive report details how outdated devices, unreliable internet, short sessions, and administrative delays reduced usable data collection from 410 to 118 students for a planned learned helplessness detection model in linear equations tutoring.
-
Test Case Generation for Simulink Models: An Experience from the E-Bike Domain
An empirical evaluation found that HECATE generated failure-revealing test cases for 83% of 36 experiments on e-bike Simulink controllers, averaging 1 hour 17 minutes per run, with developer confirmation of the failures.
-
Open Problems in a Logic of Gossips
This paper compiles and discusses several open problems in the modal logic for gossip protocols that use epistemic formulas.
- The Forensic Cost of Watermark Removal: From Dedicated Attacks to Image Editing