hub Mixed citations

Model cards for model reporting

· 2019 · arXiv 7560.328759

Mixed citation behavior. Most common role is background (64%).

40 Pith papers citing it

Background 64% of classified citations

read on arXiv browse 40 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 13 extension 1

citation-polarity summary

background 9 support 3 extend 1 unclear 1

representative citing papers

Acceptance Cards:A Four-Diagnostic Standard for Safe Fine-Tuning Defense Claims

cs.CR · 2026-05-11 · unverdicted · novelty 8.0

Acceptance Cards is a new four-diagnostic standard for safe fine-tuning defense claims that requires statistical reliability, fresh semantic generalization, mechanism alignment, and cross-task transfer; under this protocol SafeLoRA fails the full-card pass on Gemma-2-2B-it.

Towards Measuring the Representation of Subjective Global Opinions in Language Models

cs.CL · 2023-06-28 · conditional · novelty 7.0

LLMs default to responses more similar to opinions from the USA and some European and South American countries; prompting for a country shifts alignment but can introduce stereotypes, while translation does not reliably match language speakers.

Diversed Model Discovery via Structured Table Discovery

cs.IR · 2026-05-21 · unverdicted · novelty 6.0

StructuredSemanticSearch uses table discovery operators and orientation-aware integration on model-card tables to improve evidence coverage and diversity in model recommendation queries over a semantic baseline.

The Open-Box Fallacy: Why AI Deployment Needs a Calibrated Verification Regime

cs.AI · 2026-05-11 · unverdicted · novelty 6.0

AI deployment in high-stakes areas requires domain-scoped calibrated verification with monitoring and revocation, using a proposed six-component Verification Coverage standard instead of mechanistic interpretability.

Can Agent Benchmarks Support Their Scores? Evidence-Supported Bounds for Interactive-Agent Evaluation

cs.AI · 2026-05-11 · unverdicted · novelty 6.0

Agent benchmarks can report evidence-supported score bounds instead of single misleading success rates by adding a layer that checks required artifacts for outcome verification.

CIVeX: Causal Intervention Verification for Language Agents

cs.AI · 2026-05-09 · unverdicted · novelty 6.0

CIVeX maps agent tool calls to structural causal queries, checks identifiability, and issues auditable verdicts to prevent false executions while preserving utility on confounded benchmarks.

Auditable Agents

cs.AI · 2026-04-07 · unverdicted · novelty 6.0

No agent system can be accountable without auditability, which requires five dimensions (action recoverability, lifecycle coverage, policy checkability, responsibility attribution, evidence integrity) and mechanisms for detect/enforce/recover.

AI Disclosure with DAISY

cs.HC · 2026-04-03 · conditional · novelty 6.0

DAISY is a structured form tool that generates more complete AI disclosure statements for research papers without reducing author comfort levels.

A Human-Centric Framework for Data Attribution in Large Language Models

cs.CY · 2026-02-11 · unverdicted · novelty 6.0

Introduces a parameter-driven framework for data attribution in LLMs that enables negotiation among creators, users, and intermediaries to meet stakeholder goals within the data economy.

Industrial AI Robustness Card for Time Series Models

cs.CY · 2025-12-05 · unverdicted · novelty 6.0

The paper proposes the IARC-TS protocol that combines drift monitoring, uncertainty quantification, and stress tests to generate reproducible robustness evidence for industrial time series models mapped to EU AI Act obligations.

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

cs.AI · 2024-08-01 · conditional · novelty 6.0

Empirical analysis shows scaling inference compute via strategies like tree search can be more efficient than scaling model parameters, with 7B models plus novel search outperforming 34B models.

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

cs.CL · 2022-11-09 · unverdicted · novelty 6.0

BLOOM is a 176B-parameter open-access multilingual language model trained on the ROOTS corpus that achieves competitive performance on benchmarks, with improved results after multitask prompted finetuning.

PaLM: Scaling Language Modeling with Pathways

cs.CL · 2022-04-05 · accept · novelty 6.0

PaLM 540B demonstrates continued scaling benefits by setting new few-shot SOTA results on hundreds of benchmarks and outperforming humans on BIG-bench.

CTRL: A Conditional Transformer Language Model for Controllable Generation

cs.CL · 2019-09-11 · unverdicted · novelty 6.0

CTRL is a large conditional transformer language model that uses naturally occurring control codes to steer text generation style and content.

Ontological Knowledge Blocks: Executable Compliance and Profile-Based Validation for Trustworthy AI Systems

cs.AI · 2026-05-22 · conditional · novelty 5.0

Ontological Knowledge Blocks formalize regulatory obligations as 5-tuples linking RDF/OWL schemas, SHACL rules, evidence requirements and provenance, with a compiler enabling profile-based validation demonstrated in an HPC allocation scenario.

NIMROD-to-IMAS workflow for extended-magnetohydrodynamic data with reusable datasets and implications for IMAS schema development

physics.plasm-ph · 2026-05-22 · unverdicted · novelty 5.0

A NIMROD-to-IMAS conversion workflow preserves equilibrium, profile, perturbation and grid data from an edge harmonic oscillation simulation and identifies gaps in the IMAS schema for extended MHD.

The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents

cs.HC · 2026-05-20 · unverdicted · novelty 5.0

Empirical analysis of 1,524 AI incident reports shows 83% arise from worker-AI trait misalignments, with 74% of those traceable to developers prioritizing efficiency over precision or personalization.

The Agentic Economy: Humans, AI Agents, Robots, and the Measurable Transition toward Distributed Economic Action

econ.EM · 2026-05-18 · unverdicted · novelty 5.0

The agentic economy features distributed economic action across humans, AI agents, robots, protocols, and energy systems, with quantitative diagnostics from public data indicating accelerating AI adoption, robot capacity, and task reallocation rather than labor disappearance.

Beyond Model Readiness: Institutional Readiness for AI Deployment in Public Systems

cs.CY · 2026-05-17 · unverdicted · novelty 5.0

Introduces the Institutional Alignment Readiness (IAR) framework with five dimensions to evaluate institutional deployment readiness for AI in public systems, motivated by two anonymized education-sector cases.

Voices in the Loop: Mapping Participatory AI

cs.AI · 2026-05-16 · unverdicted · novelty 5.0 · 2 refs

Authors build a harmonized, geolocated atlas of participatory AI projects from existing and new sources, documenting geographic concentration and participation mostly at problem formulation and evaluation stages while providing update and governance mechanisms.

Mechanism Plausibility in Generative Agent-Based Modeling

cs.MA · 2026-05-12 · unverdicted · novelty 5.0 · 2 refs

Introduces the Mechanism Plausibility Scale, a four-level framework separating generative sufficiency from mechanistic plausibility in LLM-based agent-based models.

Exploring CoCo Challenges in ML Engineering Teams: Insights From the Semiconductor Industry

cs.SE · 2026-05-08 · unverdicted · novelty 5.0

Interviews in a semiconductor company reveal 16 collaboration and communication challenges in ML engineering teams, with unclear roles and responsibilities as the top issue, and list effective mitigation practices under hardware-driven constraints.

Recommender Systems as Control Systems

eess.SY · 2026-05-02 · unverdicted · novelty 5.0

Modeling recommender systems as control systems shows that time-optimized fairness interventions can improve overall long-term performance rather than merely trading off against utility.

Governing What the EU AI Act Excludes: Accountability for Autonomous AI Agents in Smart City Critical Infrastructure

cs.CY · 2026-05-01 · unverdicted · novelty 5.0

The EU AI Act narrows accountability for multi-agent AI in critical infrastructure by excluding safety components from key explanation and impact assessment rights, and the paper proposes AgentGov-SC, a three-layer architecture with 25 measures to address this through traceability to existing AI and

citing papers explorer

Showing 40 of 40 citing papers.

Acceptance Cards:A Four-Diagnostic Standard for Safe Fine-Tuning Defense Claims cs.CR · 2026-05-11 · unverdicted · none · ref 9
Acceptance Cards is a new four-diagnostic standard for safe fine-tuning defense claims that requires statistical reliability, fresh semantic generalization, mechanism alignment, and cross-task transfer; under this protocol SafeLoRA fails the full-card pass on Gemma-2-2B-it.
Towards Measuring the Representation of Subjective Global Opinions in Language Models cs.CL · 2023-06-28 · conditional · none · ref 79
LLMs default to responses more similar to opinions from the USA and some European and South American countries; prompting for a country shifts alignment but can introduce stereotypes, while translation does not reliably match language speakers.
Diversed Model Discovery via Structured Table Discovery cs.IR · 2026-05-21 · unverdicted · none · ref 48
StructuredSemanticSearch uses table discovery operators and orientation-aware integration on model-card tables to improve evidence coverage and diversity in model recommendation queries over a semantic baseline.
The Open-Box Fallacy: Why AI Deployment Needs a Calibrated Verification Regime cs.AI · 2026-05-11 · unverdicted · none · ref 22
AI deployment in high-stakes areas requires domain-scoped calibrated verification with monitoring and revocation, using a proposed six-component Verification Coverage standard instead of mechanistic interpretability.
Can Agent Benchmarks Support Their Scores? Evidence-Supported Bounds for Interactive-Agent Evaluation cs.AI · 2026-05-11 · unverdicted · none · ref 15
Agent benchmarks can report evidence-supported score bounds instead of single misleading success rates by adding a layer that checks required artifacts for outcome verification.
CIVeX: Causal Intervention Verification for Language Agents cs.AI · 2026-05-09 · unverdicted · none · ref 9
CIVeX maps agent tool calls to structural causal queries, checks identifiability, and issues auditable verdicts to prevent false executions while preserving utility on confounded benchmarks.
Auditable Agents cs.AI · 2026-04-07 · unverdicted · none · ref 8
No agent system can be accountable without auditability, which requires five dimensions (action recoverability, lifecycle coverage, policy checkability, responsibility attribution, evidence integrity) and mechanisms for detect/enforce/recover.
AI Disclosure with DAISY cs.HC · 2026-04-03 · conditional · none · ref 40
DAISY is a structured form tool that generates more complete AI disclosure statements for research papers without reducing author comfort levels.
A Human-Centric Framework for Data Attribution in Large Language Models cs.CY · 2026-02-11 · unverdicted · none · ref 130
Introduces a parameter-driven framework for data attribution in LLMs that enables negotiation among creators, users, and intermediaries to meet stakeholder goals within the data economy.
Industrial AI Robustness Card for Time Series Models cs.CY · 2025-12-05 · unverdicted · none · ref 16
The paper proposes the IARC-TS protocol that combines drift monitoring, uncertainty quantification, and stress tests to generate reproducible robustness evidence for industrial time series models mapped to EU AI Act obligations.
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models cs.AI · 2024-08-01 · conditional · none · ref 219
Empirical analysis shows scaling inference compute via strategies like tree search can be more efficient than scaling model parameters, with 7B models plus novel search outperforming 34B models.
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model cs.CL · 2022-11-09 · unverdicted · none · ref 282
BLOOM is a 176B-parameter open-access multilingual language model trained on the ROOTS corpus that achieves competitive performance on benchmarks, with improved results after multitask prompted finetuning.
PaLM: Scaling Language Modeling with Pathways cs.CL · 2022-04-05 · accept · none · ref 98
PaLM 540B demonstrates continued scaling benefits by setting new few-shot SOTA results on hundreds of benchmarks and outperforming humans on BIG-bench.
CTRL: A Conditional Transformer Language Model for Controllable Generation cs.CL · 2019-09-11 · unverdicted · none · ref 33
CTRL is a large conditional transformer language model that uses naturally occurring control codes to steer text generation style and content.
Ontological Knowledge Blocks: Executable Compliance and Profile-Based Validation for Trustworthy AI Systems cs.AI · 2026-05-22 · conditional · none · ref 5
Ontological Knowledge Blocks formalize regulatory obligations as 5-tuples linking RDF/OWL schemas, SHACL rules, evidence requirements and provenance, with a compiler enabling profile-based validation demonstrated in an HPC allocation scenario.
NIMROD-to-IMAS workflow for extended-magnetohydrodynamic data with reusable datasets and implications for IMAS schema development physics.plasm-ph · 2026-05-22 · unverdicted · none · ref 40
A NIMROD-to-IMAS conversion workflow preserves equilibrium, profile, perturbation and grid data from an edge harmonic oscillation simulation and identifies gaps in the IMAS schema for extended MHD.
The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents cs.HC · 2026-05-20 · unverdicted · none · ref 88
Empirical analysis of 1,524 AI incident reports shows 83% arise from worker-AI trait misalignments, with 74% of those traceable to developers prioritizing efficiency over precision or personalization.
The Agentic Economy: Humans, AI Agents, Robots, and the Measurable Transition toward Distributed Economic Action econ.EM · 2026-05-18 · unverdicted · none · ref 4
The agentic economy features distributed economic action across humans, AI agents, robots, protocols, and energy systems, with quantitative diagnostics from public data indicating accelerating AI adoption, robot capacity, and task reallocation rather than labor disappearance.
Beyond Model Readiness: Institutional Readiness for AI Deployment in Public Systems cs.CY · 2026-05-17 · unverdicted · none · ref 14
Introduces the Institutional Alignment Readiness (IAR) framework with five dimensions to evaluate institutional deployment readiness for AI in public systems, motivated by two anonymized education-sector cases.
Voices in the Loop: Mapping Participatory AI cs.AI · 2026-05-16 · unverdicted · none · ref 32 · 2 links
Authors build a harmonized, geolocated atlas of participatory AI projects from existing and new sources, documenting geographic concentration and participation mostly at problem formulation and evaluation stages while providing update and governance mechanisms.
Mechanism Plausibility in Generative Agent-Based Modeling cs.MA · 2026-05-12 · unverdicted · none · ref 59 · 2 links
Introduces the Mechanism Plausibility Scale, a four-level framework separating generative sufficiency from mechanistic plausibility in LLM-based agent-based models.
Exploring CoCo Challenges in ML Engineering Teams: Insights From the Semiconductor Industry cs.SE · 2026-05-08 · unverdicted · none · ref 16
Interviews in a semiconductor company reveal 16 collaboration and communication challenges in ML engineering teams, with unclear roles and responsibilities as the top issue, and list effective mitigation practices under hardware-driven constraints.
Recommender Systems as Control Systems eess.SY · 2026-05-02 · unverdicted · none · ref 81
Modeling recommender systems as control systems shows that time-optimized fairness interventions can improve overall long-term performance rather than merely trading off against utility.
Governing What the EU AI Act Excludes: Accountability for Autonomous AI Agents in Smart City Critical Infrastructure cs.CY · 2026-05-01 · unverdicted · none · ref 80
The EU AI Act narrows accountability for multi-agent AI in critical infrastructure by excluding safety components from key explanation and impact assessment rights, and the paper proposes AgentGov-SC, a three-layer architecture with 25 measures to address this through traceability to existing AI and
Fairness-First Design Thinking for Software Architecture cs.SE · 2026-04-20 · unverdicted · none · ref 46
A fairness-first Design Thinking method is proposed and tested in software architecture education to systematically address hidden fairness issues in digital systems.
Reckoning with the Political Economy of AI: Avoiding Decoys in Pursuit of Accountability cs.CY · 2026-04-17 · unverdicted · none · ref 103
AI accountability efforts are undermined by five decoys that create illusions of progress while co-constituting the extractive political economy of the AI Project.
Towards A Framework for Levels of Anthropomorphic Deception in Robots and AI cs.HC · 2026-04-16 · unverdicted · none · ref 41
A conceptual framework classifies anthropomorphic deception into four levels using humanlikeness, agency, and selfhood to guide ethical and practical decisions in HCI and HRI.
AI of the People, by the People, for the People: A Social Choice Approach to Collective Control of Artificial Intelligence cs.CY · 2026-04-14 · unverdicted · none · ref 91
Proposes applying social choice theory as a modeling language and axiomatic tool for incorporating collective input across the ML development pipeline.
Playing Games with My Heart: An Evaluation of AI Companion Apps cs.CY · 2026-04-08 · unverdicted · none · ref 58
All five AI companion apps use substantial dark patterns for monetization and engagement, prevalent erotica and gamification, and highly anthropomorphic designs that may foster parasocial relationships.
The Imbalanced User-AI Relationships as an Ethical Failure of Front-End Design in Healthcare AI cs.HC · 2026-03-24 · unverdicted · none · ref 11
Imbalanced user-AI relationships form a distinct front-end ethical failure in healthcare AI that design choices such as restricted inputs and suppressed uncertainty can undermine agency and that reciprocity offers a path to more balanced interactions.
What Is The Political Content in LLMs' Pre- and Post-Training Data? cs.CL · 2025-09-26 · unverdicted · none · ref 25
Training data for open LLMs is systematically left-leaning, with pre-training corpora containing more political material than post-training data and model stances aligning with data distributions.
StarCoder: may the source be with you! cs.CL · 2023-05-09 · accept · none · ref 290
StarCoderBase matches or beats OpenAI's code-cushman-001 on multi-language code benchmarks; the Python-fine-tuned StarCoder reaches 40% pass@1 on HumanEval while retaining other-language performance.
AIMBio-Mat: An AI-Native FAIR Platform for Closed-Loop Materials Discovery and Biomedical Translation physics.app-ph · 2026-05-20 · unverdicted · none · ref 27
AIMBio-Mat is a conceptual blueprint for an AI-native, FAIR, governance-aware decision layer that formulates biomedical-materials discovery as constrained multi-objective optimization under uncertainty.
FASE : A Fairness-Aware Spatiotemporal Event Graph Framework for Predictive Policing cs.LG · 2026-04-19 · unverdicted · none · ref 19
FASE pairs a spatiotemporal graph neural network and multivariate Hawkes process for crime prediction with a fairness-constrained linear program for patrol allocation, showing that allocation fairness holds in simulation but a 3.5 percentage point detection gap between minority and non-minority ZIPs
Mapping the Stochastic Penal Colony cs.CY · 2026-01-18 · unverdicted · none · ref 65
Content moderation operates as a stochastic penal colony that banishes users through the constant threat of account suspension, shown via auto-ethnographic case studies of Twitter, OpenAI DALL-E 2, and Pinterest.
Human-aligned AI Model Cards with Weighted Hierarchy Architecture cs.SE · 2025-10-08 · unverdicted · none · ref 4
Introduces CRAI-MCF, an eight-module framework distilling 217 parameters from 240 projects into a quantitative sufficiency criterion for cross-model LLM comparison grounded in Value Sensitive Design.
Building a Regional Data-Centric Materials Science Ecosystem for Processing-Rich Materials Innovation in the Great Plains cond-mat.mtrl-sci · 2026-05-19 · unverdicted · none · ref 53
Proposes a regional data-centric materials science ecosystem for the Great Plains, identifying five barriers to data sharing and outlining a staged roadmap illustrated by a high-purity germanium pilot.
AgriIR: A Scalable Framework for Domain-Specific Knowledge Retrieval cs.IR · 2026-03-17 · unverdicted · none · ref 28
AgriIR is a configurable RAG framework using modular stages and 1B-parameter models to deliver grounded, citable answers for Indian agricultural information access.
LLMs in Qualitative Research: Opportunities, Limitations, and Practical Considerations cs.HC · 2026-05-15 · unverdicted · none · ref 49
The paper outlines opportunities, limitations, and practical parameters for integrating LLMs into qualitative research while aligning with epistemological commitments like reflexivity and interpretive judgment.
Causal state binding predicts action control in language agents cs.AI · 2026-05-10 · unreviewed · ref 27 · 2 links

Model cards for model reporting

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer