archive

Every paper Pith has read. Search by title, abstract, or pith.

1373 papers in cs.CY · page 6

cs.SE 2026-05-07 reviewed

Developers put ethical rules for AI agents into repo files
Operationalizing Ethics for AI Agents: How Developers Encode Values into Repository Context Files

Christoph Treude +2
stat.ME 2026-05-07 reviewed

Marginal conformal coverage leaves 13-point subgroup gaps in survey data
Socio-Conformal Calibration in Complex Survey Data: Marginal Validity Is Not Enough for Subgroup Reliability

Amir Rafe +1
cs.AI 2026-05-07 reviewed

Compute rental rates now cap human cognitive wages
Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages

Siqi Zhu
cs.AI 2026-05-07 reviewed

Compute rental rates now bound human cognitive wages
Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages

Siqi Zhu
cs.CL 2026-05-07 reviewed

Small legal model tops frontier LLMs on contract extraction
A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction

Nicole Lincoln +3
cs.CR 2026-05-06 reviewed

Cryptographic tokens enable private web age verification
Age Verification in the Web -- Holy Grail to Control Access to Restricted Content

Wojciech Wodo +2
cs.CY 2026-05-06 reviewed

Interpretability serves as model evaluation when it meets scientific standards
Rigorous Interpretation Is a Form of Evaluation

Isabelle Lee +6
cs.CY 2026-05-06 reviewed

AI Errors Become Prompts for Student Analysis
The Pedagogy of AI Mistakes: Fostering Higher-Order Thinking

Hadi Hosseini
cs.CY 2026-05-06 reviewed

Chatbots may make people see their minds as language models
LLMorphism: When humans come to see themselves as language models

Valerio Capraro
cs.CY 2026-05-06 reviewed

Reviews of AI find inconsistent life cycle terms and limited CO2 metrics
From Cradle to Cloud: A Life Cycle Review of AI's Environmental Footprint

Katherine Lambert +1
cs.DC 2026-05-06 reviewed

Nine-dimension model explains root causes in five of twelve DeFi incidents
Toward a Risk Assessment Framework for Institutional DeFi: A Nine-Dimension Approach

Eva Oberholzer +3
cs.LG 2026-05-06 reviewed

High schoolers build ETF forecasts via AI workflow
Human-AI Co-Mentorship in Project-Based Learning: A Case Study in Financial Forecasting

Freyaa Chawla +4
cs.LG 2026-05-06 reviewed

Anomaly detection in EHR data flags errors with low false alerts
Conditional outlier detection for clinical alerting

Milos Hauskrecht +5
cs.CL 2026-05-06 reviewed

Concept fields score sentence transitions for groundedness and novelty
Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement

Nicholas S. Kersting +4
cs.CL 2026-05-06 reviewed

Concept fields turn corpora into groundedness detectors
Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement

Nicholas S. Kersting +4
cs.CL 2026-05-06 reviewed

Reward models favor undesirable responses in social tests
Misaligned by Reward: Socially Undesirable Preferences in LLMs

Gayane Ghazaryan +1
q-fin.RM 2026-05-06 reviewed

AI risks fall into four distinct insurability tiers
The Insurability Frontier of AI Risk: Mapping Threats to Affirmative Coverage, Silent Exposures, and Exclusions

Alex Leung +4
cs.CL 2026-05-06 reviewed

Patent language shifts forecast future tech combinations decades ahead
Anticipating Innovation Using Large Language Models

Enrico Maria Fenoaltea +4
cs.CR 2026-05-06 reviewed

Smart fridge cooling depends on IT that may fail first
Long-Term Risks of IoT Devices: The Case of the Smart Fridge

Erik Buchmann
physics.soc-ph 2026-05-06 reviewed

Padua releases traffic dataset linking flows to city context
A City-Scale Dataset of Traffic Flows, Travel Times, and Urban Context

Riccardo Cappi +5
cs.CY 2026-05-06 reviewed

Digital twin trust maps to four integration patterns across domains
Trustworthiness in Digital Twin Systems: Systematic Review and Research Horizons

Chi Fai David Lam (1) +3
cs.CY 2026-05-06 reviewed

19 guidelines shape AI for adult learners
Guidelines for Designing AI Technologies to Support Adult Learning

Jennifer M. Reddig +18
cs.MA 2026-05-06 reviewed

DAOs govern physical AI for community-run infrastructure
DAO-enabled decentralized physical AI: A new paradigm for human-machine collaboration

Mark C. Ballandies +3
cs.CY 2026-05-06 reviewed

Roblox moderation misses grooming and abuse in millions of chats
An Evaluation of Chat Safety Moderations in Roblox

Priya Kaushik +3
cs.CY 2026-05-06 reviewed

Roblox moderation misses grooming and bullying chats
An Evaluation of Chat Safety Moderations in Roblox

Priya Kaushik +3
cs.CV 2026-05-06 reviewed

XAI metrics to use evaluation cards for consistent reporting
Evaluation Cards for XAI Metrics

Rokas Gipi\v{s}kis +1
cs.SI 2026-05-05 reviewed

LLMs conform more than humans when updating beliefs in networks
Can LLMs Emulate Human Belief Dynamics?

Adiba Mahbub Proma +5
cs.DL 2026-05-05 reviewed

Bluesky posts on retracted papers favor scrutiny over errors
Science discussions of retracted articles on Bluesky: public scrutiny or misinformation spreading?

Er-Te Zheng +4
cs.CY 2026-05-05 reviewed

AI chatbots provide mental health support without validation or standards
AI and Suicide Prevention: A Cross-Sector Primer

Emily Saltz +1
cs.CY 2026-05-05 reviewed

AI chatbots lack clinical standards for suicide prevention
AI and Suicide Prevention: A Cross-Sector Primer

Emily Saltz +1
cs.CL 2026-05-05 reviewed

Meaning graphs keep watermarks intact through rephrasing
SWAN: Semantic Watermarking with Abstract Meaning Representation

Ziping Ye +9
cs.LG 2026-05-05 reviewed

Library unifies fairness and privacy tools for LMIC healthcare AI
FairHealth: An Open-Source Python Library for Trustworthy Healthcare AI in Low-Resource Settings

Farjana Yesmin
math.CO 2026-05-05 reviewed

Heuristic outperforms solver for high-conflict classroom seating
Conflict-Aware Seat Assignment in Classroom Environments

Bruna Cristina Braga Charytitsch +1
cs.CY 2026-05-05 reviewed

One-way model from AI patents to response outperforms baselines
Coupled-NeuralHP: Directional Temporal Coupling Between AI Innovation Exposure and Public Response

Amir Rafe +1
stat.ML 2026-05-05 reviewed

Five structures cut AI survey error by 25.8%
Heterogeneous Ordinal Structure Learning with Bayesian Nonparametric Complexity Discovery

Amir Rafe +1
cs.CY 2026-05-05 reviewed

Median LLM paper evaluates models 10.85 ECI behind frontier
Frontier Lag: A Bibliometric Audit of Capability Misrepresentation in Academic AI Evaluation

David Gringras +1
cs.CY 2026-05-05 reviewed

NeurIPS should require reproducible evidence for AI safety claims
NeurIPS Should Require Reproducibility Standards for Frontier AI Safety Claims

Varad Vishwarupe +3
cs.CL 2026-05-05 reviewed

The study audits five LLMs on 374k gender-swapped MIMIC-IV emergency triage vignettes and…
EQUITRIAGE: A Fairness Audit of Gender Bias in LLM-Based Emergency Department Triage

Richard J. Young +1
physics.ed-ph 2026-05-05 reviewed

Dialogue fixes 82% of LLM errors on image-based physics problems
A Dialogue-Based Framework for Correcting Multimodal Errors in AI-Assisted STEM Education

Akshay Syal +4
cs.LG 2026-05-05 reviewed

Model collapse hits low-resource AI hardest
Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities

Devon Jarvis +4
cs.HC 2026-05-05 reviewed

Physical objects gain responsive AI while keeping their emotional bonds
Deco: Extending Personal Physical Objects into Pervasive AI Companion through a Dual-Embodiment Framework

Zhihan Jiang +16
cs.HC 2026-05-05 reviewed

Immersive video makes self-location the core of presence without a body
Bodyless Presence: Reconsidering the Minimal Self in Immersive Video

Koichi Toida
cs.SE 2026-05-05 reviewed

Better-connected U.S
Geographic Variation in Stack Overflow Code Quality: Evidence from a Cross-Regional Study of Coding Practices

Elijah Zolduoarrati +2
cs.GT 2026-05-05 reviewed

LLMs in security dilemmas reproduce multipolar conflict and unraveling
Multi-Agent Strategic Games with LLMs

Maxim Chupilkin
cs.CY 2026-05-05 reviewed

AI safety work overlooks deskilling and addiction from generative AI
Brainrot: Deskilling and Addiction are Overlooked AI Risks

Ilias Chalkidis +1
cs.CY 2026-05-05 reviewed

Ad allocation must block interpretive deprivation in protected groups
Beyond Distributive Justice: Hermeneutical Fairness in Ad Delivery

Camilla Quaresmini +5
cs.LG 2026-05-05 reviewed

The paper builds a spatio-temporal graph neural network on a subgraph of eight European…
Will the Carbon Border Adjustment Mechanism Impact European Electricity Prices? A GNN-Based Network Analysis

Jiachen Shen +3
cs.CY 2026-05-05 reviewed

Data annotation firms pitch AI expertise as cheaper than human expertise
Cheap Expertise: Mapping and Challenging Industry Perspectives in the Expert Data Gig Economy

Robert Wolfe +1
cs.HC 2026-05-05 reviewed

Three attention shifts unlock public cyberbullying intervention in LLM sim
Attention: What Prevents Young Adults from Speaking Up Against Cyberbullying in an LLM-Powered Social Media Simulation

Qian Yang +5
cs.LG 2026-05-04 reviewed

LLM criminal bias vanishes with tuning then returns via distillation
Moral Sensitivity in LLMs: A Tiered Evaluation of Contextual Bias via Behavioral Profiling and Mechanistic Interpretability

Yash Aggarwal +5