archive
Every paper Pith has read. Search by title, abstract, or pith.
1373 papers in cs.CY · page 6
-
Developers put ethical rules for AI agents into repo files
Operationalizing Ethics for AI Agents: How Developers Encode Values into Repository Context Files
-
Marginal conformal coverage leaves 13-point subgroup gaps in survey data
Socio-Conformal Calibration in Complex Survey Data: Marginal Validity Is Not Enough for Subgroup Reliability
-
Compute rental rates now cap human cognitive wages
Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages
-
Compute rental rates now bound human cognitive wages
Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages
-
Small legal model tops frontier LLMs on contract extraction
A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction
-
Cryptographic tokens enable private web age verification
Age Verification in the Web -- Holy Grail to Control Access to Restricted Content
-
Interpretability serves as model evaluation when it meets scientific standards
Rigorous Interpretation Is a Form of Evaluation
-
AI Errors Become Prompts for Student Analysis
The Pedagogy of AI Mistakes: Fostering Higher-Order Thinking
-
Chatbots may make people see their minds as language models
LLMorphism: When humans come to see themselves as language models
-
Reviews of AI find inconsistent life cycle terms and limited CO2 metrics
From Cradle to Cloud: A Life Cycle Review of AI's Environmental Footprint
-
Nine-dimension model explains root causes in five of twelve DeFi incidents
Toward a Risk Assessment Framework for Institutional DeFi: A Nine-Dimension Approach
-
High schoolers build ETF forecasts via AI workflow
Human-AI Co-Mentorship in Project-Based Learning: A Case Study in Financial Forecasting
-
Anomaly detection in EHR data flags errors with low false alerts
Conditional outlier detection for clinical alerting
-
Concept fields score sentence transitions for groundedness and novelty
Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement
-
Concept fields turn corpora into groundedness detectors
Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement
-
Reward models favor undesirable responses in social tests
Misaligned by Reward: Socially Undesirable Preferences in LLMs
-
AI risks fall into four distinct insurability tiers
The Insurability Frontier of AI Risk: Mapping Threats to Affirmative Coverage, Silent Exposures, and Exclusions
-
Patent language shifts forecast future tech combinations decades ahead
Anticipating Innovation Using Large Language Models
-
Smart fridge cooling depends on IT that may fail first
Long-Term Risks of IoT Devices: The Case of the Smart Fridge
-
Padua releases traffic dataset linking flows to city context
A City-Scale Dataset of Traffic Flows, Travel Times, and Urban Context
-
Digital twin trust maps to four integration patterns across domains
Trustworthiness in Digital Twin Systems: Systematic Review and Research Horizons
-
19 guidelines shape AI for adult learners
Guidelines for Designing AI Technologies to Support Adult Learning
-
DAOs govern physical AI for community-run infrastructure
DAO-enabled decentralized physical AI: A new paradigm for human-machine collaboration
-
Roblox moderation misses grooming and abuse in millions of chats
An Evaluation of Chat Safety Moderations in Roblox
-
Roblox moderation misses grooming and bullying chats
An Evaluation of Chat Safety Moderations in Roblox
-
XAI metrics to use evaluation cards for consistent reporting
Evaluation Cards for XAI Metrics
-
LLMs conform more than humans when updating beliefs in networks
Can LLMs Emulate Human Belief Dynamics?
-
Bluesky posts on retracted papers favor scrutiny over errors
Science discussions of retracted articles on Bluesky: public scrutiny or misinformation spreading?
-
AI chatbots provide mental health support without validation or standards
AI and Suicide Prevention: A Cross-Sector Primer
-
AI chatbots lack clinical standards for suicide prevention
AI and Suicide Prevention: A Cross-Sector Primer
-
Meaning graphs keep watermarks intact through rephrasing
SWAN: Semantic Watermarking with Abstract Meaning Representation
-
Library unifies fairness and privacy tools for LMIC healthcare AI
FairHealth: An Open-Source Python Library for Trustworthy Healthcare AI in Low-Resource Settings
-
Heuristic outperforms solver for high-conflict classroom seating
Conflict-Aware Seat Assignment in Classroom Environments
-
One-way model from AI patents to response outperforms baselines
Coupled-NeuralHP: Directional Temporal Coupling Between AI Innovation Exposure and Public Response
-
Five structures cut AI survey error by 25.8%
Heterogeneous Ordinal Structure Learning with Bayesian Nonparametric Complexity Discovery
-
Median LLM paper evaluates models 10.85 ECI behind frontier
Frontier Lag: A Bibliometric Audit of Capability Misrepresentation in Academic AI Evaluation
-
NeurIPS should require reproducible evidence for AI safety claims
NeurIPS Should Require Reproducibility Standards for Frontier AI Safety Claims
-
The study audits five LLMs on 374k gender-swapped MIMIC-IV emergency triage vignettes and…
EQUITRIAGE: A Fairness Audit of Gender Bias in LLM-Based Emergency Department Triage
-
Dialogue fixes 82% of LLM errors on image-based physics problems
A Dialogue-Based Framework for Correcting Multimodal Errors in AI-Assisted STEM Education
-
Model collapse hits low-resource AI hardest
Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities
-
Physical objects gain responsive AI while keeping their emotional bonds
Deco: Extending Personal Physical Objects into Pervasive AI Companion through a Dual-Embodiment Framework
-
Immersive video makes self-location the core of presence without a body
Bodyless Presence: Reconsidering the Minimal Self in Immersive Video
-
Better-connected U.S
Geographic Variation in Stack Overflow Code Quality: Evidence from a Cross-Regional Study of Coding Practices
-
LLMs in security dilemmas reproduce multipolar conflict and unraveling
Multi-Agent Strategic Games with LLMs
-
AI safety work overlooks deskilling and addiction from generative AI
Brainrot: Deskilling and Addiction are Overlooked AI Risks
-
Ad allocation must block interpretive deprivation in protected groups
Beyond Distributive Justice: Hermeneutical Fairness in Ad Delivery
-
The paper builds a spatio-temporal graph neural network on a subgraph of eight European…
Will the Carbon Border Adjustment Mechanism Impact European Electricity Prices? A GNN-Based Network Analysis
-
Data annotation firms pitch AI expertise as cheaper than human expertise
Cheap Expertise: Mapping and Challenging Industry Perspectives in the Expert Data Gig Economy
-
Three attention shifts unlock public cyberbullying intervention in LLM sim
Attention: What Prevents Young Adults from Speaking Up Against Cyberbullying in an LLM-Powered Social Media Simulation
-
LLM criminal bias vanishes with tuning then returns via distillation
Moral Sensitivity in LLMs: A Tiered Evaluation of Contextual Bias via Behavioral Profiling and Mechanistic Interpretability