archive
Every paper Pith has read. Search by title, abstract, or pith.
1373 papers in cs.CY · page 3
-
AI edits can steer collective opinions across networks
AI-Mediated Communication Can Steer Collective Opinion
-
MLB needed seven years to automate its clearest rule
Inside Baseball: The Automated Ball-Strike System as an Object Lesson in Technological Rule Enforcement
-
Bots drive higher compliance than humans after Reddit post removals
Who, Why, and How: Disentangling the Effects of Moderation Source, Context, and Language on Post-Removal Behavior
-
LTL-based methods outperform LLMs at auditing AI behavior
Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems
-
Value profiles from surveys cut LLM cross-country errors
Improving Cross-Cultural Survey Simulation with Calibrated Value Personas
-
Manufacturing ransomware recovery goes beyond backups
From Backup Restoration to Minimum Viable Factory Recovery: A Systematization of Ransomware Recovery in Manufacturing Systems
-
Live demo shows path to quantum-safe blockchains
Quantum Futures Interactive: A Live Demonstration of Post-Quantum Blockchain Security, Infrastructure Tradeoffs, and Sustainable Distributed Trust
-
Simulated annealing selects better climate year subsets for energy models
Bridging the climate to energy data gap: simulated annealing for representative climate year selection
-
Endogenous tokens fail as money on shared ledgers
Privacy is Fungibility: Why Endogenous Tokens Are Not Money
-
RL-timed GenAI access raises test scores and metacognitive accuracy
Access Timing as Scaffolding: A Reinforcement Learning Approach to GenAI in Education
-
Adversarial bandits let social agents adapt strategies online
ALSO: Adversarial Online Strategy Optimization for Social Agents
-
Replicated social studies measure AI agents match to human conclusions
Validated Hypotheses as a Lens for Human-Likeness Evaluation in AI Agents
-
Framework names eight digital wastes and vets AI first
GreenZ: A Sustainable UX Framework for Complex Digital Systems
-
Ghana AI legal tool handles 32,000 student queries in 30 months
Eskwai for Students: Generative AI Assistant for Legal Education in Ghana
-
WhatsApp AI bot offers science help to West African students
Adesua: Development and Feasibility Study of an AI WhatsApp Bot for Science Learning in West Africa
-
CelebA embeds beauty double standards in labels
Beyond Performance Disparities: A Three-Level Audit of Representational Harm in CelebA
-
New metric checks if model explanations are fair across groups
GESD: Beyond Outcome-Oriented Fairness
-
Standard rules understaff SNAP call centers by ignoring redials
Due Process on Hold: A Queueing Framework for Improving Access in SNAP
-
This paper uses data from 26 million U.S
Tradeoffs are Domain Dependent: Improving Accuracy and Fairness in Property Tax Assessments
-
ViMU benchmark tests video AI on hidden meanings
ViMU: Benchmarking Video Metaphorical Understanding
-
4B genome agent matches larger LLMs on microbial trait prediction
GGBound: A Genome-Grounded Agent for Microbial Life-Boundary Prediction
-
Moderate starters gain most in AI agent workshops
Computational Thinking Development in AI Agent Creation_A Mixed-Methods Study
-
Agent harnesses allow unsafe actions even with correct final outputs
Auditing Agent Harness Safety
-
Agent harnesses break rules mid-task despite safe final answers
Auditing Agent Harness Safety
-
AI benchmarks redefine capabilities to fit their own rules
The Evaluation Trap: Benchmark Design as Theoretical Commitment
-
Safety refusals rise with Korean language but drop with Korean context
ROK-FORTRESS: Measuring the Effect of Geopolitical Transcreation for National Security and Public Safety
-
Generative models automate social doing
Synthetic Sociality: How Generative Models Privatize the Social Fabric
-
Formal checks can keep AI legal reasoning inside the text
Bridging Legal Interpretation and Formal Logic: Faithfulness, Assumption, and the Future of AI Legal Reasoning
-
GraphRAG retrieval aligns LLM agents with social values
From Descriptive to Prescriptive: Uncover the Social Value Alignment of LLM-based Agents
-
AI Overviews appear in 14% of searches with 11% unsupported claims
Measuring Google AI Overviews: Activation, Source Quality, Claim Fidelity, and Publisher Impact
-
Election tweets on X rose to 93 percent original content in 2024 from 59 percent in 2016
Amplification to Synthesis: A Comparative Analysis of Cognitive Operations Before and After Generative AI
-
Canary tokens link scrapers to the LLMs they feed
Identifying AI Web Scrapers Using Canary Tokens
-
Fine-tuning plus hierarchical prompts strengthen propaganda detection
Fine-tuning with Hierarchical Prompting for Robust Propaganda Classification Across Annotation Schemas
-
Europe Needs Preparedness Plan for AGI by 2030-2040
Europe and the Geopolitics of AGI: The Need for a Preparedness Plan
-
Students rate AI slides equal to instructor ones
AI-Generated Slides: Are They Good? Can Students Tell?
-
3C framework links competition and networks to women's computing participation
3C: Competition, Competence, and Collaboration for Women in Computing
-
Bias audits for AI image generators must match use-case risks
Context Matters: Auditing Gender Bias in T2I Generation through Risk-Tiered Use-Case Profiles
-
Watermarking turns into entity monitoring via output aggregation
Watermarking Should Be Treated as a Monitoring Primitive
-
Aggregation turns watermarking into monitoring
Watermarking Should Be Treated as a Monitoring Primitive
-
Use 'anbao' for security, keep 'anquan' for safety in Chinese tech writing
Not All Anquan Is the Same: A Terminological Proposal for Chinese Computer Science and Engineering
-
Chinese tech writing needs separate terms for safety and security
Not All Anquan Is the Same: A Terminological Proposal for Chinese Computer Science and Engineering
-
GenAI flattens L2 writers' voices into uniform English
The Cost of Perfect English: Pragmatic Flattening and the Erasure of Authorial Voice in L2 Writing Supported by GenAI
-
KITE tutor raises simulated student accuracy on algorithm tasks
Retrieval-Augmented Tutoring for Algorithm Tracing and Problem-Solving in AI Education
-
87% of teachers quit AI agent creation weeks after training
An Activity-Theoretical Approach to Teacher Professional Development in Pedagogical AI Agent Design
-
The MIRACLE system uses multiple AI agents to guide students through planning
MIRACLE_Multi-Agent Intelligent Regulation to Advance Collaborative Learning Environment
-
AI-TPACK forms through thinking style and beliefs
Modeling AI-TPACK in Practice Insights from Teachers Multi-Agent Workflow Design
-
Clinical AI models passing accuracy tests can fail hidden deployment checks
RISED: A Pre-Deployment Evaluation Framework for High-Stakes AI Decision-Support Systems, with Application to Healthcare
-
Scale separates mechanistic explanation from reproduction in LLM models
Mechanism Plausibility in Generative Agent-Based Modeling
-
Four-level scale rates LLM agent models on mechanistic plausibility
Mechanism Plausibility in Generative Agent-Based Modeling
-
Synthetic dataset benchmarks AI for swim coaching
Synthesizing the Expert: A Validated Multimodal Dataset for Trustworthy AI-Assisted Swimming Coaching