pith. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

1373 papers in cs.CY · page 4

  1. cs.LG 2026-05-12 reviewed
    Feature models cut error 22-33% on student effort forecasts

    From Heuristics to Analytics: Forecasting Effort and Progress in Online Learning

    Eric S. Qiu +4

  2. physics.ed-ph 2026-05-12 reviewed
    AI forces new rules for how universities change teaching

    A Framework for institutional change in the age of AI

    David Perl-Nussbaum +1

  3. cs.CL 2026-05-12 reviewed
    LLM simulators fix answers regardless of feedback relevance

    Simulating Students or Sycophantic Problem Solving? On Misconception Faithfulness of LLM Simulators

    Heejin Do +2

  4. cs.LG 2026-05-12 reviewed
    Outcome-fair models still reason differently for similar applicants

    Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions

    Gideon Popoola +1

  5. cs.CV 2026-05-12 reviewed
    Nobody knows the state of the art in geospatial foundation models

    No One Knows the State of the Art in Geospatial Foundation Models

    Isaac Corley +8

  6. cs.CY 2026-05-12 reviewed
    Multisector moves boost upward mobility for planning alumni

    Career Mobility of Planning Alumni in the United States: Evidence from Professional Profile Data using Large Language Models

    Yan Wang +1

  7. cs.AI 2026-05-12 reviewed
    Simulator trains AI agents on utility demand response

    Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs

    Jose E. Aguilar Escamilla +3

  8. cs.CL 2026-05-12 reviewed
    LLM political discourse lacks real population variation in crises

    The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events

    Gunjan +2

  9. cs.CL 2026-05-12 reviewed
    Embedding geometry flags LLM rating disagreements

    Predicting Disagreement with Human Raters in LLM-as-a-Judge Difficulty Assessment without Using Generation-Time Probability Signals

    Yo Ehara

  10. cs.CY 2026-05-12 reviewed
    AI in exams makes judging solutions the new measure of learning

    Reimagining Assessment in the Age of Generative AI: Lessons from Open-Book Exams with ChatGPT

    Qusay H. Mahmoud

  11. cs.CY 2026-05-12 reviewed
    Culturally responsive outreach builds AI knowledge in Black youth

    Early AI Literacy in Culturally Responsive STEM Outreach for Black Youth

    Qusay H. Mahmoud +4

  12. cs.AI 2026-05-12 reviewed
    LLM arbitration cuts delays at signal-free intersections

    LIDSA: Cognitive Arbitration for Signal-Free Autonomous Intersection Management

    Abderrahmane Lakas +2

  13. cs.AI 2026-05-12 reviewed
    LLM arbitration cuts intersection delays by 89 percent

    LIDSA: Cognitive Arbitration for Signal-Free Autonomous Intersection Management

    Abderrahmane Lakas +2

  14. cs.CY 2026-05-12 reviewed
    Budget split cuts gender skew in ads without excluding unknowns

    Into the Unknown: Accounting for Missing Demographic Data when Mitigating Ad Delivery Skew

    Isabel Corpus +1

  15. cs.AI 2026-05-12 reviewed
    Same facts produce different conclusions when inference profiles differ

    Why Conclusions Diverge from the Same Observations: Formalizing World-Model Non-Identifiability via an Inference

    Toru Takahashi

  16. cs.LG 2026-05-12 reviewed
    Adaptive weights add feature selection to FGW distances

    Fused Gromov-Wasserstein Distance with Feature Selection

    Harlin Lee +3

  17. cs.CL 2026-05-12 reviewed
    Poetic prompts create separate processing paths that evade LLM safety

    Metaphor Is Not All Attention Needs

    Olga Sorokoletova +8

  18. cs.AI 2026-05-12 reviewed
    LLMs hide biases that flip mortgage decisions when reactivated

    Fair outputs, Biased Internals: Causal Potency and Asymmetry of Latent Bias in LLMs for High-Stakes Decisions

    Jagdish Tripathy +1

  19. cs.CY 2026-05-12 reviewed
    GDPR access requests expose contracts of African content moderators

    Auditing African Content Moderators' Working Conditions by Using the European General Data Protection Regulation (GDPR)

    Mariame Tighanimine +6

  20. q-fin.TR 2026-05-12 reviewed
    Polymarket shows single fill-side cluster for all addresses

    Fill-Side Non-Retail Trading on Polymarket: An Empirical Study of Behavioral Tiers and Microstructure Signatures Under Quote-Attribution Constraints

    Maksym Nechepurenko

  21. cs.AI 2026-05-12 reviewed
    The paper introduces the Evaluation Differential (ED) as a divergence in AI model…

    The Evaluation Differential: When Frontier AI Models Recognise They Are Being Tested

    Varad Vishwarupe +3

  22. cs.CY 2026-05-11 reviewed
    Dataset documentation tools miss reflexivity themes

    Evaluating Structured Documentation as a Tool for Reflexivity in Dataset Development

    Eshta Bhardwaj +2

  23. cs.GT 2026-05-11 reviewed
    Strategic questions cut majority bias in AI outputs

    When to Ask a Question: Understanding Communication Strategies in Generative AI Tools

    Charlotte Park +2

  24. cs.CY 2026-05-11 reviewed
    Differentiated roles in human-AI tutoring lift growth 61%

    Improving Hybrid Human-AI Tutoring by Differentiating Human Tutor Roles Based on Student Needs

    Ashish Gurung +8

  25. cs.CL 2026-05-11 reviewed
    Persona disagreement cuts LLM cultural misalignment by 10-24%

    Training-Free Cultural Alignment of Large Language Models via Persona Disagreement

    Huynh Trung Kiet +7

  26. cs.CL 2026-05-11 reviewed
    Persona disagreements align LLMs to cultures without training

    Training-Free Cultural Alignment of Large Language Models via Persona Disagreement

    Huynh Trung Kiet +7

  27. cs.NI 2026-05-11 reviewed
    Platforms let anyone measure mobile network security

    Democratizing Measurement of Critical Mobile Infrastructure: Security and Privacy in an Increasingly Centralized Communication Ecosystem

    Gabriel K. Gegenhuber

  28. cs.AI 2026-05-11 reviewed
    AI framework unifies campus surveys with mental health detection

    New AI-Driven Tools for Enhancing Campus Well-being: A Prevention and Intervention Approach

    Jinwen Tang

  29. cs.CY 2026-05-11 reviewed
    TikTok users struggle to keep unwanted videos out of their For You feed

    When 'For You' Isn't For You: Measuring User Agency in TikTok's Algorithmic Feed

    Levi Kaplan +4

  30. cs.LG 2026-05-11 reviewed
    Pareto frontier of fair decisions is group threshold rules

    Fairness vs Performance: Characterizing the Pareto Frontier of Algorithmic Decision Systems

    Mieke Wilms +1

  31. cs.CY 2026-05-11 reviewed
    LLMs generate harmful stereotypes that shift with prompt language

    StereoTales: A Multilingual Framework for Open-Ended Stereotype Discovery in LLMs

    Pierre Le Jeune +6

  32. cs.CY 2026-05-11 reviewed
    Every tested LLM produces harmful stereotypes in open-ended stories

    StereoTales: A Multilingual Framework for Open-Ended Stereotype Discovery in LLMs

    Pierre Le Jeune +6

  33. cs.CY 2026-05-11 reviewed
    LLM travel agents steer 3.5-7.7pp toward high commissions

    TourMart: A Parametric Audit Instrument for Commission Steering in LLM Travel Agents

    Yao Liu

  34. cs.CY 2026-05-11 reviewed
    Blueprints could rebalance science's generation and verification costs

    Toward an Engineering of Science: Rebalancing Generation and Verification in the Age of AI

    Jiaqi W. Ma

  35. cs.AI 2026-05-11 reviewed
    AI alignment needs positive goals for human flourishing

    Positive Alignment: Artificial Intelligence for Human Flourishing

    Ruben Laukkonen +15

  36. cs.AI 2026-05-11 reviewed
    AI Alignment Must Promote Human Flourishing

    Positive Alignment: Artificial Intelligence for Human Flourishing

    Ruben Laukkonen +15

  37. cs.CY 2026-05-11 reviewed
    LLMs under-allocate pensions by factor of three in budget tests

    Social Policy of Large Language Models: How GPT, Claude, DeepSeek and Grok Allocate Social Budgets in Spain and Germany

    Claudia Benavides Cantos +1

  38. cs.CV 2026-05-11 reviewed
    100k-image dataset improves fine-grained visual privacy detection

    VPD-100K: Towards Generalizable and Fine-grained Visual Privacy Protection

    Xiaobin Hu +9

  39. cs.LG 2026-05-11 reviewed
    Dataset reveals four strategies for LLM ad success

    NaiAD: Initiate Data-Driven Research for LLM Advertising

    Yihang Zhang +4

  40. cs.CL 2026-05-11 reviewed
    AI Agents May Game Conference Acceptance by Flooding Submissions

    Position: Academic Conferences are Potentially Facing Denominator Gaming Caused by Fully Automated Scientific Agents

    Rong Shan +8

  41. physics.soc-ph 2026-05-11 reviewed
    Humanity at 0.73 on cognitive Kardashev scale

    The Cognitive Kardashev Scale: Quantifying the Material Envelope of Civilisational Computation

    Sachin Sharma

  42. cs.AI 2026-05-11 reviewed
    One invariance rule unifies all AI explanation fairness metrics

    Fairness of Explanations in Artificial Intelligence (AI): A Unifying Framework, Axioms, and Future Direction toward Responsible AI

    Gideon Popoola +1

  43. cs.CY 2026-05-10 reviewed
    CS Students Rank Pay and Location Above Ethics in Job Searches

    Cost-of-Ethics Crisis: Beliefs, Decisions, and Justifications in the Job Searches of Computer Science Students in Canada and the United States

    Mohamed Abdalla +6

  44. cs.CY 2026-05-10 reviewed
    Metaverse requires hybrid governance across law and code layers

    The Metaverse Is Not a Place Apart: Law, Code, and the Recursive Governance of Digital Space (A Review Essay on Mark Findlay, Governing the Metaverse: Law, Order and Freedom in Digital Space (2025))

    Oren Perez

  45. physics.ed-ph 2026-05-10 reviewed
    AI-powered materials discovery requires workflow-aligned AI literacy

    Preparing Students for AI-Powered Materials Discovery: A Workflow-Aligned Framework for AI Literacy, Equity, and Scientific Judgment

    Dongming Mei +2

  46. cs.CL 2026-05-10 reviewed
    Vote entropy spots safe LLM debates but misses where debate helps

    Statistical Scouting Finds Debate-Safe but Not Debate-Useful Cases: A Matched-Ceiling Study of Open-Weight LLM Reasoning Protocols

    Julia Hu +2

  47. cs.AI 2026-05-10 reviewed
    LLMs hit 95-100% math accuracy but miss most human strategies

    Beyond Accuracy: Evaluating Strategy Diversity in LLM Mathematical Reasoning

    Xia Yang +3

  48. cs.CL 2026-05-10 reviewed
    Semantic search finds more hidden Locke receptions than word matching

    Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke

    Yu Wu +4

  49. cs.CL 2026-05-10 reviewed
    Semantic search finds more implicit Locke references than keywords

    Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke

    Yu Wu +4

  50. cs.LG 2026-05-09 reviewed
    Framework spots smartphone scams early from partial app streams

    ORACLE: Anticipating Scams from Partial Trajectories in Streaming App Usage

    Wenbo Gao +8