Stack Overflow code quality varies by US region, with readability violations most common overall and fewer issues in states with higher income, internet access, and equitable wealth distribution.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
AI benchmark evaluations require standardized item-level data releases as core infrastructure to support validity assessment, demonstrated via the OpenEval archive of 10M responses across 155k items.
citing papers explorer
-
Geographic Variation in Stack Overflow Code Quality: Evidence from a Cross-Regional Study of Coding Practices
Stack Overflow code quality varies by US region, with readability violations most common overall and fewer issues in states with higher income, internet access, and equitable wealth distribution.
-
AI Evaluation Should Require Standardized Item-Level Data Releases
AI benchmark evaluations require standardized item-level data releases as core infrastructure to support validity assessment, demonstrated via the OpenEval archive of 10M responses across 155k items.