pith. sign in

arxiv: 1401.5300 · v2 · pith:FV4Q5BZAnew · submitted 2014-01-21 · 💻 cs.SE · cs.PL

How are identifiers named in open source software? About popularity and consistency

classification 💻 cs.SE cs.PL
keywords identifiernamingsoftwareconsistencyconventionspopularitysourcewritten
0
0 comments X
read the original abstract

With the rapid increasing of software project size and maintenance cost, adherence to coding standards especially by managing identifier naming, is attracting a pressing concern from both computer science educators and software managers. Software developers mainly use identifier names to represent the knowledge recorded in source code. However, the popularity and adoption consistency of identifier naming conventions have not been revealed yet in this field. Taking forty-eight popular open source projects written in three top-ranking programming languages Java, C and C++ as examples, an identifier extraction tool based on regular expression matching is developed. In the subsequent investigation, some interesting findings are obtained. For the identifier naming popularity, it is found that Camel and Pascal naming conventions are leading the road while Hungarian notation is vanishing. For the identifier naming consistency, we have found that the projects written in Java have a much better performance than those written in C and C++. Finally, academia and software industry are urged to adopt the most popular naming conventions consistently in their practices so as to lead the identifier naming to a standard, unified and high-quality road.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Geographic Variation in Stack Overflow Code Quality: Evidence from a Cross-Regional Study of Coding Practices

    cs.SE 2026-05 unverdicted novelty 5.0

    Stack Overflow code quality varies by US region, with readability violations most common overall and fewer issues in states with higher income, internet access, and equitable wealth distribution.