pith. machine review for the scientific record. sign in

arxiv: 1707.00452 · v1 · submitted 2017-07-03 · 💻 cs.SE

Recognition: unknown

Attribution Required: Stack Overflow Code Snippets in GitHub Projects

Authors on Pith no claims yet
classification 💻 cs.SE
keywords snippetsattributioncodelicenseanswerattributedcopieddevelopers
0
0 comments X
read the original abstract

Stack Overflow (SO) is the largest Q&A website for developers, providing a huge amount of copyable code snippets. Using these snippets raises various maintenance and legal issues. The SO license requires attribution, i.e., referencing the original question or answer, and requires derived work to adopt a compatible license. While there is a heated debate on SO's license model for code snippets and the required attribution, little is known about the extent to which snippets are copied from SO without proper attribution. In this paper, we present the research design and summarized results of an empirical study analyzing attributed and unattributed usages of SO code snippets in GitHub projects. On average, 3.22% of all analyzed repositories and 7.33% of the popular ones contained a reference to SO. Further, we found that developers rather refer to the whole thread on SO than to a specific answer. For Java, at least two thirds of the copied snippets were not attributed.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.