Does social identity matter in software engineering? Assessing the case of research software engineers
Pith reviewed 2026-05-07 15:59 UTC · model grok-4.3
The pith
Research software engineers are forming a collective professional identity that shapes their wellbeing.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using computational linguistic analysis of social media posts and blogs together with inferential statistics on survey responses, the study shows that a collective RSE identity is emerging and that this identity influences professional wellbeing.
What carries the argument
The collective RSE identity, detected through linguistic markers in public posts and confirmed by survey responses, as the mechanism connecting group membership to individual wellbeing outcomes.
If this is right
- Strengthening community practices among RSEs could improve wellbeing indicators such as job satisfaction and retention.
- Other specialized software engineering roles may develop similar identities that affect daily work experience.
- Interventions aimed at professional community building in software engineering could be evaluated by tracking identity markers over time.
- The interdisciplinary link between social psychology and software engineering offers a new lens for studying career sustainability in technical fields.
Where Pith is reading between the lines
- Similar identity dynamics may operate in adjacent roles like data scientists or scientific programmers, warranting parallel studies.
- If identity formation is causal, early-career RSEs could be supported by structured community onboarding to accelerate wellbeing benefits.
- The linguistic methods used here could be applied to other online professional forums to detect emerging identities before formal surveys exist.
Load-bearing premise
The combination of linguistic analysis of posts and blogs with survey answers from 381 respondents accurately measures the existence and effects of social identity without major selection bias or unmeasured influences.
What would settle it
A replication study that finds no reliable association between strength of RSE group identification and wellbeing measures once other job and demographic factors are controlled.
Figures
read the original abstract
Social identity is a concept from psychology that refers to the part of an individual's identity that derives from their group membership(s). In this paper, we explore social identity in members of the professional community of Research Software Engineers (RSEs). Using a mixed-methods approach, our study combined computational linguistic analysis and inferential statistics to examine over 28,000 social media posts, 1,700 blogs, and survey responses from 381 professional RSEs. The findings highlight the emergence of a collective RSE identity and demonstrate its role in shaping professional wellbeing. This study contributes an interdisciplinary perspective by integrating social psychology and software engineering to show how a professional identity evolves and why it matters.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript explores the concept of social identity in the context of Research Software Engineers (RSEs) by employing a mixed-methods approach. This includes computational linguistic analysis of more than 28,000 social media posts and 1,700 blogs, alongside inferential statistics applied to survey responses collected from 381 professional RSEs. The central claims are that a collective RSE identity is emerging and that this identity plays a role in shaping the professional wellbeing of RSEs, thereby contributing an interdisciplinary lens combining social psychology and software engineering.
Significance. Should the empirical findings hold under scrutiny, this work would be significant for demonstrating the applicability of social identity theory to software engineering professional communities. It could provide insights into how group membership influences wellbeing in technical fields, potentially guiding community-building efforts and support structures for RSEs. The scale of the data collection (tens of thousands of posts and hundreds of survey responses) represents a substantial effort that, if analyzed rigorously, strengthens the potential contribution.
major comments (2)
- [Abstract] The abstract claims that the findings 'demonstrate its role in shaping professional wellbeing.' However, the described study design is purely observational, relying on self-selected social media content and voluntary survey participation. No details are provided on controls for potential confounders such as career stage or institutional factors, nor on methods to address reverse causation or selection bias. This is a load-bearing issue for the central claim, as correlational patterns do not suffice to establish a shaping (causal) role.
- [Abstract] The abstract describes the data sources and methods only at a high level and supplies no specific results, effect sizes, p-values, model specifications, or validation procedures. Without these, it is not possible to determine whether the data support the claims about the emergence of collective identity or its impact on wellbeing.
minor comments (1)
- The abstract could benefit from a brief mention of key limitations or the specific statistical techniques used to enhance transparency.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback on our manuscript. We address the major comments point by point below and have made revisions to the abstract to clarify the nature of our findings and provide more specific details where feasible.
read point-by-point responses
-
Referee: [Abstract] The abstract claims that the findings 'demonstrate its role in shaping professional wellbeing.' However, the described study design is purely observational, relying on self-selected social media content and voluntary survey participation. No details are provided on controls for potential confounders such as career stage or institutional factors, nor on methods to address reverse causation or selection bias. This is a load-bearing issue for the central claim, as correlational patterns do not suffice to establish a shaping (causal) role.
Authors: We agree that our study design is observational and does not permit causal claims. The linguistic analysis identifies patterns consistent with an emerging collective identity, and the survey data reveal associations between measures of social identity and wellbeing indicators. We do not have experimental controls or longitudinal data to address reverse causation or all potential confounders. Accordingly, we will revise the abstract to replace 'demonstrate its role in shaping professional wellbeing' with 'highlight the emergence of a collective RSE identity and its association with professional wellbeing.' We will also expand the discussion of limitations to explicitly address selection bias and the correlational nature of the results. The manuscript already includes some controls for career stage in the survey analysis, but we acknowledge this does not fully mitigate all confounds. revision: yes
-
Referee: [Abstract] The abstract describes the data sources and methods only at a high level and supplies no specific results, effect sizes, p-values, model specifications, or validation procedures. Without these, it is not possible to determine whether the data support the claims about the emergence of collective identity or its impact on wellbeing.
Authors: We recognize that the abstract provides a high-level summary, as is conventional due to length constraints. To address this, we will revise the abstract to include key quantitative details, such as the exact numbers of posts and blogs analyzed, the sample size of the survey, and main statistical outcomes (e.g., significant correlations or regression coefficients from the inferential analysis). Model specifications and validation procedures are detailed in the methods section, but we will add a brief mention of the primary analytical approaches in the abstract. This will allow readers to better assess the support for our claims without needing to read the full text. revision: yes
Circularity Check
No circularity: empirical mixed-methods study with no derivations or self-referential fits
full rationale
The paper conducts a mixed-methods empirical analysis combining NLP on 28k posts and 1.7k blogs with survey data from 381 RSEs to identify patterns in social identity and wellbeing. No equations, parameter fits, predictions, or uniqueness theorems appear in the provided abstract or description. All claims rest on external data collection and standard statistical/linguistic procedures rather than any self-definition, fitted-input renaming, or self-citation chain that reduces the central result to its own inputs by construction. The derivation chain is therefore self-contained and non-circular.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Bibek Acharya and Ricardo Colomo-Palacios. 2019. A systematic literature review on autonomous agile teams. In19th Int’l Conf. on Computational Science and Its Applications, 146–151
2019
-
[2]
Salter Ainsworth, Mary C
Mary D. Salter Ainsworth, Mary C. Blehar, Everett Waters, and Sally Wall. 1978.Patterns of attachment: A psychological study of the strange situation. Lawrence Erlbaum
1978
-
[3]
Alice Allen et al. 2017. Engineering Academic Software (Dagstuhl Perspectives Workshop 16252).Dagstuhl Manifestos, 6, 1, 1–20
2017
-
[4]
Amiot, Deborah J
Catherine E. Amiot, Deborah J. Terry, Dian. Wirawan, and Tim A. Grice. 2010. Changes in social identities over time: The role of coping and adaptation processes. en.British J. of Social Psychology, 49, 4, (Nov. 2010), 803–826. Does social identity matter in software engineering? EASE 2026, June 9–12, 2026, Glasgow, United Kingdom
2010
-
[5]
Arnold, Nick Turner, Julian Barling, E
Kathleen A. Arnold, Nick Turner, Julian Barling, E. Kevin Kelloway, and M. Catherine McKee. 2007. Transformational leadership and psychological well-being: the mediating role of meaningful work.J. of occupational health psychology, 12, 3, 193
2007
-
[6]
Arriaga, Mami Kumashiro, Jeffry A
Ximena B. Arriaga, Mami Kumashiro, Jeffry A. Simpson, and Nickola C. Over- all. 2018. Revising working models across time: relationship situations that enhance attachment security.Personality and Social Psychology Review, 22, 1, 71–96
2018
-
[7]
Emma A Bäck, Hanna Bäck, Marie Gustafsson Sendén, and Sverker Sikström
-
[8]
of Social and Political Psychology, 6, 1, 76–91
From i to we: group formation and linguistic adaption in an online xenophobic forum.J. of Social and Political Psychology, 6, 1, 76–91
-
[9]
Andreas Bäckevik, Erik Tholén, and Lucas Gren. 2019. Social identity in software development. In12th Int’l Workshop on Cooperative and Human Aspects of Software Engineering. IEEE, 107–114
2019
-
[10]
Horowitz
Kim Bartholomew and Leonard M. Horowitz. 1991. Attachment styles among young adults: a test of a four-category model.J. of personality and social psychology, 61, 2, 226
1991
-
[11]
Blader and Serena Yu
Steven L. Blader and Serena Yu. 2017. Are status and respect different or two sides of the same coin?Academy of Management Annals, 11, 2, 800–824
2017
-
[12]
Ronald Bledow, Jonas Kühnel, Michael Jin, and Julius Kuhl. 2021. Breaking the chains: the inverted-u-shaped relationship between action-state orientation and creativity under low job autonomy.J. of Management, 48, 4, 905–935
2021
-
[13]
1969.Attachment and loss, Vol
John Bowlby. 1969.Attachment and loss, Vol. I: Attachment. (Reprinted 1982). Basic Books
1969
-
[14]
R. L. Boyd. 2017. Psychological text analysis in the digital humanities. InData Analytics in Digital Humanities. Springer International Publishing, 161–189
2017
-
[15]
R. L. Boyd, A. Ashokkumar, S. Seraj, and J. W. Pennebaker. 2022. The develop- ment and psychometric properties of LIWC-22. Tech. rep. University of Texas at Austin. https://www.liwc.app
2022
-
[16]
Alice Boyes. 2020. How to overcome your fear of making mistakes.Harvard Business Review
2020
-
[17]
James A. Breaugh. 1985. The measurement of work autonomy.Human Rela- tions, 38, 6, 551–570
1985
-
[18]
Brooke, Daniel W
Paul P. Brooke, Daniel W. Russell, and James L. Price. 1988. Discriminant validation of measures of job satisfaction, job involvement, and organizational commitment.J. of applied psychology, 73, 2, 139
1988
-
[19]
Caldwell and Phillip R
Jessica G. Caldwell and Phillip R. Shaver. 2012. Exploring the cognitive- emotional pathways between adult attachment and ego-resiliency.Individual Differences Research, 10, 3
2012
-
[20]
Castillo
Jamie Cano and Julia X. Castillo. 2004. Factors explaining job satisfaction among faculty.J. of Agricultural education, 45, 3, 65–74
2004
-
[21]
Jeffrey C Carver, Richard P Kendall, Susan E Squires, and Douglass E Post
-
[22]
In29th Int’l Conf
Software development environments for scientific and engineering software: a series of case studies. In29th Int’l Conf. on Software Engineering. IEEE, 550–559
-
[23]
Jeffrey C. Carver. 2008. SE-CSE 2008: the first int’l workshop on software engineering for computational science and engineering. InCompanion of the 30th Int’l Conf. on Software Engineering. ACM, 1071–1072
2008
-
[24]
Edward Chen. 2026. AI is threatening science jobs. Which ones are most at risk? en.Nature, 651, 8104, (Feb. 2026), 19–20. doi:10.1038/d41586-026-00444-9
-
[25]
Cindy K Chung and James W Pennebaker. 2008. Revealing dimensions of thinking in open-ended self-descriptions: an automated meaning extraction method for natural language.J. of research in personality, 42, 1, 96–132
2008
-
[26]
Richard I Cook. 2020. Above the line, below the line.Communications of the ACM, 63, 3, 43–46
2020
-
[27]
Crittenden and Andrea Landini
Patricia M. Crittenden and Andrea Landini. 2011.Assessing adult attachment: A dynamic-maturational approach to discourse analysis. WW Norton & Co
2011
-
[28]
Davis, Stuart Jones, Ann M
William A. Davis, Stuart Jones, Ann M. Crowell-Kuhnberg, Devin O’Keeffe, Kelly M. Boyle, Stephen B. Klainer, and Steven Yule. 2017. Operative team communication during simulated emergencies: too busy to respond?Surgery, 161, 5, 1348–1356
2017
-
[29]
Diefendorff, Rebecca J
James M. Diefendorff, Rebecca J. Hall, Robert G. Lord, and Matthew L. Strean
-
[30]
of Applied Psychology, 85, 2, 250
Action–state orientation: construct validity of a revised measure and its relationship to work-related variables.J. of Applied Psychology, 85, 2, 250
-
[31]
Torgeir Dingsøyr, Magne Jørgensen, Frode Odde Carlsen, Lena Carlström, Jens Engelsrud, Kine Hansvold, Mari Heibø-Bagheri, Kjetil Røe, and Karl Ove Vika Sørensen. 2022. Enabling autonomous teams and continuous deployment at scale.IT Professional, 24, 6, 47–53
2022
-
[32]
Branscombe, Russell Spears, and Antony S
Bertjan Doosje, Nyla R. Branscombe, Russell Spears, and Antony S. R. Manstead
-
[33]
of personality and social psychology, 75, 4, 872
Guilty by association: when one’s group has a negative history.J. of personality and social psychology, 75, 4, 872
-
[34]
Anthony Downs. 1957. An economic theory of political action in a democracy. J. of political economy, 65, 2, 135–150
1957
-
[35]
Katz, David Klein, Mark Santcroos, Tobias Schlauch, Liz Sexton-Kennedy, and Anthony Truskinger
Stephan Druskat, Daniel S. Katz, David Klein, Mark Santcroos, Tobias Schlauch, Liz Sexton-Kennedy, and Anthony Truskinger. [n. d.] Policy. https://www.sof tware.ac.uk/blog/credit-and-recognition-research-software-current-state- practice-and-outlook. Accessed 2025-10-23. ()
2025
-
[36]
Fabian Fagerholm and Jürgen Münch. 2012. Developer experience: Concept and definition. InInt’l Conf. on software and system process. IEEE, 73–77
2012
-
[37]
César França, Fabio Q. B. da Silva, and Helen Sharp. 2020. Motivation and satisfaction of software engineers.IEEE Trans. on Software Engineering, 46, 2, 118–140
2020
-
[38]
César França, Helen Sharp, and Fabio Q. B. da Silva. 2014. Motivated software engineers are engaged and focused, while satisfied ones are happy. In8th Int’l Symposium on Empirical Software Engineering and Measurement(ESEM ’14) Article 32. ACM, 8 pages
2014
-
[39]
Ghaferi and Justin B
Amir A. Ghaferi and Justin B. Dimick. 2016. Importance of teamwork, commu- nication and culture on failure-to-rescue in the elderly.J. of British Surgery, 103, 2, e47–e51
2016
- [40]
-
[41]
Michaela Greiler, Margaret-Anne Storey, and Abi Noda. 2022. An actionable framework for understanding and improving developer experience.IEEE Trans. on Software Engineering, 49, 4, 1411–1425
2022
-
[42]
i” and “we
Marie Gustafsson Sendén, Torun Lindholm, and Sverker Sikström. 2014. Se- lection bias in choice of words: evaluations of “i” and “we” differ between contexts, but “they” are always worse.Journal of Language and Social Psy- chology, 33, 1, 49–67
2014
-
[43]
Itzhak Harpaz and Xiaocong Fu. 2002. The structure of the meaning of work: a relative stability amidst change.Human relations, 55, 6, 639–667
2002
-
[44]
Alexander Haslam, Genevieve Dingle, and Mei X
Catherine Haslam, Tegan Cruwys, S. Alexander Haslam, Genevieve Dingle, and Mei X. L. Chang. 2016. Groups 4 health: evidence that a social-identity intervention that builds and strengthens social group membership improves mental health.J. of Affective Disorders, 194, 188–195
2016
-
[45]
Alexander Haslam, Anjana Iyer, Jolanda Jetten, and W
Catherine Haslam, Alice Holme, S. Alexander Haslam, Anjana Iyer, Jolanda Jetten, and W. H. Williams. 2008. Maintaining group memberships: social identity continuity predicts well-being after stroke.Neuropsychological Reha- bilitation, 18, 671–691
2008
-
[46]
Dingle, and S
Catherine Haslam, Jolanda Jetten, Tegan Cruwys, Genevieve A. Dingle, and S. Alexander Haslam. 2018.The new psychology of health: Unlocking the social cure. Routledge
2018
-
[47]
Alexander Haslam
S. Alexander Haslam. 2004.Social identity in organizations: The social identity approach. Sage
2004
-
[48]
Alexander Haslam, Penelope J
S. Alexander Haslam, Penelope J. Oakes, Katherine J. Reynolds, and John C. Turner. 1999. Social identity salience and the emergence of stereotype consensus.Personality and social psychology bulletin, 25, 7, 809–818
1999
-
[49]
Simon Hettrick. 2016. A not-so-brief history of research software engineers. https://www.software.ac.uk/blog/2016-08-17-not-so-brief-history-researc h-software-engineers-0. Accessed 2025-06-25. (2016)
2016
-
[50]
Lorin Hochstein, Filippo Lanubile, Laura Nolan, and Rafael Prikladnicki. 2023. Developing Your Software Engineering Career: Words of Advice From Sea- soned Professionals.IEEE Software, 40, 05, 29–33
2023
-
[51]
Rashina Hoda, James Noble, and Stuart Marshall. 2013. Self-organizing roles on agile software development teams.IEEE Trans. on Software Engineering, 39, 3, 422–444
2013
-
[52]
Hogg and Scott A
Michael A. Hogg and Scott A. Reid. 2006. Social identity, self-categorization, and the communication of group norms.Communication Theory, 16, 7–30
2006
-
[53]
2011.Resilience Engineering and Safety Management
Erik Hollnagel. 2011.Resilience Engineering and Safety Management. Mines Paris Tech
2011
-
[54]
Erik Hollnagel. 2014. Resilience engineering and the built environment.Build- ing Research & Information, 42, 2, 221–228
2014
-
[55]
Hovenden, S.D
F.M. Hovenden, S.D. Walker, H.C. Sharp, and M. Woodman. 1996. Building quality into scientific software.Software Quality, 5, 25–32
1996
-
[56]
Alexander Haslam, and Catherine Haslam
Jolanda Jetten, S. Alexander Haslam, and Catherine Haslam. 2012. The case for a social identity analysis of health and well-being. InThe social cure: Identity, health and well-being. Psychology Press, 3–20
2012
-
[57]
Jost, Vasso Chaikalis-Petritsis, Dominic Abrams, Jim Sidanius, Jojan- neke Van Der Toorn, and Christopher Bratt
John T. Jost, Vasso Chaikalis-Petritsis, Dominic Abrams, Jim Sidanius, Jojan- neke Van Der Toorn, and Christopher Bratt. 2012. Why men (and women) do and don’t rebel: effects of system justification on willingness to protest. Personality and social psychology Bulletin, 38, 2, 197–208
2012
-
[58]
1996.The social psychology of collective action: Identity, injustice and gender
Caroline Kelly and Sara Breinlinger. 1996.The social psychology of collective action: Identity, injustice and gender. Taylor & Francis US
1996
-
[59]
Klohnen and Sara Bera
Eva C. Klohnen and Sara Bera. 1998. Behavioral and experiential patterns of avoidantly and securely attached women across adulthood: a 31-year longitu- dinal perspective.J. of personality and social psychology, 74, 1, 211
1998
-
[60]
Miriam Koschate, Elahe Naserian, Luke Dickens, Avelie Stuart, Alessandra Russo, and Mark Levine. 2021. Asia: automated social identity assessment using linguistic style.Behavior Research Methods, 53, 4, 1762–1781
2021
-
[61]
Julius Kuhl. 2000. A functional-design approach to motivation and self-regulation: the dynamics of personality systems interactions. InHandbook of self-regulation. Academic Press, 111–169
2000
- [62]
-
[63]
Jost, Richard Bonneau, Megan MacDuffee Metzger, Sharareh Noorbaloochi, and Duncan Penfold-Brown
Melanie Langer, John T. Jost, Richard Bonneau, Megan MacDuffee Metzger, Sharareh Noorbaloochi, and Duncan Penfold-Brown. 2019. Digital dissent: An analysis of the motivational contents of tweets from an Occupy Wall Street demonstration.Motivation Science, 5, 1, 14
2019
- [64]
-
[65]
Kay, and Gail J
Kelly Laurin, Aaron C. Kay, and Gail J. Fitzsimons. 2012. Reactance versus rationalization: divergent responses to policies that constrain freedom.Psy- chological Science, 23, 2, 205–209
2012
-
[66]
Tamara Lopez, Helen Sharp, Arosha Bandara, Thein Tun, Mark Levine, and Bashar Nuseibeh. 2023. Security responses in software development.ACM Trans. on Software Engineering and Methodology, 32, 3, 1–29
2023
-
[67]
Tamara Lopez, Helen Sharp, Michel Wermelinger, Melanie Langer, Mark Levine, Caroline Jay, Yijun Yu, and Bashar Nuseibeh. 2023. Accounting for socio-technical resilience in software engineering. In16th Int’l Conf. on Coop- erative and Human Aspects of Software Engineering. IEEE, 31–36
2023
-
[68]
T. T. Manning. 2003. Leadership across cultures: attachment style influences. J. of Leadership & Organizational Studies, 9, 3, 20–30
2003
-
[69]
G Marion et al. 2022. Modelling: understanding pandemics and how to control them. epidemics 39, 100588. (2022)
2022
-
[70]
Mario Mikulincer. 1997. Adult attachment style and information processing: individual differences in curiosity and cognitive closure.J. of personality and social psychology, 72, 5, 1217
1997
-
[71]
Mario Mikulincer and Phillip R. Shaver. 2003.The attachment behavioral system in adulthood: Activation, psychodynamics, and interpersonal processes
2003
-
[72]
Shaver, and Dalit Pereg
Mario Mikulincer, Phillip R. Shaver, and Dalit Pereg. 2003. Attachment theory and affect regulation: the dynamics, development, and cognitive consequences of attachment-related strategies.Motivation and emotion, 27, 2, 77–102
2003
-
[73]
Nils Brede Moe, Viktoria Stray, Darja Smite, and Marius Mikalsen. 2023. Attractive Workplaces: What are Engineers Looking For?IEEE Software
2023
-
[74]
Muller and Karl-Dieter Opp
Edward N. Muller and Karl-Dieter Opp. 1986. Rational choice and rebellious collective action.American Political Science Review, 80, 2, 471–487
1986
-
[75]
Nord, Arthur P
Walter R. Nord, Arthur P. Brief, Judith M. Atieh, and Elizabeth M. Doherty
-
[76]
Studying meanings of work: the case of work values. (1990)
1990
-
[77]
Elizabeth O’Carroll. 2021. Building a career path for research software engi- neers. https://research.princeton.edu/news/building-career-path-research- software-engineers. Accessed 2025-06-25. (June 2021)
2021
-
[78]
Chris Parr. 2013. Save your work–give software engineers a career track. Times Higher Education, 15
2013
-
[79]
Urszula Pawlicka-Deger. 2022. Digital humanities needs equality between humanists and technicians. https://www.timeshighereducation.com/blog /digital-humanities-needs-equality-between-humanists-and-technicians. Accessed 2025-06-25. (July 2022)
2022
-
[80]
Sophie Prentice. 2022. The fear of losing your job. InThe Future of Workplace Fear: How Human Reflex Stands in the Way of Digital Transformation. Apress, 81–97
2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.