Exploring Teachers' Perspectives on Using Conversational AI Agents for Group Collaboration
Pith reviewed 2026-05-16 06:00 UTC · model grok-4.3
The pith
Teachers see a voice-based AI agent as a useful spark for classroom group engagement but raise concerns about autonomy, trust, and fit with teaching goals.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Teachers appreciated Phoenix's capacity to stimulate engagement in face-to-face groups but expressed concerns around autonomy, trust, anthropomorphism, and pedagogical alignment, yielding empirical insights into educators' mental models of AI and core design tensions for group-facing agents.
What carries the argument
Phoenix, a voice-based conversational agent designed to function as a near-peer mediator in face-to-face group collaboration, analyzed through qualitative data from teacher playtesting sessions, surveys, and focus groups.
Load-bearing premise
That the perceptions collected from 33 teachers during controlled playtesting sessions, surveys, and focus groups accurately reflect how the agent would perform and be received in everyday classroom settings with diverse student groups.
What would settle it
Deploy Phoenix in ordinary K-12 classrooms for several weeks and compare observed student engagement levels, instances of student autonomy, and teacher-reported trust issues against matched groups that do not use the agent.
read the original abstract
Collaboration is a cornerstone of 21st-century learning, yet teachers continue to face challenges in supporting productive peer interaction. Emerging generative AI tools offer new possibilities for scaffolding collaboration, but their role in mediating in-person group work remains underexplored, especially from the perspective of educators. This paper presents findings from an exploratory qualitative study with 33 K12 teachers who interacted with Phoenix, a voice-based conversational agent designed to function as a near-peer in face-to-face group collaboration. Drawing on playtesting sessions, surveys, and focus groups, we examine how teachers perceived the agent's behavior, its influence on group dynamics, and its classroom potential. While many appreciated Phoenix's capacity to stimulate engagement, they also expressed concerns around autonomy, trust, anthropomorphism, and pedagogical alignment. We contribute empirical insights into teachers' mental models of AI, reveal core design tensions, and outline considerations for group-facing AI agents that support meaningful, collaborative learning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents findings from an exploratory qualitative study involving 33 K-12 teachers who interacted with Phoenix, a voice-based conversational AI agent designed to act as a near-peer in face-to-face group collaboration. Data were collected via playtesting sessions, surveys, and focus groups to examine teachers' perceptions of the agent's behavior, its influence on group dynamics, and its classroom potential. Key results indicate that many teachers appreciated the agent's ability to stimulate engagement, while also raising concerns about autonomy, trust, anthropomorphism, and pedagogical alignment. The authors contribute empirical insights into teachers' mental models of AI, identify core design tensions, and outline considerations for developing group-facing AI agents to support collaborative learning.
Significance. If the results hold, this work provides timely empirical data on educator perspectives regarding AI tools for mediating in-person collaboration, an area that remains underexplored in HCI and educational technology. The multi-method qualitative approach yields rich insights into design tensions such as trust and anthropomorphism that can inform future agent development. Strengths include direct engagement with practicing teachers and the identification of actionable considerations for group-facing agents. The exploratory framing appropriately tempers broad claims, though the controlled nature of the data collection constrains immediate applicability to authentic classroom environments.
major comments (2)
- [Methods] Methods section: The description of the qualitative analysis process (how themes were derived from playtesting sessions, surveys, and focus groups) lacks sufficient detail on coding procedures, theme identification, or measures to ensure rigor such as inter-coder reliability. This is load-bearing for the central claims about specific concerns (autonomy, trust, anthropomorphism, pedagogical alignment) and makes it difficult to evaluate whether the themes were derived systematically or influenced by researcher bias.
- [Discussion] Discussion / Abstract: The claims about the agent's 'classroom potential' and design considerations for group-facing agents rest on perceptions gathered exclusively from controlled playtesting sessions without students present, curriculum constraints, or repeated real-world exposure. The untested assumption that these artificial-session views predict authentic classroom reception and performance needs stronger qualification or additional evidence to support the paper's broader implications.
minor comments (1)
- [Abstract] Abstract: The abstract could briefly note the exploratory and controlled nature of the study to better set reader expectations for the scope of the findings.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed feedback, which helps us strengthen the transparency and framing of our exploratory study. We address each major comment below, indicating revisions to the manuscript where appropriate.
read point-by-point responses
-
Referee: [Methods] Methods section: The description of the qualitative analysis process (how themes were derived from playtesting sessions, surveys, and focus groups) lacks sufficient detail on coding procedures, theme identification, or measures to ensure rigor such as inter-coder reliability. This is load-bearing for the central claims about specific concerns (autonomy, trust, anthropomorphism, pedagogical alignment) and makes it difficult to evaluate whether the themes were derived systematically or influenced by researcher bias.
Authors: We agree that greater detail on the analysis process is warranted to support evaluation of the themes. In the revised manuscript, we will expand the Methods section to describe the thematic analysis in full: an inductive, iterative coding process applied across playtesting transcripts, open-ended survey responses, and focus group recordings; the steps for initial code generation, clustering into themes, and refinement through team discussion; and measures for rigor including dual independent coding of a data subset by two researchers followed by consensus resolution of discrepancies. This addition will clarify the systematic derivation of the reported concerns without altering the exploratory nature of the work. revision: yes
-
Referee: [Discussion] Discussion / Abstract: The claims about the agent's 'classroom potential' and design considerations for group-facing agents rest on perceptions gathered exclusively from controlled playtesting sessions without students present, curriculum constraints, or repeated real-world exposure. The untested assumption that these artificial-session views predict authentic classroom reception and performance needs stronger qualification or additional evidence to support the paper's broader implications.
Authors: We acknowledge the controlled setting of the playtesting sessions and the resulting limits on direct claims about classroom performance. As an explicitly exploratory study, our contributions center on surfacing teachers' perceptions and design tensions rather than validated predictions. We will revise the Discussion to more explicitly qualify these boundaries, stressing the artificial conditions and the necessity of future in-classroom validation studies. We will also adjust phrasing in the Abstract and Discussion to frame 'classroom potential' and design considerations as preliminary insights intended to guide subsequent development and research, thereby aligning claims more tightly with the data collected. revision: yes
Circularity Check
No circularity: empirical qualitative study with direct data collection
full rationale
The paper reports an exploratory qualitative study drawing on playtesting sessions, surveys, and focus groups with 33 K12 teachers interacting with the Phoenix agent. No equations, fitted parameters, derivations, or predictions exist that could reduce by construction to inputs or self-citations. Central claims about teachers' perceptions of engagement, autonomy, trust, anthropomorphism, and pedagogical alignment are presented as direct empirical observations rather than derived quantities. The analysis is self-contained against external benchmarks with no load-bearing self-citation chains or ansatz smuggling.
Axiom & Free-Parameter Ledger
invented entities (1)
-
Phoenix
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
exploratory qualitative study with 33 K-12 teachers... playtesting sessions, surveys, and focus groups... concerns around autonomy, trust, anthropomorphism, and pedagogical alignment
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Phoenix... voice-based conversational agent... LLM Prompt... GPT-4.1-mini
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
npj Science of Learning10(1), 1 (2025)
Ackermann, H., Henke, A., Chevalère, J., Yun, H.S., Hafner, V.V., Pinkwart, N., Lazarides, R.: Physical embodiment and anthropomorphism of ai tutors and their role in student enjoyment and performance. npj Science of Learning10(1), 1 (2025)
work page 2025
-
[2]
Thinking Skills and creativity27, 78–91 (2018)
Al-Samarraie, H., Hurmuzan, S.: A review of brainstorming techniques in higher education. Thinking Skills and creativity27, 78–91 (2018)
work page 2018
-
[3]
Amiot, C., Charoy, F., Dinet, J.: Chatbots in collaborative settings and their im- pact on virtual teamwork. Proc. ACM Hum.-Comput. Interact.9(2), 1–18 (2025)
work page 2025
-
[4]
Anderson, E., Lin, G.C., Farid, A., Fenech, M., Hanks, B., Klopfer, E., Doherty, E., Hirshfield, L., Ko, M.L.M., Foltz, P., et al.: Exploring genai technologies within collaborative learning. In: Proceedings of the 18th International Conference on Teachers’ Perspectives on Conversational AI Agents for Collaboration 13 Computer-Supported Collaborative Lear...
work page 2025
-
[5]
International Journal of Artificial Intelligence in Education pp
de Araujo, A., Papadopoulos, P.M., McKenney, S., de Jong, T.: Investigating the impact of a collaborative conversational agent on dialogue productivity and knowl- edge acquisition. International Journal of Artificial Intelligence in Education pp. 1–27 (2025)
work page 2025
-
[6]
Qualitative re- search in sport, exercise and health11(4), 589–597 (2019)
Braun, V., Clarke, V.: Reflecting on reflexive thematic analysis. Qualitative re- search in sport, exercise and health11(4), 589–597 (2019)
work page 2019
-
[7]
Computers & Education53(4), 1147–1154 (2009)
Casamayor, A., Amandi, A., Campo, M.: Intelligent assistance for teachers in collaborative e-learning environments. Computers & Education53(4), 1147–1154 (2009)
work page 2009
-
[8]
In: Proceedings of the 29th International Conference on Intelligent User Interfaces (IUI ’24)
Chiang, C.W., Lu, Z., Li, Z., Yin, M.: Enhancing ai-assisted group decision making through llm-powered devil’s advocate. In: Proceedings of the 29th International Conference on Intelligent User Interfaces (IUI ’24). pp. 103–119. Association for Computing Machinery, New York, NY, USA (2024)
work page 2024
-
[9]
Dyke, G., Adamson, D., Howley, I., Rosé, C.P.: Enhancing scientific reasoning and discussionwithconversationalagents.IEEETransactionsonLearningTechnologies 6(3), 240–247 (2013)
work page 2013
-
[10]
British Journal of Educational Technology56(2), 712–733 (2025)
Edwards, J., Nguyen, A., Lämsä, J., Sobocinski, M., Whitehead, R., Dang, B., Roberts, A.S., Järvelä, S.: Human-ai collaboration: Designing artificial agents to facilitate socially shared regulation among learners. British Journal of Educational Technology56(2), 712–733 (2025)
work page 2025
-
[11]
Evans, C.M.: Measuring student success skills: A review of the literature on col- laboration. 21st century skills. Tech. rep., National Center for the Improvement of Educational Assessment (2020)
work page 2020
-
[12]
Computers & Education 138, 1–12 (2019)
Hernández-Sellés, N., Muñoz-Carril, P.C., González-Sanmamed, M.: Computer- supported collaborative learning: An analysis of the relationship between interac- tion, emotional support and online collaborative tools. Computers & Education 138, 1–12 (2019)
work page 2019
-
[13]
In: Proceedings of the 30th International Conference on Intelligent User Interfaces
Houde, S., Brimijoin, K., Muller, M., Ross, S.I., Silva Moran, D.A., Gonzalez, G.E., Kunde, S., Foreman, M.A., Weisz, J.D.: Controlling ai agent participation in group conversations: A human-centered approach. In: Proceedings of the 30th International Conference on Intelligent User Interfaces. pp. 390–408 (2025)
work page 2025
-
[14]
Prentice Hall, Englewood Cliffs, NJ, 5th ed
Johnson, D.W., Johnson, F.P.: Joining Together: Group Theory and Group Skills. Prentice Hall, Englewood Cliffs, NJ, 5th ed. edn. (1991)
work page 1991
-
[15]
Proceedings of the ACM on human-computer interaction2(CSCW), 1–31 (2018)
Lau, J., Zimmerman, B., Schaub, F.: Alexa, are you listening? privacy perceptions, concerns and privacy-seeking behaviors with smart speakers. Proceedings of the ACM on human-computer interaction2(CSCW), 1–31 (2018)
work page 2018
-
[16]
Leong, J., Tang, J., Cutrell, E., Junuzovic, S., Baribault, G.P., Inkpen, K.: Dittos: Personalized, embodied agents that participate in meetings when you are unavail- able. Proc. ACM Hum.-Comput. Interact.8(CSCW2), 1–28 (2024)
work page 2024
-
[17]
In: Extended Abstracts of the CHI Conference on Human Factors in Computing Systems
Liu, J., Yao, Y., An, P., Wang, Q.: Peergpt: Probing the roles of llm-based peer agents as team moderators and participants in children’s collaborative learning. In: Extended Abstracts of the CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA (2024)
work page 2024
-
[18]
Education and Information Technologies pp
Lu, G., Ba, S., Yang, L.: Optimizing gai-assisted formative feedback: an experi- mental study on its effects on engagement, shared metacognition, and learning per- formance in online collaborative learning. Education and Information Technologies pp. 1–31 (2025) 14 P. Ravi et al
work page 2025
-
[19]
preprint arXiv:2601.17962 (2026)
Lyu, W., Wang, Y., Yue, M., Sun, Y., Suh, J., Kier, M., Yao, Z., Zhang, Y.: Designing ai peers for collaborative mathematical problem solving with middle school students: A participatory design study. preprint arXiv:2601.17962 (2026)
-
[20]
In: Proceedings of the Human Factors and Er- gonomics Society Annual Meeting
Mosier, K.L., Skitka, L.J., Burdick, M.D., Heers, S.T.: Automation bias, account- ability, and verification behaviors. In: Proceedings of the Human Factors and Er- gonomics Society Annual Meeting. vol. 40, pp. 204–208. SAGE Publications Sage CA: Los Angeles, CA (1996)
work page 1996
-
[21]
In: Proceedings of the SIGCHI conference on Human factors in computing systems
Nass, C., Steuer, J., Tauber, E.R.: Computers are social actors. In: Proceedings of the SIGCHI conference on Human factors in computing systems. pp. 72–78 (1994)
work page 1994
-
[22]
Pérez-Marín, D.: A review of the practical applications of pedagogic conversational agents to be used in school and university classrooms. Digital1(1), 18–33 (2021)
work page 2021
-
[23]
International Journal of Human-Computer Studies195, 103407 (2025)
Piercy, C.W., Montgomery-Vestecka, G., Lee, S.K.: Gender and accent stereotypes in communication with an intelligent virtual assistant. International Journal of Human-Computer Studies195, 103407 (2025)
work page 2025
-
[24]
PROJECT (Professional Journal of English Education)2(5), 594 (2019)
Rahmayanti, P., Saraswati, P.A., Bhuana, G.P., et al.: The use of ice breaker to improve students’ motivation in learning english at the tenth grade students of smk ypkkp. PROJECT (Professional Journal of English Education)2(5), 594 (2019)
work page 2019
-
[25]
Roberts, T.S., McInnerney, J.M.: Seven problems of online group learning (and their solutions). Journal of Educational Technology & Society10(4), 257–268 (2007), retrieved August 28, 2025 from https://www.learntechlib.org/p/74872/
work page 2007
-
[26]
In: International Conference on Artificial Intelligence in Education
Sankaranarayanan, S., Kandimalla, S.R., Hasan, S., An, H., Bogart, C., Murray, R.C., Hilton, M., Sakr, M., Rosé, C.: Agent-in-the-loop: Conversational agent support in service of reflection for learning during collaborative programming. In: International Conference on Artificial Intelligence in Education. pp. 273–278. Springer (2020)
work page 2020
- [27]
-
[28]
In: Proceedings of the 6th ACM Conference on Conversational User Interfaces
Sun, G., Zhan, X., Such, J.: Building better ai agents: A provocation on the util- isation of persona in llm-based conversational agents. In: Proceedings of the 6th ACM Conference on Conversational User Interfaces. pp. 1–6 (2024)
work page 2024
-
[29]
Computers and Edu- cation: Artificial Intelligence3, 100097 (2022)
Tan, S.C., Lee, A.V.Y., Lee, M.: A systematic review of artificial intelligence tech- niques for collaborative learning over the past two decades. Computers and Edu- cation: Artificial Intelligence3, 100097 (2022)
work page 2022
-
[30]
Pedagogies: An International Journal19(3), 493–503 (2024)
Tang, K.S., Cooper, G., Rappa, N., Cooper, M., Sims, C., Nonis, K.: A dialogic approach to transform teaching, learning & assessment with generative ai in sec- ondary education: A proof of concept. Pedagogies: An International Journal19(3), 493–503 (2024)
work page 2024
-
[31]
Zawacki-Richter, O., Marín, V.I., Bond, M., Gouverneur, F.: Systematic review of research on artificial intelligence applications in higher education–where are the educators? International journal of educational technology in higher education 16(1), 1–27 (2019)
work page 2019
-
[32]
Educational Psychology Review37(4), 106 (2025)
Zhang, H., Liu, Y., Jiang, M., Chen, J., Wang, M., Paas, F.: Emotional artifi- cial intelligence in education: A systematic review and meta-analysis. Educational Psychology Review37(4), 106 (2025)
work page 2025
-
[33]
Zhang, R., Duan, W., Flathmann, C., McNeese, N., Freeman, G., Williams, A.: Investigating ai teammate communication strategies and their impact in human- ai teams for effective teamwork. Proc. ACM Hum.-Comput. Interact.7(CSCW2) (2023)
work page 2023
-
[34]
Zhang, R., Duan, W., Flathmann, C., McNeese, N., Knijnenburg, B., Freeman, G.: Verbal vs. visual: How humans perceive and collaborate with ai teammates using different communication modalities in various human-ai team compositions. Proc. ACM Hum.-Comput. Interact.8(CSCW2) (Nov 2024) Teachers’ Perspectives on Conversational AI Agents for Collaboration 15
work page 2024
-
[35]
In: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems
Zhang, Z., Sun, B., An, P.: Breaking barriers or building dependency? exploring team-llm collaboration in ai-infused classroom debate. In: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems. CHI ’25, Association for Computing Machinery, New York, NY, USA (2025)
work page 2025
-
[36]
In: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems
Zheng, C., Wu, Y., Shi, C., Ma, S., Luo, J., Ma, X.: Competent but rigid: Identify- ing the gap in empowering ai to participate equally in group decision-making. In: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. pp. 1–19 (2023)
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.