Dynamic In-Group Persona Generation for Enhancing Human-AI Rapport

Inseo Jung; Jinkyu Kim; Jungbeom Lee; Minwoo Kang; Suhong Moon; Yoonseok Oh

arxiv: 2606.18256 · v1 · pith:MYXOV523new · submitted 2026-05-05 · 💻 cs.HC · cs.AI

Dynamic In-Group Persona Generation for Enhancing Human-AI Rapport

Yoonseok Oh , Inseo Jung , Jinkyu Kim , Jungbeom Lee , Minwoo Kang , Suhong Moon This is my paper

Pith reviewed 2026-07-01 00:28 UTC · model grok-4.3

classification 💻 cs.HC cs.AI

keywords in-group personahuman-AI rapportLLM chatbotspersona generationuser experiencecounselingpeer supportsynthetic persona

0 comments

The pith

In-group persona conditioning for LLMs significantly boosts perceived human-AI rapport.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper proposes generating dynamic in-group personas for LLM-based chatbots to improve rapport in sensitive conversations. The method extracts the user's primary concern and creates a synthetic persona that shares it while varying in other details like age or profession. A controlled human study compares it to no-persona and minimal-disclosure agents. The in-group approach led to higher scores on rapport, personal relevance, and engagement measures. Readers might care if this makes AI chatbots more effective for counseling and peer support by building trust through perceived similarity.

Core claim

The central discovery is that an agent using a dynamically generated in-group persona, which matches the user's primary concern but differs in background, produces significantly better perceived rapport, personal relevance, and positive user experience including higher engagement than either a conventional agent or one with minimal self-disclosure.

What carries the argument

Dynamic in-group persona generation, which identifies the primary concern from user input and synthesizes a persona sharing that concern with altered narrative details to condition the LLM.

If this is right

The in-group persona agent improves perceived rapport compared to baselines.
It increases personal relevance and yields more positive user experience.
Engagement is notably higher with the in-group approach.
The method targets domains like counseling where rapport is essential.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Longer interactions might benefit from updating the persona over time based on ongoing conversation.
Similar techniques could apply to other AI interaction domains such as education or customer service.
Measuring actual behavior change, not just questionnaires, would strengthen evidence for practical impact.

Load-bearing premise

Users perceive the LLM-generated personas as authentic in-group members who share their primary concern.

What would settle it

If users in a study rate the in-group persona agent no higher on rapport when the persona details are randomized rather than matched to their concern, the matching mechanism would be falsified.

Figures

Figures reproduced from arXiv: 2606.18256 by Inseo Jung, Jinkyu Kim, Jungbeom Lee, Minwoo Kang, Suhong Moon, Yoonseok Oh.

**Figure 1.** Figure 1: Dynamic Persona Generation for Rapport in Human-AI Dialogue. Comparison of three LLM agents responding to the same user utterance expressing life concern. No Persona, the chatbot LLM without additional persona conditioning, gives neutral, generic guidance; No Persona + Self-Disclosure adds a brief, generic self-disclosure that is not tailored to the user’s elicited context; finally, In-group Persona Agent … view at source ↗

**Figure 2.** Figure 2: Overview of the proposed In-Group Persona Agent. (a) Overall interaction flow: a pre-chat dialogue elicitation with a sufficiency check collects the user’s concern; the user then selects one of five candidate personas; the main conversation proceeds with the selected persona (see Section 3). (b) Persona generation process: traits are extracted from the pre-chat dialogue and partitioned into collected vs. n… view at source ↗

**Figure 3.** Figure 3: Group comparison of Rapport and UX items. Bars display mean ± 95% CI for visualization, although all statistical tests were performed on medians due to non-normal distributions (Shapiro–Wilk; see Appendix B). Rank-based one-sided Mann–Whitney contrasts tested the pair-wise ordered hypothesis NoP < NoPs < IPA. Bar colors denote the three conditions: NoP, NoPs, and IPA. Ours: In-group Persona Agent (IPA) Par… view at source ↗

**Figure 4.** Figure 4: Rubric Score comparison between Matched and Not Matched. Bar colors denote the condition Not Matched and Matched [PITH_FULL_IMAGE:figures/full_fig_p015_4.png] view at source ↗

**Figure 5.** Figure 5: First page of survey. Participants were distinguished by their Participant ID (PID), which ensured [PITH_FULL_IMAGE:figures/full_fig_p025_5.png] view at source ↗

**Figure 6.** Figure 6: Consent form page. Participants could download this form as a PDF. [PITH_FULL_IMAGE:figures/full_fig_p026_6.png] view at source ↗

**Figure 7.** Figure 7: Left: Initial state of Pre-chat stage. Middle: Example of an insufficient dialogue for sufficiency check. Right: Example of a dialogue with sufficient information to end the chat. 26 [PITH_FULL_IMAGE:figures/full_fig_p026_7.png] view at source ↗

**Figure 8.** Figure 8: Participants could select one of five personas to chat with(not shown to groups NoP and NoPs). [PITH_FULL_IMAGE:figures/full_fig_p027_8.png] view at source ↗

**Figure 9.** Figure 9: Post-chat page. Participants could chat with the agent. [PITH_FULL_IMAGE:figures/full_fig_p027_9.png] view at source ↗

**Figure 10.** Figure 10: Questionnaire page. Participants could respond the questionnaire about the experience with the agent. [PITH_FULL_IMAGE:figures/full_fig_p028_10.png] view at source ↗

read the original abstract

LLM-based chatbots are increasingly applied in interpersonal domains such as counseling and peer support, where establishing human-AI rapport is crucial yet remains challenging. In this work, we introduce a novel approach for conditioning LLMs with in-group personas, which (i) first identifies a user's primary concern and brief personal context (e.g., a computer science undergraduate worried about future career prospects), and (ii) generates a synthetic in-group persona that shares a similar primary concern while differing in background and narrative details, such as age or profession (e.g., a junior researcher at an AI startup). Furthermore, we conduct a human-subject study to systematically evaluate the effectiveness of in-group persona agents in enhancing human-AI rapport. We compare our approach against two baseline conditions: a conventional agent without persona conditioning and an agent exhibiting minimal self-disclosure (e.g., "I've felt that too"). Results from post-task questionnaires assessing rapport and user experience indicate that the in-group persona agent significantly improves perceived rapport and personal relevance compared to the baselines, and also yields more positive user experience-most notably higher engagement.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The in-group persona generation method is a straightforward idea but the study provides no sample size, stats, or methods to back the rapport claims.

read the letter

The paper's main point is a two-step process for LLMs: pull the user's primary concern from their message, then generate a persona that matches on that concern but changes details like age or job. They compare this in-group version to a plain agent and one with minimal self-disclosure, claiming better rapport and engagement on questionnaires.

The generation step itself is concrete and easy to follow. Matching only on the core issue while varying the rest is a sensible way to create perceived similarity without obvious copying. That targets a real need in counseling-style chatbots.

The problem is the study evidence. The abstract states statistically significant gains but gives zero information on participant count, recruitment, scales, randomization, conversation controls, or any tests. Without those basics the result cannot be assessed.

This would mainly interest people building conversational agents for support or mental health applications. A reader looking for practical persona techniques might try the pipeline even if the evaluation stays thin.

The work deserves peer review. Referees can request the full methods and data, check for demand effects, and see whether the differences hold up. The idea is worth testing properly.

Referee Report

3 major / 2 minor

Summary. The paper introduces a dynamic in-group persona generation technique for LLMs that first extracts a user's primary concern and then synthesizes a persona sharing that concern but differing in background details; it evaluates the approach via a human-subject study comparing the in-group condition against a no-persona baseline and a minimal self-disclosure baseline, claiming statistically significant gains in rapport, personal relevance, and engagement from post-task questionnaires.

Significance. If the empirical results can be substantiated with standard methodological reporting, the work would offer a concrete, implementable technique for improving rapport in counseling-style chatbots. The dynamic, concern-driven persona synthesis is a clear contribution over static personas, and the study design (three-condition comparison) is appropriate in principle for isolating the in-group effect.

major comments (3)

[Abstract and Human-Subject Study section] The abstract and results description claim that the in-group persona agent 'significantly improves perceived rapport' and yields 'more positive user experience,' yet supply no sample size, recruitment method, randomization procedure, exact questionnaire items or scales, statistical tests (e.g., ANOVA or t-tests), p-values, effect sizes, or exclusion criteria. This evidentiary gap is load-bearing for the central claim.
[Evaluation / Results] The evaluation does not address potential confounds such as demand characteristics, differences in response length or verbosity across conditions, or whether participants perceived the synthetic personas as authentic in-group members rather than artificial constructs; without these checks the reported gains cannot be attributed to the in-group mechanism.
[Method / Human-Subject Study] The paper provides no information on conversation length controls, task instructions given to participants, or how the primary concern was elicited and used to generate personas, all of which are required to interpret the questionnaire outcomes.

minor comments (2)

[Abstract] The abstract uses the phrase 'most notably higher engagement' without defining the engagement metric or reporting its statistical result.
[Approach] Notation for the persona generation pipeline (e.g., how the primary concern is formalized before LLM prompting) is introduced informally and would benefit from a short pseudocode or diagram.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments on our manuscript. We agree that the current reporting of the human-subject study is insufficient to substantiate the central claims and will undertake a major revision to address all methodological gaps identified.

read point-by-point responses

Referee: [Abstract and Human-Subject Study section] The abstract and results description claim that the in-group persona agent 'significantly improves perceived rapport' and yields 'more positive user experience,' yet supply no sample size, recruitment method, randomization procedure, exact questionnaire items or scales, statistical tests (e.g., ANOVA or t-tests), p-values, effect sizes, or exclusion criteria. This evidentiary gap is load-bearing for the central claim.

Authors: We acknowledge the evidentiary gap in reporting. The revised manuscript will expand both the abstract and the Human-Subject Study section to include the sample size, recruitment method, randomization procedure, exact questionnaire items and scales, statistical tests performed, p-values, effect sizes, and exclusion criteria. revision: yes
Referee: [Evaluation / Results] The evaluation does not address potential confounds such as demand characteristics, differences in response length or verbosity across conditions, or whether participants perceived the synthetic personas as authentic in-group members rather than artificial constructs; without these checks the reported gains cannot be attributed to the in-group mechanism.

Authors: We agree these confounds require explicit treatment. The revision will add a dedicated subsection discussing demand characteristics (including any debriefing measures), comparisons of response length and verbosity across conditions, and participant feedback on the perceived authenticity of the generated personas. revision: yes
Referee: [Method / Human-Subject Study] The paper provides no information on conversation length controls, task instructions given to participants, or how the primary concern was elicited and used to generate personas, all of which are required to interpret the questionnaire outcomes.

Authors: We will revise the Method section to provide the requested details on conversation length controls, the verbatim task instructions presented to participants, and the precise procedure used to elicit the primary concern and incorporate it into persona generation. revision: yes

Circularity Check

0 steps flagged

No significant circularity: empirical human-subject study with no derivations or fitted predictions

full rationale

The manuscript presents a method for generating synthetic in-group personas from user concerns and evaluates it through a human-subject study comparing questionnaire outcomes against two baselines. No equations, parameters, or derivation chains appear in the provided text or abstract. Claims rest on empirical questionnaire results rather than any self-referential fitting, self-citation load-bearing uniqueness theorems, or ansatzes smuggled via prior work. The reader's assessment of circularity score 0.0 is consistent with the absence of any load-bearing steps that reduce to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim depends on the untested domain assumption that LLM-generated personas sharing only a primary concern will be experienced as in-group by users. No free parameters are described. The synthetic persona itself is an invented construct without independent evidence of effectiveness outside this study.

axioms (1)

domain assumption Sharing a primary concern while differing in background details produces an effective in-group perception that increases rapport with an LLM agent.
This premise underpins the entire persona-generation approach and the interpretation of the study results.

invented entities (1)

Synthetic in-group persona no independent evidence
purpose: To condition the LLM so that it appears to share the user's primary concern while differing in other details.
Newly introduced conditioning mechanism with no external validation or falsifiable prediction provided in the abstract.

pith-pipeline@v0.9.1-grok · 5734 in / 1245 out tokens · 32802 ms · 2026-07-01T00:28:54.177694+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

13 extracted references · 4 canonical work pages · 1 internal anchor

[1]

Naif Alawi, Triparna De Vreede, and Gert-Jan De Vreede

Ai-powered recommendations: the roles of perceived similarity and psychological dis- tance on persuasion.International Journal of Advertising, 40(8):1366–1384. Naif Alawi, Triparna De Vreede, and Gert-Jan De Vreede. 2023. Accepting the familiar: the effect of perceived similarity with ai agents on intention to use and the mediating effect of it identity. ...

2023
[2]

Godfrey T Barrett-Lennard

Rapport-driven virtual agent: Rapport building dialogue strategy for improving user experience at first meeting.arXiv preprint arXiv:2406.09839. Godfrey T Barrett-Lennard. 1981. The empathy cycle: Refinement of a nuclear concept.Journal of counseling psychology, 28(2):91. Donn Erwin Byrne. 1972. The attraction paradigm. Behavior Therapy, 3(2):337–338. Rob...

work page arXiv 1981
[3]

GPT-4o System Card

Psychological, relational, and emotional effects of self-disclosure after conversations with a chatbot.Journal of Communication, 68(4):712– 733. Sture Holm. 1979. A simple sequentially rejective multiple test procedure.Scandinavian journal of statistics, pages 65–70. Aaron Hurst, Adam Lerer, Adam P Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Os...

work page internal anchor Pith review Pith/arXiv arXiv 1979
[4]

arXiv preprint arXiv:2407.18416 , year=

Exploring relationship development with social chatbots: A mixed-method study of replika.Computers in Human Behavior, 140:107600. Melanie Randle, Richard Eckersley, and Leonie Miller. 2017. Societal and personal concerns, their associations with stress, and the implica- tions for progress and the future.Futures, 93:68– 79. Byron Reeves and Clifford Nass. ...

work page arXiv 2017
[5]

arXiv preprint arXiv:2406.01171 , year=

“it happened to be the perfect thing”: ex- periences of generative ai chatbots for mental health.npj Mental Health Research, 3(1):48. Teunis J Terpstra. 1952. The asymptotic normality and consistency of kendall’s test against trend, when ties are present in one ranking.Indaga- tiones Mathematicae, 14(3):327–333. Linda Tickle-Degnen and Robert Rosenthal. 1...

work page arXiv 1952
[6]

Educational background: The user has a degree in computer science
[7]

Emotional context: The user feels scared and behind compared to peers
[8]

Observed in pre-chat:

Professional interest: The user is interested in AI research. Observed in pre-chat:
[9]

Educational background: The user studied computer science
[10]

Emotional context: The user feels scared and behind
[11]

collected information

Professional interest: The user is interested in AI research. Reason: The pre-chat provides sufficient context with three relevant and distinct elements: the user’s educational background, emotional context, and professional interest. These elements are crucial for understanding the user’s concern about career direction and feelings of being behind peers....

2024
[12]

Self-disclosure depth (0–3)
[13]

I understand

Empathy level (0–3) Definitions (apply to BOTH User and Assistant, but roles differ): SELF-DISCLOSURE DEPTH (0–3): Level 0 – No self-disclosure - The speaker does not talk about themselves at all. - No personal facts, no personal experiences, no personal feelings. Level 1 – Low / Peripheral self-disclosure - Basic, surface-level facts about the speaker: r...

2023

[1] [1]

Naif Alawi, Triparna De Vreede, and Gert-Jan De Vreede

Ai-powered recommendations: the roles of perceived similarity and psychological dis- tance on persuasion.International Journal of Advertising, 40(8):1366–1384. Naif Alawi, Triparna De Vreede, and Gert-Jan De Vreede. 2023. Accepting the familiar: the effect of perceived similarity with ai agents on intention to use and the mediating effect of it identity. ...

2023

[2] [2]

Godfrey T Barrett-Lennard

Rapport-driven virtual agent: Rapport building dialogue strategy for improving user experience at first meeting.arXiv preprint arXiv:2406.09839. Godfrey T Barrett-Lennard. 1981. The empathy cycle: Refinement of a nuclear concept.Journal of counseling psychology, 28(2):91. Donn Erwin Byrne. 1972. The attraction paradigm. Behavior Therapy, 3(2):337–338. Rob...

work page arXiv 1981

[3] [3]

GPT-4o System Card

Psychological, relational, and emotional effects of self-disclosure after conversations with a chatbot.Journal of Communication, 68(4):712– 733. Sture Holm. 1979. A simple sequentially rejective multiple test procedure.Scandinavian journal of statistics, pages 65–70. Aaron Hurst, Adam Lerer, Adam P Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Os...

work page internal anchor Pith review Pith/arXiv arXiv 1979

[4] [4]

arXiv preprint arXiv:2407.18416 , year=

Exploring relationship development with social chatbots: A mixed-method study of replika.Computers in Human Behavior, 140:107600. Melanie Randle, Richard Eckersley, and Leonie Miller. 2017. Societal and personal concerns, their associations with stress, and the implica- tions for progress and the future.Futures, 93:68– 79. Byron Reeves and Clifford Nass. ...

work page arXiv 2017

[5] [5]

arXiv preprint arXiv:2406.01171 , year=

“it happened to be the perfect thing”: ex- periences of generative ai chatbots for mental health.npj Mental Health Research, 3(1):48. Teunis J Terpstra. 1952. The asymptotic normality and consistency of kendall’s test against trend, when ties are present in one ranking.Indaga- tiones Mathematicae, 14(3):327–333. Linda Tickle-Degnen and Robert Rosenthal. 1...

work page arXiv 1952

[6] [6]

Educational background: The user has a degree in computer science

[7] [7]

Emotional context: The user feels scared and behind compared to peers

[8] [8]

Observed in pre-chat:

Professional interest: The user is interested in AI research. Observed in pre-chat:

[9] [9]

Educational background: The user studied computer science

[10] [10]

Emotional context: The user feels scared and behind

[11] [11]

collected information

Professional interest: The user is interested in AI research. Reason: The pre-chat provides sufficient context with three relevant and distinct elements: the user’s educational background, emotional context, and professional interest. These elements are crucial for understanding the user’s concern about career direction and feelings of being behind peers....

2024

[12] [12]

Self-disclosure depth (0–3)

[13] [13]

I understand

Empathy level (0–3) Definitions (apply to BOTH User and Assistant, but roles differ): SELF-DISCLOSURE DEPTH (0–3): Level 0 – No self-disclosure - The speaker does not talk about themselves at all. - No personal facts, no personal experiences, no personal feelings. Level 1 – Low / Peripheral self-disclosure - Basic, surface-level facts about the speaker: r...

2023