PEMANT: Persona-Enriched Multi-Agent Negotiation for Travel

Chia-yu Wu; Mustafa Sameen; Xilei Zhao; Yaotian Zhang; Yuran Sun

arxiv: 2604.10475 · v1 · submitted 2026-04-12 · 💻 cs.AI

PEMANT: Persona-Enriched Multi-Agent Negotiation for Travel

Yuran Sun , Mustafa Sameen , Yaotian Zhang , Chia-yu Wu , Xilei Zhao This is my paper

Pith reviewed 2026-05-10 16:16 UTC · model grok-4.3

classification 💻 cs.AI

keywords travel demand modelingmulti-agent LLMsbehavioral theoryhousehold decision makingpersona modelingtrip generationnegotiation simulation

0 comments

The pith

PEMANT turns household demographics into narrative personas and simulates multi-agent negotiations to predict trip generation more accurately.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Household trip generation drives demand forecasting, traffic estimation, and urban planning. Classical machine learning models had limited predictive power, and recent LLM methods omitted behavioral theory and intra-household dynamics. PEMANT first converts sociodemographic attributes into coherent narrative profiles via the Household-Aware Chain-of-Planned-Behavior framework, explicitly encoding attitudes, subjective norms, and perceived controls. It then runs these personas through a structured two-phase multi-agent conversation with a persona-alignment control to model real negotiation. The result is consistent outperformance on both national and regional household travel survey datasets.

Core claim

PEMANT integrates behavioral theory for individualized persona modeling by transforming static sociodemographic attributes into coherent narrative profiles that explicitly encode household-level attitudes, subjective norms, and perceived behavioral controls following the Household-Aware Chain-of-Planned-Behavior framework. Building on these theory-grounded personas, PEMANT captures real-world household decision negotiation via a structured two-phase multi-agent conversation framework with a novel persona-alignment control mechanism, and this approach outperforms state-of-the-art benchmarks across datasets.

What carries the argument

The HA-CoPB framework that creates narrative personas from sociodemographics combined with the two-phase multi-agent LLM conversation structure and persona-alignment control that simulates household negotiations.

If this is right

Higher accuracy in household-level trip generation forecasts for transportation planning.
Explicit inclusion of attitudes, norms, and perceived controls in collective travel decisions.
More reliable inputs for traffic flow estimation and urban system models.
A scalable template for simulating other intra-group decisions using the same persona-negotiation structure.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same persona-plus-negotiation pattern could be tested on collective choices outside travel, such as household energy use or spending decisions.
Validation could involve side-by-side comparison of model outputs with video or diary records of real families negotiating trips.
Dynamic updates to personas from new preference data might extend the framework to evolving household situations over time.

Load-bearing premise

Narrative personas derived from sociodemographic attributes via HA-CoPB and structured multi-agent LLM conversations accurately capture real intra-household negotiation dynamics and behavioral controls.

What would settle it

A dataset or experiment that records actual observed household negotiation processes and trip choices, then directly compares PEMANT predictions against those recorded outcomes for mismatch.

Figures

Figures reproduced from arXiv: 2604.10475 by Chia-yu Wu, Mustafa Sameen, Xilei Zhao, Yaotian Zhang, Yuran Sun.

**Figure 1.** Figure 1: Methodological Framework for PEMANT (Persona-Enriched Multi-Agent Negotiation for Travel). LLM-based agents can improve their collective performance (Zhang et al., 2024a; Yan et al., 2025; Wu et al., 2024). A common design paradigm assigns explicit roles or personas to agents, which enables them to collaboratively solve tasks through structured communication (Abbasiantaeb et al., 2024; Wu et al., 2024; Li … view at source ↗

**Figure 2.** Figure 2: Structural Correlation Alignment. Diagonal split heatmap comparing demographic-opinion correlations. The upperleft triangle represents Human Ground Truth (NHTS), while the lower-right triangle shows Synthesized Personas. The high alignment score (ρ = 0.699) indicates strong preservation of latent sociological structure. mance on objective traits. For Health Status, the personas achieve 85.0% Soft Accura… view at source ↗

**Figure 3.** Figure 3: Detailed Distributional Comparison. Comparison of synthesized persona response distributions (Green) vs. Human Ground Truth (Black). Mathematical Formulations of Metrics Accuracy within a Tolerance (±1 Scale Point) We adapt the tolerance-based accuracy metric defined previously (Eq. 12) to the context of Likert scales. A prediction is considered accurate if |yi,v − yˆi,v| ≤ 1. This ±1 tolerance accounts fo… view at source ↗

read the original abstract

Modeling household-level trip generation is fundamental to accurate demand forecasting, traffic flow estimation, and urban system planning. Existing studies were mostly based on classical machine learning models with limited predictive capability, while recent LLM-based approaches have yet to incorporate behavioral theory or intra-household interaction dynamics, both of which are critical for modeling realistic collective travel decisions. To address these limitations, we propose a novel LLM-based framework, named Persona-Enriched Multi-Agent Negotiation for Travel (PEMANT), which first integrates behavioral theory for individualized persona modeling and then conducts household-level trip planning negotiations via a structured multi-agent conversation. Specifically, PEMANT transforms static sociodemographic attributes into coherent narrative profiles that explicitly encode household-level attitudes, subjective norms, and perceived behavioral controls, following our proposed Household-Aware Chain-of-Planned-Behavior (HA-CoPB) framework. Building on these theory-grounded personas, PEMANT captures real-world household decision negotiation via a structured two-phase multi-agent conversation framework with a novel persona-alignment control mechanism. Evaluated on both national and regional household travel survey datasets, PEMANT consistently outperforms state-of-the-art benchmarks across datasets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper proposes PEMANT, an LLM-based framework for household-level trip generation modeling. It first applies the Household-Aware Chain-of-Planned-Behavior (HA-CoPB) to convert static sociodemographic attributes into narrative personas that encode attitudes, subjective norms, and perceived behavioral controls. These personas then drive a structured two-phase multi-agent conversation with a persona-alignment control mechanism to simulate intra-household negotiation over trip planning. The manuscript claims that PEMANT consistently outperforms state-of-the-art benchmarks on both national and regional household travel survey datasets.

Significance. If the claimed performance gains are reproducible and the personas faithfully reflect real behavioral dynamics, PEMANT would represent a meaningful step toward theory-grounded LLM applications in transportation demand modeling. It directly addresses gaps in classical ML (limited behavioral fidelity) and prior LLM work (lack of intra-household interaction) by integrating established behavioral constructs with multi-agent negotiation. The explicit use of HA-CoPB and the alignment control mechanism are concrete contributions that could improve realism in collective decision forecasting for urban planning.

major comments (3)

[Abstract] Abstract: the central claim of consistent outperformance across national and regional datasets is stated without any quantitative metrics, baseline specifications, error bars, statistical tests, or evaluation protocol details. This absence prevents assessment of effect sizes or whether gains are robust.
[Method (HA-CoPB and multi-agent negotiation)] Method section describing HA-CoPB and the multi-agent framework: the performance advantage is asserted to arise from theory-grounded personas and structured negotiation, yet no external validation is provided (e.g., alignment of generated personas or dialogues with observed household decision logs, expert ratings of realism, or ablation isolating HA-CoPB versus generic prompting). Without such checks, superior end-task accuracy could result from LLM priors or prompt design rather than the proposed behavioral components.
[Evaluation] Evaluation section: the manuscript reports results on household travel surveys but supplies no details on how trip-generation targets are defined, how household-level aggregation is performed, or whether the persona-alignment control mechanism measurably improves negotiation fidelity over uncontrolled multi-agent baselines.

minor comments (2)

[Abstract] Abstract: the acronym HA-CoPB is introduced without immediate parenthetical expansion, which may hinder immediate comprehension for readers outside behavioral modeling.
[Method] Notation: the distinction between 'persona-alignment control mechanism' and standard LLM prompting constraints is not clearly delineated in the high-level description, making it hard to isolate its contribution.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the detailed and constructive review. We appreciate the recognition that PEMANT addresses important gaps in theory-grounded LLM applications for transportation modeling. Below we respond point-by-point to the major comments. We will revise the manuscript to incorporate additional quantitative details, clarifications, and supporting analyses as outlined.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim of consistent outperformance across national and regional datasets is stated without any quantitative metrics, baseline specifications, error bars, statistical tests, or evaluation protocol details. This absence prevents assessment of effect sizes or whether gains are robust.

Authors: We agree that the abstract would be strengthened by including key quantitative results. In the revised manuscript we will add concise performance metrics (e.g., relative improvements in MAE or accuracy over the strongest baselines on both the national and regional datasets), a brief statement of the evaluation protocol, and a note that full tables with error bars and statistical significance tests appear in Section 4 and the supplementary material. Space constraints preclude listing every baseline and test in the abstract itself, but the central effect sizes will be reported. revision: yes
Referee: [Method (HA-CoPB and multi-agent negotiation)] Method section describing HA-CoPB and the multi-agent framework: the performance advantage is asserted to arise from theory-grounded personas and structured negotiation, yet no external validation is provided (e.g., alignment of generated personas or dialogues with observed household decision logs, expert ratings of realism, or ablation isolating HA-CoPB versus generic prompting). Without such checks, superior end-task accuracy could result from LLM priors or prompt design rather than the proposed behavioral components.

Authors: We acknowledge that direct external validation of the personas and negotiation dialogues would further isolate the contribution of HA-CoPB. The primary evidence we provide is the consistent downstream improvement in household trip-generation accuracy across two independent datasets, which would be unlikely if the personas were merely generic LLM outputs. Nevertheless, we will add an ablation study in the revision that replaces HA-CoPB personas with generic prompting while keeping the multi-agent structure fixed. We will also include qualitative examples of generated personas and dialogues. Full alignment with granular household decision logs is not feasible because standard travel surveys do not record intra-household negotiation traces; we will explicitly discuss this data limitation. revision: partial
Referee: [Evaluation] Evaluation section: the manuscript reports results on household travel surveys but supplies no details on how trip-generation targets are defined, how household-level aggregation is performed, or whether the persona-alignment control mechanism measurably improves negotiation fidelity over uncontrolled multi-agent baselines.

Authors: We will expand the evaluation section to provide these missing details. Specifically, we will define the trip-generation targets (daily trips by purpose and mode for each household member), describe the aggregation procedure from individual agent outputs to household-level predictions, and add a controlled comparison that isolates the persona-alignment mechanism against an otherwise identical multi-agent baseline without the alignment control. These elements were summarized in the supplementary material; they will be moved into the main text with accompanying quantitative results. revision: yes

Circularity Check

0 steps flagged

No significant circularity; claims rest on external empirical evaluation

full rationale

The paper introduces PEMANT by first defining a new HA-CoPB framework to convert sociodemographic inputs into narrative personas encoding attitudes/norms/controls, then applies structured multi-agent LLM dialogues for household trip planning, and finally reports accuracy on independent national/regional household travel survey datasets against external benchmarks. No quoted equations, fitted parameters, or self-citation chains reduce the reported outperformance to the inputs by construction; the central results are falsifiable predictions on held-out survey data rather than tautological renamings or self-referential definitions. The framework draws on standard external behavioral theory (Theory of Planned Behavior) without smuggling ansatzes or uniqueness theorems from prior author work.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 2 invented entities

Review limited to abstract; ledger entries reflect components explicitly named but without implementation details or independent validation.

axioms (2)

domain assumption Behavioral theory (attitudes, subjective norms, perceived behavioral control) can be reliably encoded from sociodemographic attributes into narrative personas
Basis for HA-CoPB framework
domain assumption Structured multi-agent LLM conversations with persona-alignment control can simulate realistic household trip-planning negotiations
Core mechanism of PEMANT

invented entities (2)

Household-Aware Chain-of-Planned-Behavior (HA-CoPB) no independent evidence
purpose: Transform static attributes into coherent narrative profiles encoding household attitudes, norms, and controls
Proposed framework for persona creation
persona-alignment control mechanism no independent evidence
purpose: Maintain consistency across agents during multi-agent negotiation
Novel control introduced for the conversation framework

pith-pipeline@v0.9.0 · 5501 in / 1361 out tokens · 76635 ms · 2026-05-10T16:16:05.618777+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

15 extracted references · 15 canonical work pages

[1]

Brick, J

1976. Brick, J. M. Unit nonresponse and weighting adjustments: A critical review.Journal of Official Statistics, 29(3):329– 353, 2013. doi: 10.2478/jos-2013-0026. URL https: //doi.org/10.2478/jos-2013-0026. Chen, R., Wang, C., Sun, Y ., Zhao, X., and Xu, S. From perceptions to decisions: Wildfire evacuation decision prediction with behavioral theory-infor...

work page doi:10.2478/jos-2013-0026 1976
[2]

URL https://www.sciencedirect.com/ science/article/pii/S0965856424003021

doi: https://doi.org/10.1016/j.tra.2024.104254. URL https://www.sciencedirect.com/ science/article/pii/S0965856424003021. Naseralavi, S., Soltanirad, M., Ranjbar, E., Lucero, M., Gorzin, F., Hakiminejad, Y ., Azimi, S., Baghersad, M., and Mazaheri, A. Machine learning modeling of house- hold trip generation by state using nhts data.Urban Science, 9(9):353...

work page doi:10.1016/j.tra.2024.104254 2024
[3]

Judgment under Uncertainty: Heuristics and Biases,

URL https://www.science.org/doi/ abs/10.1126/science.185.4157.1124. Wang, J., Jiang, R., Yang, C., Wu, Z., Onizuka, M., Shibasaki, R., Koshizuka, N., and Xiao, C. Large lan- guage models as urban residents: An llm agent frame- work for personal mobility generation.Advances in Neu- ral Information Processing Systems, 37:124547–124574, 2024a. Wang, Y . and ...

work page doi:10.1126/science.185.4157.1124 2007
[4]

Behavioral Tendency

Attitude (Individual Utility): - Does this agent view travel as a utility (work/school) or a burden? - How does the "Behavioral Tendency" (e.g., Tech-Savvy) shift their travel demand?

work page
[5]

Subjective Norms (Household Obligations): - What role-based obligations exist (e.g., "Parent"→Escort trips)? - Are there coordinated trips required by other members?

work page
[6]

Phase 3: Comparison Baselines A.3.1

Perceived Behavioral Control (Resource Competition): -Crucial: Is the shared vehicle actually available, or is it claimed by another driver? - Do financial or location constraints (e.g., Rural) limit their autonomy? [OUTPUT] Rationale:<Step-by-step TPB analysis resolving the conflicts above> Final Answer:<integer> A.3. Phase 3: Comparison Baselines A.3.1....

work page
[7]

Behavioral Tendencies

Aggregated Needs (Attitude): - Sum the mandatory trips (work/school) for all members. - Estimate discretionary trips based on the "Behavioral Tendencies" listed above

work page
[8]

Does this limit the total volume? -Coordination: If a parent drives a child, count it as 2 Person-Trips

Shared Constraints (PBC): -Bottleneck Check: You have{veh count}vehicles for{driver count}drivers. Does this limit the total volume? -Coordination: If a parent drives a child, count it as 2 Person-Trips

work page
[9]

- Adjust based on the net balance of Needs vs

Synthesis: - Start with the Anchor ({anchor}). - Adjust based on the net balance of Needs vs. Constraints. [OUTPUT] Respond STRICTLY in this format: Needs Analysis:<Who needs to travel?> Constraint Logic:<How do vehicle limits reduce the total?> Final Answer:<integer> B. Behavioral Theory Implementation B.1. Behavioral Anchor Logic To calibrate the LLM’s ...

work page 2017
[10]

• Subjective Norm (SN ): Redefined from abstract social pressure to concrete household role obligations (e.g., ”I must drive my child to school”)

Construct Definitions • Attitude (A): We utilize the Attitudinal Imputation strategy (Mokhtarian, 2024) to map marker variables (e.g., Age) to specific psychometric statements (e.g., ”Tech-Savvy”). • Subjective Norm (SN ): Redefined from abstract social pressure to concrete household role obligations (e.g., ”I must drive my child to school”). • Perceived ...

work page 2024
[11]

loves driving

Priority Rules • Constraint Precedence (PBC ≻ A ): Resource constraints strictly override attitudinal preferences. For example, an agent who “loves driving” (A) but lacks access to a vehicle (PBC) is forced to generate zero drive trips. 17 PEMANT: Persona-Enriched Multi-Agent Negotiation for Travel Table 6.Attitudinal Constructs and Psychometric Statement...

work page
[12]

Household-level removal:If any household member contained invalid or missing values for critical variables, the entire household was excluded to maintain consistency in household-level aggregation

work page
[13]

For example, negative codes for rideshare usage were treated as no rideshare use

Rule-based recoding:Selected missing or negative-coded responses were recoded using domain-informed rules. For example, negative codes for rideshare usage were treated as no rideshare use

work page
[14]

I prefer to live in a community with mixed land uses (homes, shops, work) so I can walk to places

Median/mode imputation:For remaining variables with missing values, numeric features were imputed using the median and categorical features were imputed using the mode to reduce data loss while preserving overall distributions. D.3. Variable Definitions Table 7 lists the variables used in the experiments. Variables marked with “*” are included in both the...

work page 2017
[15]

A prediction is considered accurate if |yi,v −ˆyi,v| ≤1

to the context of Likert scales. A prediction is considered accurate if |yi,v −ˆyi,v| ≤1 . This ±1 tolerance accounts for the inherent subjectivity of self-reported attitudes (e.g., the subtle distinction between ”Agree” and ”Strongly Agree”), capturing whether the persona’s sentiment is directionally correct. Quadratic Weighted Kappa (QWK)To measure indi...

work page 1968

[1] [1]

Brick, J

1976. Brick, J. M. Unit nonresponse and weighting adjustments: A critical review.Journal of Official Statistics, 29(3):329– 353, 2013. doi: 10.2478/jos-2013-0026. URL https: //doi.org/10.2478/jos-2013-0026. Chen, R., Wang, C., Sun, Y ., Zhao, X., and Xu, S. From perceptions to decisions: Wildfire evacuation decision prediction with behavioral theory-infor...

work page doi:10.2478/jos-2013-0026 1976

[2] [2]

URL https://www.sciencedirect.com/ science/article/pii/S0965856424003021

doi: https://doi.org/10.1016/j.tra.2024.104254. URL https://www.sciencedirect.com/ science/article/pii/S0965856424003021. Naseralavi, S., Soltanirad, M., Ranjbar, E., Lucero, M., Gorzin, F., Hakiminejad, Y ., Azimi, S., Baghersad, M., and Mazaheri, A. Machine learning modeling of house- hold trip generation by state using nhts data.Urban Science, 9(9):353...

work page doi:10.1016/j.tra.2024.104254 2024

[3] [3]

Judgment under Uncertainty: Heuristics and Biases,

URL https://www.science.org/doi/ abs/10.1126/science.185.4157.1124. Wang, J., Jiang, R., Yang, C., Wu, Z., Onizuka, M., Shibasaki, R., Koshizuka, N., and Xiao, C. Large lan- guage models as urban residents: An llm agent frame- work for personal mobility generation.Advances in Neu- ral Information Processing Systems, 37:124547–124574, 2024a. Wang, Y . and ...

work page doi:10.1126/science.185.4157.1124 2007

[4] [4]

Behavioral Tendency

Attitude (Individual Utility): - Does this agent view travel as a utility (work/school) or a burden? - How does the "Behavioral Tendency" (e.g., Tech-Savvy) shift their travel demand?

work page

[5] [5]

Subjective Norms (Household Obligations): - What role-based obligations exist (e.g., "Parent"→Escort trips)? - Are there coordinated trips required by other members?

work page

[6] [6]

Phase 3: Comparison Baselines A.3.1

Perceived Behavioral Control (Resource Competition): -Crucial: Is the shared vehicle actually available, or is it claimed by another driver? - Do financial or location constraints (e.g., Rural) limit their autonomy? [OUTPUT] Rationale:<Step-by-step TPB analysis resolving the conflicts above> Final Answer:<integer> A.3. Phase 3: Comparison Baselines A.3.1....

work page

[7] [7]

Behavioral Tendencies

Aggregated Needs (Attitude): - Sum the mandatory trips (work/school) for all members. - Estimate discretionary trips based on the "Behavioral Tendencies" listed above

work page

[8] [8]

Does this limit the total volume? -Coordination: If a parent drives a child, count it as 2 Person-Trips

Shared Constraints (PBC): -Bottleneck Check: You have{veh count}vehicles for{driver count}drivers. Does this limit the total volume? -Coordination: If a parent drives a child, count it as 2 Person-Trips

work page

[9] [9]

- Adjust based on the net balance of Needs vs

Synthesis: - Start with the Anchor ({anchor}). - Adjust based on the net balance of Needs vs. Constraints. [OUTPUT] Respond STRICTLY in this format: Needs Analysis:<Who needs to travel?> Constraint Logic:<How do vehicle limits reduce the total?> Final Answer:<integer> B. Behavioral Theory Implementation B.1. Behavioral Anchor Logic To calibrate the LLM’s ...

work page 2017

[10] [10]

• Subjective Norm (SN ): Redefined from abstract social pressure to concrete household role obligations (e.g., ”I must drive my child to school”)

Construct Definitions • Attitude (A): We utilize the Attitudinal Imputation strategy (Mokhtarian, 2024) to map marker variables (e.g., Age) to specific psychometric statements (e.g., ”Tech-Savvy”). • Subjective Norm (SN ): Redefined from abstract social pressure to concrete household role obligations (e.g., ”I must drive my child to school”). • Perceived ...

work page 2024

[11] [11]

loves driving

Priority Rules • Constraint Precedence (PBC ≻ A ): Resource constraints strictly override attitudinal preferences. For example, an agent who “loves driving” (A) but lacks access to a vehicle (PBC) is forced to generate zero drive trips. 17 PEMANT: Persona-Enriched Multi-Agent Negotiation for Travel Table 6.Attitudinal Constructs and Psychometric Statement...

work page

[12] [12]

Household-level removal:If any household member contained invalid or missing values for critical variables, the entire household was excluded to maintain consistency in household-level aggregation

work page

[13] [13]

For example, negative codes for rideshare usage were treated as no rideshare use

Rule-based recoding:Selected missing or negative-coded responses were recoded using domain-informed rules. For example, negative codes for rideshare usage were treated as no rideshare use

work page

[14] [14]

I prefer to live in a community with mixed land uses (homes, shops, work) so I can walk to places

Median/mode imputation:For remaining variables with missing values, numeric features were imputed using the median and categorical features were imputed using the mode to reduce data loss while preserving overall distributions. D.3. Variable Definitions Table 7 lists the variables used in the experiments. Variables marked with “*” are included in both the...

work page 2017

[15] [15]

A prediction is considered accurate if |yi,v −ˆyi,v| ≤1

to the context of Likert scales. A prediction is considered accurate if |yi,v −ˆyi,v| ≤1 . This ±1 tolerance accounts for the inherent subjectivity of self-reported attitudes (e.g., the subtle distinction between ”Agree” and ”Strongly Agree”), capturing whether the persona’s sentiment is directionally correct. Quadratic Weighted Kappa (QWK)To measure indi...

work page 1968