Beyond Isolated Behaviors: Hierarchical User Modeling for LLM Personalization

Liang Wang; Tiannan Wang; Xiaoyou Liu; Xinyi Mou; Yuqing Wang; Zhongyu Wei

arxiv: 2606.02300 · v1 · pith:3NUBRSYUnew · submitted 2026-06-01 · 💻 cs.CL

Beyond Isolated Behaviors: Hierarchical User Modeling for LLM Personalization

Liang Wang , Xinyi Mou , Xiaoyou Liu , Tiannan Wang , Yuqing Wang , Zhongyu Wei This is my paper

Pith reviewed 2026-06-28 14:48 UTC · model grok-4.3

classification 💻 cs.CL

keywords LLM personalizationhierarchical user modelingPractice-Habitus-FieldBourdieu theoryLaMP benchmarkbehavioral structuresmodel-agnostic methods

0 comments

The pith

Hierarchical structures from practices, habitus, and fields improve LLM personalization over flat behavior aggregation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes that user behaviors with LLMs should be organized into three levels drawn from Bourdieu's theory rather than treated as isolated events. Individual actions become practices, their accumulation over time forms stable dispositions called habitus, and regularities across similar users create fields. This structure is turned into a lightweight system called PHF_Compass that runs on a frozen LLM and produces measurable gains on the LaMP benchmark while making the resulting patterns easier to inspect. A reader would care because it replaces simple history collection with a model that aims to capture how personal tendencies form and persist, potentially yielding more consistent personalization across sessions.

Core claim

PHF reconceptualizes LLM personalization through three hierarchical levels: individual behaviors as practices, their temporal accumulation into stable dispositions as habitus, and shared regularities across similar users as fields. Instantiated via PHF_Compass on a frozen LLM, this yields consistent improvements on the LaMP benchmark and validates the interpretability and extensibility of the learned behavioral structures.

What carries the argument

The PHF framework, which maps Bourdieu's Theory of Practice onto user-LLM interaction sequences to produce the three levels of practices, habitus, and fields.

If this is right

Personalization performance improves consistently across diverse tasks on the LaMP benchmark.
The learned behavioral structures become more interpretable through the habitus and field levels.
The approach extends to new tasks and users while remaining model-agnostic.
A frozen LLM suffices for the implementation, avoiding the need for task-specific fine-tuning.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Long-term tracking of habitus evolution could support personalization that adapts as user tendencies change rather than resetting per session.
Grouping users into fields might surface collaborative effects where similar users indirectly improve each other's models.
The same three-level decomposition could be tested on sequential interaction logs from dialogue systems or recommendation platforms.

Load-bearing premise

Bourdieu's Theory of Practice can be directly mapped onto sequences of user-LLM interactions to produce stable, hierarchical behavioral structures that causally improve personalization performance.

What would settle it

If the PHF_Compass implementation produces no consistent gains over standard flat aggregation methods when evaluated on the LaMP benchmark tasks, or if the extracted habitus and field structures show no temporal stability, the central claim would be falsified.

Figures

Figures reproduced from arXiv: 2606.02300 by Liang Wang, Tiannan Wang, Xiaoyou Liu, Xinyi Mou, Yuqing Wang, Zhongyu Wei.

**Figure 2.** Figure 2: Overview of the PHF framework and PHFCompass implement. (1) Raw behaviors are abstracted into denoised practice essence via semantic ID extraction. (2) Practices are temporally aggregated into stable habitus representations. (3) Users are organized into latent fields through group router. (4) Habitus and field embeddings are combined as well as the query embedding to condition the LLM for personalized gene… view at source ↗

**Figure 3.** Figure 3: The influence of field granularity. that both habitus and field contribute meaningfully to personalization, with habitus playing the dominant role. The effect of implementation choices of PHFCompass. Replacing temporal weighting (w/o Temporal) with uniform averaging mainly affects tasks involving evolving user preferences, suggesting that temporal aggregation provides complementary gains beyond habitus … view at source ↗

**Figure 4.** Figure 4: Impact of codebook configuration. The no [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: Performance comparison across LaMP-2 to 5. [PITH_FULL_IMAGE:figures/full_fig_p012_5.png] view at source ↗

**Figure 6.** Figure 6: The impact of contrastive learning on the [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗

read the original abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse domains, yet personalizing their outputs to individual users remains an open challenge. Existing approaches predominantly adopt a flat behavioral paradigm, aggregating user behaviors without an explicit account of how they are organized into deeper behavioral structures. In this work, we draw on Pierre Bourdieu's Theory of Practice to propose PHF (Practice-Habitus-Field), a sociologically grounded framework that reconceptualizes LLM personalization through three hierarchical levels: individual behaviors as practices, their temporal accumulation into stable dispositions as habitus, and shared regularities across similar users as fields. We instantiate PHF through $\mathrm{PHF}_{\text{Compass}}$, a lightweight and model-agnostic implementation based on a frozen LLM. Experiments on the Language Model Personalization (LaMP) benchmark demonstrate consistent improvements across diverse tasks, while further analyses validate the interpretability and extensibility of the learned behavioral structures.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PHF applies Bourdieu's hierarchy to LLM user modeling and reports gains on LaMP, but the abstract gives no ablations isolating the hierarchy from plain structured prompting.

read the letter

The main takeaway is that this paper maps Bourdieu's Practice-Habitus-Field theory onto LLM personalization, treating single interactions as practices, accumulated patterns as habitus, and cross-user regularities as fields, then implements it in a frozen-LLM system called PHF_Compass that improves on the LaMP benchmark.

The new element is the explicit sociological framing and the named three-level structure. Prior work already does hierarchical or multi-scale user modeling, but this version tries to import the specific concepts of temporal disposition-building and field-level shared norms. The lightweight, model-agnostic design is practical and the claim of improved interpretability is worth checking.

The soft spot is exactly the one in the stress-test note. The abstract says the gains come from instantiating the hierarchy, yet it supplies no ablation that keeps the same LLM and input format while flattening the levels or removing the habitus/field construction steps. Without that isolation, any structured history encoding could explain the result. There are also no numbers, no baseline list, and no description of how the frozen model actually encodes the temporal accumulation or shared fields. The theory-to-implementation link therefore stays untested.

This is for people working on LLM adaptation and user modeling who are open to sociological lenses. A reader who wants concrete evidence that the hierarchy drives performance will come away unsatisfied. The paper deserves a serious referee to examine the full experiments and ablations; the central claim is testable and the framing is coherent enough to review rather than desk-reject.

Referee Report

2 major / 2 minor

Summary. The paper proposes PHF (Practice-Habitus-Field), a hierarchical framework for LLM personalization grounded in Bourdieu's Theory of Practice. Individual user behaviors are modeled as practices, their temporal accumulation as habitus, and shared patterns across users as fields. The framework is instantiated in PHF_Compass, a lightweight implementation using a frozen LLM, and evaluated on the LaMP benchmark where it reports consistent improvements across tasks along with analyses supporting interpretability and extensibility of the learned structures.

Significance. If the hierarchical structures can be shown to drive gains beyond standard user-history prompting, the work would provide a sociologically motivated alternative to flat aggregation methods in personalization, with potential benefits for interpretability. The use of a frozen LLM and model-agnostic design is a practical strength.

major comments (2)

[§4] §4 (Experiments): The reported consistent improvements on LaMP are not accompanied by an ablation that removes the habitus and field levels (e.g., a flat practice-only variant with identical LLM, history encoding, and input format). Without this isolation, the central claim that gains arise specifically from the Bourdieu-derived hierarchy rather than any structured prompting remains unsupported.
[§3] §3 (PHF_Compass): The implementation details do not specify how the habitus level (temporal accumulation into stable dispositions) and field level (shared regularities across users) are explicitly constructed or enforced inside the frozen-LLM pipeline, as opposed to emerging implicitly from prompt formatting. This is load-bearing for validating the three-level hierarchy.

minor comments (2)

[Abstract] The abstract refers to 'further analyses' validating interpretability; the corresponding section should explicitly state the quantitative or qualitative metrics used (e.g., human evaluation protocol or clustering coherence scores).
[§2] Notation for the three levels (Practice, Habitus, Field) should be introduced with a clear diagram or pseudocode in §2 to aid readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback. We address each major comment below and commit to revisions that directly strengthen the evidential basis for the hierarchical claims.

read point-by-point responses

Referee: [§4] §4 (Experiments): The reported consistent improvements on LaMP are not accompanied by an ablation that removes the habitus and field levels (e.g., a flat practice-only variant with identical LLM, history encoding, and input format). Without this isolation, the central claim that gains arise specifically from the Bourdieu-derived hierarchy rather than any structured prompting remains unsupported.

Authors: We agree that the current experiments do not isolate the contribution of the hierarchy. In the revised manuscript we will add a flat practice-only ablation that uses the identical frozen LLM, history encoding, and input format. Results from this comparison will be reported in §4 to test whether gains are attributable to the Bourdieu-derived levels rather than structured prompting alone. revision: yes
Referee: [§3] §3 (PHF_Compass): The implementation details do not specify how the habitus level (temporal accumulation into stable dispositions) and field level (shared regularities across users) are explicitly constructed or enforced inside the frozen-LLM pipeline, as opposed to emerging implicitly from prompt formatting. This is load-bearing for validating the three-level hierarchy.

Authors: We accept that §3 requires greater explicitness. The revised version will detail the explicit mechanisms: habitus is formed by a defined temporal aggregation operator over practice representations, and fields are instantiated via similarity-based clustering of habitus vectors whose centroids are injected as additional prompt context. Algorithmic pseudocode and input-construction examples will be added to demonstrate explicit enforcement within the frozen-LLM pipeline. revision: yes

Circularity Check

0 steps flagged

No significant circularity; framework is externally grounded

full rationale

The paper's core move is to import Bourdieu's Theory of Practice (an external sociological source) and map it onto user-LLM interaction sequences to define the three-level PHF hierarchy. This mapping is presented as a modeling choice rather than a derivation from prior equations or self-citations. PHF_Compass is described as a lightweight implementation on a frozen LLM, with performance evaluated on the independent LaMP benchmark. No equations, fitted parameters renamed as predictions, self-citation load-bearing steps, or uniqueness theorems from the same authors appear in the provided text. The central claim therefore rests on experimental outcomes and the external theory rather than reducing to its own inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the domain assumption that Bourdieu's sociological constructs translate directly to digital interaction logs. The only invented entity visible in the abstract is the PHF_Compass implementation. No free parameters are described.

axioms (1)

domain assumption Bourdieu's Theory of Practice applies to sequences of user interactions with LLMs and yields useful hierarchical behavioral structures
The entire PHF framework is built on this mapping as stated in the abstract.

invented entities (1)

PHF_Compass no independent evidence
purpose: Lightweight, model-agnostic implementation of the PHF framework using a frozen LLM
Introduced in the abstract as the concrete system that instantiates the three-level model.

pith-pipeline@v0.9.1-grok · 5698 in / 1247 out tokens · 45954 ms · 2026-06-28T14:48:48.511677+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

113 extracted references · 52 canonical work pages · 13 internal anchors

[1]

arXiv preprint arXiv:2511.13593 , year=

O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents , author=. arXiv preprint arXiv:2511.13593 , year=

work page arXiv
[2]

SpeechMedAssist: Efficiently and Effectively Adapting Speech Language Models for Medical Consultation

SpeechMedAssist: Efficiently and Effectively Adapting Speech Language Models for Medical Consultation , author=. arXiv preprint arXiv:2601.04638 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[3]

2013 , publisher=

Principles of topological psychology , author=. 2013 , publisher=

2013
[4]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

Retrieval augmented generation with collaborative filtering for personalized text generation , author=. Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=
[5]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Proper: A progressive learning framework for personalized large language models with group-level adaptation , author=. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=
[6]

CCF International Conference on Natural Language Processing and Chinese Computing , pages=

FinTeam: A Multi-agent Collaborative Intelligence System for Comprehensive Financial Scenarios , author=. CCF International Conference on Natural Language Processing and Chinese Computing , pages=. 2025 , organization=

2025
[7]

arXiv preprint arXiv:2507.04037 , year=

Ready Jurist One: Benchmarking Language Agents for Legal Intelligence in Dynamic Environments , author=. arXiv preprint arXiv:2507.04037 , year=

work page arXiv
[8]

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension , author=. arXiv preprint arXiv:1705.03551 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[9]

Electronics , volume=

A survey of recommendation systems: recommendation models, techniques, and application fields , author=. Electronics , volume=. 2022 , publisher=

2022
[10]

Available at SSRN 5453594 , year=

TiLLM-Rec: Temporal-Interval-Aware Large Language Model for Sequential Recommendation under Irregular User Interactions , author=. Available at SSRN 5453594 , year=
[11]

Data Science and Engineering , volume=

Spatio-temporal representation learning with social tie for personalized poi recommendation , author=. Data Science and Engineering , volume=. 2022 , publisher=

2022
[12]

2024 IEEE 40th International Conference on Data Engineering (ICDE) , pages=

Adapting large language models by integrating collaborative semantics for recommendation , author=. 2024 IEEE 40th International Conference on Data Engineering (ICDE) , pages=. 2024 , organization=

2024
[13]

Advances in Neural Information Processing Systems , volume=

Recommender systems with generative retrieval , author=. Advances in Neural Information Processing Systems , volume=
[14]

Ieee Access , volume=

A survey of recommender systems based on deep learning , author=. Ieee Access , volume=. 2018 , publisher=

2018
[15]

arXiv preprint arXiv:2510.12563 , year=

HardcoreLogic: Challenging Large Reasoning Models with Long-tail Logic Puzzle Games , author=. arXiv preprint arXiv:2510.12563 , year=

work page arXiv
[16]

arXiv preprint arXiv:2509.25106 , year=

Towards personalized deep research: Benchmarks and evaluations , author=. arXiv preprint arXiv:2509.25106 , year=

work page arXiv
[17]

arXiv preprint arXiv:2412.03563 , year=

From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents , author=. arXiv preprint arXiv:2412.03563 , year=

work page arXiv
[18]

Frontiers of Computer Science , volume=

A survey on large language model based autonomous agents , author=. Frontiers of Computer Science , volume=. 2024 , publisher=

2024
[19]

Science China Information Sciences , volume=

The rise and potential of large language model based agents: A survey , author=. Science China Information Sciences , volume=. 2025 , publisher=

2025
[20]

arXiv preprint arXiv:2406.01171 , year=

Two tales of persona in llms: A survey of role-playing and personalization , author=. arXiv preprint arXiv:2406.01171 , year=

work page arXiv
[21]

Proceedings of the AAAI Conference on Artificial Intelligence , year=

Simulation-free hierarchical latent policy planning for proactive dialogues , author=. Proceedings of the AAAI Conference on Artificial Intelligence , year=
[22]

arXiv preprint arXiv:2311.00262 , year=

Plug-and-play policy planner for large language model powered dialogue agents , author=. arXiv preprint arXiv:2311.00262 , year=

work page arXiv
[23]

Companion Proceedings of the ACM on Web Conference 2025 , pages=

User-llm: Efficient llm contextualization with user embeddings , author=. Companion Proceedings of the ACM on Web Conference 2025 , pages=

2025
[24]

arXiv preprint arXiv:2408.00960 , year=

PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting , author=. arXiv preprint arXiv:2408.00960 , year=

work page arXiv
[25]

Agent hospital: A simulacrum of hospital with evolvable medical agents.arXiv preprint arXiv:2405.02957, 2024

Agent hospital: A simulacrum of hospital with evolvable medical agents , author=. arXiv preprint arXiv:2405.02957 , year=

work page arXiv
[26]

2025 , url=

Sicheng Yang and Zhaohu Xing and Lei Zhu , booktitle=. 2025 , url=

2025
[27]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Regularized vector quantization for tokenized image synthesis , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=
[28]

arXiv preprint arXiv:2511.17467 , year=

PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM , author=. arXiv preprint arXiv:2511.17467 , year=

work page arXiv
[29]

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Prefix-Tuning: Optimizing Continuous Prompts for Generation , author=. arXiv preprint arXiv:2101.00190 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[30]

arXiv preprint arXiv:2411.13902 , year=

Piors: Personalized intelligent outpatient reception based on large language model with multi-agents medical scenario simulation , author=. arXiv preprint arXiv:2411.13902 , year=

work page arXiv
[31]

arXiv preprint arXiv:2408.11779 , year=

Personality alignment of large language models , author=. arXiv preprint arXiv:2408.11779 , year=

work page arXiv
[32]

arXiv preprint arXiv:2408.10075 , year=

Personalizing reinforcement learning from human feedback with variational preference learning , author=. arXiv preprint arXiv:2408.10075 , year=

work page arXiv
[33]

arXiv preprint arXiv:2504.14439 , year=

LoRe: Personalizing LLMs via Low-Rank Reward Modeling , author=. arXiv preprint arXiv:2504.14439 , year=

work page arXiv
[34]

Advances in Neural Information Processing Systems , volume=

Lima: Less is more for alignment , author=. Advances in Neural Information Processing Systems , volume=
[35]

arXiv preprint arXiv:2402.05133 , year=

Personalized language modeling from personalized human feedback , author=. arXiv preprint arXiv:2402.05133 , year=

work page arXiv
[36]

arXiv preprint arXiv:2304.11406 , year=

Lamp: When large language models meet personalization , author=. arXiv preprint arXiv:2304.11406 , year=

work page arXiv
[37]

arXiv preprint arXiv:2404.18231 , year=

From persona to personalization: A survey on role-playing language agents , author=. arXiv preprint arXiv:2404.18231 , year=

work page arXiv
[38]

arXiv preprint arXiv:2503.02614 , year=

Personalized generation in large model era: A survey , author=. arXiv preprint arXiv:2503.02614 , year=

work page arXiv
[39]

Bert: Pre-training of deep bidirectional transformers for language understanding , author=. Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) , pages=

2019
[40]

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension , author=. arXiv preprint arXiv:1910.13461 , year=

work page internal anchor Pith review Pith/arXiv arXiv 1910
[41]

Journal of machine learning research , volume=

Exploring the limits of transfer learning with a unified text-to-text transformer , author=. Journal of machine learning research , volume=
[42]

arXiv preprint arXiv:2401.04858 , year=

User embedding model for personalized language prompting , author=. arXiv preprint arXiv:2401.04858 , year=

work page arXiv
[43]

Advances in neural information processing systems , volume=

Visual instruction tuning , author=. Advances in neural information processing systems , volume=
[44]

International conference on machine learning , pages=

Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models , author=. International conference on machine learning , pages=. 2023 , organization=

2023
[45]

IEEE Transactions on Knowledge and Data Engineering , year=

How to Bridge the Gap between Modalities: Survey on Multimodal Large Language Model , author=. IEEE Transactions on Knowledge and Data Engineering , year=
[46]

Advances in neural information processing systems , volume=

Neural discrete representation learning , author=. Advances in neural information processing systems , volume=
[47]

European Conference on Computer Vision , pages=

Unicode: Learning a unified codebook for multimodal large language models , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024
[48]

arXiv preprint arXiv:2404.03565 , year=

Personalized llm response generation with parameterized memory injection , author=. arXiv preprint arXiv:2404.03565 , year=

work page arXiv
[49]

, author=

Lora: Low-rank adaptation of large language models. , author=. ICLR , volume=
[50]

arXiv preprint arXiv:2407.02345 , year=

Morpheus: Modeling role from personalized dialogue history by exploring and utilizing latent space , author=. arXiv preprint arXiv:2407.02345 , year=

work page arXiv
[51]

First Conference on Language Modeling , year=

Factual and Tailored Recommendation Endorsements using Language Models and Reinforcement Learning , author=. First Conference on Language Modeling , year=
[52]

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Roberta: A robustly optimized bert pretraining approach , author=. arXiv preprint arXiv:1907.11692 , year=

work page internal anchor Pith review Pith/arXiv arXiv 1907
[53]

arXiv preprint arXiv:2503.15463 , year=

From 1,000,000 users to every user: Scaling up personalized preference for user-level alignment , author=. arXiv preprint arXiv:2503.15463 , year=

work page arXiv
[54]

arXiv e-prints , pages=

The llama 3 herd of models , author=. arXiv e-prints , pages=
[55]

Qwen2.5 Technical Report

Qwen2 technical report , author=. arXiv preprint arXiv:2412.15115 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[56]

international semantic web conference , pages=

Dbpedia: A nucleus for a web of open data , author=. international semantic web conference , pages=. 2007 , organization=

2007
[57]

Kim, Byeongchang and Kim, Hyunwoo and Kim, Gunhee , title = "
[58]

Personafeedback: A large-scale human-annotated benchmark for personalization.arXiv preprint arXiv:2506.12915,

PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization , author=. arXiv preprint arXiv:2506.12915 , year=

work page arXiv
[59]

Decoupled Weight Decay Regularization

Decoupled weight decay regularization , author=. arXiv preprint arXiv:1711.05101 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[60]

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation , author=. arXiv preprint arXiv:2410.13848 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[61]

Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning

Tan, Zhaoxuan and Zeng, Qingkai and Tian, Yijun and Liu, Zheyuan and Yin, Bing and Jiang, Meng. Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024

2024
[62]

arXiv preprint arXiv:2407.19412 , year=

Identity-driven hierarchical role-playing agents , author=. arXiv preprint arXiv:2407.19412 , year=

work page arXiv
[63]

arXiv preprint arXiv:2210.01240 , year=

Language models are greedy reasoners: A systematic formal analysis of chain-of-thought , author=. arXiv preprint arXiv:2210.01240 , year=

work page arXiv
[64]

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

Min, Sewon and Lyu, Xinxi and Holtzman, Ari and Artetxe, Mikel and Lewis, Mike and Hajishirzi, Hannaneh and Zettlemoyer, Luke. Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022

2022
[65]

and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy

Liu, Nelson F. and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy. Lost in the Middle: How Language Models Use Long Contexts. Transactions of the Association for Computational Linguistics. 2024

2024
[66]

arXiv preprint arXiv:2402.16333 , year=

Unveiling the truth and facilitating change: Towards agent-based large-scale social movement simulation , author=. arXiv preprint arXiv:2402.16333 , year=

work page arXiv
[67]

2024 , eprint=

OASIS: Open Agent Social Interaction Simulations with One Million Agents , author=. 2024 , eprint=

2024
[68]

arXiv preprint arXiv:2504.10157 , year=

Socioverse: A world model for social simulation powered by llm agents and a pool of 10 million real-world users , author=. arXiv preprint arXiv:2504.10157 , year=

work page arXiv
[69]

Proceedings of the 47th international ACM SIGIR conference on research and development in Information Retrieval , pages=

On generative agents in recommendation , author=. Proceedings of the 47th international ACM SIGIR conference on research and development in Information Retrieval , pages=
[70]

Faithful Persona-based Conversational Dataset Generation with Large Language Models

Jandaghi, Pegah and Sheng, Xianghai and Bai, Xinyi and Pujara, Jay and Sidahmed, Hakim. Faithful Persona-based Conversational Dataset Generation with Large Language Models. Findings of the Association for Computational Linguistics: ACL 2024. 2024

2024
[71]

arXiv preprint arXiv:2402.09660 , year=

User modeling and user profiling: A comprehensive survey , author=. arXiv preprint arXiv:2402.09660 , year=

work page arXiv
[72]

arXiv preprint arXiv:2302.11087 , year=

A survey on user behavior modeling in recommender systems , author=. arXiv preprint arXiv:2302.11087 , year=

work page arXiv
[73]

AI That Keeps Up: NeurIPS 2025 Workshop on Continual and Compatible Foundation Model Updates , year=

Embedding-to-Prefix: Continual Personalization with Large Language Models , author=. AI That Keeps Up: NeurIPS 2025 Workshop on Continual and Compatible Foundation Model Updates , year=

2025
[74]

LLM s + Persona-Plug = Personalized LLM s

Liu, Jiongnan and Zhu, Yutao and Wang, Shuting and Wei, Xiaochi and Min, Erxue and Lu, Yu and Wang, Shuaiqiang and Yin, Dawei and Dou, Zhicheng. LLM s + Persona-Plug = Personalized LLM s. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025

2025
[75]

arXiv preprint arXiv:2405.20985 , year=

Deco: Decoupling token compression from semantic abstraction in multimodal large language models , author=. arXiv preprint arXiv:2405.20985 , year=

work page arXiv
[76]

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , pages=

Building and applying a concept hierarchy representation of a user profile , author=. Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , pages=
[77]

Proceedings of the 24th international conference on world wide web , pages=

A multi-view deep learning approach for cross domain user modeling in recommendation systems , author=. Proceedings of the 24th international conference on world wide web , pages=
[78]

Proceedings of the 57th annual meeting of the association for computational linguistics , pages=

Neural news recommendation with long-and short-term user representations , author=. Proceedings of the 57th annual meeting of the association for computational linguistics , pages=
[79]

Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining , pages=

Variational user modeling with slow and fast features , author=. Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining , pages=
[80]

arXiv preprint arXiv:2304.03516 , year=

Generative recommendation: Towards next-generation recommender paradigm , author=. arXiv preprint arXiv:2304.03516 , year=

work page arXiv

Showing first 80 references.

[1] [1]

arXiv preprint arXiv:2511.13593 , year=

O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents , author=. arXiv preprint arXiv:2511.13593 , year=

work page arXiv

[2] [2]

SpeechMedAssist: Efficiently and Effectively Adapting Speech Language Models for Medical Consultation

SpeechMedAssist: Efficiently and Effectively Adapting Speech Language Models for Medical Consultation , author=. arXiv preprint arXiv:2601.04638 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[3] [3]

2013 , publisher=

Principles of topological psychology , author=. 2013 , publisher=

2013

[4] [4]

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

Retrieval augmented generation with collaborative filtering for personalized text generation , author=. Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages=

[5] [5]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Proper: A progressive learning framework for personalized large language models with group-level adaptation , author=. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

[6] [6]

CCF International Conference on Natural Language Processing and Chinese Computing , pages=

FinTeam: A Multi-agent Collaborative Intelligence System for Comprehensive Financial Scenarios , author=. CCF International Conference on Natural Language Processing and Chinese Computing , pages=. 2025 , organization=

2025

[7] [7]

arXiv preprint arXiv:2507.04037 , year=

Ready Jurist One: Benchmarking Language Agents for Legal Intelligence in Dynamic Environments , author=. arXiv preprint arXiv:2507.04037 , year=

work page arXiv

[8] [8]

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension , author=. arXiv preprint arXiv:1705.03551 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[9] [9]

Electronics , volume=

A survey of recommendation systems: recommendation models, techniques, and application fields , author=. Electronics , volume=. 2022 , publisher=

2022

[10] [10]

Available at SSRN 5453594 , year=

TiLLM-Rec: Temporal-Interval-Aware Large Language Model for Sequential Recommendation under Irregular User Interactions , author=. Available at SSRN 5453594 , year=

[11] [11]

Data Science and Engineering , volume=

Spatio-temporal representation learning with social tie for personalized poi recommendation , author=. Data Science and Engineering , volume=. 2022 , publisher=

2022

[12] [12]

2024 IEEE 40th International Conference on Data Engineering (ICDE) , pages=

Adapting large language models by integrating collaborative semantics for recommendation , author=. 2024 IEEE 40th International Conference on Data Engineering (ICDE) , pages=. 2024 , organization=

2024

[13] [13]

Advances in Neural Information Processing Systems , volume=

Recommender systems with generative retrieval , author=. Advances in Neural Information Processing Systems , volume=

[14] [14]

Ieee Access , volume=

A survey of recommender systems based on deep learning , author=. Ieee Access , volume=. 2018 , publisher=

2018

[15] [15]

arXiv preprint arXiv:2510.12563 , year=

HardcoreLogic: Challenging Large Reasoning Models with Long-tail Logic Puzzle Games , author=. arXiv preprint arXiv:2510.12563 , year=

work page arXiv

[16] [16]

arXiv preprint arXiv:2509.25106 , year=

Towards personalized deep research: Benchmarks and evaluations , author=. arXiv preprint arXiv:2509.25106 , year=

work page arXiv

[17] [17]

arXiv preprint arXiv:2412.03563 , year=

From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents , author=. arXiv preprint arXiv:2412.03563 , year=

work page arXiv

[18] [18]

Frontiers of Computer Science , volume=

A survey on large language model based autonomous agents , author=. Frontiers of Computer Science , volume=. 2024 , publisher=

2024

[19] [19]

Science China Information Sciences , volume=

The rise and potential of large language model based agents: A survey , author=. Science China Information Sciences , volume=. 2025 , publisher=

2025

[20] [20]

arXiv preprint arXiv:2406.01171 , year=

Two tales of persona in llms: A survey of role-playing and personalization , author=. arXiv preprint arXiv:2406.01171 , year=

work page arXiv

[21] [21]

Proceedings of the AAAI Conference on Artificial Intelligence , year=

Simulation-free hierarchical latent policy planning for proactive dialogues , author=. Proceedings of the AAAI Conference on Artificial Intelligence , year=

[22] [22]

arXiv preprint arXiv:2311.00262 , year=

Plug-and-play policy planner for large language model powered dialogue agents , author=. arXiv preprint arXiv:2311.00262 , year=

work page arXiv

[23] [23]

Companion Proceedings of the ACM on Web Conference 2025 , pages=

User-llm: Efficient llm contextualization with user embeddings , author=. Companion Proceedings of the ACM on Web Conference 2025 , pages=

2025

[24] [24]

arXiv preprint arXiv:2408.00960 , year=

PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting , author=. arXiv preprint arXiv:2408.00960 , year=

work page arXiv

[25] [25]

Agent hospital: A simulacrum of hospital with evolvable medical agents.arXiv preprint arXiv:2405.02957, 2024

Agent hospital: A simulacrum of hospital with evolvable medical agents , author=. arXiv preprint arXiv:2405.02957 , year=

work page arXiv

[26] [26]

2025 , url=

Sicheng Yang and Zhaohu Xing and Lei Zhu , booktitle=. 2025 , url=

2025

[27] [27]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

Regularized vector quantization for tokenized image synthesis , author=. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages=

[28] [28]

arXiv preprint arXiv:2511.17467 , year=

PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM , author=. arXiv preprint arXiv:2511.17467 , year=

work page arXiv

[29] [29]

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Prefix-Tuning: Optimizing Continuous Prompts for Generation , author=. arXiv preprint arXiv:2101.00190 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[30] [30]

arXiv preprint arXiv:2411.13902 , year=

Piors: Personalized intelligent outpatient reception based on large language model with multi-agents medical scenario simulation , author=. arXiv preprint arXiv:2411.13902 , year=

work page arXiv

[31] [31]

arXiv preprint arXiv:2408.11779 , year=

Personality alignment of large language models , author=. arXiv preprint arXiv:2408.11779 , year=

work page arXiv

[32] [32]

arXiv preprint arXiv:2408.10075 , year=

Personalizing reinforcement learning from human feedback with variational preference learning , author=. arXiv preprint arXiv:2408.10075 , year=

work page arXiv

[33] [33]

arXiv preprint arXiv:2504.14439 , year=

LoRe: Personalizing LLMs via Low-Rank Reward Modeling , author=. arXiv preprint arXiv:2504.14439 , year=

work page arXiv

[34] [34]

Advances in Neural Information Processing Systems , volume=

Lima: Less is more for alignment , author=. Advances in Neural Information Processing Systems , volume=

[35] [35]

arXiv preprint arXiv:2402.05133 , year=

Personalized language modeling from personalized human feedback , author=. arXiv preprint arXiv:2402.05133 , year=

work page arXiv

[36] [36]

arXiv preprint arXiv:2304.11406 , year=

Lamp: When large language models meet personalization , author=. arXiv preprint arXiv:2304.11406 , year=

work page arXiv

[37] [37]

arXiv preprint arXiv:2404.18231 , year=

From persona to personalization: A survey on role-playing language agents , author=. arXiv preprint arXiv:2404.18231 , year=

work page arXiv

[38] [38]

arXiv preprint arXiv:2503.02614 , year=

Personalized generation in large model era: A survey , author=. arXiv preprint arXiv:2503.02614 , year=

work page arXiv

[39] [39]

Bert: Pre-training of deep bidirectional transformers for language understanding , author=. Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) , pages=

2019

[40] [40]

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension , author=. arXiv preprint arXiv:1910.13461 , year=

work page internal anchor Pith review Pith/arXiv arXiv 1910

[41] [41]

Journal of machine learning research , volume=

Exploring the limits of transfer learning with a unified text-to-text transformer , author=. Journal of machine learning research , volume=

[42] [42]

arXiv preprint arXiv:2401.04858 , year=

User embedding model for personalized language prompting , author=. arXiv preprint arXiv:2401.04858 , year=

work page arXiv

[43] [43]

Advances in neural information processing systems , volume=

Visual instruction tuning , author=. Advances in neural information processing systems , volume=

[44] [44]

International conference on machine learning , pages=

Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models , author=. International conference on machine learning , pages=. 2023 , organization=

2023

[45] [45]

IEEE Transactions on Knowledge and Data Engineering , year=

How to Bridge the Gap between Modalities: Survey on Multimodal Large Language Model , author=. IEEE Transactions on Knowledge and Data Engineering , year=

[46] [46]

Advances in neural information processing systems , volume=

Neural discrete representation learning , author=. Advances in neural information processing systems , volume=

[47] [47]

European Conference on Computer Vision , pages=

Unicode: Learning a unified codebook for multimodal large language models , author=. European Conference on Computer Vision , pages=. 2024 , organization=

2024

[48] [48]

arXiv preprint arXiv:2404.03565 , year=

Personalized llm response generation with parameterized memory injection , author=. arXiv preprint arXiv:2404.03565 , year=

work page arXiv

[49] [49]

, author=

Lora: Low-rank adaptation of large language models. , author=. ICLR , volume=

[50] [50]

arXiv preprint arXiv:2407.02345 , year=

Morpheus: Modeling role from personalized dialogue history by exploring and utilizing latent space , author=. arXiv preprint arXiv:2407.02345 , year=

work page arXiv

[51] [51]

First Conference on Language Modeling , year=

Factual and Tailored Recommendation Endorsements using Language Models and Reinforcement Learning , author=. First Conference on Language Modeling , year=

[52] [52]

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Roberta: A robustly optimized bert pretraining approach , author=. arXiv preprint arXiv:1907.11692 , year=

work page internal anchor Pith review Pith/arXiv arXiv 1907

[53] [53]

arXiv preprint arXiv:2503.15463 , year=

From 1,000,000 users to every user: Scaling up personalized preference for user-level alignment , author=. arXiv preprint arXiv:2503.15463 , year=

work page arXiv

[54] [54]

arXiv e-prints , pages=

The llama 3 herd of models , author=. arXiv e-prints , pages=

[55] [55]

Qwen2.5 Technical Report

Qwen2 technical report , author=. arXiv preprint arXiv:2412.15115 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[56] [56]

international semantic web conference , pages=

Dbpedia: A nucleus for a web of open data , author=. international semantic web conference , pages=. 2007 , organization=

2007

[57] [57]

Kim, Byeongchang and Kim, Hyunwoo and Kim, Gunhee , title = "

[58] [58]

Personafeedback: A large-scale human-annotated benchmark for personalization.arXiv preprint arXiv:2506.12915,

PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization , author=. arXiv preprint arXiv:2506.12915 , year=

work page arXiv

[59] [59]

Decoupled Weight Decay Regularization

Decoupled weight decay regularization , author=. arXiv preprint arXiv:1711.05101 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[60] [60]

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation , author=. arXiv preprint arXiv:2410.13848 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[61] [61]

Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning

Tan, Zhaoxuan and Zeng, Qingkai and Tian, Yijun and Liu, Zheyuan and Yin, Bing and Jiang, Meng. Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024

2024

[62] [62]

arXiv preprint arXiv:2407.19412 , year=

Identity-driven hierarchical role-playing agents , author=. arXiv preprint arXiv:2407.19412 , year=

work page arXiv

[63] [63]

arXiv preprint arXiv:2210.01240 , year=

Language models are greedy reasoners: A systematic formal analysis of chain-of-thought , author=. arXiv preprint arXiv:2210.01240 , year=

work page arXiv

[64] [64]

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

Min, Sewon and Lyu, Xinxi and Holtzman, Ari and Artetxe, Mikel and Lewis, Mike and Hajishirzi, Hannaneh and Zettlemoyer, Luke. Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022

2022

[65] [65]

and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy

Liu, Nelson F. and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy. Lost in the Middle: How Language Models Use Long Contexts. Transactions of the Association for Computational Linguistics. 2024

2024

[66] [66]

arXiv preprint arXiv:2402.16333 , year=

Unveiling the truth and facilitating change: Towards agent-based large-scale social movement simulation , author=. arXiv preprint arXiv:2402.16333 , year=

work page arXiv

[67] [67]

2024 , eprint=

OASIS: Open Agent Social Interaction Simulations with One Million Agents , author=. 2024 , eprint=

2024

[68] [68]

arXiv preprint arXiv:2504.10157 , year=

Socioverse: A world model for social simulation powered by llm agents and a pool of 10 million real-world users , author=. arXiv preprint arXiv:2504.10157 , year=

work page arXiv

[69] [69]

Proceedings of the 47th international ACM SIGIR conference on research and development in Information Retrieval , pages=

On generative agents in recommendation , author=. Proceedings of the 47th international ACM SIGIR conference on research and development in Information Retrieval , pages=

[70] [70]

Faithful Persona-based Conversational Dataset Generation with Large Language Models

Jandaghi, Pegah and Sheng, Xianghai and Bai, Xinyi and Pujara, Jay and Sidahmed, Hakim. Faithful Persona-based Conversational Dataset Generation with Large Language Models. Findings of the Association for Computational Linguistics: ACL 2024. 2024

2024

[71] [71]

arXiv preprint arXiv:2402.09660 , year=

User modeling and user profiling: A comprehensive survey , author=. arXiv preprint arXiv:2402.09660 , year=

work page arXiv

[72] [72]

arXiv preprint arXiv:2302.11087 , year=

A survey on user behavior modeling in recommender systems , author=. arXiv preprint arXiv:2302.11087 , year=

work page arXiv

[73] [73]

AI That Keeps Up: NeurIPS 2025 Workshop on Continual and Compatible Foundation Model Updates , year=

Embedding-to-Prefix: Continual Personalization with Large Language Models , author=. AI That Keeps Up: NeurIPS 2025 Workshop on Continual and Compatible Foundation Model Updates , year=

2025

[74] [74]

LLM s + Persona-Plug = Personalized LLM s

Liu, Jiongnan and Zhu, Yutao and Wang, Shuting and Wei, Xiaochi and Min, Erxue and Lu, Yu and Wang, Shuaiqiang and Yin, Dawei and Dou, Zhicheng. LLM s + Persona-Plug = Personalized LLM s. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025

2025

[75] [75]

arXiv preprint arXiv:2405.20985 , year=

Deco: Decoupling token compression from semantic abstraction in multimodal large language models , author=. arXiv preprint arXiv:2405.20985 , year=

work page arXiv

[76] [76]

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , pages=

Building and applying a concept hierarchy representation of a user profile , author=. Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval , pages=

[77] [77]

Proceedings of the 24th international conference on world wide web , pages=

A multi-view deep learning approach for cross domain user modeling in recommendation systems , author=. Proceedings of the 24th international conference on world wide web , pages=

[78] [78]

Proceedings of the 57th annual meeting of the association for computational linguistics , pages=

Neural news recommendation with long-and short-term user representations , author=. Proceedings of the 57th annual meeting of the association for computational linguistics , pages=

[79] [79]

Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining , pages=

Variational user modeling with slow and fast features , author=. Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining , pages=

[80] [80]

arXiv preprint arXiv:2304.03516 , year=

Generative recommendation: Towards next-generation recommender paradigm , author=. arXiv preprint arXiv:2304.03516 , year=

work page arXiv