IVIE: A Neuro-symbolic Approach to Incremental and Validated Generation of Interactive Fiction Worlds

Luis Chiruzzo; Micaela Vaucher; Santiago G\'ongora; Santiago Silveira

arxiv: 2606.13348 · v1 · pith:KI4QX6CQnew · submitted 2026-06-11 · 💻 cs.CL · cs.AI

IVIE: A Neuro-symbolic Approach to Incremental and Validated Generation of Interactive Fiction Worlds

Micaela Vaucher , Santiago Silveira , Santiago G\'ongora , Luis Chiruzzo This is my paper

Pith reviewed 2026-06-27 06:26 UTC · model grok-4.3

classification 💻 cs.CL cs.AI

keywords interactive fictionneuro-symbolic generationlarge language modelsworld coherencepuzzle designnarrative generationcomputational creativitysymbolic validation

0 comments

The pith

A neuro-symbolic pipeline uses LLMs for creative choices and symbolic checks to produce coherent interactive fiction worlds with puzzles and goals.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces IVIE as an incremental four-stage system that assigns setting creation, character design, and puzzle construction to large language models while applying symbolic validation to the evolving world state. This setup seeks to combine the generative range of neural models with the consistency guarantees of symbolic reasoning for fully playable interactive fiction. Human evaluations indicate that the outputs achieve thematic coherence, immersion, and player engagement. The work argues that the hybrid method preserves creative freedom while grounding the narrative structure around a central goal. Remaining gaps include occasional inconsistencies that evade the symbolic layer and the lack of quantitative checks on goal reachability.

Core claim

IVIE implements a four-stage incremental generation pipeline on top of the PAYADOR neuro-symbolic framework. Large language models handle creative decisions such as setting and character creation plus puzzle design, while symbolic validation enforces consistency across interconnected locations, functional items, non-player characters, and goal-oriented puzzles. Human evaluation of the generated worlds shows them to be immersive and thematically coherent with high player engagement, supporting the claim that symbolic grounding can constrain LLM output without removing generative flexibility.

What carries the argument

The four-stage incremental generation pipeline that delegates creative decisions to LLMs while grounding the world state through symbolic validation.

If this is right

Worlds can be produced with interconnected locations, functional items, non-player characters, and coherent puzzles structured around a central goal.
Symbolic validation can constrain LLM output while still allowing creative flexibility in narrative elements.
Human-evaluated outputs reach high levels of thematic coherence and player engagement.
Future neurosymbolic storytelling systems can follow the same delegation pattern between neural creativity and symbolic grounding.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Adding automated metrics for goal reachability and structural validity would make the validation claims more testable.
The pipeline might be applied to generate content for existing game engines rather than standalone text worlds.
Similar incremental validation steps could address coherence issues in other LLM-driven narrative tasks such as multi-character dialogue.

Load-bearing premise

Symbolic validation catches most LLM-generated inconsistencies and human judgments alone are enough to confirm objective playability.

What would settle it

A generated world in which a player cannot reach the stated goal because of an inconsistency that passed the symbolic checks.

Figures

Figures reproduced from arXiv: 2606.13348 by Luis Chiruzzo, Micaela Vaucher, Santiago G\'ongora, Santiago Silveira.

**Figure 2.** Figure 2: IVIE’s incremental generation pipeline. Starting from optional user inspiration, the LLM generates worlds progres [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: IVIE’s interface allows users to generate worlds in two modes: Generate (the LLM takes the creative responsibility) [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Three key dimensions evaluated: World Genera [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

read the original abstract

Computational creativity in Interactive Fiction faces a fundamental tension: Large Language Models (LLM) may produce creative narratives but struggle with world coherence, while symbolic systems ensure consistency but lack creative flexibility. We present IVIE (Incremental & Validated Interactive Experiences), a neuro-symbolic approach to generating complete and playable interactive fiction worlds from scratch. Building upon PAYADOR's neuro-symbolic framework, IVIE implements a four-stage incremental generation pipeline that delegates creative decisions--setting and character creation, puzzle design--to LLMs while grounding the world state through symbolic validation. The system generates worlds with interconnected locations, functional items, non-player characters, and coherent puzzles, all structured around a central goal-oriented architecture. Human evaluation shows the approach generates immersive, thematically coherent worlds with high player engagement. Results seem to indicate that the neuro-symbolic approach successfully balances flexibility with narrative coherence: symbolic validation grounds LLM generation without eliminating generative freedom. However, challenges remain: LLM inconsistencies occasionally bypass puzzle constraints, and objective validation gaps allow some structurally impossible goals. We identify key design considerations for future neurosymbolic interactive storytelling systems, particularly regarding LLM capabilities and their limitations.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

IVIE gives a concrete four-stage pipeline for neuro-symbolic IF world generation but its claims rest on thin, unquantified human evaluation.

read the letter

The main thing to know is that this paper describes IVIE, a four-stage incremental pipeline that hands creative tasks like setting and puzzle design to LLMs while using symbolic validation to keep the world state consistent, and that the evidence offered for its success is limited to human judgments without supporting numbers.

What is new is the specific delegation and validation structure built on top of the PAYADOR framework. The system generates interconnected locations, functional items, NPCs, and goal-oriented puzzles from scratch, with symbolic checks applied at each stage to catch inconsistencies that pure LLM generation tends to produce.

The paper does a reasonable job of laying out the architecture and being upfront about remaining problems, including cases where LLM outputs bypass puzzle constraints and where objective validation gaps allow structurally impossible goals.

The soft spot is the evaluation. The abstract reports that human evaluation shows immersive and thematically coherent worlds with high player engagement, yet it provides no details on study size, controls, or any objective measures such as goal-reachability rates or counts of dead-end states. The central claim that symbolic validation successfully balances flexibility and coherence therefore depends on an untested assumption that the admitted failures are rare enough not to matter.

This work is for researchers working on computational creativity or AI tools for interactive storytelling. A reader looking for practical hybrid pipelines could extract useful design considerations from the four-stage description.

It deserves peer review so that referees can examine the full implementation and any additional results that might address the metric gaps.

Referee Report

2 major / 2 minor

Summary. The paper presents IVIE, a neuro-symbolic system for generating complete and playable interactive fiction worlds from scratch. It extends the PAYADOR framework with a four-stage incremental pipeline that assigns creative tasks (setting/character creation, puzzle design) to LLMs while using symbolic validation to enforce world-state consistency, interconnected locations, functional items, NPCs, and goal-oriented puzzles. Human evaluation is cited as evidence that the generated worlds are immersive, thematically coherent, and engaging, supporting the claim that the approach balances generative flexibility with narrative coherence, though the authors note residual issues with LLM inconsistencies bypassing constraints and gaps in objective validation.

Significance. If the central claim holds under stronger scrutiny, the work would contribute a concrete neuro-symbolic pipeline for computational creativity in interactive fiction, identifying design considerations for future systems. The incremental, validated generation approach and explicit acknowledgment of LLM limitations are constructive. However, the absence of quantitative metrics for playability reduces the strength of the significance assessment.

major comments (2)

[Human Evaluation / Results] The central claim that the four-stage pipeline plus symbolic validation produces 'complete and playable' worlds rests on human evaluation alone. No quantitative results are reported for goal-reachability success rate, fraction of worlds containing dead-end states, or automated solvability checks, despite the abstract explicitly stating that 'LLM inconsistencies occasionally bypass puzzle constraints' and 'objective validation gaps allow some structurally impossible goals.'
[Pipeline Description / Evaluation] The assertion that 'symbolic validation grounds LLM generation without eliminating generative freedom' is load-bearing for the neuro-symbolic contribution, yet the manuscript provides no breakdown of how often validation catches (or fails to catch) inconsistencies, nor any comparison against a pure-LLM baseline on the same metrics.

minor comments (2)

[Abstract] The abstract uses the hedging phrase 'Results seem to indicate'; this could be replaced with a direct statement of the observed outcomes once the full evaluation numbers are presented.
[Method] Clarify whether the symbolic validator operates on an explicit state representation (e.g., PDDL-style or custom logic) and how it interfaces with the LLM outputs at each of the four stages.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive comments. We address each major comment below.

read point-by-point responses

Referee: [Human Evaluation / Results] The central claim that the four-stage pipeline plus symbolic validation produces 'complete and playable' worlds rests on human evaluation alone. No quantitative results are reported for goal-reachability success rate, fraction of worlds containing dead-end states, or automated solvability checks, despite the abstract explicitly stating that 'LLM inconsistencies occasionally bypass puzzle constraints' and 'objective validation gaps allow some structurally impossible goals.'

Authors: We agree that quantitative metrics for playability would strengthen the claims. The manuscript's evaluation centers on human judgments of immersion and coherence because these are the primary qualities of interest for interactive fiction. In revision we will add any available internal statistics on validation pass rates and explicitly discuss the difficulty of automated solvability checks for open-ended worlds. We already flag the relevant limitations in the abstract and will expand that discussion. revision: partial
Referee: [Pipeline Description / Evaluation] The assertion that 'symbolic validation grounds LLM generation without eliminating generative freedom' is load-bearing for the neuro-symbolic contribution, yet the manuscript provides no breakdown of how often validation catches (or fails to catch) inconsistencies, nor any comparison against a pure-LLM baseline on the same metrics.

Authors: We will add a breakdown of validation interventions (number of constraints triggered and rejected outputs) drawn from generation logs. A head-to-head pure-LLM baseline on the same quantitative metrics was not run; we will therefore expand the discussion section to articulate the design rationale for the neuro-symbolic split and acknowledge the lack of direct empirical comparison as a limitation. revision: partial

standing simulated objections not resolved

A full empirical comparison against a pure-LLM baseline on automated playability metrics would require new experiments outside the scope of the submitted work.

Circularity Check

0 steps flagged

No circularity; system description is self-contained

full rationale

The paper describes a four-stage neuro-symbolic pipeline for generating interactive fiction worlds, delegating creative elements to LLMs while applying symbolic validation, and reports human evaluation results for coherence and engagement. No equations, parameters, or derivations appear in the provided text. The reference to the prior PAYADOR framework supplies background context but does not serve as the sole justification for the central claim; the new pipeline stages and human-rated outcomes constitute independent content. No self-definitional reductions, fitted inputs presented as predictions, or load-bearing self-citations that collapse the result to its inputs are present.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only abstract available; no free parameters, axioms, or invented entities are specified in the provided text.

pith-pipeline@v0.9.1-grok · 5741 in / 969 out tokens · 18148 ms · 2026-06-27T06:26:43.757550+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

41 extracted references · 6 canonical work pages

[1]

Riedl and Vadim Bulitko

Mark O. Riedl and Vadim Bulitko. Interactive Narrative: An Intelligent Systems Approach. AI Magazine. doi:10.1609/aimag.v34i1.2449

work page doi:10.1609/aimag.v34i1.2449
[2]

2003 , isbn =

Nick Montfort , title =. 2003 , isbn =

2003
[3]

Nelson , title =

Noor Shaker and Julian Togelius and Mark J. Nelson , title =. 2016 , publisher =. doi:10.1007/978-3-319-42716-4 , url =

work page doi:10.1007/978-3-319-42716-4 2016
[4]

Riedl , title =

Mark O. Riedl , title =. Medium , year =
[5]

Automated Story Generation as Question-Answering , doi =

Castricato, Louis and Frazier, Spencer and Balloch, Jonathan and Tarakad, Nitya and Riedl, Mark , year =. Automated Story Generation as Question-Answering , doi =
[6]

Kamrul Hasan Sarker and Lu Zhou and Aaron Eberhart and Pascal Hitzler , title =

Md. Kamrul Hasan Sarker and Lu Zhou and Aaron Eberhart and Pascal Hitzler , title =. AI Communications , year =. doi:10.3233/AIC-210084 , url =

work page doi:10.3233/aic-210084
[7]

Survey of Hallucination in Natural Language Generation , volume=

Ji, Ziwei and Lee, Nayeon and Frieske, Rita and Yu, Tiezheng and Su, Dan and Xu, Yan and Ishii, Etsuko and Bang, Ye Jin and Madotto, Andrea and Fung, Pascale , title =. 2023 , issue_date =. doi:10.1145/3571730 , journal =

work page doi:10.1145/3571730 2023
[8]

2017 , doi=

What can you do with a rock? Affordance extraction via word embeddings , author=. 2017 , doi=

2017
[9]

ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) , volume=

Procedural Content Generation for Games: A Survey , author=. ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) , volume=. 2013 , publisher=

2013
[10]

Meehan , title =

James R. Meehan , title =. Proceedings of the 5th International Joint Conference on Artificial Intelligence (IJCAI-77) , year =
[11]

Copyright Contracts , editor =

Alina Trapova , title =. Copyright Contracts , editor =. 2023 , doi =

2023
[12]

AI Storytelling Game May Expand Publishing's Horizons , year =
[13]

Negative Feedback on LLM-Powered Storytelling and Roleplay Apps , year =
[14]

Proceedings of The 15th International Conference on Computational Creativity , year=

G. Proceedings of The 15th International Conference on Computational Creativity , year=
[15]

Proceedings of The 17th International Conference on Computational Creativity , year=

World-State Transformations for Neuro-symbolic Interactive Storytelling , author=. Proceedings of The 17th International Conference on Computational Creativity , year=
[16]

2024 , url =

Boluwatife Oluwadare , title =. 2024 , url =

2024
[17]

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks , booktitle =

Patrick Lewis and Ethan Perez and Aleksandra Piktus and Fabio Petroni and Vladimir Karpukhin and Naman Goyal and Heinrich K. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks , booktitle =. 2020 , pages =

2020
[18]

TattleTale: Storytelling with Planning and Large Language Models , year=

Nisha Ingrid Simon and Christian Muise , url=. TattleTale: Storytelling with Planning and Large Language Models , year=
[19]

Proceedings of the ACM on Human-Computer Interaction , year =

Kotaro Nishigori, Hideaki Takeda , title =. Proceedings of the ACM on Human-Computer Interaction , year =
[20]

arXiv preprint arXiv:2505.12439 , year =

Zihao Li and Zichong Wang and Caiming Xiong and Chen Xing , title =. arXiv preprint arXiv:2505.12439 , year =

arXiv
[21]

Martin and Francis Ferraro , title =

Rachel Chambers and Naomi Tack and Eliot Pearson and Lara J. Martin and Francis Ferraro , title =. 4th Wordplay: When Language Meets Games Workshop @ ACL 2024 , year =

2024
[22]

2025 , school=

Approaches to interactive and improvisational storytelling , author=. 2025 , school=

2025
[23]

Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-Playing Games

G \'o ngora, Santiago and Chiruzzo, Luis and M \'e ndez, Gonzalo and Gerv \'a s, Pablo. Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-Playing Games. Games and Learning Alliance. 2024

2024
[24]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing , pages=

Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence , author=. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing , pages=

2022
[25]

and Lindsay, Alan and Cavazza, Marc , title =

Porteous, Julie and Ferreira, João F. and Lindsay, Alan and Cavazza, Marc , title =. Autonomous Agents and Multi-Agent Systems , year =
[26]

Kelly and A

J. Kelly and A. Calderwood and N. Wardrip-Fruin and M. Mateas , title =. Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE) , year =
[27]

The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025 , publisher =

Jailbreaking as a Reward Misspecification Problem , author =. The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025 , publisher =. 2025 , url =

2025
[28]

and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy

Liu, Nelson F. and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy. Lost in the Middle: How Language Models Use Long Contexts. Transactions of the Association for Computational Linguistics. 2024. doi:10.1162/tacl_a_00638

work page doi:10.1162/tacl_a_00638 2024
[29]

Contemporary Music Review , volume =

Giancarlo Schiaffini , title =. Contemporary Music Review , volume =. 2006 , doi =

2006
[30]

Keith Sawyer , title =

R. Keith Sawyer , title =. Mind, Culture, and Activity , volume =. 2000 , doi =

2000
[31]

2016 , type =

Henri Bomström , title =. 2016 , type =

2016
[32]

and Barr, Pippin , title =

Khaled, Rilla and Nelson, Mark J. and Barr, Pippin , title =. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems , series =. 2013 , pages =

2013
[33]

ECAI 2012 , pages=

Computational creativity: The final frontier? , author=. ECAI 2012 , pages=. 2012 , publisher=

2012
[34]

arXiv preprint arXiv:2505.03547 , year=

STORY2GAME: Generating (almost) everything in an interactive fiction game , author=. arXiv preprint arXiv:2505.03547 , year=

arXiv
[35]

The 4th Wordplay: When Language Meets Games Workshop , year=

Berall: Towards generating retrieval-augmented state-based interactive fiction games , author=. The 4th Wordplay: When Language Meets Games Workshop , year=
[36]

The 4th Wordplay: When Language Meets Games Workshop , year=

DAGGER: Data Augmentation for Generative Gaming in Enriched Realms , author=. The 4th Wordplay: When Language Meets Games Workshop , year=
[37]

International Conference on Interactive Digital Storytelling , pages=

From playing the story to gaming the system: Repeat experiences of a large language model-based interactive story , author=. International Conference on Interactive Digital Storytelling , pages=. 2023 , organization=

2023
[38]

arXiv preprint arXiv:2307.02483 , year =

Jailbroken: How Does LLM Safety Training Fail? , author =. arXiv preprint arXiv:2307.02483 , year =

Pith/arXiv arXiv
[39]

Interactive Storytelling: 9th International Conference on Interactive Digital Storytelling, ICIDS 2016, Los Angeles, CA, USA, November 15--18, 2016, Proceedings 9 , pages=

Improvisational computational storytelling in open worlds , author=. Interactive Storytelling: 9th International Conference on Interactive Digital Storytelling, ICIDS 2016, Los Angeles, CA, USA, November 15--18, 2016, Proceedings 9 , pages=. 2016 , organization=

2016
[40]

2025 , type =

Vaucher, Micaela and Silveira, Santiago , title =. 2025 , type =

2025
[41]

Two Tales of Persona in LLM s: A Survey of Role-Playing and Personalization

Tseng, Yu-Min and Huang, Yu-Chao and Hsiao, Teng-Yun and Chen, Wei-Lin and Huang, Chao-Wei and Meng, Yu and Chen, Yun-Nung. Two Tales of Persona in LLM s: A Survey of Role-Playing and Personalization. Findings of the Association for Computational Linguistics: EMNLP 2024. 2024. doi:10.18653/v1/2024.findings-emnlp.969

work page doi:10.18653/v1/2024.findings-emnlp.969 2024

[1] [1]

Riedl and Vadim Bulitko

Mark O. Riedl and Vadim Bulitko. Interactive Narrative: An Intelligent Systems Approach. AI Magazine. doi:10.1609/aimag.v34i1.2449

work page doi:10.1609/aimag.v34i1.2449

[2] [2]

2003 , isbn =

Nick Montfort , title =. 2003 , isbn =

2003

[3] [3]

Nelson , title =

Noor Shaker and Julian Togelius and Mark J. Nelson , title =. 2016 , publisher =. doi:10.1007/978-3-319-42716-4 , url =

work page doi:10.1007/978-3-319-42716-4 2016

[4] [4]

Riedl , title =

Mark O. Riedl , title =. Medium , year =

[5] [5]

Automated Story Generation as Question-Answering , doi =

Castricato, Louis and Frazier, Spencer and Balloch, Jonathan and Tarakad, Nitya and Riedl, Mark , year =. Automated Story Generation as Question-Answering , doi =

[6] [6]

Kamrul Hasan Sarker and Lu Zhou and Aaron Eberhart and Pascal Hitzler , title =

Md. Kamrul Hasan Sarker and Lu Zhou and Aaron Eberhart and Pascal Hitzler , title =. AI Communications , year =. doi:10.3233/AIC-210084 , url =

work page doi:10.3233/aic-210084

[7] [7]

Survey of Hallucination in Natural Language Generation , volume=

Ji, Ziwei and Lee, Nayeon and Frieske, Rita and Yu, Tiezheng and Su, Dan and Xu, Yan and Ishii, Etsuko and Bang, Ye Jin and Madotto, Andrea and Fung, Pascale , title =. 2023 , issue_date =. doi:10.1145/3571730 , journal =

work page doi:10.1145/3571730 2023

[8] [8]

2017 , doi=

What can you do with a rock? Affordance extraction via word embeddings , author=. 2017 , doi=

2017

[9] [9]

ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) , volume=

Procedural Content Generation for Games: A Survey , author=. ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) , volume=. 2013 , publisher=

2013

[10] [10]

Meehan , title =

James R. Meehan , title =. Proceedings of the 5th International Joint Conference on Artificial Intelligence (IJCAI-77) , year =

[11] [11]

Copyright Contracts , editor =

Alina Trapova , title =. Copyright Contracts , editor =. 2023 , doi =

2023

[12] [12]

AI Storytelling Game May Expand Publishing's Horizons , year =

[13] [13]

Negative Feedback on LLM-Powered Storytelling and Roleplay Apps , year =

[14] [14]

Proceedings of The 15th International Conference on Computational Creativity , year=

G. Proceedings of The 15th International Conference on Computational Creativity , year=

[15] [15]

Proceedings of The 17th International Conference on Computational Creativity , year=

World-State Transformations for Neuro-symbolic Interactive Storytelling , author=. Proceedings of The 17th International Conference on Computational Creativity , year=

[16] [16]

2024 , url =

Boluwatife Oluwadare , title =. 2024 , url =

2024

[17] [17]

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks , booktitle =

Patrick Lewis and Ethan Perez and Aleksandra Piktus and Fabio Petroni and Vladimir Karpukhin and Naman Goyal and Heinrich K. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks , booktitle =. 2020 , pages =

2020

[18] [18]

TattleTale: Storytelling with Planning and Large Language Models , year=

Nisha Ingrid Simon and Christian Muise , url=. TattleTale: Storytelling with Planning and Large Language Models , year=

[19] [19]

Proceedings of the ACM on Human-Computer Interaction , year =

Kotaro Nishigori, Hideaki Takeda , title =. Proceedings of the ACM on Human-Computer Interaction , year =

[20] [20]

arXiv preprint arXiv:2505.12439 , year =

Zihao Li and Zichong Wang and Caiming Xiong and Chen Xing , title =. arXiv preprint arXiv:2505.12439 , year =

arXiv

[21] [21]

Martin and Francis Ferraro , title =

Rachel Chambers and Naomi Tack and Eliot Pearson and Lara J. Martin and Francis Ferraro , title =. 4th Wordplay: When Language Meets Games Workshop @ ACL 2024 , year =

2024

[22] [22]

2025 , school=

Approaches to interactive and improvisational storytelling , author=. 2025 , school=

2025

[23] [23]

Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-Playing Games

G \'o ngora, Santiago and Chiruzzo, Luis and M \'e ndez, Gonzalo and Gerv \'a s, Pablo. Skill Check: Some Considerations on the Evaluation of Gamemastering Models for Role-Playing Games. Games and Learning Alliance. 2024

2024

[24] [24]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing , pages=

Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence , author=. Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing , pages=

2022

[25] [25]

and Lindsay, Alan and Cavazza, Marc , title =

Porteous, Julie and Ferreira, João F. and Lindsay, Alan and Cavazza, Marc , title =. Autonomous Agents and Multi-Agent Systems , year =

[26] [26]

Kelly and A

J. Kelly and A. Calderwood and N. Wardrip-Fruin and M. Mateas , title =. Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE) , year =

[27] [27]

The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025 , publisher =

Jailbreaking as a Reward Misspecification Problem , author =. The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025 , publisher =. 2025 , url =

2025

[28] [28]

and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy

Liu, Nelson F. and Lin, Kevin and Hewitt, John and Paranjape, Ashwin and Bevilacqua, Michele and Petroni, Fabio and Liang, Percy. Lost in the Middle: How Language Models Use Long Contexts. Transactions of the Association for Computational Linguistics. 2024. doi:10.1162/tacl_a_00638

work page doi:10.1162/tacl_a_00638 2024

[29] [29]

Contemporary Music Review , volume =

Giancarlo Schiaffini , title =. Contemporary Music Review , volume =. 2006 , doi =

2006

[30] [30]

Keith Sawyer , title =

R. Keith Sawyer , title =. Mind, Culture, and Activity , volume =. 2000 , doi =

2000

[31] [31]

2016 , type =

Henri Bomström , title =. 2016 , type =

2016

[32] [32]

and Barr, Pippin , title =

Khaled, Rilla and Nelson, Mark J. and Barr, Pippin , title =. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems , series =. 2013 , pages =

2013

[33] [33]

ECAI 2012 , pages=

Computational creativity: The final frontier? , author=. ECAI 2012 , pages=. 2012 , publisher=

2012

[34] [34]

arXiv preprint arXiv:2505.03547 , year=

STORY2GAME: Generating (almost) everything in an interactive fiction game , author=. arXiv preprint arXiv:2505.03547 , year=

arXiv

[35] [35]

The 4th Wordplay: When Language Meets Games Workshop , year=

Berall: Towards generating retrieval-augmented state-based interactive fiction games , author=. The 4th Wordplay: When Language Meets Games Workshop , year=

[36] [36]

The 4th Wordplay: When Language Meets Games Workshop , year=

DAGGER: Data Augmentation for Generative Gaming in Enriched Realms , author=. The 4th Wordplay: When Language Meets Games Workshop , year=

[37] [37]

International Conference on Interactive Digital Storytelling , pages=

From playing the story to gaming the system: Repeat experiences of a large language model-based interactive story , author=. International Conference on Interactive Digital Storytelling , pages=. 2023 , organization=

2023

[38] [38]

arXiv preprint arXiv:2307.02483 , year =

Jailbroken: How Does LLM Safety Training Fail? , author =. arXiv preprint arXiv:2307.02483 , year =

Pith/arXiv arXiv

[39] [39]

Interactive Storytelling: 9th International Conference on Interactive Digital Storytelling, ICIDS 2016, Los Angeles, CA, USA, November 15--18, 2016, Proceedings 9 , pages=

Improvisational computational storytelling in open worlds , author=. Interactive Storytelling: 9th International Conference on Interactive Digital Storytelling, ICIDS 2016, Los Angeles, CA, USA, November 15--18, 2016, Proceedings 9 , pages=. 2016 , organization=

2016

[40] [40]

2025 , type =

Vaucher, Micaela and Silveira, Santiago , title =. 2025 , type =

2025

[41] [41]

Two Tales of Persona in LLM s: A Survey of Role-Playing and Personalization

Tseng, Yu-Min and Huang, Yu-Chao and Hsiao, Teng-Yun and Chen, Wei-Lin and Huang, Chao-Wei and Meng, Yu and Chen, Yun-Nung. Two Tales of Persona in LLM s: A Survey of Role-Playing and Personalization. Findings of the Association for Computational Linguistics: EMNLP 2024. 2024. doi:10.18653/v1/2024.findings-emnlp.969

work page doi:10.18653/v1/2024.findings-emnlp.969 2024