MMoA: An AI-Agent framework with recurrence for Memoried Mixure-of-Agent
Pith reviewed 2026-05-20 10:16 UTC · model grok-4.3
The pith
A recurrent LSTM router lets Mixture-of-Agents match standard performance while activating fewer agents for better efficiency.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By replacing static routers with an LSTM-based recurrence router, MMoA adaptively modulates agent contributions based on current inputs and historical routing decisions. This enables more context-aware aggregation across layers. On standard benchmarks like AlpacaEval 2.0, the system achieves a 58.0% win rate compared to 59.8% for traditional MoA, while improving runtime efficiency by up to 4.6% through reduced agent activations.
What carries the argument
The LSTM-based gating mechanism, which serves as the recurrence router to capture temporal and contextual dependencies for adaptive agent selection.
If this is right
- MMoA achieves a win rate of 58.0% on AlpacaEval 2.0 versus 59.8% for MoA.
- Runtime efficiency improves by up to 4.6% by dynamically activating fewer agents.
- Comparable performance holds on MT-Bench and Arena-Hard benchmarks.
- The recurrence allows context-aware aggregation without full static routing overhead.
Where Pith is reading between the lines
- The LSTM memory might help maintain consistency in multi-turn conversations by recalling past agent choices.
- Extending the recurrence to deeper layers or more agents could further optimize efficiency in larger systems.
- This dynamic selection strategy may apply to other multi-component AI systems beyond LLMs, such as tool-using agents.
Load-bearing premise
The LSTM-based gating mechanism successfully captures temporal and contextual dependencies across aggregation layers and produces genuinely adaptive agent selection rather than merely adding overhead.
What would settle it
Running MMoA and standard MoA on the same inputs with profiling tools to count active agents per query and measure actual inference time, verifying if efficiency gains exceed the added LSTM computation cost.
Figures
read the original abstract
The Mixture-of-Agents (MoA) framework has shown promise in improving large language model (LLM) performance by aggregating outputs from multiple agents. However, existing MoA systems often rely on static routers that do not fully capture temporal and contextual dependencies across aggregation layers. To address this limitation, we propose MMoA, a recurrent MoA architecture that integrates LSTM-based gating into the agent selection process. The recurrence router adaptively modulates agent contributions based on both current inputs and historical routing decisions, enabling more context-aware aggregation. We evaluate MMoA on standard instruction-following benchmarks, including AlpacaEval 2.0, MT-Bench, and Arena-Hard. The results show that MMoA achieves comparable accuracy to traditional MoA while reducing computational overhead by dynamically activating fewer agents. For example, on AlpacaEval 2.0, MMoA achieves a win rate of 58.0%, compared with 59.8% for MoA, while improving runtime efficiency by up to 4.6%. These results suggest that MMoA provides a scalable and efficient approach for adaptive multi-agent LLM systems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes MMoA, a recurrent extension of the Mixture-of-Agents (MoA) framework that integrates an LSTM-based gating mechanism to adaptively select and modulate agent contributions based on current inputs and historical routing decisions. It evaluates the approach on instruction-following benchmarks including AlpacaEval 2.0, MT-Bench, and Arena-Hard, claiming comparable accuracy to static MoA (e.g., 58.0% vs. 59.8% win rate on AlpacaEval 2.0) while achieving up to 4.6% runtime efficiency gains through dynamic activation of fewer agents.
Significance. If the efficiency improvements can be rigorously verified as arising from adaptive agent selection rather than implementation artifacts, MMoA would represent a practical advance for scalable multi-agent LLM systems by addressing limitations of static routers in capturing temporal dependencies. The work builds on existing MoA literature but currently lacks the empirical grounding needed to establish its contribution clearly.
major comments (2)
- [Abstract] Abstract: the central efficiency claim (up to 4.6% runtime improvement via fewer activated agents) is load-bearing yet unsupported by any reported statistics on mean or distribution of agents selected per query, wall-clock/FLOPs breakdowns separating LSTM overhead from agent calls, or ablation replacing the recurrent gate with a static router of matched parameter count.
- [Abstract] Abstract: benchmark results such as the 58.0% win rate on AlpacaEval 2.0 are presented without error bars, variance estimates, or details on how agent activation counts were measured, rendering the 'comparable accuracy' and efficiency assertions unverifiable from the given text.
minor comments (1)
- [Title] Title: contains apparent typographical errors ('Memoried' for 'Memory', 'Mixure' for 'Mixture', and singular 'Agent' instead of 'Agents') that should be corrected for consistency with the abstract and standard terminology.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address each major comment below and will revise the manuscript to provide the requested empirical details and statistical rigor.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central efficiency claim (up to 4.6% runtime improvement via fewer activated agents) is load-bearing yet unsupported by any reported statistics on mean or distribution of agents selected per query, wall-clock/FLOPs breakdowns separating LSTM overhead from agent calls, or ablation replacing the recurrent gate with a static router of matched parameter count.
Authors: We agree that the efficiency claims require stronger empirical grounding. In the revised manuscript we will report the mean and distribution of the number of agents activated per query under MMoA versus the static baseline. We will also add wall-clock and FLOPs breakdowns that isolate LSTM gating overhead from agent inference costs, and include an ablation that replaces the recurrent gate with a static router of matched parameter count to isolate the contribution of recurrence. revision: yes
-
Referee: [Abstract] Abstract: benchmark results such as the 58.0% win rate on AlpacaEval 2.0 are presented without error bars, variance estimates, or details on how agent activation counts were measured, rendering the 'comparable accuracy' and efficiency assertions unverifiable from the given text.
Authors: We acknowledge that the current presentation lacks necessary statistical detail. The revised version will include error bars or standard deviations for all reported win rates (including the 58.0% on AlpacaEval 2.0) and will explicitly describe the measurement protocol for agent activation counts, including how dynamic selection thresholds and per-query counts were computed. revision: yes
Circularity Check
No circularity: empirical benchmark results with no self-referential derivations
full rationale
The paper introduces MMoA as an LSTM-augmented variant of Mixture-of-Agents and reports direct experimental outcomes on AlpacaEval 2.0, MT-Bench, and Arena-Hard. The central efficiency claim (58.0% vs 59.8% win rate with up to 4.6% runtime improvement) is presented as a measured result rather than a quantity derived from or defined in terms of itself. No equations, fitted parameters renamed as predictions, uniqueness theorems, or self-citation chains appear in the abstract or described architecture. The derivation chain consists of an architectural proposal followed by external-benchmark evaluation; these results remain independently falsifiable and do not reduce to the inputs by construction.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
recurrent gating module—implemented via an LSTM (or RNN)—that processes both the outputs of the agents and the hidden state from previous aggregation layers
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
By carrying forward a single LSTM hidden state instead of re-computing a fresh gating network for each agent at every layer, the recurrence router reduces the effective time complexity from O(nL) to roughly O(n+L)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
- [1]
-
[2]
Publications Manual , year = "1983", publisher =
work page 1983
-
[3]
Ashok K. Chandra and Dexter C. Kozen and Larry J. Stockmeyer , year = "1981", title =. doi:10.1145/322234.322243
-
[4]
Andrew, Galen and Gao, Jianfeng , booktitle=. Scalable training of. 2007 , url=
work page 2007
-
[5]
Dan Gusfield , title =. 1997
work page 1997
-
[6]
Mohammad Sadegh Rasooli and Joel R. Tetreault , title =. Computing Research Repository , volume =. 2015 , url =
work page 2015
-
[7]
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =
Ando, Rie Kubota and Zhang, Tong , Issn =. A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , Volume =. Journal of Machine Learning Research , Month = dec, Numpages =. 2005 , url=
work page 2005
-
[8]
Cooley, James W. and Tukey, John W. , journal=. An algorithm for the machine calculation of complex. 1965 , url=
work page 1965
-
[9]
arXiv preprint arXiv:2408.06793 , year=
Layerwise recurrent router for mixture-of-experts , author=. arXiv preprint arXiv:2408.06793 , year=
-
[10]
arXiv preprint arXiv:2310.07188 , year=
Adaptive gating in mixture-of-experts based language models , author=. arXiv preprint arXiv:2310.07188 , year=
-
[11]
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Outrageously large neural networks: The sparsely-gated mixture-of-experts layer , author=. arXiv preprint arXiv:1701.06538 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[12]
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Gshard: Scaling giant models with conditional computation and automatic sharding , author=. arXiv preprint arXiv:2006.16668 , year=
work page internal anchor Pith review Pith/arXiv arXiv 2006
-
[13]
Proceedings of the 3rd Wordplay: When Language Meets Games Workshop (Wordplay 2022). 2022
work page 2022
-
[14]
A Systematic Survey of Text Worlds as Embodied Natural Language Environments
Jansen, Peter. A Systematic Survey of Text Worlds as Embodied Natural Language Environments. Proceedings of the 3rd Wordplay: When Language Meets Games Workshop (Wordplay 2022). 2022. doi:10.18653/v1/2022.wordplay-1.1
-
[15]
A Minimal Computational Improviser Based on Oral Thought
Montfort, Nick and Bartlett Fernandez, Sebastian. A Minimal Computational Improviser Based on Oral Thought. Proceedings of the 3rd Wordplay: When Language Meets Games Workshop (Wordplay 2022). 2022. doi:10.18653/v1/2022.wordplay-1.2
-
[16]
Volum, Ryan and Rao, Sudha and Xu, Michael and DesGarennes, Gabriel and Brockett, Chris and Van Durme, Benjamin and Deng, Olivia and Malhotra, Akanksha and Dolan, Bill. Craft an Iron Sword: Dynamically Generating Interactive Game Characters by Prompting Large Language Models Tuned on Code. Proceedings of the 3rd Wordplay: When Language Meets Games Worksho...
-
[17]
A Sequence Modelling Approach to Question Answering in Text-Based Games
Furman, Gregory and Toledo, Edan and Shock, Jonathan and Buys, Jan. A Sequence Modelling Approach to Question Answering in Text-Based Games. Proceedings of the 3rd Wordplay: When Language Meets Games Workshop (Wordplay 2022). 2022. doi:10.18653/v1/2022.wordplay-1.4
-
[18]
Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents
Teodorescu, Laetitia and Yuan, Xingdi and C \^o t \'e , Marc-Alexandre and Oudeyer, Pierre-Yves. Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents. Proceedings of the 3rd Wordplay: When Language Meets Games Workshop (Wordplay 2022). 2022. doi:10.18653/v1/2022.wordplay-1.5
-
[19]
Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022
work page 2022
-
[20]
Separating Hate Speech and Offensive Language Classes via Adversarial Debiasing
Yuan, Shuzhou and Maronikolakis, Antonis and Sch. Separating Hate Speech and Offensive Language Classes via Adversarial Debiasing. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.1
-
[21]
Towards Automatic Generation of Messages Countering Online Hate Speech and Microaggressions
Ashida, Mana and Komachi, Mamoru. Towards Automatic Generation of Messages Countering Online Hate Speech and Microaggressions. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.2
-
[22]
G rease V ision: Rewriting the Rules of the Interface
Datta, Siddhartha and Kollnig, Konrad and Shadbolt, Nigel. G rease V ision: Rewriting the Rules of the Interface. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.3
-
[23]
Ludwig, Florian and Dolos, Klara and Zesch, Torsten and Hobley, Eleanor. Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.4
-
[24]
`` Zo Grof ! '' : A Comprehensive Corpus for Offensive and Abusive Language in D utch
Ruitenbeek, Ward and Zwart, Victor and Van Der Noord, Robin and Gnezdilov, Zhenja and Caselli, Tommaso. `` Zo Grof ! '' : A Comprehensive Corpus for Offensive and Abusive Language in D utch. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.5
-
[25]
Counter- TWIT : An I talian Corpus for Online Counterspeech in Ecological Contexts
Goffredo, Pierpaolo and Basile, Valerio and Cepollaro, Bianca and Patti, Viviana. Counter- TWIT : An I talian Corpus for Online Counterspeech in Ecological Contexts. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.6
-
[26]
S tereo KG : Data-Driven Knowledge Graph Construction For Cultural Knowledge and Stereotypes
Deshpande, Awantee and Ruiter, Dana and Mosbach, Marius and Klakow, Dietrich. S tereo KG : Data-Driven Knowledge Graph Construction For Cultural Knowledge and Stereotypes. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.7
-
[27]
Lu, Christina and Jurgens, David. The subtle language of exclusion: Identifying the Toxic Speech of Trans-exclusionary Radical Feminists. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.8
-
[28]
Lost in Distillation: A Case Study in Toxicity Modeling
Chvasta, Alyssa and Lees, Alyssa and Sorensen, Jeffrey and Vasserman, Lucy and Goyal, Nitesh. Lost in Distillation: A Case Study in Toxicity Modeling. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.9
-
[29]
Cleansing & expanding the HURTLEX (el) with a multidimensional categorization of offensive words
Stamou, Vivian and Alexiou, Iakovi and Klimi, Antigone and Molou, Eleftheria and Saivanidou, Alexandra and Markantonatou, Stella. Cleansing & expanding the HURTLEX (el) with a multidimensional categorization of offensive words. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.10
-
[30]
Free speech or Free Hate Speech? Analyzing the Proliferation of Hate Speech in Parler
Israeli, Abraham and Tsur, Oren. Free speech or Free Hate Speech? Analyzing the Proliferation of Hate Speech in Parler. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.11
-
[31]
Resources for Multilingual Hate Speech Detection
Arango Monnar, Ayme and Perez, Jorge and Poblete, Barbara and Salda \ n a, Magdalena and Proust, Valentina. Resources for Multilingual Hate Speech Detection. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.12
-
[32]
Enriching Abusive Language Detection with Community Context
Saleem, Haji Mohammad and Kurrek, Jana and Ruths, Derek. Enriching Abusive Language Detection with Community Context. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.13
-
[33]
DeTox: A Comprehensive Dataset for G erman Offensive Language and Conversation Analysis
Demus, Christoph and Pitz, Jonas and Sch. DeTox: A Comprehensive Dataset for G erman Offensive Language and Conversation Analysis. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.14
-
[34]
Multilingual H ate C heck: Functional Tests for Multilingual Hate Speech Detection Models
R. Multilingual H ate C heck: Functional Tests for Multilingual Hate Speech Detection Models. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.15
-
[35]
Distributional properties of political dogwhistle representations in S wedish BERT
Hertzberg, Niclas and Cooper, Robin and Lindgren, Elina and R. Distributional properties of political dogwhistle representations in S wedish BERT. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.16
-
[36]
Hate Speech Criteria: A Modular Approach to Task-Specific Hate Speech Definitions
Khurana, Urja and Vermeulen, Ivar and Nalisnick, Eric and Van Noorloos, Marloes and Fokkens, Antske. Hate Speech Criteria: A Modular Approach to Task-Specific Hate Speech Definitions. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.17
-
[37]
Accounting for Offensive Speech as a Practice of Resistance
Diaz, Mark and Amironesei, Razvan and Weidinger, Laura and Gabriel, Iason. Accounting for Offensive Speech as a Practice of Resistance. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.18
-
[38]
Zheng, Joan and Friedman, Scott and Schmer-galunder, Sonja and Magnusson, Ian and Wheelock, Ruta and Gottlieb, Jeremy and Gomez, Diana and Miller, Christopher. Towards a Multi-Entity Aspect-Based Sentiment Analysis for Characterizing Directed Social Regard in Online Messaging. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:1...
-
[39]
Flexible text generation for counterfactual fairness probing
Fryer, Zee and Axelrod, Vera and Packer, Ben and Beutel, Alex and Chen, Jilin and Webster, Kellie. Flexible text generation for counterfactual fairness probing. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.20
-
[40]
Users Hate Blondes: Detecting Sexism in User Comments on Online R omanian News
Moldovan, Andreea and Cs. Users Hate Blondes: Detecting Sexism in User Comments on Online R omanian News. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.21
-
[41]
Targeted Identity Group Prediction in Hate Speech Corpora
Sachdeva, Pratik and Barreto, Renata and Von Vacano, Claudia and Kennedy, Chris. Targeted Identity Group Prediction in Hate Speech Corpora. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.22
-
[42]
Revisiting Queer Minorities in Lexicons
Ramesh, Krithika and Kumar, Sumeet and Khudabukhsh, Ashiqur. Revisiting Queer Minorities in Lexicons. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.23
-
[43]
HATE - ITA : Hate Speech Detection in I talian Social Media Text
Nozza, Debora and Bianchi, Federico and Attanasio, Giuseppe. HATE - ITA : Hate Speech Detection in I talian Social Media Text. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. doi:10.18653/v1/2022.woah-1.24
-
[44]
Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[45]
Changes in Tweet Geolocation over Time: A Study with Carmen 2.0
Zhang, Jingyu and DeLucia, Alexandra and Dredze, Mark. Changes in Tweet Geolocation over Time: A Study with Carmen 2.0. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[46]
Extracting Mathematical Concepts from Text
Collard, Jacob and de Paiva, Valeria and Fong, Brendan and Subrahmanian, Eswaran. Extracting Mathematical Concepts from Text. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[47]
Ehghaghi, Malikeh and Rudzicz, Frank and Novikova, Jekaterina. Data-driven Approach to Differentiating between Depression and Dementia from Noisy Speech and Language Data. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[48]
Cross-Dialect Social Media Dependency Parsing for Social Scientific Entity Attribute Analysis
Eggleston, Chloe and O ' Connor, Brendan. Cross-Dialect Social Media Dependency Parsing for Social Scientific Entity Attribute Analysis. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[49]
Novikova, Jekaterina. Impact of Environmental Noise on A lzheimer ' s Disease Detection from Speech: Should You Let a Baby Cry?. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[50]
Exploring Multimodal Features and Fusion Strategies for Analyzing Disaster Tweets
Pranesh, Raj. Exploring Multimodal Features and Fusion Strategies for Analyzing Disaster Tweets. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[51]
NTULM : Enriching Social Media Text Representations with Non-Textual Units
Li, Jinning and Mishra, Shubhanshu and El-Kishky, Ahmed and Mehta, Sneha and Kulkarni, Vivek. NTULM : Enriching Social Media Text Representations with Non-Textual Units. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[52]
Robust Candidate Generation for Entity Linking on Short Social Media Texts
Hebert, Liam and Makki, Raheleh and Mishra, Shubhanshu and Saghir, Hamidreza and Kamath, Anusha and Merhav, Yuval. Robust Candidate Generation for Entity Linking on Short Social Media Texts. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[53]
T rans POS : Transformers for Consolidating Different POS Tagset Datasets
Li, Alex and Bankole-Hameed, Ilyas and Singh, Ranadeep and Ng, Gabriel and Gupta, Akshat. T rans POS : Transformers for Consolidating Different POS Tagset Datasets. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[54]
Fu, Xue-Yong and Chen, Cheng and Laskar, Md Tahmid Rahman and Tn, Shashi Bhushan and Corston-Oliver, Simon. An Effective, Performant Named Entity Recognition System for Noisy Business Telephone Conversation Transcripts. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[55]
Leveraging Semantic and Sentiment Knowledge for User-Generated Text Sentiment Classification
Khan, Jawad and Ahmad, Niaz and Alam, Aftab and Lee, Youngmoon. Leveraging Semantic and Sentiment Knowledge for User-Generated Text Sentiment Classification. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[56]
An Emotional Journey: Detecting Emotion Trajectories in D utch Customer Service Dialogues
Labat, Sofie and Hadifar, Amir and Demeester, Thomas and Hoste, Veronique. An Emotional Journey: Detecting Emotion Trajectories in D utch Customer Service Dialogues. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[57]
Supervised and Unsupervised Evaluation of Synthetic Code-Switching
Orlov, Evgeny and Artemova, Ekaterina. Supervised and Unsupervised Evaluation of Synthetic Code-Switching. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[58]
A rab G end: Gender Analysis and Inference on A rabic T witter
Mubarak, Hamdy and Chowdhury, Shammur Absar and Alam, Firoj. A rab G end: Gender Analysis and Inference on A rabic T witter. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[59]
Automatic Identification of 5 C Vaccine Behaviour on Social Media
Sampath Kumar, Ajay Hemanth and Shausan, Aminath and Demartini, Gianluca and Rahimi, Afshin. Automatic Identification of 5 C Vaccine Behaviour on Social Media. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[60]
Dimeski, Adam and Rahimi, Afshin. Automatic Extraction of Structured Mineral Drillhole Results from Unstructured Mining Company Reports. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[61]
S, Sumukh and Shrivastava, Manish. `` Kanglish alli names! '' Named Entity Recognition for K annada- E nglish Code-Mixed Social Media Data. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[62]
Span Extraction Aided Improved Code-mixed Sentiment Classification
S, Ramaneswaran and Benhur, Sean and Ghosh, Sreyan. Span Extraction Aided Improved Code-mixed Sentiment Classification. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[63]
A d BERT : An Effective Few Shot Learning Framework for Aligning Tweets to Superbowl Advertisements
Das, Debarati and Chenchu, Roopana and Abdollahi, Maral and Huh, Jisu and Srivastava, Jaideep. A d BERT : An Effective Few Shot Learning Framework for Aligning Tweets to Superbowl Advertisements. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[64]
Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data
Vielsted, Marcus and Wallenius, Nikolaj and van der Goot, Rob. Increasing Robustness for Cross-domain Dialogue Act Classification on Social Media Data. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[65]
Disfluency Detection for V ietnamese
Dao, Mai Hoang and Truong, Thinh Hung and Nguyen, Dat Quoc. Disfluency Detection for V ietnamese. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[66]
A multi-level approach for hierarchical Ticket Classification
Marcuzzo, Matteo and Zangari, Alessandro and Schiavinato, Michele and Giudice, Lorenzo and Gasparetto, Andrea and Albarelli, Andrea. A multi-level approach for hierarchical Ticket Classification. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[67]
Towards better structured and less noisy Web data: Oscar with Register annotations
Laippala, Veronika and Salmela, Anna and R. Towards better structured and less noisy Web data: Oscar with Register annotations. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[68]
True or False? Detecting False Information on Social Media Using Graph Neural Networks
Rode-Hasinger, Samyo and Kruspe, Anna and Zhu, Xiao Xiang. True or False? Detecting False Information on Social Media Using Graph Neural Networks. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[69]
Analyzing the Real Vulnerability of Hate Speech Detection Systems against Targeted Intentional Noise
Aggarwal, Piush and Zesch, Torsten. Analyzing the Real Vulnerability of Hate Speech Detection Systems against Targeted Intentional Noise. Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). 2022
work page 2022
-
[70]
Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022
work page 2022
-
[71]
Uncovering Surprising Event Boundaries in Narratives
Wang, Zhilin and Jafarpour, Anna and Sap, Maarten. Uncovering Surprising Event Boundaries in Narratives. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/2022.wnu-1.1
-
[72]
Compositional Generalization for Kinship Prediction through Data Augmentation
Wei, Kangda and Ghosh, Sayan and Srivastava, Shashank. Compositional Generalization for Kinship Prediction through Data Augmentation. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/2022.wnu-1.2
-
[73]
How to be Helpful on Online Support Forums?
Wang, Zhilin and Torres, Pablo E. How to be Helpful on Online Support Forums?. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/2022.wnu-1.3
-
[74]
GPT -2-based Human-in-the-loop Theatre Play Script Generation
Rosa, Rudolf and Schmidtov \'a , Patr \' cia and Du s ek, Ond r ej and Musil, Tom \'a s and Mare c ek, David and Obaid, Saad and Nov \'a kov \'a , Marie and Voseck \'a , Kl \'a ra and Dole z al, Josef. GPT -2-based Human-in-the-loop Theatre Play Script Generation. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/...
-
[75]
G is P y: A Tool for Measuring Gist Inference Score in Text
Hosseini, Pedram and Wolfe, Christopher and Diab, Mona and Broniatowski, David. G is P y: A Tool for Measuring Gist Inference Score in Text. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/2022.wnu-1.5
-
[76]
Stammbach, Dominik and Antoniak, Maria and Ash, Elliott. Heroes, Villains, and Victims, and GPT -3: Automated Extraction of Character Roles Without Training Data. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/2022.wnu-1.6
-
[77]
Narrative Detection and Feature Analysis in Online Health Communities
Ganti, Achyutarama and Wilson, Steven and Ma, Zexin and Zhao, Xinyan and Ma, Rong. Narrative Detection and Feature Analysis in Online Health Communities. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/2022.wnu-1.7
-
[78]
van Duijn, Max and van Dijk, Bram and Spruit, Marco. Looking from the Inside: How Children Render Character ' s Perspectives in Freely Told Fantasy Stories. Proceedings of the 4th Workshop of Narrative Understanding (WNU2022). 2022. doi:10.18653/v1/2022.wnu-1.8
-
[79]
Proceedings of the 2nd Workshop on Deriving Insights from User-Generated Text. 2022
work page 2022
-
[80]
Unsupervised Abstractive Dialogue Summarization with Word Graphs and POV Conversion
Park, Seongmin and Lee, Jihwa. Unsupervised Abstractive Dialogue Summarization with Word Graphs and POV Conversion. Proceedings of the 2nd Workshop on Deriving Insights from User-Generated Text. 2022. doi:10.18653/v1/2022.wit-1.1
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.