Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

Fan Wu; Fateme Dinmohammadi; Jingzhi Gong; Mari Ashiga; Paul Brookes; Vardan Voskanyan; Wei Jie; Zheng Wang

arxiv: 2503.13505 · v3 · pith:SDLYU6F6new · submitted 2025-03-13 · 💻 cs.CL · cs.AI· cs.LG

Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

Mari Ashiga , Wei Jie , Fan Wu , Vardan Voskanyan , Fateme Dinmohammadi , Paul Brookes , Jingzhi Gong , Zheng Wang This is my paper

classification 💻 cs.CL cs.AIcs.LG

keywords ensemblegenerationllmstextcodelanguageapproachesfurther

0 comments

read the original abstract

Generative Pretrained Transformers (GPTs) are foundational Large Language Models (LLMs) for text generation. However, individual LLMs often produce inconsistent outputs and exhibit biases, limiting their representation of diverse language patterns. The closed-source nature of many powerful LLMs further restricts industry applications due to data privacy concerns. Inspired by successes in text generation, LLM ensemble techniques are now increasingly explored for code generation. This article reviews these emerging ensemble approaches to enhance understanding, encourage further research, and promote practical implementation in both text and code generation. We categorize LLM ensembles into seven main methods - weight merging, knowledge fusion, mixture-of-experts, reward ensemble, output ensemble, routing, and cascading - analyzing capabilities of those approaches. Our findings highlight key benefits such as improved diversity representation, enhanced output quality, and greater application flexibility. These insights aid model selection for real-world tasks and crucially, lay groundwork for extending ensemble strategies to multimodal LLMs.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Token-Level LLM Collaboration via FusionRoute
cs.AI 2026-01 unverdicted novelty 6.0

FusionRoute augments token-level expert routing with a trainable complementary logit generator to expand the policy class and recover optimal decoding under mild conditions, outperforming prior collaboration and mergi...