Journal of machine Learning research , volume=

Latent dirichlet allocation , author=

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

browse 8 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

cs.CL · 2020-12-31 · conditional · novelty 8.0

The Pile is a newly constructed 825 GiB dataset from 22 diverse sources that enables language models to achieve better performance on academic, professional, and cross-domain tasks than models trained on Common Crawl variants.

AI4BayesCode: From Natural Language Descriptions to Validated Modular Stateful Bayesian Samplers

stat.CO · 2026-05-18 · unverdicted · novelty 6.0

AI4BayesCode generates validated modular stateful MCMC samplers from natural language Bayesian model descriptions via LLM translation, modular blocks, and recursive stateful composition.

BoolXLLM: LLM-Assisted Explainability for Boolean Models

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

BoolXLLM augments an existing Boolean rule learner with LLMs for feature selection, discretization thresholds, and natural-language rule translation to improve interpretability while preserving accuracy.

Learning Mixtures of Nonparametric and Convolutional Measures on Effectively Low-dimensional Affine Spaces

math.ST · 2026-04-19 · unverdicted · novelty 6.0

Mixtures of convolutional measures on low-dimensional affine spaces admit unique identifiability in semi-parametric settings and posterior contraction rates under convex polytope support assumptions in a well-specified Bayesian regime.

ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models

cs.CL · 2024-02-18 · unverdicted · novelty 6.0

ALLaVA creates 1.3M GPT4V-synthesized samples enabling 4B VLMs to achieve competitive results on 17 benchmarks and match 7B/13B models on some tasks.

Measuring Embedding Sensitivity to Authorial Style in French: Comparing Literary Texts with Language Model Rewritings

cs.CL · 2026-05-11 · unverdicted · novelty 5.0

Embeddings reliably capture authorial stylistic features in French literary texts, and these signals persist after LLM rewriting while showing model-specific patterns.

Graph-Augmented LLMs for Swiss MP Ideology Prediction

cs.CL · 2026-05-06 · unverdicted · novelty 5.0

Graph-augmented LLMs using a political knowledge graph improve ideology prediction accuracy for Swiss MPs by incorporating relational data beyond text alone.

The Shape of Testimony: A Scalable Framework for Oral History Archive Comparison

cs.AI · 2026-05-20 · unverdicted · novelty 4.0

Large-scale computational comparison of two major Holocaust oral history collections shows both expected differences and significant overlaps in interview structure, yielding a replicable framework for archive analysis.

citing papers explorer

Showing 8 of 8 citing papers.

The Pile: An 800GB Dataset of Diverse Text for Language Modeling cs.CL · 2020-12-31 · conditional · none · ref 180
The Pile is a newly constructed 825 GiB dataset from 22 diverse sources that enables language models to achieve better performance on academic, professional, and cross-domain tasks than models trained on Common Crawl variants.
AI4BayesCode: From Natural Language Descriptions to Validated Modular Stateful Bayesian Samplers stat.CO · 2026-05-18 · unverdicted · none · ref 28
AI4BayesCode generates validated modular stateful MCMC samplers from natural language Bayesian model descriptions via LLM translation, modular blocks, and recursive stateful composition.
BoolXLLM: LLM-Assisted Explainability for Boolean Models cs.AI · 2026-05-12 · unverdicted · none · ref 30
BoolXLLM augments an existing Boolean rule learner with LLMs for feature selection, discretization thresholds, and natural-language rule translation to improve interpretability while preserving accuracy.
Learning Mixtures of Nonparametric and Convolutional Measures on Effectively Low-dimensional Affine Spaces math.ST · 2026-04-19 · unverdicted · none · ref 14
Mixtures of convolutional measures on low-dimensional affine spaces admit unique identifiability in semi-parametric settings and posterior contraction rates under convex polytope support assumptions in a well-specified Bayesian regime.
ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models cs.CL · 2024-02-18 · unverdicted · none · ref 73
ALLaVA creates 1.3M GPT4V-synthesized samples enabling 4B VLMs to achieve competitive results on 17 benchmarks and match 7B/13B models on some tasks.
Measuring Embedding Sensitivity to Authorial Style in French: Comparing Literary Texts with Language Model Rewritings cs.CL · 2026-05-11 · unverdicted · none · ref 4
Embeddings reliably capture authorial stylistic features in French literary texts, and these signals persist after LLM rewriting while showing model-specific patterns.
Graph-Augmented LLMs for Swiss MP Ideology Prediction cs.CL · 2026-05-06 · unverdicted · none · ref 233
Graph-augmented LLMs using a political knowledge graph improve ideology prediction accuracy for Swiss MPs by incorporating relational data beyond text alone.
The Shape of Testimony: A Scalable Framework for Oral History Archive Comparison cs.AI · 2026-05-20 · unverdicted · none · ref 27
Large-scale computational comparison of two major Holocaust oral history collections shows both expected differences and significant overlaps in interview structure, yielding a replicable framework for archive analysis.

Journal of machine Learning research , volume=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer