3 David Arthur and Sergei Vassilvitskii

Figure 7 Topic Prevalence by Time, Provider Type STM AND BERTOPIC FOR SURVEY RESPONSES 22 Note · 2023 · arXiv 2008.09470

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 1 method 1

citation-polarity summary

unclear 1 use method 1

representative citing papers

Reasoning-Based Refinement of Unsupervised Text Clusters with LLMs

cs.CL · 2026-04-08 · unverdicted · novelty 6.0

LLM reasoning refines unsupervised text clusters via coherence checks, redundancy removal, and label grounding, yielding better coherence and human-aligned labels on social media data.

PRISM: LLM-Guided Semantic Clustering for High-Precision Topics

cs.LG · 2026-04-03 · unverdicted · novelty 6.0

PRISM distills sparse LLM labels into a fine-tuned embedding model for thresholded clustering that separates fine-grained topics better than prior local models or raw frontier embeddings.

A Comparative Evaluation of Structural Topic Models and BERTopic for Short, Open-Ended Survey Responses

cs.CL · 2026-05-21 · unverdicted · novelty 5.0

BERTopic with contextual augmentation outperforms STM on topic coherence and interpretability for short survey responses, but STM better supports inferential covariate analysis.

Traditional statistical representations outperform generative AI in identifying expert peer reviewers

cs.IR · 2026-05-18 · unverdicted · novelty 5.0

TF-IDF identifies labeled experts in the top 25 recommendations 79.5% of the time versus 51.5% for GPT-4o mini on an astronomy observatory dataset.

Evolution of Research Method Usage Across the Academic Careers of Library and Information Science Scholars

cs.DL · 2026-04-22 · unverdicted · novelty 5.0

Bibliometric methods rise from 19.61% to 31.81% usage as LIS scholars age, method diversity increases then declines, and scholars increasingly combine conventional and unconventional methods.

Granite Embedding Multilingual R2 Models

cs.IR · 2026-05-13 · unverdicted · novelty 4.0

Granite Embedding Multilingual R2 releases 311M and 97M parameter bi-encoder models that achieve state-of-the-art retrieval performance on multilingual text, code, long-document, and reasoning datasets.

Intelligent Knowledge Mining Framework: Bridging AI Analysis and Trustworthy Preservation

cs.DL · 2025-12-19 · unverdicted · novelty 3.0

IKMF introduces a dual-stream architecture that converts raw data into semantically rich knowledge via AI mining while maintaining integrity, provenance, and reproducibility through parallel archiving.

Much of Geospatial Web Search Is Beyond Traditional GIS

cs.IR · 2026-05-11

citing papers explorer

Showing 8 of 8 citing papers.

Reasoning-Based Refinement of Unsupervised Text Clusters with LLMs cs.CL · 2026-04-08 · unverdicted · none · ref 3
LLM reasoning refines unsupervised text clusters via coherence checks, redundancy removal, and label grounding, yielding better coherence and human-aligned labels on social media data.
PRISM: LLM-Guided Semantic Clustering for High-Precision Topics cs.LG · 2026-04-03 · unverdicted · none · ref 2
PRISM distills sparse LLM labels into a fine-tuned embedding model for thresholded clustering that separates fine-grained topics better than prior local models or raw frontier embeddings.
A Comparative Evaluation of Structural Topic Models and BERTopic for Short, Open-Ended Survey Responses cs.CL · 2026-05-21 · unverdicted · none · ref 5
BERTopic with contextual augmentation outperforms STM on topic coherence and interpretability for short survey responses, but STM better supports inferential covariate analysis.
Traditional statistical representations outperform generative AI in identifying expert peer reviewers cs.IR · 2026-05-18 · unverdicted · none · ref 75
TF-IDF identifies labeled experts in the top 25 recommendations 79.5% of the time versus 51.5% for GPT-4o mini on an astronomy observatory dataset.
Evolution of Research Method Usage Across the Academic Careers of Library and Information Science Scholars cs.DL · 2026-04-22 · unverdicted · none · ref 31
Bibliometric methods rise from 19.61% to 31.81% usage as LIS scholars age, method diversity increases then declines, and scholars increasingly combine conventional and unconventional methods.
Granite Embedding Multilingual R2 Models cs.IR · 2026-05-13 · unverdicted · none · ref 3
Granite Embedding Multilingual R2 releases 311M and 97M parameter bi-encoder models that achieve state-of-the-art retrieval performance on multilingual text, code, long-document, and reasoning datasets.
Intelligent Knowledge Mining Framework: Bridging AI Analysis and Trustworthy Preservation cs.DL · 2025-12-19 · unverdicted · none · ref 50
IKMF introduces a dual-stream architecture that converts raw data into semantically rich knowledge via AI mining while maintaining integrity, provenance, and reproducibility through parallel archiving.
Much of Geospatial Web Search Is Beyond Traditional GIS cs.IR · 2026-05-11 · unreviewed · ref 2

3 David Arthur and Sergei Vassilvitskii

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer