hub Mixed citations

Warren, Lu Cheng, Haidar M

doi: 10 · 2024 · arXiv 2323.2024

Mixed citation behavior. Most common role is background (56%).

31 Pith papers citing it

Background 56% of classified citations

read on arXiv browse 31 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 7 extension 1 method 1

citation-polarity summary

background 5 support 2 extend 1 use method 1

representative citing papers

Optical Music Recognition for Real-World Manuscripts with Synthetic Data

cs.CV · 2026-06-08 · unverdicted · novelty 7.0

Domain adaptation via synthetic manuscript images improves OMR performance on real-world piano manuscripts without requiring in-domain symbols.

PackSELL: A Sparse Matrix Format for Precision-Agnostic High-Performance SpMV

cs.DC · 2026-04-15 · unverdicted · novelty 7.0

PackSELL packs delta-encoded indices and values into single words with tunable bit allocation, delivering up to 1.63x faster FP16 SpMV and FP32-accurate performance exceeding FP16 cuSPARSE while reducing memory traffic.

Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis

cs.CL · 2026-03-20 · conditional · novelty 7.0

Seven clinician-informed safety criteria enable LLM-as-a-Judge to reach substantial agreement with human consensus (Cohen's κ up to 0.75) on evaluating LLM responses to users demonstrating psychosis.

Revisiting Forest Proximities via Sparse Leaf-Incidence Kernels

cs.LG · 2026-01-06 · conditional · novelty 7.0

Forest proximities admit an exact sparse factorization via separable weighted leaf-collision kernels that reduces computation to sparse linear algebra over leaf collisions.

AlphaEvolve: A coding agent for scientific and algorithmic discovery

cs.AI · 2025-06-16 · unverdicted · novelty 7.0

AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.

TabPATE: Differentially Private Tabular In-Context Learning Without Public Data

cs.LG · 2026-06-30 · unverdicted · novelty 6.0

TabPATE applies a PATE-style private aggregation to synthetic tabular queries generated from feature ranges, enabling private in-context learning with near-random membership inference success while keeping competitive utility.

Cross-Lingual Exploration for Parametric Knowledge

cs.CL · 2026-06-23 · unverdicted · novelty 6.0

Cross-lingual prompt exploration improves factual recall and consistency in LLMs across 17 languages more efficiently than native-language scaling.

PIPER: Content-Based Table Search via profiling and LLM-Generated Pseudoqueries

cs.IR · 2026-05-18 · unverdicted · novelty 6.0

PIPER retrieves and ranks tabular datasets by profiling their content and using LLM-generated queries for dense vector search, outperforming metadata baselines and TableQA methods in low-metadata settings.

Macro: Enhancing Multilingual Counterfactual Explanations through Alignment-as-Preference Optimization

cs.CL · 2026-05-12 · unverdicted · novelty 6.0

Macro uses DPO on composite preference pairs to raise validity of multilingual self-generated counterfactual explanations by 12.55% on average over chain-of-thought while preserving minimality.

cs.SE · 2026-05-08 · unverdicted · novelty 6.0

SPARK improves LLM-based test code fault localization by retrieving similar past faults and selectively annotating suspicious lines in new failing tests.

Make Any Collection Navigable: Methods for Constructing and Evaluating Hypergraph of Text

cs.IR · 2026-04-28 · unverdicted · novelty 6.0

Methods for constructing Hypergraphs of Text are proposed with a new effort ratio metric where TF-IDF baselines match LLM methods in experiments.

A Catalog of Data Errors

cs.DB · 2026-04-10 · unverdicted · novelty 6.0

A new catalog classifying 35 data error types into missing, incorrect, and redundant categories for tabular data, with definitions and examples to improve data quality management.

MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

cs.AI · 2026-04-09 · unverdicted · novelty 6.0

MONETA is the first multimodal benchmark for industry classification using text and geographic sources, with MLLM baselines at 62-74% accuracy and up to 22.8% gains from multi-turn context enrichment and explanations.

Counterfactual Modeling with Fine-Tuned LLMs for Health Intervention Design and Sensor Data Augmentation

cs.LG · 2026-01-21 · conditional · novelty 6.0

Fine-tuned LLMs produce plausible counterfactuals for health interventions and recover 20% F1 via data augmentation in label-scarce sensor datasets.

InsideOut: Measuring and Mitigating Insider-Outsider Bias in Interview Script Generation

cs.CL · 2025-09-25 · conditional · novelty 6.0

The paper introduces the InsideOut benchmark to quantify insider-outsider bias in LLM-generated interview scripts across 10 cultures and shows that multi-agent mitigation frameworks substantially reduce the bias on metrics like Cultural Alignment Gap.

When Summaries Distort Decisions: Information Fidelity in LLM-Compressed Financial Analysis

cs.AI · 2026-06-28 · unverdicted · novelty 5.0

LLM-based compression of financial source material can alter downstream investment decisions via decontextualization and model dependency, addressed by an agentic auditing approach that checks multiple compressions against the original.

AgentRx: A Benchmark Study of LLM Agents for Multimodal Clinical Prediction Tasks

cs.AI · 2026-05-11 · unverdicted · novelty 5.0

Single-agent LLM frameworks outperform naive multi-agent systems in multimodal clinical risk prediction tasks and are better calibrated.

NEURON: A Neuro-symbolic System for Grounded Clinical Explainability

cs.AI · 2026-05-02 · unverdicted · novelty 5.0 · 2 refs

NEURON integrates SNOMED CT, ML, and RAG LLM to raise AUC from 0.74-0.77 to 0.84-0.88 and human-aligned explainability scores from 0.50 to 0.85 on MIMIC-IV acute heart failure data.

Context-Mediated Domain Adaptation in Multi-Agent Sensemaking Systems

cs.HC · 2026-03-25 · unverdicted · novelty 5.0

Context-mediated domain adaptation treats user modifications to AI artifacts as implicit domain specifications that reshape LLM-powered multi-agent reasoning, demonstrated via the Seedentia system which extracted 46 domain knowledge entries from expert edits.

ML-Powered LDAP Reconnaissance Detection using Weak Supervision

cs.LG · 2026-06-27 · unverdicted · novelty 4.0

Weakly supervised ML classifier and hypothesis-testing signature mining detect LDAP reconnaissance at 65% TPR and 81.48% field precision.

Bridging the Smart City Cybersecurity Data Gap Through AI-Driven Synthetic Dataset Generation

cs.CR · 2026-06-10 · unverdicted · novelty 4.0

Proposes an AI-driven synthetic data generation framework to create realistic cybersecurity datasets for smart city research where real data is scarce or sensitive.

The social consequences of AI delegation

physics.soc-ph · 2026-06-09 · unverdicted · novelty 4.0

This perspective paper calls for a research program treating LLMs as consequential social actors whose outputs influence human decisions, norms, and collective dynamics.

A Theory-Guided LLM Pedagogical Agent for STEM+C Scaffolding Without Over-Reliance

cs.MA · 2026-05-28 · unverdicted · novelty 4.0

Copa is a theory-guided multimodal LLM agent that supports high school computational modeling through adaptive feedback, shown in a 33-dyad study to increase student confidence and conceptual verbalization without fostering dependence.

MedicalRec: Medical recommender system for image classification without retraining

cs.LG · 2026-05-23 · unverdicted · novelty 4.0

A transformer recommender system trained on a new benchmark of over 5,000 model performances from medical imaging papers achieves up to 75.5% HitRate@100.

citing papers explorer

Showing 25 of 25 citing papers after filters.

Optical Music Recognition for Real-World Manuscripts with Synthetic Data cs.CV · 2026-06-08 · unverdicted · none · ref 47
Domain adaptation via synthetic manuscript images improves OMR performance on real-world piano manuscripts without requiring in-domain symbols.
PackSELL: A Sparse Matrix Format for Precision-Agnostic High-Performance SpMV cs.DC · 2026-04-15 · unverdicted · none · ref 40
PackSELL packs delta-encoded indices and values into single words with tunable bit allocation, delivering up to 1.63x faster FP16 SpMV and FP32-accurate performance exceeding FP16 cuSPARSE while reducing memory traffic.
AlphaEvolve: A coding agent for scientific and algorithmic discovery cs.AI · 2025-06-16 · unverdicted · none · ref 38
AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.
TabPATE: Differentially Private Tabular In-Context Learning Without Public Data cs.LG · 2026-06-30 · unverdicted · none · ref 3
TabPATE applies a PATE-style private aggregation to synthetic tabular queries generated from feature ranges, enabling private in-context learning with near-random membership inference success while keeping competitive utility.
Cross-Lingual Exploration for Parametric Knowledge cs.CL · 2026-06-23 · unverdicted · none · ref 45
Cross-lingual prompt exploration improves factual recall and consistency in LLMs across 17 languages more efficiently than native-language scaling.
PIPER: Content-Based Table Search via profiling and LLM-Generated Pseudoqueries cs.IR · 2026-05-18 · unverdicted · none · ref 17
PIPER retrieves and ranks tabular datasets by profiling their content and using LLM-generated queries for dense vector search, outperforming metadata baselines and TableQA methods in low-metadata settings.
Macro: Enhancing Multilingual Counterfactual Explanations through Alignment-as-Preference Optimization cs.CL · 2026-05-12 · unverdicted · none · ref 5
Macro uses DPO on composite preference pairs to raise validity of multilingual self-generated counterfactual explanations by 12.55% on average over chain-of-thought while preserving minimality.
Similar Pattern Annotation via Retrieval Knowledge for LLM-Based Test Code Fault Localization cs.SE · 2026-05-08 · unverdicted · none · ref 46
SPARK improves LLM-based test code fault localization by retrieving similar past faults and selectively annotating suspicious lines in new failing tests.
Make Any Collection Navigable: Methods for Constructing and Evaluating Hypergraph of Text cs.IR · 2026-04-28 · unverdicted · none · ref 5
Methods for constructing Hypergraphs of Text are proposed with a new effort ratio metric where TF-IDF baselines match LLM methods in experiments.
A Catalog of Data Errors cs.DB · 2026-04-10 · unverdicted · none · ref 125
A new catalog classifying 35 data error types into missing, incorrect, and redundant categories for tabular data, with definitions and examples to improve data quality management.
MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems cs.AI · 2026-04-09 · unverdicted · none · ref 42
MONETA is the first multimodal benchmark for industry classification using text and geographic sources, with MLLM baselines at 62-74% accuracy and up to 22.8% gains from multi-turn context enrichment and explanations.
When Summaries Distort Decisions: Information Fidelity in LLM-Compressed Financial Analysis cs.AI · 2026-06-28 · unverdicted · none · ref 10
LLM-based compression of financial source material can alter downstream investment decisions via decontextualization and model dependency, addressed by an agentic auditing approach that checks multiple compressions against the original.
AgentRx: A Benchmark Study of LLM Agents for Multimodal Clinical Prediction Tasks cs.AI · 2026-05-11 · unverdicted · none · ref 56
Single-agent LLM frameworks outperform naive multi-agent systems in multimodal clinical risk prediction tasks and are better calibrated.
NEURON: A Neuro-symbolic System for Grounded Clinical Explainability cs.AI · 2026-05-02 · unverdicted · none · ref 104 · 2 links
NEURON integrates SNOMED CT, ML, and RAG LLM to raise AUC from 0.74-0.77 to 0.84-0.88 and human-aligned explainability scores from 0.50 to 0.85 on MIMIC-IV acute heart failure data.
Context-Mediated Domain Adaptation in Multi-Agent Sensemaking Systems cs.HC · 2026-03-25 · unverdicted · none · ref 50
Context-mediated domain adaptation treats user modifications to AI artifacts as implicit domain specifications that reshape LLM-powered multi-agent reasoning, demonstrated via the Seedentia system which extracted 46 domain knowledge entries from expert edits.
ML-Powered LDAP Reconnaissance Detection using Weak Supervision cs.LG · 2026-06-27 · unverdicted · none · ref 8
Weakly supervised ML classifier and hypothesis-testing signature mining detect LDAP reconnaissance at 65% TPR and 81.48% field precision.
Bridging the Smart City Cybersecurity Data Gap Through AI-Driven Synthetic Dataset Generation cs.CR · 2026-06-10 · unverdicted · none · ref 19
Proposes an AI-driven synthetic data generation framework to create realistic cybersecurity datasets for smart city research where real data is scarce or sensitive.
The social consequences of AI delegation physics.soc-ph · 2026-06-09 · unverdicted · none · ref 11
This perspective paper calls for a research program treating LLMs as consequential social actors whose outputs influence human decisions, norms, and collective dynamics.
A Theory-Guided LLM Pedagogical Agent for STEM+C Scaffolding Without Over-Reliance cs.MA · 2026-05-28 · unverdicted · none · ref 9
Copa is a theory-guided multimodal LLM agent that supports high school computational modeling through adaptive feedback, shown in a 33-dyad study to increase student confidence and conceptual verbalization without fostering dependence.
MedicalRec: Medical recommender system for image classification without retraining cs.LG · 2026-05-23 · unverdicted · none · ref 21
A transformer recommender system trained on a new benchmark of over 5,000 model performances from medical imaging papers achieves up to 75.5% HitRate@100.
Opportunities and Risks of Generative AI through the Health Information Journey cs.CY · 2026-05-21 · unverdicted · none · ref 92
Authors propose a four-stage framework to analyze opportunities and risks of generative AI across the health information journey from public sources to clinical care.
Modality vs. Morphology: A Framework for Time Series Classification for Biological Signals cs.LG · 2026-05-18 · unverdicted · none · ref 79
A review synthesizes evidence from EEG, EMG, ECG, PPG and ocular signals to argue that waveform morphology, rather than modality or model class, primarily determines TSC performance and interpretability.
Assessment of RAG and Fine-Tuning for Industrial Question-Answering-Applications cs.CL · 2026-05-10 · unverdicted · none · ref 2
RAG is more effective and cost-efficient than fine-tuning for industrial QA adaptation on automotive datasets.
Towards Enabling An Artificial Self-Construction Software Life-cycle via Autopoietic Architectures cs.SE · 2026-04-15 · unverdicted · none · ref 43
Proposes autopoietic architectures for self-constructing software as a fundamental shift in the SDLC, leveraging foundation models for autonomous evolution and maintenance.
"Skill issues'': data-centric optimization of lakehouse agents cs.AI · 2026-05-31 · unverdicted · none · ref 29
Data-centric optimization of skills for agents on a branching lakehouse improves accuracy by 31.9% on 25 tasks via state-verification evaluation.

Warren, Lu Cheng, Haidar M

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer