Title resolution pending

Hu, Edward J · 2022

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

browse 9 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Conformal Selective Acting: Anytime-Valid Risk Control for RLVR-Trained LLMs

cs.LG · 2026-05-18 · conditional · novelty 8.0

Conformal Selective Acting (CSA) fills a gap in conformal methods by providing per-round, pathwise-valid selective risk bounds for adaptive RLVR LLM streams under predictable updates and isotonic calibration.

Embeddings for Preferences, Not Semantics

cs.AI · 2026-05-08 · unverdicted · novelty 7.0

Synthetic training data designed to break the correlation between semantic and preferential signals in text embeddings provably improves preference prediction across 11 online deliberation datasets.

GraSP-VL: Length as a Semantic Granularity Interface for Vision-Language Representations

cs.CV · 2026-05-18 · unverdicted · novelty 6.0

GraSP-VL turns frozen VLM embedding length into a controllable semantic granularity interface via a learned shared prefix transform that creates a Semantic Matryoshka structure.

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-Play

cs.AI · 2026-05-16 · unverdicted · novelty 6.0

PopuLoRA shows that co-evolving populations of LoRA adapters through cross-evaluated self-play can outperform compute-matched single-agent baselines on multiple code and math reasoning benchmarks.

SCALE-LoRA: Auditing Post-Retrieval LoRA Composition with Residual Merging and View Reliability

cs.AI · 2026-05-02 · unverdicted · novelty 6.0

SCALE-LoRA proposes a post-retrieval audit framework using sparse residual composition and disagreement-based reliability signals to improve open-pool LoRA adapter reuse on tasks like BIG-Bench Hard.

LLM Benchmark Datasets Should Be Contamination-Resistant

cs.LG · 2026-05-19 · unverdicted · novelty 4.0

Authors call for contamination-resistant LLM benchmarks that exploit Transformer training-inference asymmetry and require new mathematical methods for cross-architecture interoperability.

Caraman at SemEval-2026 Task 8: Three-Stage Multi-Turn Retrieval with Query Rewriting, Hybrid Search, and Cross-Encoder Reranking

cs.CL · 2026-05-12 · unverdicted · novelty 4.0

A pipeline with LoRA-fine-tuned query rewriting, BM25+dense hybrid retrieval via RRF, and cross-encoder reranking reaches nDCG@5 of 0.531 on multi-turn retrieval across four domains.

PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

cs.CL · 2026-05-06 · unverdicted · novelty 3.0

An ensemble of per-language fine-tuned Gemma 3 models with three synthetic data strategies and per-language threshold tuning achieves 2nd place overall in SemEval-2026 Task 9 with mean macro-F1 of 0.811.

PRISM: Preference-Aware Influence Function Based Data Selection Method for Efficient Fine-Tuning

cs.LG · 2026-05-20

citing papers explorer

Showing 9 of 9 citing papers.

Conformal Selective Acting: Anytime-Valid Risk Control for RLVR-Trained LLMs cs.LG · 2026-05-18 · conditional · none · ref 10
Conformal Selective Acting (CSA) fills a gap in conformal methods by providing per-round, pathwise-valid selective risk bounds for adaptive RLVR LLM streams under predictable updates and isotonic calibration.
Embeddings for Preferences, Not Semantics cs.AI · 2026-05-08 · unverdicted · none · ref 19
Synthetic training data designed to break the correlation between semantic and preferential signals in text embeddings provably improves preference prediction across 11 online deliberation datasets.
GraSP-VL: Length as a Semantic Granularity Interface for Vision-Language Representations cs.CV · 2026-05-18 · unverdicted · none · ref 17
GraSP-VL turns frozen VLM embedding length into a controllable semantic granularity interface via a learned shared prefix transform that creates a Semantic Matryoshka structure.
PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-Play cs.AI · 2026-05-16 · unverdicted · none · ref 30
PopuLoRA shows that co-evolving populations of LoRA adapters through cross-evaluated self-play can outperform compute-matched single-agent baselines on multiple code and math reasoning benchmarks.
SCALE-LoRA: Auditing Post-Retrieval LoRA Composition with Residual Merging and View Reliability cs.AI · 2026-05-02 · unverdicted · none · ref 1
SCALE-LoRA proposes a post-retrieval audit framework using sparse residual composition and disagreement-based reliability signals to improve open-pool LoRA adapter reuse on tasks like BIG-Bench Hard.
LLM Benchmark Datasets Should Be Contamination-Resistant cs.LG · 2026-05-19 · unverdicted · none · ref 41
Authors call for contamination-resistant LLM benchmarks that exploit Transformer training-inference asymmetry and require new mathematical methods for cross-architecture interoperability.
Caraman at SemEval-2026 Task 8: Three-Stage Multi-Turn Retrieval with Query Rewriting, Hybrid Search, and Cross-Encoder Reranking cs.CL · 2026-05-12 · unverdicted · none · ref 4
A pipeline with LoRA-fine-tuned query rewriting, BM25+dense hybrid retrieval via RRF, and cross-encoder reranking reaches nDCG@5 of 0.531 on multi-turn retrieval across four domains.
PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation cs.CL · 2026-05-06 · unverdicted · none · ref 7
An ensemble of per-language fine-tuned Gemma 3 models with three synthetic data strategies and per-language threshold tuning achieves 2nd place overall in SemEval-2026 Task 9 with mean macro-F1 of 0.811.
PRISM: Preference-Aware Influence Function Based Data Selection Method for Efficient Fine-Tuning cs.LG · 2026-05-20 · unreviewed · ref 25

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer