hub Canonical reference

URL https://openreview.net/ pdf/9a7e7a9787d14ac8302215f8e4ef959606b78a94.pdf

Joanne Lin, Nantheera Anantrasirichai, David Bull · 2025 · arXiv 9660.2025

Canonical reference. 79% of citing Pith papers cite this work as background.

32 Pith papers citing it

Background 79% of classified citations

read on arXiv browse 32 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11 baseline 2 method 1

citation-polarity summary

background 11 baseline 2 use method 1

representative citing papers

Evaluating the Search Agent in a Parallel World

cs.AI · 2026-03-05 · unverdicted · novelty 7.0

Mind-ParaWorld creates parallel worlds with atomic facts to evaluate search agents on future scenarios, showing they synthesize evidence well but struggle with collection, coverage, sufficiency judgment, and stopping decisions.

Symbolic recovery of PDEs from measurement data

cs.LG · 2026-02-17 · unverdicted · novelty 7.0

Symbolic rational-function networks recover an admissible PDE from noiseless complete measurements and select the regularization-minimizing parameterization within the architecture.

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

cs.IR · 2026-02-13 · unverdicted · novelty 7.0

SQuTR aggregates 37k queries from six text retrieval datasets, synthesizes speech from 200 speakers, adds 17 noise categories at varying SNR, and shows that even large retrieval models degrade sharply under extreme acoustic noise.

DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction

cs.CV · 2026-01-21 · unverdicted · novelty 7.0

DuFal combines global and local high-frequency Fourier neural operators with cross-attention fusion to recover fine anatomical structures in extremely sparse-view CBCT, outperforming prior methods on LUNA16 and ToothFairy data.

High Volume Rate 3D Ultrasound Reconstruction with Diffusion Models

eess.IV · 2025-05-28 · unverdicted · novelty 7.0 · 2 refs

Diffusion models reconstruct high-resolution 3D cardiac ultrasound volumes from heavily undersampled elevation planes and outperform traditional interpolation and supervised deep learning baselines.

TextTeacher: What Can Language Teach About Images?

cs.CV · 2026-05-21 · unverdicted · novelty 6.0

TextTeacher uses frozen text embeddings from captions as semantic anchors to guide vision model training, improving ImageNet accuracy by up to 2.7 p.p. and transfer performance by 1.0 p.p. on average.

Connectionless Bluetooth LE Channel Sounding via PAwR for Scalable and Energy-Efficient Ranging

eess.SP · 2026-05-16 · unverdicted · novelty 6.0

A connectionless Bluetooth LE Channel Sounding system via PAwR eliminates connection overhead, cuts energy use by up to 88% under partner switching, and supports 16384 devices per train.

Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

Transcoda achieves state-of-the-art zero-shot OMR with an 18.46% OMR-NED error rate on synthetic scores and 63.97% on historical Polish scans using a 59M model trained in 6 hours via synthetic data, kern normalization, and grammar decoding.

Communicating Sound Through Natural Language

cs.LG · 2026-05-09 · unverdicted · novelty 6.0

Lexical acoustic coding lets LLMs transmit audio waveforms as editable natural-language sentences that another LLM can parse and reconstruct into sound.

SAGE: Signal-Amplified Guided Embeddings for LLM-based Vulnerability Detection

cs.CR · 2026-04-21 · unverdicted · novelty 6.0

SAGE uses sparse autoencoders to boost vulnerability signals in LLMs, raising internal SNR 12.7x and delivering up to 318% MCC gains on vulnerability detection benchmarks.

Sonata: A Hybrid World Model for Inertial Kinematics under Clinical Data Scarcity

cs.LG · 2026-04-20 · unverdicted · novelty 6.0

Sonata is a small hybrid world model pre-trained to predict future IMU states that outperforms autoregressive baselines on clinical discrimination, fall-risk prediction, and cross-cohort transfer while fitting on-device wearables.

LLM-Codec: Neural Audio Codec Meets Language Model Objectives

cs.SD · 2026-04-20 · unverdicted · novelty 6.0

LLM-Codec augments audio codec training with multi-step token prediction and contrastive semantic alignment to improve both waveform reconstruction and autoregressive predictability for speech language models.

A Case Study on the Impact of Anonymization Along the RAG Pipeline

cs.CR · 2026-04-17 · unverdicted · novelty 6.0

Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.

SyncBreaker:Stage-Aware Multimodal Adversarial Attacks on Audio-Driven Talking Head Generation

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

SyncBreaker jointly attacks image and audio streams with Multi-Interval Sampling and Cross-Attention Fooling to degrade speech-driven talking head generation more than single-modality baselines.

MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

cs.AI · 2026-04-09 · unverdicted · novelty 6.0

MONETA is the first multimodal benchmark for industry classification using text and geographic sources, with MLLM baselines at 62-74% accuracy and up to 22.8% gains from multi-turn context enrichment and explanations.

Leveraging Artist Catalogs for Cold-Start Music Recommendation

cs.IR · 2026-04-08 · unverdicted · novelty 6.0

ACARec attends over artist catalogs to generate CF embeddings for new tracks, more than doubling recall and NDCG versus content-only baselines in music recommendation.

TADA! Tuning Audio Diffusion Models through Activation Steering

cs.SD · 2026-02-12 · unverdicted · novelty 6.0

Activation steering at a semantic bottleneck in audio diffusion models achieves state-of-the-art control over musical attributes such as instruments, vocals, and genres.

zea: A Toolbox for Cognitive Ultrasound Imaging

eess.SP · 2025-12-01 · unverdicted · novelty 6.0 · 2 refs

zea is a Python toolbox that supplies a modular differentiable pipeline for ultrasound imaging and signal processing, built on Keras 3 to support TensorFlow, PyTorch, and JAX backends.

SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

cs.SD · 2025-08-05 · unverdicted · novelty 6.0

SonicMaster is a text-conditioned flow-matching generative model for unified music restoration and mastering, trained on a dataset of simulated degradations across equalization, dynamics, reverb, amplitude, and stereo.

Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline

cs.CV · 2025-04-16 · unverdicted · novelty 6.0 · 2 refs

A self-supervised Degradation Estimation Network estimates parameters for physics-informed noise distributions to generate realistic synthetic low-light data, showing gains on noise replication, enhancement, and detection tasks.

Articulatory movements influence electromagnetic wave transmission through the vocal tract

physics.app-ph · 2026-04-21 · conditional · novelty 5.0 · 2 refs

Articulatory configurations during vowel production create distinct electromagnetic transmission patterns through the vocal tract, confirmed by qualitative agreement between finite-element simulations and scattering-matrix measurements on two subjects.

Training-inference input alignment outweighs framework choice in longitudinal retinal image prediction

cs.CV · 2026-04-18 · unverdicted · novelty 5.0

Training-inference input alignment outweighs framework choice for longitudinal retinal image prediction, with deterministic regression matching complex models when acquisition variability dominates disease progression.

AI Models for Depressive Disorder Detection and Diagnosis: A Review

cs.AI · 2025-08-16 · accept · novelty 5.0

A systematic review of AI for depressive disorder detection that introduces a novel hierarchical taxonomy organized by clinical task, data modality, and model class.

Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models

cs.AI · 2025-03-12 · unverdicted · novelty 5.0

The paper unifies perspectives on Long CoT in reasoning LLMs by introducing a taxonomy, detailing characteristics of deep reasoning and reflection, and discussing emergence phenomena and future directions.

citing papers explorer

Showing 32 of 32 citing papers.

Evaluating the Search Agent in a Parallel World cs.AI · 2026-03-05 · unverdicted · none · ref 8
Mind-ParaWorld creates parallel worlds with atomic facts to evaluate search agents on future scenarios, showing they synthesize evidence well but struggle with collection, coverage, sufficiency judgment, and stopping decisions.
Symbolic recovery of PDEs from measurement data cs.LG · 2026-02-17 · unverdicted · none · ref 41
Symbolic rational-function networks recover an admissible PDE from noiseless complete measurements and select the regularization-minimizing parameterization within the architecture.
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise cs.IR · 2026-02-13 · unverdicted · none · ref 19
SQuTR aggregates 37k queries from six text retrieval datasets, synthesizes speech from 200 speakers, adds 17 noise categories at varying SNR, and shows that even large retrieval models degrade sharply under extreme acoustic noise.
DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction cs.CV · 2026-01-21 · unverdicted · none · ref 8
DuFal combines global and local high-frequency Fourier neural operators with cross-attention fusion to recover fine anatomical structures in extremely sparse-view CBCT, outperforming prior methods on LUNA16 and ToothFairy data.
High Volume Rate 3D Ultrasound Reconstruction with Diffusion Models eess.IV · 2025-05-28 · unverdicted · none · ref 37 · 2 links
Diffusion models reconstruct high-resolution 3D cardiac ultrasound volumes from heavily undersampled elevation planes and outperform traditional interpolation and supervised deep learning baselines.
TextTeacher: What Can Language Teach About Images? cs.CV · 2026-05-21 · unverdicted · none · ref 26
TextTeacher uses frozen text embeddings from captions as semantic anchors to guide vision model training, improving ImageNet accuracy by up to 2.7 p.p. and transfer performance by 1.0 p.p. on average.
Connectionless Bluetooth LE Channel Sounding via PAwR for Scalable and Energy-Efficient Ranging eess.SP · 2026-05-16 · unverdicted · none · ref 5
A connectionless Bluetooth LE Channel Sounding system via PAwR eliminates connection overhead, cuts energy use by up to 88% under partner switching, and supports 16384 devices per train.
Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training cs.CV · 2026-05-11 · unverdicted · none · ref 14
Transcoda achieves state-of-the-art zero-shot OMR with an 18.46% OMR-NED error rate on synthetic scores and 63.97% on historical Polish scans using a 59M model trained in 6 hours via synthetic data, kern normalization, and grammar decoding.
Communicating Sound Through Natural Language cs.LG · 2026-05-09 · unverdicted · none · ref 12
Lexical acoustic coding lets LLMs transmit audio waveforms as editable natural-language sentences that another LLM can parse and reconstruct into sound.
SAGE: Signal-Amplified Guided Embeddings for LLM-based Vulnerability Detection cs.CR · 2026-04-21 · unverdicted · none · ref 26
SAGE uses sparse autoencoders to boost vulnerability signals in LLMs, raising internal SNR 12.7x and delivering up to 318% MCC gains on vulnerability detection benchmarks.
Sonata: A Hybrid World Model for Inertial Kinematics under Clinical Data Scarcity cs.LG · 2026-04-20 · unverdicted · none · ref 32
Sonata is a small hybrid world model pre-trained to predict future IMU states that outperforms autoregressive baselines on clinical discrimination, fall-risk prediction, and cross-cohort transfer while fitting on-device wearables.
LLM-Codec: Neural Audio Codec Meets Language Model Objectives cs.SD · 2026-04-20 · unverdicted · none · ref 14
LLM-Codec augments audio codec training with multi-step token prediction and contrastive semantic alignment to improve both waveform reconstruction and autoregressive predictability for speech language models.
A Case Study on the Impact of Anonymization Along the RAG Pipeline cs.CR · 2026-04-17 · unverdicted · none · ref 11
Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.
SyncBreaker:Stage-Aware Multimodal Adversarial Attacks on Audio-Driven Talking Head Generation cs.CV · 2026-04-09 · unverdicted · none · ref 29
SyncBreaker jointly attacks image and audio streams with Multi-Interval Sampling and Cross-Attention Fooling to degrade speech-driven talking head generation more than single-modality baselines.
MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems cs.AI · 2026-04-09 · unverdicted · none · ref 8
MONETA is the first multimodal benchmark for industry classification using text and geographic sources, with MLLM baselines at 62-74% accuracy and up to 22.8% gains from multi-turn context enrichment and explanations.
Leveraging Artist Catalogs for Cold-Start Music Recommendation cs.IR · 2026-04-08 · unverdicted · none · ref 40
ACARec attends over artist catalogs to generate CF embeddings for new tracks, more than doubling recall and NDCG versus content-only baselines in music recommendation.
TADA! Tuning Audio Diffusion Models through Activation Steering cs.SD · 2026-02-12 · unverdicted · none · ref 7
Activation steering at a semantic bottleneck in audio diffusion models achieves state-of-the-art control over musical attributes such as instruments, vocals, and genres.
zea: A Toolbox for Cognitive Ultrasound Imaging eess.SP · 2025-12-01 · unverdicted · none · ref 16 · 2 links
zea is a Python toolbox that supplies a modular differentiable pipeline for ultrasound imaging and signal processing, built on Keras 3 to support TensorFlow, PyTorch, and JAX backends.
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering cs.SD · 2025-08-05 · unverdicted · none · ref 7
SonicMaster is a text-conditioned flow-matching generative model for unified music restoration and mastering, trained on a dataset of simulated degradations across equalization, dynamics, reverb, amplitude, and stereo.
Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline cs.CV · 2025-04-16 · unverdicted · none · ref 10 · 2 links
A self-supervised Degradation Estimation Network estimates parameters for physics-informed noise distributions to generate realistic synthetic low-light data, showing gains on noise replication, enhancement, and detection tasks.
Articulatory movements influence electromagnetic wave transmission through the vocal tract physics.app-ph · 2026-04-21 · conditional · none · ref 26 · 2 links
Articulatory configurations during vowel production create distinct electromagnetic transmission patterns through the vocal tract, confirmed by qualitative agreement between finite-element simulations and scattering-matrix measurements on two subjects.
Training-inference input alignment outweighs framework choice in longitudinal retinal image prediction cs.CV · 2026-04-18 · unverdicted · none · ref 9
Training-inference input alignment outweighs framework choice for longitudinal retinal image prediction, with deterministic regression matching complex models when acquisition variability dominates disease progression.
AI Models for Depressive Disorder Detection and Diagnosis: A Review cs.AI · 2025-08-16 · accept · none · ref 75
A systematic review of AI for depressive disorder detection that introduces a novel hierarchical taxonomy organized by clinical task, data modality, and model class.
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models cs.AI · 2025-03-12 · unverdicted · none · ref 251
The paper unifies perspectives on Long CoT in reasoning LLMs by introducing a taxonomy, detailing characteristics of deep reasoning and reflection, and discussing emergence phenomena and future directions.
MSDS: Deep Structural Similarity with Multiscale Representation cs.CV · 2026-04-21 · unverdicted · none · ref 19
MSDS computes DeepSSIM at multiple pyramid scales and fuses the scores with learned weights, producing consistent improvements over single-scale DeepSSIM on IQA benchmarks with negligible extra cost.
SatBLIP: Context Understanding and Feature Identification from Satellite Imagery with Vision-Language Learning cs.CV · 2026-04-15 · unverdicted · none · ref 2
SatBLIP fine-tunes a satellite-adapted BLIP model on GPT-4o-generated captions to predict county-level SVI from satellite tiles and uses SHAP to highlight key features like roof condition and vegetation.
LLM4Log: A Systematic Review of Large Language Model-based Log Analysis cs.SE · 2026-03-18 · unverdicted · none · ref 205 · 2 links
Systematic review of 145 papers on LLM-based log analysis, providing a unified taxonomy, common design patterns, evaluation practices, and challenges for deployment under drift and limited labels.
The Prompt Engineering Report Distilled: Quick Start Guide for Life Sciences cs.CL · 2025-09-14 · unverdicted · none · ref 60
The paper reduces a broad set of prompt engineering techniques to six core approaches and applies them to life sciences use cases while addressing common LLM pitfalls.
Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding cs.CV · 2025-08-28 · unverdicted · none · ref 195
A literature survey on abstract concept recognition in videos that catalogs prior tasks and datasets while advocating for foundation models and reuse of decades of community experience.
Four Decades of Digital Waveguides eess.AS · 2026-04-14 · unverdicted · none · ref 214
Digital waveguide models enable efficient physically accurate sound synthesis and are now being optimized using classical, evolutionary, and neural methods.
A Review of Diffusion-based Simulation-Based Inference: Foundations and Applications in Non-Ideal Data Scenarios cs.LG · 2025-12-26 · accept · none · ref 78
A synthesis of diffusion-based simulation-based inference methods that address model misspecification, irregular observations, and missing data in scientific applications.
Secure Password Generator Based on Secure Pseudo-Random Number Generator cs.CR · 2025-08-25 · unverdicted · none · ref 16 · 2 links
A MAC-based PRNG for passwords is implemented and shown to meet NIST SP 800-90B entropy and IID criteria.

URL https://openreview.net/ pdf/9a7e7a9787d14ac8302215f8e4ef959606b78a94.pdf

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer