Mixed citations

Title resolution pending

Repetti, A · 2020 · arXiv 0776.2020

Mixed citation behavior. Most common role is background (57%).

36 Pith papers citing it

Background 57% of classified citations

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 4 dataset 2 method 1

citation-polarity summary

background 4 support 1 use dataset 1 use method 1

representative citing papers

FUTO Swipe: Layout-Agnostic Neural Swipe Decoding

cs.HC · 2026-06-24 · unverdicted · novelty 7.0

Neural swipe decoder trained with geometric augmentations on 1M+ swipes generalizes to unseen keyboard layouts by predicting per-point character locations and mapping via inference-time layout.

PolySpeech-100: A Large-Scale Benchmark for Speech Understanding Across 100+ Languages and Dialects

cs.CL · 2026-05-31 · unverdicted · novelty 7.0

PolySpeech-100 is a new benchmark for native-level speech comprehension across 110 linguistic variants that evaluates 22 models and reports E2E advantages on dialects, robustness gaps on low-resource languages, and degradation from Chain-of-Thought prompting.

A strongly annotated passive acoustic dataset for tropical bird monitoring

cs.SD · 2026-05-20 · accept · novelty 7.0 · 2 refs

PteroSet is a new strongly annotated dataset of 563 tropical bird recordings (73.62 h) containing 15,372 time-frequency labels for 168 species, released in COCO-style JSON with a binary bird detection baseline.

FLARE: Full-Modality Long-Video Audiovisual Retrieval Benchmark with User-Simulated Queries

cs.MM · 2026-05-11 · unverdicted · novelty 7.0

FLARE is a new benchmark with 399 long videos, 87k multimodal clips, and 275k user-style queries for testing audiovisual retrieval under caption and query regimes.

Geo2Sound: A Scalable Geo-Aligned Framework for Soundscape Generation from Satellite Imagery

cs.MM · 2026-04-16 · unverdicted · novelty 7.0

Geo2Sound generates geographically realistic soundscapes from satellite imagery via geospatial attribute modeling, semantic hypothesis expansion, and geo-acoustic alignment, achieving SOTA FAD of 1.765 on a new 20k-pair benchmark.

Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs

cs.CL · 2025-12-18 · unverdicted · novelty 7.0 · 2 refs

Cascaded systems remain the most reliable for speech translation overall, but recent SpeechLLMs match or outperform them in many conditions while standalone speech models lag.

Moshi: a speech-text foundation model for real-time dialogue

eess.AS · 2024-09-17 · accept · novelty 7.0

Moshi is the first real-time full-duplex spoken large language model that casts dialogue as speech-to-speech generation using parallel audio streams and an inner monologue of time-aligned text tokens.

Syntactic Belief Update as the Driver of Garden Path Processing Difficulty

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

Syntactic belief update via generalized Rényi divergence on syntactic trees predicts garden path reading times better than lexical surprisal.

BareWave: Waveform-Native Flow-Matching Text-to-Speech

eess.AS · 2026-06-08 · unverdicted · novelty 6.0

BareWave develops a waveform-native flow-matching framework for direct text-to-waveform TTS using representation alignment, staged noise scheduling, and velocity-aware perceptual alignment to achieve strong zero-shot voice cloning results.

MS-DKC: A Dataset Knowledge Card Framework for Designing and Adapting Medical Image Segmentation Models

cs.CV · 2026-06-04 · unverdicted · novelty 6.0

MS-DKC is a dataset knowledge card framework that maps image, morphology, supervision, context, and risk descriptors to design priors and failure modes, shown to produce dataset-specific model adaptations with improved metrics on DRIVE, ISIC2018, and ACDC.

Data-Driven Forecasting of three-Component Seismograms Using Transformer Architectures

astro-ph.IM · 2026-06-01 · unverdicted · novelty 6.0

SeismoGPT is a transformer autoregressive model achieving median normalized cross-correlation above 0.93 when forecasting synthetic three-component seismograms up to 240 s ahead from P- and S-wave context.

Parameter-efficient Dual-encoder Architecture with Differentiable Choquet Integral Fusion for Underwater Acoustic Classification

cs.SD · 2026-06-01 · unverdicted · novelty 6.0

A parameter-efficient dual-encoder model with differentiable Choquet integral fusion improves underwater acoustic classification accuracy over single-encoder baselines on DeepShip and ShipsEar datasets.

Reliable model selection in the presence of parameter non-identifiability

stat.ME · 2026-05-19 · unverdicted · novelty 6.0

Proposes adaptive multiple importance sampling for robust Bayesian model evidence estimation under parameter non-identifiability, shown to outperform deterministic methods on ecological case studies while being cheaper than MCMC.

Annotation-free deep learning for detection and segmentation of fetal germinal matrix-intraventricular hemorrhage in brain MRI

eess.IV · 2026-05-10 · conditional · novelty 6.0

FreeHemoSeg detects fetal GMH-IVH on T2-weighted MRI with high sensitivity and specificity and moderate segmentation accuracy using pseudo-image synthesis from normal scans, outperforming supervised and unsupervised baselines in internal and external validation.

Aspect-Aware Content-Based Recommendations for Mathematical Research Papers

cs.IR · 2026-05-05 · unverdicted · novelty 6.0

The authors introduce aspect-aware datasets GoldRiM and SilverRiM for math papers and AchGNN, a heterogeneous GNN that outperforms prior methods by jointly modeling textual semantics, citations, and author lineage across aspects.

Almost for Free: Crafting Adversarial Examples with Convolutional Image Filters

cs.LG · 2026-05-01 · conditional · novelty 6.0

Optimized 3x3 adversarial image filters based on edge detection generate transferable untargeted attacks on neural networks with 30-80% success using only one pass and far fewer parameters than prior methods.

CAHAL: Clinically Applicable resolution enHAncement for Low-resolution MRI scans

cs.CV · 2026-04-20 · unverdicted · novelty 6.0

CAHAL introduces a physics-informed mixture-of-experts super-resolution network for clinical MRI that conditions on resolution and anisotropy and uses edge-penalised, Fourier, and segmentation-guided losses to reduce hallucinations compared with prior generative methods.

SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation

cs.CL · 2025-12-24 · unverdicted · novelty 6.0 · 2 refs

SpidR-Adapt uses meta-learning with a first-order bi-level optimization heuristic to adapt speech representations to new languages with less than 1 hour of data, achieving 100x better efficiency than standard training.

Perceptual implications of automatic anonymization in pathological speech

eess.AS · 2025-05-01 · conditional · novelty 6.0 · 2 refs

Listeners detect automatic anonymization in pathological speech at 91-93% accuracy with a 30-point perceived quality drop, yet clinical severity ratings stay nearly unchanged for dysarthria, dysglossia, and dysphonia.

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

cs.CL · 2024-11-07 · conditional · novelty 6.0

MoT decouples non-embedding parameters by modality in transformers to match dense multi-modal performance with roughly one-third to one-half the FLOPs.

A Zero-shot Generalized Graph Anomaly Detection Framework via Node Reconstruction

cs.LG · 2026-06-10 · unverdicted · novelty 5.0

AlignGAD is a zero-shot generalized graph anomaly detection framework using a Global Unification Module, Clustering Module, and Node Discrepancy Scoring Module.

Speaker Group Encoding in Self-supervised Speech Recognition Models

cs.CL · 2026-06-09 · unverdicted · novelty 5.0

Self-supervised speech models encode multiple speaker group attributes, with ASR finetuning discarding phonetically variant information while retaining semantically variant information.

Quadratic integrate-and-fire neurons exhibit less fragmented loss landscapes and outperform leaky integrate-and-fire neurons in spike-based gradient descent

cs.NE · 2026-06-02 · unverdicted · novelty 5.0

QIF neurons outperform LIF neurons in spike-based gradient descent training of spiking neural networks by avoiding discontinuities that fragment the loss landscape.

High-Quality Synthetic Financial Time-Series using a GAN-Diffusion Framework

cs.LG · 2026-05-26 · unverdicted · novelty 5.0

Hybrid CoMeTS-GAN plus diffusion model generates multivariate financial time series claimed to better reproduce stylized facts and inter-asset correlations than prior generative methods.

citing papers explorer

Showing 1 of 1 citing paper after filters.

The Association of Transformer-based Sentiment Analysis with Symptom Distress and Deterioration in Routine Psychotherapy Care cs.CL · 2026-05-11 · unverdicted · none · ref 33
Transformer-derived sentiment features from therapy sessions correlate with emotional-valence components of the OQ-45 and differ significantly between patients identified as at risk of deterioration by rational and empirical outcome models.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer