JackZebra performs long-horizon route hijacking of vision-based AVs by converting adversarial patches into online-selected steering primitives via closed-loop control from an attacker vehicle.
super hub Mixed citations
librosa/librosa: 0.6.3
Mixed citation behavior. Most common role is background (54%).
hub tools
citation-role summary
citation-polarity summary
co-cited works
representative citing papers
Fetal-Gauge benchmark shows state-of-the-art vision-language models reach only 55% accuracy on fetal ultrasound tasks, well below clinical needs and highlighting the requirement for domain-adapted models.
Derives non-asymptotic 2-norm and infinity-norm error bounds for deterministic and stochastic variants of OPTQ and Qronos PTQ algorithms.
A parton-shower-inspired local subtraction scheme for double-real corrections in color singlet decays is introduced, with finiteness verified for the e+e- to qqbar remainder and phase-space integrals computed analytically and via sector decomposition.
ColumnKeeper provides the first mitigations for ColumnDisturb using per-subarray counters or probabilistic refresh, with low overheads at 1M and 128K thresholds.
Proves sharp threshold on mutation parameter χ for (1+1)-EA on Dynamic Binary Value and Uniform weight dynamic linear problems, yielding O(n log n) runtime below threshold and 2^Ω(n) above, plus a second stagnation-distance threshold for the former.
POPSICLE introduces benchmark datasets for cryoET segmentation and localization built from the CryoET Data Portal.
First unified benchmark finds GLR family has only 3x median slowdown over LR(1) on deterministic grammars and is the fastest among generalized parsers.
Introduces ESAS benchmark dataset using LLM-assisted event injection into acoustic scenes, showing significant performance drops in existing ASC models.
Optimal SSB frame origin for LGWA cuts sampling time by 10x and tightens chirp mass and sky position constraints for stellar-mass binaries beyond LVK performance.
Randomized experiment finds AI draft assistance raises feedback provision by teaching assistants 10.8 percentage points without harming quality.
A new greedy rebalancing algorithm for multi-constraint hypergraphs, integrated into Mt-KaHyPar, reduces geometric mean connectivity by 11.5% versus Metis while improving partition balance reliability.
Bayesian optimization identifies cement-salt hydrate composites achieving up to five times higher specific energy than prior cement-based TCES materials, with LiCl-based formulations reaching 458 kJ/kg.
FLDD learns non-Markovian marginal and posterior distributions for the forward process so a factorized reverse process can match the target better and produce higher-quality samples in fewer steps.
Human face perception aligns with neural networks trained on inverse-generative and naturalistic discriminative tasks, as these best predict human dissimilarity judgments on controversial and random face pairs.
An SMT-based active learning algorithm learns minimal nondeterministic weighted automata over arbitrary semirings, with partial correctness proofs, a sufficient termination condition, and experiments showing smaller models and fewer queries than baselines.
Rabi coupling allows a third component to join a self-bound binary quantum droplet in Bose gases, stabilized by finite detuning despite added repulsive forces.
FDA-QC combines functional data analysis of curves with quasi-conformal mappings to register and analyze both boundaries and interiors of planar biological shapes for morphing and variation studies.
Text-guided class-agnostic counting models exhibit significant weaknesses in grounding textual prompts to visual objects, as demonstrated by new negative-label and distractor tests on a multi-category dataset.
A new Java bytecode optimizer fuses map and filter into mapMulti to reduce stream overhead, sidestepping Streamliner's restrictions and delivering superior results in two of nine benchmarks while passing all 31,799 Kafka tests.
ContextualJailbreak uses evolutionary search over simulated primed dialogues with novel mutations to reach 90-100% attack success on open LLMs and transfers to some closed frontier models at 15-90% rates.
Vega-Video integrates video into Vega via synchronization, annotation, and transformation classes, using split signals and VOD repurposing for responsive mixed-modality visualizations.
Physics-informed transformer with sin^2(theta) encoding, physics-aware positional encoding, multi-task decoder, and three-stage curriculum classifies powder diffraction into 99 extinction groups, with structured errors on symmetry subgroup hierarchy.
BuyTheBy is a new annotated dataset of 18,710 paper mill advertisements containing 51,812 timestamped prices and 20,598 product positions.
citing papers explorer
-
FETAL-GAUGE: A Benchmark for Assessing Vision-Language Models in Fetal Ultrasound
Fetal-Gauge benchmark shows state-of-the-art vision-language models reach only 55% accuracy on fetal ultrasound tasks, well below clinical needs and highlighting the requirement for domain-adapted models.
-
Does it Really Count? Assessing Semantic Grounding in Text-Guided Class-Agnostic Counting
Text-guided class-agnostic counting models exhibit significant weaknesses in grounding textual prompts to visual objects, as demonstrated by new negative-label and distractor tests on a multi-category dataset.
-
Neuroscience-inspired Staged Representation Learning with Disentangled Coarse- and Fine-Grained Semantics for EEG Visual Decoding
A neuroscience-inspired staged framework with dual-level semantic disentanglement and semantic latent channels improves EEG visual decoding performance on the THINGS-EEG benchmark under zero-shot settings.
-
Transcoda: End-to-End Zero-Shot Optical Music Recognition via Data-Centric Synthetic Training
Transcoda achieves state-of-the-art zero-shot OMR with an 18.46% OMR-NED error rate on synthetic scores and 63.97% on historical Polish scans using a 59M model trained in 6 hours via synthetic data, kern normalization, and grammar decoding.
-
VerteNet -- A Multi-Context Hybrid CNN Transformer for Accurate Vertebral Landmark Localization in Lateral Spine DXA Images
A dual-resolution self- and cross-attention hybrid model localizes T12-L5 vertebral landmarks in multi-scanner DXA images with normalized mean error 4.92 pixels and median 2.35 pixels, outperforming baselines.
-
TrOCR for Medieval HTR: A Systematic Ablation Study with Cross-Dataset Validation
Systematic ablation of TrOCR fine-tuning for medieval HTR finds that freezing up to three encoder or six decoder layers does not significantly harm accuracy and that removing CLAHE contrast normalization yields comparable 7.84% CER on the Cortonese manuscript.
-
Rethinking the Good Enough Embedding for Easy Few-Shot Learning
Frozen DINOv2-L features with k-NN classification and PCA/ICA refinement achieve state-of-the-art few-shot performance on four benchmarks without any backpropagation or fine-tuning.
-
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
AlphaEarth Foundations produces general geospatial embedding fields that consistently outperform other featurization methods on diverse mapping tasks from sparse labels and releases annual global layers for 2017-2024.
-
Surgical Visual Understanding (SurgVU) Dataset
Releases the SurgVU dataset of surgical videos and labels to enable machine learning research in surgical data science.
-
Representation Paradigms in AI-based 3D Radiological Image Reconstruction: A Systematic Review
A systematic review that categorizes AI-based 3D radiological image reconstruction algorithms into four representation paradigms, summarizes evaluation metrics and datasets, and outlines challenges and future directions.