Mixed citations

The American Statistician 36(3a):153–157 Charoenphakdee N, Cui Z, Zhang Y, et al (2021) Classification with rejection based on cost- sensitive classification

Deng, J · 2009 · arXiv 2009.520684

Mixed citation behavior. Most common role is background (53%).

69 Pith papers citing it

Background 53% of classified citations

read on arXiv browse 69 citing papers

citation-role summary

background 9 dataset 4 method 2

citation-polarity summary

background 8 use dataset 4 use method 2 unclear 1

representative citing papers

Disentanglement Beyond Generative Models with Riemannian ICA

cs.LG · 2026-05-21 · unverdicted · novelty 8.0

RICA replaces ICA's global generative model with local Riemannian geometry, introducing a disentanglement tensor based on the Hessian of the log-likelihood and Ricci curvature to measure pointwise disentanglement, which recovers sources across manifolds in controlled tests.

STRABLE: Benchmarking Tabular Machine Learning with Strings

cs.LG · 2026-05-12 · unverdicted · novelty 8.0

A new corpus of 108 mixed string-numeric tables shows that advanced tabular learners with basic string embeddings perform well on most real-world data, while large LLM encoders help on free-text heavy tables.

Building Normalizing Flows with Stochastic Interpolants

cs.LG · 2022-09-30 · conditional · novelty 8.0

Normalizing flows are constructed by learning the velocity of a stochastic interpolant via a quadratic loss derived from its probability current, yielding an efficient ODE-based alternative to diffusion models.

Oracle Supervision Transfers for Hyperparameter Prediction in Model-Based Image Denoising

cs.CV · 2026-05-19 · conditional · novelty 7.0

HyperDn is a configuration-conditioned predictor that transfers oracle supervision across denoising paradigms to achieve near-oracle hyperparameter prediction with few or zero target labels.

SDM: A Powerful Tool for Evaluating Model Robustness

cs.CV · 2026-05-19 · unverdicted · novelty 7.0

SDM is a new staged gradient attack that reconstructs the adversarial objective around probability differences and reports stronger performance than prior methods like APGD.

Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

RAT reformulates regularized natural policy gradients as vanilla gradients with a transformed advantage, computed efficiently via randomized block Kaczmarz iterations on on-policy data.

Reasoning Portability: Guiding Continual Learning for MLLMs in the RLVR Era

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

Formalizes Reasoning Portability (RP) and proposes RDB-CL to modulate per-sample KL regularization in RLVR for MLLM continual learning, achieving +12.0% Last accuracy over vanilla RLVR baseline by preserving reusable reasoning on high-RP samples.

PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media

cs.CL · 2026-05-16 · unverdicted · novelty 7.0

PluRule is a new multimodal multilingual benchmark showing that state-of-the-art vision-language models perform only marginally better than a trivial baseline at detecting specific rule violations in pluralistic online communities.

Navigating Potholes with Geometry-Aware Sharpness Minimization

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

LLQR+SAM pairs a slow learned geometry preconditioner with fast SAM perturbations to amplify escape from locally sharp 'potholes' while stabilizing flat basins, producing consistent gains over SAM and LLQR alone.

Human face perception reflects inverse-generative and naturalistic discriminative objectives

q-bio.NC · 2026-05-12 · unverdicted · novelty 7.0

Human face perception aligns with neural networks trained on inverse-generative and naturalistic discriminative tasks, as these best predict human dissimilarity judgments on controversial and random face pairs.

Multi-domain Multi-modal Document Classification Benchmark with a Multi-level Taxonomy

cs.CL · 2026-05-11 · unverdicted · novelty 7.0 · 2 refs

MMM-Bench supplies 5,990 multi-modal documents from 12 commercial domains annotated along a 5-level taxonomy to test document classification under realistic business conditions.

Urban-ImageNet: A Large-Scale Multi-Modal Dataset and Evaluation Framework for Urban Space Perception

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

Urban-ImageNet is a 2-million-image multi-modal dataset with HUSIC 10-class taxonomy enabling benchmarks for urban scene classification, cross-modal retrieval, and instance segmentation.

Hyperbolic Concept Bottleneck Models

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

HypCBM reformulates concept activations as geometric containment in hyperbolic space to produce sparse, hierarchy-aware signals that match Euclidean models trained on 20 times more data.

Physics-informed, Generative Adversarial Design of Funicular Shells

cs.CE · 2026-04-17 · unverdicted · novelty 7.0

A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.

HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement

cs.CV · 2026-04-12 · unverdicted · novelty 7.0

A diffusion-based pipeline creates a 27M-annotation dataset of object placements that outperforms human annotations and baselines on image editing tasks, then distills it into a fast model.

Towards Near-Real-Time Telemetry-Aware Routing with Neural Routing Algorithms

cs.LG · 2026-04-03 · unverdicted · novelty 7.0

LOGGIA is a delay-aware graph neural routing algorithm using pre-training and RL that outperforms shortest-path and other neural methods in realistic network simulations.

Vibe Coding XR: Accelerating AI + XR Prototyping with XR Blocks and Gemini

cs.HC · 2026-03-25 · unverdicted · novelty 7.0

XR Blocks supplies an LLM-optimized Reality Model and Vibe Coding XR workflow that converts high-level prompts into working physics-aware XR applications with high one-shot success.

Setting-Matched and Semantics-Scaled Benchmarking of One-Step Generative Models Against Multistep Diffusion and Flow Models

cs.CV · 2026-03-15 · unverdicted · novelty 7.0

Matched benchmarking reveals FID misleads in few-step regimes under CFG, prompting CLIP-scaled and PickScore-scaled FID and IS variants for better semantic evaluation of one-step image generators.

Unifying Contrastive and Generative Objectives for Visual Understanding and Text-to-Image Generation

cs.CV · 2026-03-03 · unverdicted · novelty 7.0

DREAM introduces Masking Warmup and Semantically Aligned Decoding to let a single encoder handle both contrastive alignment and masked generation, yielding gains over CLIP and FLUID on understanding and generation benchmarks.

MobileMold: A Smartphone-Based Microscopy Dataset for Food Mold Detection

cs.CV · 2026-03-02 · unverdicted · novelty 7.0

MobileMold provides 4941 smartphone microscopy images and shows deep learning models reach 99.5% accuracy on mold detection and food classification tasks.

On the Convergence Rate of LoRA Gradient Descent

cs.LG · 2025-12-20 · unverdicted · novelty 7.0

LoRA gradient descent converges to a stationary point at rate O(1/log T).

Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models

cs.LG · 2025-12-02 · unverdicted · novelty 7.0

F2D2 jointly distills sampling and likelihood computation in flow-based models by adding a divergence head to a few-step flow map, achieving accurate log-likelihoods at 2-10 NFEs while preserving sample quality.

Representational Alignment Across Model Layers and Brain Regions with Multi-Level Optimal Transport

cs.LG · 2025-10-02 · accept · novelty 7.0

Multi-Level Optimal Transport (MOT) jointly infers soft layer couplings and neuron transport plans to produce global alignment scores and structured hierarchical correspondences between networks of varying depths.

Exemplar-Free Continual Learning for State Space Models

cs.LG · 2025-05-24 · unverdicted · novelty 7.0

Inf-SSM constrains the infinite-horizon evolution of SSMs via Grassmannian geometry and an efficient O(n^2) Sylvester solver to enable exemplar-free continual learning with reduced forgetting.

citing papers explorer

Showing 50 of 69 citing papers.

Disentanglement Beyond Generative Models with Riemannian ICA cs.LG · 2026-05-21 · unverdicted · none · ref 23
RICA replaces ICA's global generative model with local Riemannian geometry, introducing a disentanglement tensor based on the Hessian of the log-likelihood and Ricci curvature to measure pointwise disentanglement, which recovers sources across manifolds in controlled tests.
STRABLE: Benchmarking Tabular Machine Learning with Strings cs.LG · 2026-05-12 · unverdicted · none · ref 9
A new corpus of 108 mixed string-numeric tables shows that advanced tabular learners with basic string embeddings perform well on most real-world data, while large LLM encoders help on free-text heavy tables.
Building Normalizing Flows with Stochastic Interpolants cs.LG · 2022-09-30 · conditional · none · ref 10
Normalizing flows are constructed by learning the velocity of a stochastic interpolant via a quadratic loss derived from its probability current, yielding an efficient ODE-based alternative to diffusion models.
Oracle Supervision Transfers for Hyperparameter Prediction in Model-Based Image Denoising cs.CV · 2026-05-19 · conditional · none · ref 5
HyperDn is a configuration-conditioned predictor that transfers oracle supervision across denoising paradigms to achieve near-oracle hyperparameter prediction with few or zero target labels.
SDM: A Powerful Tool for Evaluating Model Robustness cs.CV · 2026-05-19 · unverdicted · none · ref 26
SDM is a new staged gradient attack that reconstructs the adversarial objective around probability differences and reports stronger performance than prior methods like APGD.
Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation cs.LG · 2026-05-18 · unverdicted · none · ref 67
RAT reformulates regularized natural policy gradients as vanilla gradients with a transformed advantage, computed efficiently via randomized block Kaczmarz iterations on on-policy data.
Reasoning Portability: Guiding Continual Learning for MLLMs in the RLVR Era cs.LG · 2026-05-17 · unverdicted · none · ref 9
Formalizes Reasoning Portability (RP) and proposes RDB-CL to modulate per-sample KL regularization in RLVR for MLLM continual learning, achieving +12.0% Last accuracy over vanilla RLVR baseline by preserving reusable reasoning on high-RP samples.
PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media cs.CL · 2026-05-16 · unverdicted · none · ref 157
PluRule is a new multimodal multilingual benchmark showing that state-of-the-art vision-language models perform only marginally better than a trivial baseline at detecting specific rule violations in pluralistic online communities.
Navigating Potholes with Geometry-Aware Sharpness Minimization cs.LG · 2026-05-15 · unverdicted · none · ref 28
LLQR+SAM pairs a slow learned geometry preconditioner with fast SAM perturbations to amplify escape from locally sharp 'potholes' while stabilizing flat basins, producing consistent gains over SAM and LLQR alone.
Human face perception reflects inverse-generative and naturalistic discriminative objectives q-bio.NC · 2026-05-12 · unverdicted · none · ref 46
Human face perception aligns with neural networks trained on inverse-generative and naturalistic discriminative tasks, as these best predict human dissimilarity judgments on controversial and random face pairs.
Multi-domain Multi-modal Document Classification Benchmark with a Multi-level Taxonomy cs.CL · 2026-05-11 · unverdicted · none · ref 34 · 2 links
MMM-Bench supplies 5,990 multi-modal documents from 12 commercial domains annotated along a 5-level taxonomy to test document classification under realistic business conditions.
Urban-ImageNet: A Large-Scale Multi-Modal Dataset and Evaluation Framework for Urban Space Perception cs.CV · 2026-05-11 · unverdicted · none · ref 7
Urban-ImageNet is a 2-million-image multi-modal dataset with HUSIC 10-class taxonomy enabling benchmarks for urban scene classification, cross-modal retrieval, and instance segmentation.
Hyperbolic Concept Bottleneck Models cs.LG · 2026-05-07 · unverdicted · none · ref 6
HypCBM reformulates concept activations as geometric containment in hyperbolic space to produce sparse, hierarchy-aware signals that match Euclidean models trained on 20 times more data.
Physics-informed, Generative Adversarial Design of Funicular Shells cs.CE · 2026-04-17 · unverdicted · none · ref 43
A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.
HiddenObjects: Scalable Diffusion-Distilled Spatial Priors for Object Placement cs.CV · 2026-04-12 · unverdicted · none · ref 11
A diffusion-based pipeline creates a 27M-annotation dataset of object placements that outperforms human annotations and baselines on image editing tasks, then distills it into a fast model.
Towards Near-Real-Time Telemetry-Aware Routing with Neural Routing Algorithms cs.LG · 2026-04-03 · unverdicted · none · ref 2
LOGGIA is a delay-aware graph neural routing algorithm using pre-training and RL that outperforms shortest-path and other neural methods in realistic network simulations.
Vibe Coding XR: Accelerating AI + XR Prototyping with XR Blocks and Gemini cs.HC · 2026-03-25 · unverdicted · none · ref 14
XR Blocks supplies an LLM-optimized Reality Model and Vibe Coding XR workflow that converts high-level prompts into working physics-aware XR applications with high one-shot success.
Setting-Matched and Semantics-Scaled Benchmarking of One-Step Generative Models Against Multistep Diffusion and Flow Models cs.CV · 2026-03-15 · unverdicted · none · ref 3
Matched benchmarking reveals FID misleads in few-step regimes under CFG, prompting CLIP-scaled and PickScore-scaled FID and IS variants for better semantic evaluation of one-step image generators.
Unifying Contrastive and Generative Objectives for Visual Understanding and Text-to-Image Generation cs.CV · 2026-03-03 · unverdicted · none · ref 5
DREAM introduces Masking Warmup and Semantically Aligned Decoding to let a single encoder handle both contrastive alignment and masked generation, yielding gains over CLIP and FLUID on understanding and generation benchmarks.
MobileMold: A Smartphone-Based Microscopy Dataset for Food Mold Detection cs.CV · 2026-03-02 · unverdicted · none · ref 9
MobileMold provides 4941 smartphone microscopy images and shows deep learning models reach 99.5% accuracy on mold detection and food classification tasks.
On the Convergence Rate of LoRA Gradient Descent cs.LG · 2025-12-20 · unverdicted · none · ref 2
LoRA gradient descent converges to a stationary point at rate O(1/log T).
Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models cs.LG · 2025-12-02 · unverdicted · none · ref 4
F2D2 jointly distills sampling and likelihood computation in flow-based models by adding a divergence head to a few-step flow map, achieving accurate log-likelihoods at 2-10 NFEs while preserving sample quality.
Representational Alignment Across Model Layers and Brain Regions with Multi-Level Optimal Transport cs.LG · 2025-10-02 · accept · none · ref 1
Multi-Level Optimal Transport (MOT) jointly infers soft layer couplings and neuron transport plans to produce global alignment scores and structured hierarchical correspondences between networks of varying depths.
Exemplar-Free Continual Learning for State Space Models cs.LG · 2025-05-24 · unverdicted · none · ref 10
Inf-SSM constrains the infinite-horizon evolution of SSMs via Grassmannian geometry and an efficient O(n^2) Sylvester solver to enable exemplar-free continual learning with reduced forgetting.
TextTeacher: What Can Language Teach About Images? cs.CV · 2026-05-21 · unverdicted · none · ref 12
TextTeacher uses frozen text embeddings from captions as semantic anchors to guide vision model training, improving ImageNet accuracy by up to 2.7 p.p. and transfer performance by 1.0 p.p. on average.
Activation-Free Backbones for Image Recognition: Polynomial Alternatives within MetaFormer-Style Vision Models cs.CV · 2026-05-20 · unverdicted · none · ref 3
Polynomial replacements for activations in MLPs, convolutions, and attention within MetaFormer yield PolyNeXt models that match or exceed standard performance on ImageNet, ADE20K, and robustness benchmarks while beating prior polynomial networks.
Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models cs.LG · 2026-05-18 · unverdicted · none · ref 87
COCOCO is a conformal framework for NeSy-CBMs that jointly conformalizes concepts and labels, reconciles them via deduction-abduction revision, and satisfies consistency, coverage, and conciseness while retaining distribution-free guarantees.
Venus-DeFakerOne: Unified Fake Image Detection & Localization cs.CV · 2026-05-13 · unverdicted · none · ref 59
DeFakerOne integrates InternVL2 and SAM2 into a single model that achieves state-of-the-art results on 39 detection and 9 localization benchmarks for unified fake image detection and localization.
The Diffusion Encoder cs.LG · 2026-05-13 · unverdicted · none · ref 34
A diffusion model serves as the encoder in an autoencoder when trained alternately with the decoder to resolve opposing update directions while retaining the standard diffusion training objective.
MC$^2$: Monte Carlo Correction for Fast Elliptic PDE Solving cs.LG · 2026-05-10 · unverdicted · none · ref 8
MC² corrects low-budget Monte Carlo solutions for elliptic PDEs with a single-pass neural network to match the accuracy of 1000× more Monte Carlo samples while outperforming classical and learned baselines.
Removing the Watermark Is Not Enough: Forensic Stealth in Generative-AI Watermark Removal cs.CR · 2026-05-09 · unverdicted · none · ref 8
Current AI image watermark removal attacks replace the watermark with a different forensic signal, allowing independent detectors to distinguish processed outputs from clean images at over 98% true-positive rate under a 1% false-positive budget.
Compared to What? Baselines and Metrics for Counterfactual Prompting cs.CL · 2026-05-01 · conditional · none · ref 107
Counterfactual prompting effects on LLMs are often indistinguishable from those caused by meaning-preserving paraphrases, causing most previously reported demographic sensitivities to disappear under proper statistical comparison.
Preventing Latent Rehearsal Decay in Online Continual SSL with SOLAR cs.LG · 2026-04-12 · unverdicted · none · ref 15
SOLAR prevents latent rehearsal decay in online continual SSL by adaptively managing replay buffers with deviation proxies and an explicit overlap loss, delivering both fast convergence and state-of-the-art final accuracy on vision benchmarks.
Zero-shot World Models Are Developmentally Efficient Learners cs.AI · 2026-04-11 · unverdicted · none · ref 17
A zero-shot visual world model trained on one child's experience achieves broad competence on physical understanding benchmarks while matching developmental behavioral patterns.
Drifting Fields are not Conservative cs.LG · 2026-04-07 · unverdicted · none · ref 1 · 2 links
Drift fields are not conservative except for Gaussian kernels; sharp normalization makes them conservative for any radial kernel by equating them to score differences of kernel density estimates.
CLAMP: Contrastive Learning for 3D Multi-View Action-Conditioned Robotic Manipulation Pretraining cs.RO · 2026-01-31 · unverdicted · none · ref 7
CLAMP pretrains 3D multi-view encoders with contrastive learning on point clouds and actions, then initializes diffusion policies for more sample-efficient fine-tuning on robotic tasks.
Fundamental Limitations of Favorable Privacy-Utility Guarantees for DP-SGD cs.LG · 2026-01-15 · unverdicted · none · ref 19
Shuffled DP-SGD requires σ ≥ 1/√(2 ln M) or κ ≥ (1/√8)(1 - 1/√(4π ln M)) to limit adversarial advantage, preventing strong privacy and high utility simultaneously.
NEO: No-Optimization Test-Time Adaptation through Latent Re-Centering cs.LG · 2025-10-07 · unverdicted · none · ref 2
NEO performs test-time adaptation by re-centering target latent embeddings at the origin, boosting accuracy on distribution-shifted datasets like ImageNet-C with no optimization or hyperparameters and minimal extra compute.
Curriculum-guided multimodal representation learning enables generalizable prediction of nanomaterial-protein interactions cs.LG · 2025-07-18 · conditional · none · ref 45
CuMMI applies curriculum learning across progressively complex biofluids to a multimodal model integrating protein sequence, structure, and 37 experimental features, achieving mean classification metrics above 0.75 on temporal, nanomaterial-held-out, and protein-held-out tests.
Perceptual implications of automatic anonymization in pathological speech eess.AS · 2025-05-01 · conditional · none · ref 63
Listeners detect automatic anonymization in pathological speech at 91-93% accuracy with a 30-point perceived quality drop, yet clinical severity ratings stay nearly unchanged for dysarthria, dysglossia, and dysphonia.
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets cs.RO · 2025-04-03 · unverdicted · none · ref 17
Unified World Models couple video and action diffusion inside one transformer with independent timesteps, enabling pretraining on heterogeneous robot datasets that include action-free video and producing more generalizable policies than imitation learning alone.
Low Rank Based Subspace Inference for the Laplace Approximation of Bayesian Neural Networks cs.LG · 2025-02-04 · unverdicted · none · ref 54
Derives optimal low-rank subspace for Laplace approx in BNNs, provides scalable outperforming version, and new comparison metric.
Excretion Detection in Pigsties Using Convolutional and Transformerbased Deep Neural Networks cs.CV · 2024-11-29 · unverdicted · none · ref 5
Four object detection models achieve over 90% average precision detecting excretions in pigsties from thermal images and remain reasonably robust on out-of-distribution data from different barns.
LaMI: Augmenting Large Language Models via Late Multi-Image Fusion cs.CL · 2024-06-19 · unverdicted · none · ref 7
LaMI augments LLMs with visual commonsense via late fusion of predictions from multiple text-generated images, outperforming prior augmented LLMs on visual tasks while matching VLMs and preserving or improving NLP performance.
Near OOD Detection for Vision-Language Prompt Learning with Contrastive Logit Score cs.CV · 2024-05-25 · unverdicted · none · ref 1
Contrastive Logit Score (CLS) improves near OOD detection AUROC by up to 11.67% for pre-trained vision-language prompt learning methods as a plug-and-play post-hoc function.
When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing cs.CV · 2026-05-15 · unverdicted · none · ref 32
Sparse MoE vision models show positive accuracy gaps only when routing a substantial compute fraction ρ and using k≥2 experts at large scale; batch-axis dispatch is identified as a key failure mode.
Prediction of Rectal Cancer Regrowth from Longitudinal Endoscopy cs.CV · 2026-05-13 · unverdicted · none · ref 51
TREX detects rectal cancer local regrowth from longitudinal endoscopy image pairs with 97% sensitivity and enables early prediction 3-12 months before clinical confirmation, outperforming baselines.
Refresh-Scaling the Memory of Balanced Adam cs.LG · 2026-05-11 · unverdicted · none · ref 1
Setting β in balanced Adam to achieve a refresh count R_β ≈1000 based on effective learning horizon T_ES improves validation robustness over fixed-β baselines across 11 vision and language experiments.
StomaD2: An All-in-One System for Intelligent Stomatal Phenotype Analysis via Diffusion-Based Restoration Detection Network cs.CV · 2026-04-18 · unverdicted · none · ref 26
StomaD2 integrates diffusion-based image restoration with a specialized rotated detection network to achieve high-accuracy stomatal phenotyping across more than 130 plant species.
Weak-to-Strong Knowledge Distillation Accelerates Visual Learning cs.CV · 2026-04-16 · unverdicted · none · ref 7
Weak-to-strong knowledge distillation applied early and then turned off accelerates convergence to target performance in visual learning tasks by factors of 1.7-4.8x.

The American Statistician 36(3a):153–157 Charoenphakdee N, Cui Z, Zhang Y, et al (2021) Classification with rejection based on cost- sensitive classification

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer