super hub Mixed citations

Gradient-based learning applied to document recognition

L. Bottou, P. Haffner, Y. Bengio, Y. Lecun · 1998 · Proceedings of the IEEE · DOI 10.1109/5.726791

Mixed citation behavior. Most common role is background (43%).

56 Pith papers citing it

44.7k external citations · Crossref

Background 43% of classified citations

open at publisher browse 56 citing papers more from L. Bottou

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 7 dataset 4 method 3

citation-polarity summary

background 6 use dataset 4 use method 3 support 1

authors

L. Bottou P. Haffner Y. Bengio Y. Lecun

co-cited works

representative citing papers

STRABLE: Benchmarking Tabular Machine Learning with Strings

cs.LG · 2026-05-12 · unverdicted · novelty 8.0

A new corpus of 108 mixed string-numeric tables shows that advanced tabular learners with basic string embeddings perform well on most real-world data, while large LLM encoders help on free-text heavy tables.

Adaptive multi-line fitting for stable line-core intensity and Doppler velocity

astro-ph.SR · 2026-05-20 · conditional · novelty 7.0

LineFit delivers more stable line-core intensity and Doppler velocity time series from complex multi-line solar spectra by combining adaptive windowing, asymmetric Voigt options, and split-core handling, outperforming standard fast estimators on synthetic benchmarks.

Stress-Testing Neural Network Verifiers with Provably Robust Instances

cs.LG · 2026-05-16 · conditional · novelty 7.0

A reusable framework generates verification instances with provably known robustness labels, revealing numeric tolerance issues and bugs in five verifiers while introducing difficulty profiles to diagnose failure modes.

Quantitative Linear Logic for Neuro-Symbolic Learning and Verification

cs.LO · 2026-05-13 · unverdicted · novelty 7.0 · 2 refs

QLL is a novel logic for neuro-symbolic learning that uses ML-native operations (sum, log-sum-exp) on logits to embed constraints, satisfying most linear logic properties and showing stronger correlation between empirical robustness and formal verification than prior approaches.

Estimating Implicit Regularization in Deep Learning

stat.ML · 2026-05-06 · unverdicted · novelty 7.0

Gradient matching empirically recovers implicit regularization effects such as l2 penalties from early stopping and dropout in neural networks.

On the Architectural Complexity of Neural Networks

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

A framework quantifies DNN complexity via tensor operations, links 40 years of breakthroughs to complexity increases, and releases a dataset of 3000+ unexplored high-complexity architectures.

Beyond ECE: Calibrated Size Ratio, Risk Assessment, and Confidence-Weighted Metrics

cs.LG · 2026-05-03 · unverdicted · novelty 7.0

Introduces Calibrated Size Ratio (CSR) and confidence-weighted metrics to better detect overconfidence risk and calibration issues beyond the limitations of ECE.

BRIDGE and TCH-Net: Heterogeneous Benchmark and Multi-Branch Baseline for Cross-Domain IoT Botnet Detection

cs.CR · 2026-04-13 · unverdicted · novelty 7.0

BRIDGE creates the first formal heterogeneous multi-dataset benchmark for IoT botnet detection with LODO evaluation, and TCH-Net achieves mean LODO F1 of 0.5577 while reaching F1 0.8296 on standard tests, outperforming twelve baselines.

Lightweight True In-Pixel Encryption with FeFET Enabled Pixel Design for Secure Imaging

cs.CV · 2026-04-06 · unverdicted · novelty 7.0

SecurePix uses FeFET multidomain polarization states for in-pixel symmetric-key encryption, dropping ResNet-18 accuracy to 9.58% on MNIST and 6.98% on CIFAR-10 while supporting key-based decryption via lookup table.

Dynamic Free-Rider Detection in Federated Learning via Simulated Attack Patterns

cs.LG · 2026-04-06 · unverdicted · novelty 7.0

S2-WEF detects dynamic free-riders in federated learning by simulating attack WEF patterns from prior global models, combining them with mutual deviation scores, and using two-dimensional clustering without proxy data or pre-training.

Multi-Mode Quantum Annealing for Generative Representation Learning with Boltzmann Priors

quant-ph · 2026-04-01 · unverdicted · novelty 7.0

A multi-mode quantum annealing approach enables VAEs with Boltzmann priors, showing faster training and better generation than Gaussian-prior VAEs on MNIST, Fashion-MNIST, and CelebA plus improved out-of-distribution detection.

Selectivity and Shape in the Design of Forward-Forward Goodness Functions

cs.LG · 2026-03-28 · unverdicted · novelty 7.0

Shape- and peak-sensitive goodness functions for Forward-Forward deliver up to 72pp gains over sum-of-squares, reaching 98.2% on MNIST and 89% on Fashion-MNIST.

Stochastic Attention via Langevin Dynamics on the Modern Hopfield Energy

cs.LG · 2026-03-06 · unverdicted · novelty 7.0

Langevin sampling on the modern Hopfield energy produces training-free stochastic attention that transitions from exact retrieval to generation as temperature rises, with an entropy inflection condition marking the shift.

Programmable superconducting neuron with intrinsic in-memory computation and dual-timescale plasticity for ultra-efficient neuromorphic computing

cs.ET · 2026-03-05 · unverdicted · novelty 7.0

A programmable superconducting LIF neuron with intrinsic static memory and dual-timescale plasticity achieves 45 GHz operation and femtojoule energy per spike.

Task complexity shapes internal representations and robustness in neural networks

cs.LG · 2025-08-07 · unverdicted · novelty 7.0

Harder classification tasks produce neural representations whose accuracy collapses under binarization and shuffling while easier tasks remain robust, defining task complexity via the performance gap between full-precision and perturbed networks.

Encrypted Neural Networks without Overflows

cs.CR · 2026-05-21 · unverdicted · novelty 6.0

Introduces formal verification to compute certified neuron range bounds for CKKS-encrypted neural networks, eliminating overflow failures that previously reached 47%.

Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

Derives expectation consistency condition as necessary and sufficient for calibration under covariate shift and proposes ECL loss with matching sample complexity to ECE.

Generative Recursive Reasoning

cs.AI · 2026-05-19 · unverdicted · novelty 6.0 · 2 refs

GRAM is a latent-variable generative model that performs recursive reasoning via stochastic trajectories, trained with amortized variational inference to support multi-hypothesis reasoning and unconditional generation.

The Diffusion Encoder

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

A diffusion model serves as the encoder in an autoencoder when trained alternately with the decoder to resolve opposing update directions while retaining the standard diffusion training objective.

From Clever Hans to Scientific Discovery: Interpreting EEG Foundational Transformers with LRP

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

LRP on EEG transformers reveals Clever Hans artifacts in motor imagery tasks and a recurring central electrode cluster as a candidate sensorimotor signature of arousal.

Instructions Shape Production of Language, not Processing

cs.CL · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

Instructions trigger a production-centered mechanism in language models, with task-specific information stable in input tokens but varying strongly in output tokens and correlating with behavior.

Inducing Spatial Locality in Vision Transformers through the Training Protocol

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

CutMix augmentation during training induces spatial locality in early layers of Vision Transformers trained from scratch, as measured by reduced Mean Attention Distance.

What If We Let Forecasting Forget? A Sparse Bottleneck for Cross-Variable Dependencies

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

MS-FLOW uses a capacity-limited sparse routing mechanism to model only critical inter-variable dependencies in time series data, achieving state-of-the-art accuracy on 12 benchmarks with fewer but more reliable connections.

Flow Matching with Arbitrary Auxiliary Paths

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

AuxPath-FM extends flow matching to arbitrary auxiliary distributions while preserving the continuity equation and marginal training objective.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Adaptive multi-line fitting for stable line-core intensity and Doppler velocity astro-ph.SR · 2026-05-20 · conditional · none · ref 39
LineFit delivers more stable line-core intensity and Doppler velocity time series from complex multi-line solar spectra by combining adaptive windowing, asymmetric Voigt options, and split-core handling, outperforming standard fast estimators on synthetic benchmarks.
Daily Predictions of F10.7 and F30 Solar Indices with Deep Learning astro-ph.SR · 2026-04-11 · unverdicted · none · ref 16
SINet outperforms five prior statistical and deep learning methods on F10.7 predictions and provides the first deep learning forecasts for the F30 solar index.

Gradient-based learning applied to document recognition

hub tools

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer