pith. sign in

hub

An image is worth 16x16 words: Transformers for image recognition at scale

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it

hub tools

citation-role summary

background 1

citation-polarity summary

years

2026 11 2025 3

roles

background 1

polarities

background 1

clear filters

representative citing papers

The Indra Representation Hypothesis for Multimodal Alignment

cs.CV · 2026-04-06 · unverdicted · novelty 7.0

Unimodal model representations converge to a relational structure captured by the Indra representation via V-enriched Yoneda embedding, which is unique and structure-preserving and improves cross-model and cross-modal robustness when instantiated with angular distance.

Winfree Oscillatory Neural Network

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

WONN is a new oscillatory neural network based on generalized Winfree dynamics that scales competitively to ImageNet-1K and reaches 80.1% accuracy on Maze-hard with 1% of prior model parameters.

Uncertainty-Aware Foundation Models for Clinical Data

cs.LG · 2026-04-05 · unverdicted · novelty 6.0

The work introduces uncertainty-aware foundation models for clinical data by learning set-valued patient representations that enforce consistency across partial observations and integrate multimodal self-supervised objectives.

Vision Transformers Need Better Token Interaction

cs.CV · 2026-05-22 · unverdicted · novelty 5.0

Replacing softmax attention with entmax-1.5 in DINOv1 ViT-S/16 improves semantic segmentation mIoU on three benchmarks while keeping ImageNet linear-probing accuracy unchanged.

Sharpness-Aware Minimization with Z-Score Gradient Filtering

cs.LG · 2025-05-05 · unverdicted · novelty 4.0

Z-Score Filtered SAM retains only high absolute Z-score gradient components per layer during the ascent step and reports higher test accuracy than standard SAM on CIFAR and Tiny-ImageNet benchmarks.

citing papers explorer

Showing 1 of 1 citing paper after filters.