pith. sign in

Title resolution pending

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

fields

cs.LG 4 cs.CV 1

representative citing papers

Dataset Distillation

cs.LG · 2018-11-27 · unverdicted · novelty 8.0

Dataset distillation creates a tiny synthetic training set that, when used with a fixed network initialization, produces models whose performance approximates that of models trained on the full original dataset.

Hyperspherical Forward-Forward with Prototypical Representations

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

HFF replaces binary goodness-of-fit in Forward-Forward with hyperspherical prototypes for direct multi-class decisions, enabling single-forward-pass inference and training that scales to ImageNet while closing much of the gap to backpropagation.

citing papers explorer

Showing 5 of 5 citing papers.

  • Dataset Distillation cs.LG · 2018-11-27 · unverdicted · none · ref 54

    Dataset distillation creates a tiny synthetic training set that, when used with a fixed network initialization, produces models whose performance approximates that of models trained on the full original dataset.

  • Hyperspherical Forward-Forward with Prototypical Representations cs.LG · 2026-04-30 · unverdicted · none · ref 35

    HFF replaces binary goodness-of-fit in Forward-Forward with hyperspherical prototypes for direct multi-class decisions, enabling single-forward-pass inference and training that scales to ImageNet while closing much of the gap to backpropagation.

  • Prototype-Based Test-Time Adaptation of Vision-Language Models cs.CV · 2026-04-23 · unverdicted · none · ref 28

    PTA adapts VLMs at test time by maintaining and updating class-specific knowledge prototypes from test samples, achieving higher accuracy than cache-based methods with far less speed loss.

  • Layer by Layer: Uncovering Hidden Representations in Language Models cs.LG · 2025-02-04 · unverdicted · none · ref 139

    Intermediate layers in LLMs consistently provide stronger features than final layers across tasks and architectures, as quantified by a new framework of information-theoretic, geometric, and invariance metrics.

  • SURGE: Surrogate Gradient Adaptation in Binary Neural Networks cs.LG · 2026-05-09 · unreviewed · ref 65 · 2 links