Deep Learning

Ian Goodfellow, Yoshua Bengio, Aaron Courville · 2016

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

PaLI: A Jointly-Scaled Multilingual Language-Image Model

cs.CV · 2022-09-14 · conditional · novelty 7.0

PaLI jointly scales a 4B-parameter vision transformer with language models on a new 10B multilingual image-text dataset to reach state-of-the-art results on vision-language tasks while keeping a simple modular design.

SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning

cs.LG · 2025-08-07 · unverdicted · novelty 6.0

SPaCe uses semantic clustering to shrink training sets and a multi-armed bandit to adaptively select samples, matching or beating baselines on reasoning benchmarks with up to 100x fewer examples.

Bayesian Surrogate Training on Multiple Data Sources: A Hybrid Modeling Strategy

stat.ML · 2024-12-16 · unverdicted · novelty 6.0

Two hybrid Bayesian surrogate training approaches integrate simulation and real-world data via a weighting strategy independent of surrogate family, shown in synthetic and real case studies to improve accuracy and diagnose simulation issues.

Hierarchical Reasoning Model

cs.AI · 2025-06-26 · unverdicted · novelty 5.0

HRM is a recurrent architecture with high-level planning and low-level execution modules that reaches near-perfect accuracy on complex Sudoku, maze navigation, and ARC benchmarks using 27M parameters and 1000 samples without pre-training or CoT supervision.

citing papers explorer

Showing 4 of 4 citing papers.

PaLI: A Jointly-Scaled Multilingual Language-Image Model cs.CV · 2022-09-14 · conditional · none · ref 28
PaLI jointly scales a 4B-parameter vision transformer with language models on a new 10B multilingual image-text dataset to reach state-of-the-art results on vision-language tasks while keeping a simple modular design.
SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning cs.LG · 2025-08-07 · unverdicted · none · ref 7
SPaCe uses semantic clustering to shrink training sets and a multi-armed bandit to adaptively select samples, matching or beating baselines on reasoning benchmarks with up to 100x fewer examples.
Bayesian Surrogate Training on Multiple Data Sources: A Hybrid Modeling Strategy stat.ML · 2024-12-16 · unverdicted · none · ref 11
Two hybrid Bayesian surrogate training approaches integrate simulation and real-world data via a weighting strategy independent of surrogate family, shown in synthetic and real case studies to improve accuracy and diagnose simulation issues.
Hierarchical Reasoning Model cs.AI · 2025-06-26 · unverdicted · none · ref 1
HRM is a recurrent architecture with high-level planning and low-level execution modules that reaches near-perfect accuracy on complex Sudoku, maze navigation, and ARC benchmarks using 27M parameters and 1000 samples without pre-training or CoT supervision.

Deep Learning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer