Simulation study shows cold TLB misses in reverse address translation dominate latency for small collectives in multi-GPU pods, causing up to 1.4x degradation, while larger ones see diminishing returns.
ImageNet classific ation with deep convolutional neural networks
9 Pith papers cite this work, alongside 31,060 external citations. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
representative citing papers
Reachability for neural networks is NP-hard for single-hidden-layer networks with output dimension 1 and weights restricted to {-1,0,1}.
AI use in science has grown exponentially since 2015 but stays confined to computer science and statistics topics, shows higher retraction rates and citations, and follows distinct global adoption patterns.
AI peer review systems are vulnerable to prompt injections, prestige biases, assertion strength effects, and contextual poisoning, as demonstrated by a new attack taxonomy and causal experiments on real conference submissions.
AIBuildAI uses a manager agent and three LLM sub-agents to fully automate AI model development and achieves a 63.1% medal rate on MLE-Bench, matching experienced human engineers.
A zero-shot visual world model trained on one child's experience achieves broad competence on physical understanding benchmarks while matching developmental behavioral patterns.
Strategic insertion of Global Average Pooling layers in VGG-16 reduces trainable parameters by 98%, maintains 66.4% ImageNet Top-1 accuracy, doubles translation robustness, and yields superior Spearman correlations in perceptual IQA tasks.
Convolutional neural networks are shown to perform inverse design of thin-film metamaterial stacks by learning the mapping from structure to ellipsometric and reflectance/transmittance spectra, with efficiency gains over traditional optimization as layer count increases.
citing papers explorer
-
Analyzing Reverse Address Translation Overheads in Multi-GPU Scale-Up Pods
Simulation study shows cold TLB misses in reverse address translation dominate latency for small collectives in multi-GPU pods, causing up to 1.4x degradation, while larger ones see diminishing returns.
-
Reachability In Simple Neural Networks
Reachability for neural networks is NP-hard for single-hidden-layer networks with output dimension 1 and weights restricted to {-1,0,1}.
-
When AI Meets Science: Research Diversity, Interdisciplinarity, Visibility, and Retractions across Disciplines in a Global Surge
AI use in science has grown exponentially since 2015 but stays confined to computer science and statistics topics, shows higher retraction rates and citations, and follows distinct global adoption patterns.
-
When AI reviews science: Can we trust the referee?
AI peer review systems are vulnerable to prompt injections, prestige biases, assertion strength effects, and contextual poisoning, as demonstrated by a new attack taxonomy and causal experiments on real conference submissions.
-
AIBuildAI: An AI Agent for Automatically Building AI Models
AIBuildAI uses a manager agent and three LLM sub-agents to fully automate AI model development and achieves a 63.1% medal rate on MLE-Bench, matching experienced human engineers.
-
Zero-shot World Models Are Developmentally Efficient Learners
A zero-shot visual world model trained on one child's experience achieves broad competence on physical understanding benchmarks while matching developmental behavioral patterns.
-
Parameter-Efficient Architectural Modifications for Translation-Invariant CNNs
Strategic insertion of Global Average Pooling layers in VGG-16 reduces trainable parameters by 98%, maintains 66.4% ImageNet Top-1 accuracy, doubles translation robustness, and yields superior Spearman correlations in perceptual IQA tasks.
-
General Inverse Design of Thin-Film Metamaterials With Convolutional Neural Networks
Convolutional neural networks are shown to perform inverse design of thin-film metamaterial stacks by learning the mapping from structure to ellipsometric and reflectance/transmittance spectra, with efficiency gains over traditional optimization as layer count increases.
- Venus-DeFakerOne: Unified Fake Image Detection & Localization