Chip placement with deep reinforcement learning

20 Weijian Luo, Tianyang Hu, Shifeng Zhang, Jiacheng Sun, Zhenguo Li, Zhihua Zhang · 2023 · arXiv 2004.10746

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

"Noisier" Noise Contrastive Eestimation is (Almost) Maximum Likelihood

cs.LG · 2024-05-27 · unverdicted · novelty 6.0

Scaling noise magnitude in NCE aligns gradients with MLE, enabling a practical approximation that improves performance on CIFAR-10 and ImageNet image modeling with fewer training steps.

From LLM to Silicon: RL-Driven ASIC Architecture Exploration for On-Device AI Inference

cs.AR · 2026-04-08 · unverdicted · novelty 5.0

An RL agent using Soft Actor-Critic with Mixture-of-Experts jointly optimizes ASIC architecture, memory hierarchy, and partitioning for AI inference, achieving 29809 tokens/s for Llama 3.1 at 3nm and under 13mW for SmolVLM across 3-28nm nodes without manual retuning.

citing papers explorer

Showing 2 of 2 citing papers.

"Noisier" Noise Contrastive Eestimation is (Almost) Maximum Likelihood cs.LG · 2024-05-27 · unverdicted · none · ref 7
Scaling noise magnitude in NCE aligns gradients with MLE, enabling a practical approximation that improves performance on CIFAR-10 and ImageNet image modeling with fewer training steps.
From LLM to Silicon: RL-Driven ASIC Architecture Exploration for On-Device AI Inference cs.AR · 2026-04-08 · unverdicted · none · ref 17
An RL agent using Soft Actor-Critic with Mixture-of-Experts jointly optimizes ASIC architecture, memory hierarchy, and partitioning for AI inference, achieving 29809 tokens/s for Llama 3.1 at 3nm and under 13mW for SmolVLM across 3-28nm nodes without manual retuning.

Chip placement with deep reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer