Attention is all you need,

· 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Hybrid JIT-CUDA Graph Optimization for Low-Latency Large Language Model Inference

cs.LG · 2026-04-25 · unverdicted · novelty 5.0

A hybrid JIT-CUDA Graph framework reduces TTFT by up to 66% and P99 latency versus TensorRT-LLM for single-GPU LLaMA-2 7B inference on short prompts.

Drift-Aware Online Dynamic Learning for Nonstationary Multivariate Time Series: Application to Sintering Quality Prediction

cs.LG · 2026-04-10 · unverdicted · novelty 4.0

DA-MSDL maintains predictive performance on drifting multivariate time series by detecting distribution shifts without labels and adapting via prioritized replay and hierarchical fine-tuning.

citing papers explorer

Showing 2 of 2 citing papers.

Hybrid JIT-CUDA Graph Optimization for Low-Latency Large Language Model Inference cs.LG · 2026-04-25 · unverdicted · none · ref 25
A hybrid JIT-CUDA Graph framework reduces TTFT by up to 66% and P99 latency versus TensorRT-LLM for single-GPU LLaMA-2 7B inference on short prompts.
Drift-Aware Online Dynamic Learning for Nonstationary Multivariate Time Series: Application to Sintering Quality Prediction cs.LG · 2026-04-10 · unverdicted · none · ref 39
DA-MSDL maintains predictive performance on drifting multivariate time series by detecting distribution shifts without labels and adapting via prioritized replay and hierarchical fine-tuning.

Attention is all you need,

fields

years

verdicts

representative citing papers

citing papers explorer