Title resolution pending

LLM · 2022

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

ZAYA1-8B Technical Report

cs.AI · 2026-05-06 · unverdicted · novelty 6.0

ZAYA1-8B is a reasoning MoE model with 700M active parameters that matches larger models on math and coding benchmarks and reaches 91.9% on AIME'25 via Markovian RSA test-time compute.

Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

cs.CL · 2024-06-16 · unverdicted · novelty 6.0

Quest speeds up long-context LLM self-attention by up to 2.23x via query-dependent selection of top-K critical KV cache pages, cutting overall latency by 7.03x with negligible accuracy loss.

Orbax: Distributed Checkpointing with JAX

cs.DC · 2026-05-21

Stability Implies Redundancy: Delta Attention Selective Halting for Efficient Long-Context Prefilling

cs.AI · 2026-04-20

citing papers explorer

Showing 4 of 4 citing papers.

ZAYA1-8B Technical Report cs.AI · 2026-05-06 · unverdicted · none · ref 130
ZAYA1-8B is a reasoning MoE model with 700M active parameters that matches larger models on math and coding benchmarks and reaches 91.9% on AIME'25 via Markovian RSA test-time compute.
Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference cs.CL · 2024-06-16 · unverdicted · none · ref 46
Quest speeds up long-context LLM self-attention by up to 2.23x via query-dependent selection of top-K critical KV cache pages, cutting overall latency by 7.03x with negligible accuracy loss.
Orbax: Distributed Checkpointing with JAX cs.DC · 2026-05-21 · unreviewed · ref 15
Stability Implies Redundancy: Delta Attention Selective Halting for Efficient Long-Context Prefilling cs.AI · 2026-04-20 · unreviewed · ref 24

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer