Native LLM and MLLM Inference at Scale on Apple Silicon

Apple Inc · 2023 · arXiv 2601.19139

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Cross-Family Speculative Decoding for Polish Language Models on Apple~Silicon: An Empirical Evaluation of Bielik~11B with UAG-Extended MLX-LM

cs.CL · 2026-03-22 · unverdicted · novelty 6.0

Context-aware cross-family speculative decoding reaches 1.7x speedup on structured Polish text but fails to deliver gains on varied instructions because both models are memory-bandwidth bound on unified memory.

citing papers explorer

Showing 1 of 1 citing paper.

Cross-Family Speculative Decoding for Polish Language Models on Apple~Silicon: An Empirical Evaluation of Bielik~11B with UAG-Extended MLX-LM cs.CL · 2026-03-22 · unverdicted · none · ref 1
Context-aware cross-family speculative decoding reaches 1.7x speedup on structured Polish text but fails to deliver gains on varied instructions because both models are memory-bandwidth bound on unified memory.

Native LLM and MLLM Inference at Scale on Apple Silicon

fields

years

verdicts

representative citing papers

citing papers explorer