Ollama: Get up and running with large language models locally, 2024

Ollama · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Local-Splitter: A Measurement Study of Seven Tactics for Reducing Cloud LLM Token Usage on Coding-Agent Workloads

cs.DC · 2026-04-14 · unverdicted · novelty 6.0

Combining local routing with prompt compression saves 45-79% cloud tokens on edit and explanation workloads, while a fuller set including draft-review saves 51% on RAG-heavy tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Local-Splitter: A Measurement Study of Seven Tactics for Reducing Cloud LLM Token Usage on Coding-Agent Workloads cs.DC · 2026-04-14 · unverdicted · none · ref 17
Combining local routing with prompt compression saves 45-79% cloud tokens on edit and explanation workloads, while a fuller set including draft-review saves 51% on RAG-heavy tasks.

Ollama: Get up and running with large language models locally, 2024

fields

years

verdicts

representative citing papers

citing papers explorer