Title resolution pending

· 2024 · arXiv 2411.07447

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Beyond Per-Token Pricing: A Concurrency-Aware Methodology for LLM Infrastructure Cost Estimation

cs.DC · 2026-06-10 · unverdicted · novelty 6.0

Effective LLM inference cost per million output tokens varies 2.5-36x with offered request rate due to utilization, addressed by a concurrency-aware measurement methodology and open-source vLLM tool validated across model types.

Large Databases Need Small, Open-Weight Language Models

cs.AI · 2026-06-30 · unverdicted · novelty 4.0

Quantized open-weight LMs on consumer hardware match closed-source API accuracy for LM-enhanced relational operators while delivering 390x lower cost and 3.8x lower latency in the BlendSQL framework.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Beyond Per-Token Pricing: A Concurrency-Aware Methodology for LLM Infrastructure Cost Estimation cs.DC · 2026-06-10 · unverdicted · none · ref 8
Effective LLM inference cost per million output tokens varies 2.5-36x with offered request rate due to utilization, addressed by a concurrency-aware measurement methodology and open-source vLLM tool validated across model types.
Large Databases Need Small, Open-Weight Language Models cs.AI · 2026-06-30 · unverdicted · none · ref 27
Quantized open-weight LMs on consumer hardware match closed-source API accuracy for LM-enhanced relational operators while delivering 390x lower cost and 3.8x lower latency in the BlendSQL framework.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer