← back to paper
arxiv: 2606.29629 · 2 revisions
Energy-Efficient Multimodal Inference Serving with Tri-serve