Empowering edge intelligence: A comprehensive survey on on-device ai models

Wang, X · 2025 · DOI 10.1145/3724420

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Protecting On-Device AI Inference: A Systematic Review of Attacks and Defence Mechanisms

cs.CR · 2026-05-28 · unverdicted · novelty 6.0

A systematic review of on-device AI inference security finds defenses are imbalanced, with roughly half focused on IP theft while one-third of attacks (adversarial examples) lack any associated defenses.

ReMoE: Boosting Expert Reuse through Router Fine-Tuning in Memory-Constrained MoE LLM Inference

cs.LG · 2026-05-26 · unverdicted · novelty 6.0

Router fine-tuning that biases MoE models toward short-term expert reuse improves cache locality, delivering 26% higher reuse and 1.77-1.99x decode speedup under memory constraints without inference-time overhead.

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

Unsupervised single-generation confidence calibration for reasoning LLMs via offline self-consistency proxy distillation outperforms baselines on math and QA tasks and improves selective prediction.

Position Paper: From Edge AI to Adaptive Edge AI

cs.AR · 2026-03-31 · unverdicted · novelty 5.0

Edge AI systems require ongoing adaptation to evolving data and constraints to avoid violating budgets or losing reliability, formalized via an Agent-System-Environment lens that defines ten future research challenges.

A Comparative Study of CNN Optimization Methods for Edge AI: Exploring the Role of Early Exits

cs.AI · 2026-04-16 · unverdicted · novelty 4.0

Combining pruning, quantization, and early exits in CNNs reduces inference latency and memory on real edge devices with minimal accuracy loss.

Little Brains, Big Feats: Exploring Compact Language Models

cs.CL · 2026-06-29 · unverdicted · novelty 3.0

Small language models can run RAG generation on-device without GPUs in reasonable time.

Real-Time Cellist Postural Evaluation With On-Device Computer Vision

cs.HC · 2026-04-19 · unverdicted · novelty 3.0

Cello Evaluator is a real-time postural feedback system for cellists running on current Android phones via on-device computer vision, validated as user-friendly by experts.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Empowering edge intelligence: A comprehensive survey on on-device ai models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer