VisionAId is an offline-first Android application that combines six on-device models for depth, segmentation, embeddings, face detection and banknote recognition with a few-shot pipeline that lets users teach the system their personal objects and then guides them to those objects via AR, audio and h
arXiv preprint arXiv:2601.12882 , year =
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2representative citing papers
Empirical benchmark finds YOLO26 superior on Pascal VOC accuracy and efficiency but YOLOv8 faster on GPU, with both models struggling similarly on VisDrone small-object detection.
citing papers explorer
-
YOLO26 vs. YOLOv8: A Comprehensive Architectural Benchmark of Next-Generation Real-Time Object Detection Models
Empirical benchmark finds YOLO26 superior on Pascal VOC accuracy and efficiency but YOLOv8 faster on GPU, with both models struggling similarly on VisDrone small-object detection.