Empirical study of a fully synthetic data generation pipeline for text-based person retrieval that tests its use as a replacement or augmentation for real data across scenarios.
hub
& Daoudi, A.Real-Time Flying Ob- ject Detection with YOLOv8 2023
10 Pith papers cite this work. Polarity classification is still indexing.
hub tools
citation-role summary
citation-polarity summary
representative citing papers
Introduces the XAMI benchmark dataset of 1000 annotated XMM-Newton images for artefact detection together with a hybrid CNN-transformer instance segmentation demonstration.
A drone-mounted stereo camera pipeline with YOLO segmentation, deep stereo depth, centroid triangulation, and MAD outlier rejection achieves robust 3D positioning of thin pine branches at 1-2 m distances.
AnatomicalNets segments lung structures and computes tumor size and proximity via contours to reach 91.36% T-staging accuracy on Lung-PET-CT-Dx following clinical guidelines.
WildfireVLM integrates YOLOv12 object detection on satellite imagery with multimodal LLMs to detect wildfires and produce contextual risk assessments and response recommendations.
A deep learning pipeline with YOLOv8 and Keypoint R-CNN achieves ICC up to 0.80 for bone loss severity and 87% accuracy for horizontal vs. angular pattern classification on 1000 annotated IOPA radiographs.
A survey consolidating frameworks, data practices, large action models, benchmarks, applications, and research gaps in LLM-brained GUI agents.
HyDRA Scorpion is a low-cost, 4-DoF ROV with dual manipulators and AI perception that achieves 0.89 mAP object detection and stable operation at simulated depths up to 304.8 m.
Drone stereo vision pipeline segments pine branches with YOLO variants and estimates depth with deep stereo networks, yielding more coherent maps than SGBM at 1-2 m distances.
Proposes a multimodal pipeline for video retrieval that incorporates information from multiple frames to enable higher-level abstraction beyond single-image object detection.
citing papers explorer
-
An Empirical Study of Validating Synthetic Data for Text-Based Person Retrieval
Empirical study of a fully synthetic data generation pipeline for text-based person retrieval that tests its use as a replacement or augmentation for real data across scenarios.
-
XAMI -- A Benchmark Dataset for Artefact Detection in XMM-Newton Optical Images
Introduces the XAMI benchmark dataset of 1000 annotated XMM-Newton images for artefact detection together with a hybrid CNN-transformer instance segmentation demonstration.
-
Low-Cost Stereo Vision for Robust 3D Positioning of Thin Radiata Pine Branches in Autonomous Drone Pruning
A drone-mounted stereo camera pipeline with YOLO segmentation, deep stereo depth, centroid triangulation, and MAD outlier rejection achieves robust 3D positioning of thin pine branches at 1-2 m distances.
-
AnatomicalNets: A Multi-Structure Segmentation and Contour-Based Distance Estimation Pipeline for Clinically Grounded Lung Cancer T-Staging
AnatomicalNets segments lung structures and computes tumor size and proximity via contours to reach 91.36% T-staging accuracy on Lung-PET-CT-Dx following clinical guidelines.
-
WildfireVLM: AI-powered Analysis for Early Wildfire Detection and Risk Assessment Using Satellite Imagery
WildfireVLM integrates YOLOv12 object detection on satellite imagery with multimodal LLMs to detect wildfires and produce contextual risk assessments and response recommendations.
-
AI-assisted radiographic analysis in detecting alveolar bone-loss severity and patterns
A deep learning pipeline with YOLOv8 and Keypoint R-CNN achieves ICC up to 0.80 for bone loss severity and 87% accuracy for horizontal vs. angular pattern classification on 1000 annotated IOPA radiographs.
-
Large Language Model-Brained GUI Agents: A Survey
A survey consolidating frameworks, data practices, large action models, benchmarks, applications, and research gaps in LLM-brained GUI agents.
-
HyDRA Scorpion: A Cost-effective and Modular ROV for Real-Time Underwater Inspection, Intervention, and Object Detection
HyDRA Scorpion is a low-cost, 4-DoF ROV with dual manipulators and AI perception that achieves 0.89 mAP object detection and stable operation at simulated depths up to 304.8 m.
-
Positioning radiata pine branches requiring pruning by drone stereo vision
Drone stereo vision pipeline segments pine branches with YOLO variants and estimates depth with deep stereo networks, yielding more coherent maps than SGBM at 1-2 m distances.
-
Multimodal Contextualized Support for Enhancing Video Retrieval System
Proposes a multimodal pipeline for video retrieval that incorporates information from multiple frames to enable higher-level abstraction beyond single-image object detection.