EO-Gym supplies an executable multimodal environment and 9k-trajectory benchmark that turns Earth Observation into a tool-using, multi-step reasoning task, revealing that current VLMs struggle on temporal and cross-sensor workflows while fine-tuning lifts Pass@3 from 0.49 to 0.74.
EarthGPT: A universal multi -modal large language model for multi-sensor image comprehension in remote sensing domain
4 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 4verdicts
UNVERDICTED 4representative citing papers
CoastlineVLM-7B, a 7B VLM fine-tuned from LLaVA/GeoChat, jointly detects coastline presence, classifies proxies, and outputs polylines, reducing Hausdorff distance to 31.84 m and EMD to 17.32 m versus segmentation baselines on NZCCD.
Introduces the SMART-HC-VQA dataset with 65k single-image and 2.3M temporal VQA examples plus an adapted LLaVA-NeXT MLLM framework for geospatial-temporal sensemaking of remote sensing construction activity.
A scoping review of physics-informed machine learning for seismic wave propagation finds applications in forward and inverse problems with often comparable accuracy at lower cost, while identifying gaps in benchmarking, training cost, and 3D/experimental validation.
citing papers explorer
-
EO-Gym: A Multimodal, Interactive Environment for Earth Observation Agents
EO-Gym supplies an executable multimodal environment and 9k-trajectory benchmark that turns Earth Observation into a tool-using, multi-step reasoning task, revealing that current VLMs struggle on temporal and cross-sensor workflows while fine-tuning lifts Pass@3 from 0.49 to 0.74.
-
Geometric Coastline Localization using Vision-Language Models
CoastlineVLM-7B, a 7B VLM fine-tuned from LLaVA/GeoChat, jointly detects coastline presence, classifies proxies, and outputs polylines, reducing Hausdorff distance to 31.84 m and EMD to 17.32 m versus segmentation baselines on NZCCD.
-
Geospatial-Temporal Sensemaking of Remote Sensing Activity Detections with Multimodal Large Language Model
Introduces the SMART-HC-VQA dataset with 65k single-image and 2.3M temporal VQA examples plus an adapted LLaVA-NeXT MLLM framework for geospatial-temporal sensemaking of remote sensing construction activity.
-
A Scoping Review of Physics Informed Machine Learning for Wave Propagation Modeling in Seismology
A scoping review of physics-informed machine learning for seismic wave propagation finds applications in forward and inverse problems with often comparable accuracy at lower cost, while identifying gaps in benchmarking, training cost, and 3D/experimental validation.