Ringmo-agent: A unified re- mote sensing foundation model for multi-platform and multi- modal reasoning

Huiyang Hu, Peijin Wang, Yingchao Feng, Kaiwen Wei, Wenxin Yin, Wenhui Diao, Mengyu Wang, Hanbo Bi, Kaiyue Kang, Tong Ling, et al · 2025 · arXiv 2507.20776

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

GeoMeld: Toward Semantically Grounded Foundation Models for Remote Sensing

cs.CV · 2026-04-12 · unverdicted · novelty 7.0

GeoMeld provides a large-scale aligned multimodal remote sensing dataset with verified semantic captions and a joint pretraining method that improves downstream transfer and cross-sensor robustness in foundation models.

Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems

cs.CV · 2026-01-05 · unverdicted · novelty 7.0

The paper delivers the first comprehensive review and unified taxonomy of agentic AI in remote sensing, covering single-agent copilots, multi-agent systems, planning mechanisms, benchmarks, and a roadmap while noting limitations in grounding and safety.

No One Knows the State of the Art in Geospatial Foundation Models

cs.CV · 2026-05-12 · accept · novelty 6.0

An audit of 152 papers reveals that geospatial foundation models lack standardized evaluations, training controls, and weight releases, so no one knows the state of the art.

MPerS: Dynamic MLLM MixExperts Perception-Guided Remote Sensing Scene Segmentation

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

MPerS dynamically mixes semantic guidance from MLLM-generated RS captions with DINOv3 features via MixExperts and Linguistic Query Guided Attention to achieve superior semantic segmentation on three public remote sensing datasets.

Bridging Perception and Action: A Lightweight Multimodal Meta-Planner Framework for Robust Earth Observation Agents

cs.MA · 2026-05-06 · unverdicted · novelty 4.0

The LMMP framework improves tool-calling accuracy and task success rates for Earth observation agents by grounding plans in multimodal features and remote sensing expert knowledge via a two-stage training process.

citing papers explorer

Showing 5 of 5 citing papers.

GeoMeld: Toward Semantically Grounded Foundation Models for Remote Sensing cs.CV · 2026-04-12 · unverdicted · none · ref 8
GeoMeld provides a large-scale aligned multimodal remote sensing dataset with verified semantic captions and a joint pretraining method that improves downstream transfer and cross-sensor robustness in foundation models.
Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems cs.CV · 2026-01-05 · unverdicted · none · ref 55
The paper delivers the first comprehensive review and unified taxonomy of agentic AI in remote sensing, covering single-agent copilots, multi-agent systems, planning mechanisms, benchmarks, and a roadmap while noting limitations in grounding and safety.
No One Knows the State of the Art in Geospatial Foundation Models cs.CV · 2026-05-12 · accept · none · ref 32
An audit of 152 papers reveals that geospatial foundation models lack standardized evaluations, training controls, and weight releases, so no one knows the state of the art.
MPerS: Dynamic MLLM MixExperts Perception-Guided Remote Sensing Scene Segmentation cs.CV · 2026-05-11 · unverdicted · none · ref 6
MPerS dynamically mixes semantic guidance from MLLM-generated RS captions with DINOv3 features via MixExperts and Linguistic Query Guided Attention to achieve superior semantic segmentation on three public remote sensing datasets.
Bridging Perception and Action: A Lightweight Multimodal Meta-Planner Framework for Robust Earth Observation Agents cs.MA · 2026-05-06 · unverdicted · none · ref 29
The LMMP framework improves tool-calling accuracy and task success rates for Earth observation agents by grounding plans in multimodal features and remote sensing expert knowledge via a two-stage training process.

Ringmo-agent: A unified re- mote sensing foundation model for multi-platform and multi- modal reasoning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer