Graph is a Substrate Across Data Modalities

Chuxu Zhang; Jiazheng Li; Jinhe Bi; Xiaoming Wu; Yanfang Ye; Yijun Tian; Yunpu Ma; Zehong Wang; Ziming Li

arxiv: 2601.22384 · v2 · pith:7URYAVRGnew · submitted 2026-01-29 · 💻 cs.LG · cs.AI

Graph is a Substrate Across Data Modalities

Ziming Li , Xiaoming Wu , Zehong Wang , Jiazheng Li , Yijun Tian , Jinhe Bi , Yunpu Ma , Yanfang Ye

show 1 more author

Chuxu Zhang

This is my paper

classification 💻 cs.LG cs.AI

keywords graphacrossmodalitiesstructureg-substratelearningtasksrepresentations

0 comments

read the original abstract

Graphs provide a natural representation of relational structure that arises across diverse domains. Despite this ubiquity, graph structure is typically learned in a modality- and task-isolated manner, where graph representations are constructed within individual task contexts and discarded thereafter. As a result, structural regularities across modalities and tasks are repeatedly reconstructed rather than accumulated at the level of intermediate graph representations. This motivates a representation-learning question: how should graph structure be organized so that it can persist and accumulate across heterogeneous modalities and tasks? We adopt a representation-centric perspective in which graph structure is treated as a structural substrate that persists across learning contexts. To instantiate this perspective, we propose G-Substrate, a graph substrate framework that organizes learning around shared graph structures. G-Substrate comprises two complementary mechanisms: a unified structural schema that ensures compatibility among graph representations across heterogeneous modalities and tasks, and an interleaved role-based training strategy that exposes the same graph structure to multiple functional roles during learning. Experiments across multiple domains, modalities, and tasks show that G-Substrate outperforms task-isolated and naive multi-task learning methods. The codebase, model, and datasets are available at https://github.com/zmli6/G-Substrate.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

On the Safety of Graph Representation Learning
cs.LG 2026-05 unverdicted novelty 7.0

GRL-Safety benchmark shows that safety in graph representation learning depends on interactions between method design and specific graph stresses rather than broad method families.
Why Semantic Entropy Fails: Geometry-Aware and Calibrated Uncertainty for Policy Optimization
cs.LG 2026-05 unverdicted novelty 5.0

Identifies two gaps in entropy-based uncertainty for LLM post-training and proposes GCPO to align geometry-aware disagreement measures with reward-based calibration for better gradient regulation.