node2vec: Scalable Feature Learning for Networks
read the original abstract
Prediction tasks over nodes and edges in networks require careful effort in engineering features used by learning algorithms. Recent research in the broader field of representation learning has led to significant progress in automating prediction by learning the features themselves. However, present feature learning approaches are not expressive enough to capture the diversity of connectivity patterns observed in networks. Here we propose node2vec, an algorithmic framework for learning continuous feature representations for nodes in networks. In node2vec, we learn a mapping of nodes to a low-dimensional space of features that maximizes the likelihood of preserving network neighborhoods of nodes. We define a flexible notion of a node's network neighborhood and design a biased random walk procedure, which efficiently explores diverse neighborhoods. Our algorithm generalizes prior work which is based on rigid notions of network neighborhoods, and we argue that the added flexibility in exploring neighborhoods is the key to learning richer representations. We demonstrate the efficacy of node2vec over existing state-of-the-art techniques on multi-label classification and link prediction in several real-world networks from diverse domains. Taken together, our work represents a new way for efficiently learning state-of-the-art task-independent representations in complex networks.
This paper has not been read by Pith yet.
Forward citations
Cited by 6 Pith papers
-
MediaGraph: A Network Theoretic Framework to Analyze Reporting Preferences in Indian News Media
MediaGraph uses co-occurrence networks from Indian news on farmer protests and a new link predictability metric to reveal source-specific reporting preferences and under-representation of farmer leaders.
-
Fast and Featureless Node Representation Learning with Partial Pairwise Supervision
Contrastive FUSE learns node embeddings from partial pairwise supervision and structural signals alone by optimizing a spectral contrastive objective with a lightweight modularity approximation, yielding competitive p...
-
Real-World Challenges in Fake News Detection: Dealing with Posts by Cold Users
Cold users dominate fake news datasets, and the User Evidence Network approximates their absent behavior data from existing user interactions to enable robust misinformation detection.
-
From Node2Vec to GPT-based GraphRAG: scientific impact prediction across graph and language models
Directed citation graphs plus textual embeddings reach 0.84-0.85 AUC for top-P% impact classification while GPT-5.5/5.4 Nano prompts hit 0.87 but show no consistent gain from retrieved graph neighborhoods over target-...
-
DeepTrax: Embedding Graphs of Financial Transactions
DeepTrax learns embeddings for accounts and merchants in financial transaction graphs via methods inspired by standard graph embedding techniques, reporting strong link prediction performance and utility in fraud dete...
-
Graph Embeddings at Scale
Presents a distributed infrastructure for scaling skip-gram graph embeddings to 68M-vertex networks by avoiding partitioning, using dynamic size-constrained graphs, and efficient indexing for updates.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.