Title resolution pending

A Vaswani · 2017

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

browse 8 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

Next-Scale Generative Reranking: A Tree-based Generative Rerank Method at Meituan

cs.IR · 2026-04-07 · unverdicted · novelty 7.0

NSGR is a tree-structured generative reranker that progressively generates optimal lists via next-scale expansion and multi-scale neighbor loss to balance perspectives and align training signals.

Graph Retention Networks for Dynamic Graphs

cs.LG · 2024-11-18 · unverdicted · novelty 7.0

Graph Retention Networks extend retention to dynamic graphs to enable parallelizable training, O(1) inference, and chunkwise long-term training while delivering competitive performance with major efficiency gains.

Towards Localizing Conversation Partners using Head Motion

cs.HC · 2026-04-27 · unverdicted · novelty 6.0 · 2 refs

HALo uses smartglasses IMU head orientation to localize conversation partners' acoustic zones, achieving 21% better performance with known partner count, while CoCo classifies partner numbers at 0.74 accuracy using only IMU data.

FLAME: Condensing Ensemble Diversity into a Single Network for Efficient Sequential Recommendation

cs.IR · 2026-04-05 · conditional · novelty 6.0

FLAME condenses ensemble diversity into a single network via modular ensemble simulation and guided mutual learning during training, delivering ensemble-level performance with single-network inference speed on sequential recommendation tasks.

Boosting Team Modeling through Tempo-Relational Representation Learning

cs.LG · 2025-07-17 · unverdicted · novelty 6.0

A tempo-relational neural architecture jointly models temporal and relational aspects of team interactions to outperform prior approaches on team performance prediction and enable efficient multi-task prediction of team constructs.

AlignedServe: Orchestrating Prefix-aware Batching to Build a High-throughput and Computing-efficient LLM Serving System

cs.DC · 2026-05-22 · unverdicted · novelty 5.0

AlignedServe uses prefix-aware batching, large CPU in-flight request pools, batch scheduling, and GPU-to-GPU KV prefetching to raise decoding throughput up to 1.98x and cut latency up to 7.4x versus prior serving systems.

Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification

cs.MM · 2025-05-16 · unverdicted · novelty 5.0

CDGLT achieves SOTA on MET-Meme for multimodal metaphor identification by using SLERP-based concept drift and prompt-adapted LayerNorm tuning with reduced compute.

CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training

cs.DC · 2026-05-06 · unverdicted · novelty 4.0

CCL-D detects slow/hang anomalies in CCL for distributed training via lightweight tracing probes and an intelligent analyzer, achieving near-complete coverage and 6-minute rank localization on a 4000-GPU cluster over one year.

citing papers explorer

Showing 8 of 8 citing papers.

Next-Scale Generative Reranking: A Tree-based Generative Rerank Method at Meituan cs.IR · 2026-04-07 · unverdicted · none · ref 35
NSGR is a tree-structured generative reranker that progressively generates optimal lists via next-scale expansion and multi-scale neighbor loss to balance perspectives and align training signals.
Graph Retention Networks for Dynamic Graphs cs.LG · 2024-11-18 · unverdicted · none · ref 36
Graph Retention Networks extend retention to dynamic graphs to enable parallelizable training, O(1) inference, and chunkwise long-term training while delivering competitive performance with major efficiency gains.
Towards Localizing Conversation Partners using Head Motion cs.HC · 2026-04-27 · unverdicted · none · ref 97 · 2 links
HALo uses smartglasses IMU head orientation to localize conversation partners' acoustic zones, achieving 21% better performance with known partner count, while CoCo classifies partner numbers at 0.74 accuracy using only IMU data.
FLAME: Condensing Ensemble Diversity into a Single Network for Efficient Sequential Recommendation cs.IR · 2026-04-05 · conditional · none · ref 49
FLAME condenses ensemble diversity into a single network via modular ensemble simulation and guided mutual learning during training, delivering ensemble-level performance with single-network inference speed on sequential recommendation tasks.
Boosting Team Modeling through Tempo-Relational Representation Learning cs.LG · 2025-07-17 · unverdicted · none · ref 137
A tempo-relational neural architecture jointly models temporal and relational aspects of team interactions to outperform prior approaches on team performance prediction and enable efficient multi-task prediction of team constructs.
AlignedServe: Orchestrating Prefix-aware Batching to Build a High-throughput and Computing-efficient LLM Serving System cs.DC · 2026-05-22 · unverdicted · none · ref 35
AlignedServe uses prefix-aware batching, large CPU in-flight request pools, batch scheduling, and GPU-to-GPU KV prefetching to raise decoding throughput up to 1.98x and cut latency up to 7.4x versus prior serving systems.
Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification cs.MM · 2025-05-16 · unverdicted · none · ref 35
CDGLT achieves SOTA on MET-Meme for multimodal metaphor identification by using SLERP-based concept drift and prompt-adapted LayerNorm tuning with reduced compute.
CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training cs.DC · 2026-05-06 · unverdicted · none · ref 55
CCL-D detects slow/hang anomalies in CCL for distributed training via lightweight tracing probes and an intelligent analyzer, achieving near-complete coverage and 6-minute rank localization on a 4000-GPU cluster over one year.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer