hub Canonical reference

Attention is all you need

· 2017

Canonical reference. 100% of citing Pith papers cite this work as background.

20 Pith papers citing it

Background 100% of classified citations

browse 20 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 5

citation-polarity summary

background 5

representative citing papers

Efficient and Adaptive Human Activity Recognition via LLM Backbones

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Pretrained LLMs adapted via convolutional projections and LoRA act as efficient frozen backbones for sensor-based human activity recognition, delivering strong data efficiency and cross-dataset transfer.

Joint Fullband-Subband Modeling for High-Resolution SingFake Detection

cs.SD · 2026-04-06 · unverdicted · novelty 7.0

A joint fullband-subband model using high-resolution 44.1 kHz audio outperforms standard 16 kHz detectors for singing voice deepfake detection by exploiting spectrum-specific synthesis artifacts.

MDS-DETR: DETR with Masked Duplicate Suppressor

cs.CV · 2026-05-22 · unverdicted · novelty 6.0

MDS-DETR introduces a masked duplicate suppressor in self-attention to enable one-to-many supervision inside a single decoder, yielding +2.8 mAP over Deformable-DETR on COCO with 5% more training time and outperforming MR.DETR by 0.3 mAP while training 20% faster.

MSACT: Multistage Spatial Alignment for Stable Low-Latency Fine Manipulation

cs.RO · 2026-05-01 · unverdicted · novelty 6.0

MSACT improves localization stability and task success rates in limited-data bimanual manipulation by extracting stable 2D attention points and aligning predicted attention sequences across frames without keypoint labels.

Stereo Multistage Spatial Attention for Real-Time Mobile Manipulation Under Visual Scale Variation and Disturbances

cs.RO · 2026-05-01 · unverdicted · novelty 6.0

A stereo multistage spatial attention deep predictive learning system improves robustness and success rates for real-time mobile manipulation under visual scale variation and disturbances.

Diffusion Sequence Models for Generative In-Context Meta-Learning of Robot Dynamics

cs.LG · 2026-04-15 · unverdicted · novelty 6.0

Diffusion models for in-context meta-learning of robot dynamics outperform deterministic Transformers in robustness to distribution shifts while enabling real-time operation via warm-started sampling.

Unsupervised Equivalent Contrastive Learning for Radio Signal Recognition

eess.SP · 2026-04-13 · unverdicted · novelty 6.0

Unsupervised contrastive learning with multi-domain equivalent transformations produces robust radio signal embeddings that outperform baselines in few-shot and cross-domain settings.

Contrastive Feedback Mechanism for Simultaneous Speech Translation

cs.CL · 2024-07-30 · unverdicted · novelty 6.0

CFM uses unstable predictions via contrastive learning to improve SST quality on 3 decision policies and 8 languages in MuST-C v1.0.

MTA-RL: Robust Urban Driving via Multi-modal Transformer-based 3D Affordances and Reinforcement Learning

cs.CV · 2026-05-11 · unverdicted · novelty 5.0

MTA-RL predicts 3D driving affordances from multi-modal sensors with a transformer and uses them as the observation space for an RL policy, yielding better route completion and generalization than baselines in CARLA urban scenarios.

Progressive Semantic Communication for Efficient Edge-Cloud Vision-Language Models

cs.LG · 2026-04-29 · unverdicted · novelty 5.0

A Meta AutoEncoder framework enables adaptive, progressive compression of visual features for low-latency edge-cloud VLM inference without model fine-tuning.

Regularized Entropy Information Adaptation with Temporal-Awareness Networks for Simultaneous Speech Translation

cs.LG · 2026-04-10 · unverdicted · novelty 5.0

REINA-SAN and REINA-TAN add temporal context to information-based read/write policies, improving the quality-latency tradeoff in simultaneous speech translation by up to 7.1% on Normalized Streaming Efficiency.

Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping

cs.RO · 2025-11-03 · unverdicted · novelty 5.0

A rectified flow model trained on 30 actuation-space demonstrations produces control sequences that yield 97.5% grasp success across the workspace, with generalization to object size changes of ±33% and execution speed scaling from 20% to 200%.

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving

cs.CV · 2025-07-05 · unverdicted · novelty 5.0

DIVER uses RL-guided diffusion to produce diverse feasible trajectories from one ground-truth path, addressing mode collapse in imitation learning for autonomous driving.

MSDformer: Multi-scale Discrete Transformer For Time Series Generation

cs.LG · 2025-05-20 · unverdicted · novelty 5.0

MSDformer introduces a multi-scale discrete transformer that tokenizes time series at multiple scales and models them autoregressively in discrete space, claiming superior performance over prior DTM methods with rate-distortion theoretical support.

Conditional Flow-VAE for Safety-Critical Traffic Scenario Generation

cs.RO · 2026-05-06 · unverdicted · novelty 4.0

A conditional flow matching model generates realistic safety-critical traffic scenarios by turning nominal scenes into dangerous rollouts using combined simulation and real data.

From Prompts to Pavement: LMMs-based Agentic Behavior-Tree Generation Framework for Autonomous Vehicles

cs.CV · 2026-01-18 · unverdicted · novelty 4.0

An agentic LLM/LVM framework generates adaptive behavior trees on-the-fly for AV navigation in CARLA+Nav2 simulation, succeeding in obstacle avoidance where static BTs fail.

A Comprehensive Survey on Network Traffic Synthesis: From Statistical Models to Deep Learning

cs.NI · 2025-06-23 · unverdicted · novelty 4.0

A survey reviewing statistical and deep learning approaches to synthetic network traffic generation, with comparisons, an AI comparison tool, open challenges, and future directions.

Sustainable Code Generation Using Large Language Models: A Systematic Literature Review

cs.SE · 2026-03-01 · unverdicted · novelty 3.0

A systematic review finds research on the sustainability of LLM-generated code to be limited, fragmented, and without accepted frameworks for measurement or benchmarking.

Redefining End-of-Life: Intelligent Automation for Electronics Remanufacturing Systems

eess.SY · 2026-04-03 · unverdicted · novelty 2.0

A literature review of intelligent automation approaches using robotics, AI, and control for disassembly, inspection, sorting, and reprocessing of end-of-life electronics.

Adaptive Head Budgeting for Efficient Multi-Head Attention

cs.LG · 2026-04-24

citing papers explorer

Showing 20 of 20 citing papers.

Efficient and Adaptive Human Activity Recognition via LLM Backbones cs.LG · 2026-05-12 · unverdicted · none · ref 7
Pretrained LLMs adapted via convolutional projections and LoRA act as efficient frozen backbones for sensor-based human activity recognition, delivering strong data efficiency and cross-dataset transfer.
Joint Fullband-Subband Modeling for High-Resolution SingFake Detection cs.SD · 2026-04-06 · unverdicted · none · ref 37
A joint fullband-subband model using high-resolution 44.1 kHz audio outperforms standard 16 kHz detectors for singing voice deepfake detection by exploiting spectrum-specific synthesis artifacts.
MDS-DETR: DETR with Masked Duplicate Suppressor cs.CV · 2026-05-22 · unverdicted · none · ref 36
MDS-DETR introduces a masked duplicate suppressor in self-attention to enable one-to-many supervision inside a single decoder, yielding +2.8 mAP over Deformable-DETR on COCO with 5% more training time and outperforming MR.DETR by 0.3 mAP while training 20% faster.
MSACT: Multistage Spatial Alignment for Stable Low-Latency Fine Manipulation cs.RO · 2026-05-01 · unverdicted · none · ref 23
MSACT improves localization stability and task success rates in limited-data bimanual manipulation by extracting stable 2D attention points and aligning predicted attention sequences across frames without keypoint labels.
Stereo Multistage Spatial Attention for Real-Time Mobile Manipulation Under Visual Scale Variation and Disturbances cs.RO · 2026-05-01 · unverdicted · none · ref 29
A stereo multistage spatial attention deep predictive learning system improves robustness and success rates for real-time mobile manipulation under visual scale variation and disturbances.
Diffusion Sequence Models for Generative In-Context Meta-Learning of Robot Dynamics cs.LG · 2026-04-15 · unverdicted · none · ref 11
Diffusion models for in-context meta-learning of robot dynamics outperform deterministic Transformers in robustness to distribution shifts while enabling real-time operation via warm-started sampling.
Unsupervised Equivalent Contrastive Learning for Radio Signal Recognition eess.SP · 2026-04-13 · unverdicted · none · ref 27
Unsupervised contrastive learning with multi-domain equivalent transformations produces robust radio signal embeddings that outperform baselines in few-shot and cross-domain settings.
Contrastive Feedback Mechanism for Simultaneous Speech Translation cs.CL · 2024-07-30 · unverdicted · none · ref 25
CFM uses unstable predictions via contrastive learning to improve SST quality on 3 decision policies and 8 languages in MuST-C v1.0.
MTA-RL: Robust Urban Driving via Multi-modal Transformer-based 3D Affordances and Reinforcement Learning cs.CV · 2026-05-11 · unverdicted · none · ref 20
MTA-RL predicts 3D driving affordances from multi-modal sensors with a transformer and uses them as the observation space for an RL policy, yielding better route completion and generalization than baselines in CARLA urban scenarios.
Progressive Semantic Communication for Efficient Edge-Cloud Vision-Language Models cs.LG · 2026-04-29 · unverdicted · none · ref 16
A Meta AutoEncoder framework enables adaptive, progressive compression of visual features for low-latency edge-cloud VLM inference without model fine-tuning.
Regularized Entropy Information Adaptation with Temporal-Awareness Networks for Simultaneous Speech Translation cs.LG · 2026-04-10 · unverdicted · none · ref 28
REINA-SAN and REINA-TAN add temporal context to information-based read/write policies, improving the quality-latency tradeoff in simultaneous speech translation by up to 7.1% on Normalized Streaming Efficiency.
Lightweight Learning from Actuation-Space Demonstrations via Flow Matching for Whole-Body Soft Robotic Grasping cs.RO · 2025-11-03 · unverdicted · none · ref 38
A rectified flow model trained on 30 actuation-space demonstrations produces control sequences that yield 97.5% grasp success across the workspace, with generalization to object size changes of ±33% and execution speed scaling from 20% to 200%.
DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving cs.CV · 2025-07-05 · unverdicted · none · ref 22
DIVER uses RL-guided diffusion to produce diverse feasible trajectories from one ground-truth path, addressing mode collapse in imitation learning for autonomous driving.
MSDformer: Multi-scale Discrete Transformer For Time Series Generation cs.LG · 2025-05-20 · unverdicted · none · ref 61
MSDformer introduces a multi-scale discrete transformer that tokenizes time series at multiple scales and models them autoregressively in discrete space, claiming superior performance over prior DTM methods with rate-distortion theoretical support.
Conditional Flow-VAE for Safety-Critical Traffic Scenario Generation cs.RO · 2026-05-06 · unverdicted · none · ref 35
A conditional flow matching model generates realistic safety-critical traffic scenarios by turning nominal scenes into dangerous rollouts using combined simulation and real data.
From Prompts to Pavement: LMMs-based Agentic Behavior-Tree Generation Framework for Autonomous Vehicles cs.CV · 2026-01-18 · unverdicted · none · ref 8
An agentic LLM/LVM framework generates adaptive behavior trees on-the-fly for AV navigation in CARLA+Nav2 simulation, succeeding in obstacle avoidance where static BTs fail.
A Comprehensive Survey on Network Traffic Synthesis: From Statistical Models to Deep Learning cs.NI · 2025-06-23 · unverdicted · none · ref 156
A survey reviewing statistical and deep learning approaches to synthetic network traffic generation, with comparisons, an AI comparison tool, open challenges, and future directions.
Sustainable Code Generation Using Large Language Models: A Systematic Literature Review cs.SE · 2026-03-01 · unverdicted · none · ref 1
A systematic review finds research on the sustainability of LLM-generated code to be limited, fragmented, and without accepted frameworks for measurement or benchmarking.
Redefining End-of-Life: Intelligent Automation for Electronics Remanufacturing Systems eess.SY · 2026-04-03 · unverdicted · none · ref 215
A literature review of intelligent automation approaches using robotics, AI, and control for disassembly, inspection, sorting, and reprocessing of end-of-life electronics.
Adaptive Head Budgeting for Efficient Multi-Head Attention cs.LG · 2026-04-24 · unreviewed · ref 1

Attention is all you need

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer