Tong Yu — Pith Author Registry

Identifiers

name variant Tong Yu 0.60 · backfill

Papers (17)

F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking cs.LG · 2026 · author #8
OLIVIA: Online Learning via Inference-time Action Adaptation for Decision Making in LLM ReAct Agents cs.AI · 2026 · author #6
MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization cs.LG · 2026 · author #6
FERA: Uncertainty-Aware Federated Reasoning for Large Language Models cs.CL · 2026 · author #6
Skill-R1: Agent Skill Evolution via Reinforcement Learning cs.LG · 2026 · author #7
Skill-CMIB: Multimodal Agent Skill for Consistent Action via Conditional Multimodal Information Bottleneck cs.LG · 2026 · author #3
A Survey on LLM-based Conversational User Simulation cs.CL · 2026 · author #17
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning cs.LG · 2026 · author #17
CachePrune: Teaching LLMs What Not to Follow via KV-Cache Editing cs.CR · 2025 · author #4
Federated Large Language Models: Current Progress and Future Directions cs.LG · 2024 · author #6
Figure Captioning with Reasoning and Sequence-Level Training cs.CV · 2019 · author #6
Privacy Partitioning: Protecting User Data During the Deep Learning Inference Phase cs.CR · 2018 · author #4
Superconductivity in Li6P electride cond-mat.supr-con · 2018 · author #3
Understanding and Improving Recurrent Networks for Human Activity Recognition by Continuous Attention cs.LG · 2018 · author #3
Semi-Supervised Convolutional Neural Networks for Human Activity Recognition cs.LG · 2018 · author #2
Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models cs.CL · 2017 · author #2
SpectralLeader: Online Spectral Learning for Single Topic Models cs.LG · 2017 · author #1

Mentions

2409.15723 #6 · arxiv_oai · confidence 0.70 Tong Yu

Frequent Coauthors

Junda Wu 10 shared papers
Julian McAuley 9 shared papers
Jingbo Shang 5 shared papers
Rohan Surana 5 shared papers
Xintong Li 5 shared papers
Lina Yao 4 shared papers
Ole J. Mengshoel 4 shared papers
Ruiyi Zhang 4 shared papers
Ryan Rossi 4 shared papers
Ian Lane 3 shared papers
Jiawei Han 3 shared papers
Sheldon Yu 3 shared papers
Sizhe Zhou 3 shared papers
Sungchul Kim 3 shared papers
Yu Xia 3 shared papers
Zihan Huang 3 shared papers
Bowen Jin 2 shared papers
Branislav Kveton 2 shared papers
Chengkai Huang 2 shared papers
Chuhan Wang 2 shared papers