Wenxuan Wang — Pith Author Registry

Identifiers

name variant Wenxuan Wang 0.60 · backfill

Papers (22)

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL cs.CV · 2026 · author #9
ComPASS: Towards Personalized Agentic Social Support via Tool-Augmented Companionship cs.CL · 2026 · author #5
NVBench: A Benchmark for Speech Synthesis with Non-Verbal Vocalizations cs.SD · 2026 · author #4
RefereeBench: Are Video MLLMs Ready to be Multi-Sport Referees cs.CV · 2026 · author #7
EgoEsportsQA: An Egocentric Video Benchmark for Perception and Reasoning in Esports cs.CV · 2026 · author #5
AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security cs.AI · 2026 · author #27
Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification cs.AI · 2026 · author #5
Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation cs.CL · 2026 · author #4
AutoMonitor-Bench: Evaluating the Reliability of LLM-Based Misbehavior Monitor cs.CL · 2026 · author #5
Emu3.5: Native Multimodal Models are World Learners cs.CV · 2025 · author #10
Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards cs.CL · 2025 · author #8
The PIMMUR Principles: Ensuring Validity in Collective Behavior of LLM Societies cs.CL · 2025 · author #7
Beyond the Leaderboard: Rethinking Medical Benchmarks for Large Language Models cs.CL · 2025 · author #7
A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron? cs.CL · 2025 · author #8
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning cs.CL · 2025 · author #10
Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding cs.CV · 2025 · author #14
Human Cognitive Benchmarks Reveal Foundational Visual Gaps in MLLMs cs.CV · 2025 · author #6
Learning to Ask: When LLM Agents Meet Unclear Instruction cs.CL · 2024 · author #1
Identifying the Achilles' Heel: An Iterative Method for Dynamically Uncovering Factual Errors in Large Language Models cs.SE · 2024 · author #1
A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition cs.CV · 2019 · author #1
UFANS: U-shaped Fully-Parallel Acoustic Neural Structure For Statistical Parametric Speech Synthesis With 20X Faster cs.SD · 2018 · author #4
Pose-Normalized Image Generation for Person Re-identification cs.CV · 2017 · author #4

Mentions

2601.06600 #4 · arxiv_oai · confidence 0.70 Wenxuan Wang
2510.26583 #10 · arxiv_oai · confidence 0.70 Wenxuan Wang
2503.13377 #14 · arxiv_oai · confidence 0.70 Wenxuan Wang
2504.11456 #10 · arxiv_oai · confidence 0.70 Wenxuan Wang

Frequent Coauthors

Jen-tse Huang 7 shared papers
Qin Jin 4 shared papers
Wenxiang Jiao 4 shared papers
Youliang Yuan 4 shared papers
Michael R. Lyu 3 shared papers
Zhaopeng Tu 3 shared papers
Dong Yu 2 shared papers
Haitao Mi 2 shared papers
Jianzhe Ma 2 shared papers
Juluan Shi 2 shared papers
Pinjia He 2 shared papers
Shuai Wang 2 shared papers
Shu Yang 2 shared papers
Xiaoyuan Liu 2 shared papers
Yanwei Fu 2 shared papers
Yichen Xu 2 shared papers
Yuk-Kit Chan 2 shared papers
Zixuan Ling 2 shared papers
Ada Chen 1 shared papers
Beier Zhu 1 shared papers