pith. sign in

Wenxuan Wang

Identifiers

  • name variant Wenxuan Wang 0.60 · backfill

Papers (22)

  1. Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL cs.CV · 2026 · author #9
  2. ComPASS: Towards Personalized Agentic Social Support via Tool-Augmented Companionship cs.CL · 2026 · author #5
  3. NVBench: A Benchmark for Speech Synthesis with Non-Verbal Vocalizations cs.SD · 2026 · author #4
  4. RefereeBench: Are Video MLLMs Ready to be Multi-Sport Referees cs.CV · 2026 · author #7
  5. EgoEsportsQA: An Egocentric Video Benchmark for Perception and Reasoning in Esports cs.CV · 2026 · author #5
  6. AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security cs.AI · 2026 · author #27
  7. Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification cs.AI · 2026 · author #5
  8. Probing Multimodal Large Language Models on Cognitive Biases in Chinese Short-Video Misinformation cs.CL · 2026 · author #4
  9. AutoMonitor-Bench: Evaluating the Reliability of LLM-Based Misbehavior Monitor cs.CL · 2026 · author #5
  10. Emu3.5: Native Multimodal Models are World Learners cs.CV · 2025 · author #10
  11. Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards cs.CL · 2025 · author #8
  12. The PIMMUR Principles: Ensuring Validity in Collective Behavior of LLM Societies cs.CL · 2025 · author #7
  13. Beyond the Leaderboard: Rethinking Medical Benchmarks for Large Language Models cs.CL · 2025 · author #7
  14. A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron? cs.CL · 2025 · author #8
  15. DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning cs.CL · 2025 · author #10
  16. Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding cs.CV · 2025 · author #14
  17. Human Cognitive Benchmarks Reveal Foundational Visual Gaps in MLLMs cs.CV · 2025 · author #6
  18. Learning to Ask: When LLM Agents Meet Unclear Instruction cs.CL · 2024 · author #1
  19. Identifying the Achilles' Heel: An Iterative Method for Dynamically Uncovering Factual Errors in Large Language Models cs.SE · 2024 · author #1
  20. A Fine-Grained Facial Expression Database for End-to-End Multi-Pose Facial Expression Recognition cs.CV · 2019 · author #1
  21. UFANS: U-shaped Fully-Parallel Acoustic Neural Structure For Statistical Parametric Speech Synthesis With 20X Faster cs.SD · 2018 · author #4
  22. Pose-Normalized Image Generation for Person Re-identification cs.CV · 2017 · author #4

Mentions

  • 2601.06600 #4 · arxiv_oai · confidence 0.70 Wenxuan Wang
  • 2510.26583 #10 · arxiv_oai · confidence 0.70 Wenxuan Wang
  • 2503.13377 #14 · arxiv_oai · confidence 0.70 Wenxuan Wang
  • 2504.11456 #10 · arxiv_oai · confidence 0.70 Wenxuan Wang

Frequent Coauthors