Mu Cai — Pith Author Registry

Identifiers

name variant Mu Cai 0.60 · backfill

Papers (25)

MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents cs.CV · 2026 · author #4
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models cs.CV · 2026 · author #2
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025 · author #1232
Decomposing Complex Visual Comprehension into Atomic Visual Skills for Vision Language Models cs.CV · 2025 · author #6
Magma: A Foundation Model for Multimodal AI Agents cs.CV · 2025 · author #8
Humanity's Last Exam cs.LG · 2025 · author #837
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models cs.CV · 2024 · author #1
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos cs.CV · 2024 · author #2
Removing Distributional Discrepancies in Captions Improves Image-Text Alignment cs.CV · 2024 · author #3
Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Manner cs.CV · 2024 · author #4
Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds cs.CV · 2024 · author #1
CHARTOM: A Visual Theory-of-Mind Benchmark for LLMs on Misleading Charts cs.AI · 2024 · author #5
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation cs.CV · 2024 · author #2
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy cs.RO · 2024 · author #9
Yo'LLaVA: Your Personalized Language and Vision Assistant cs.CV · 2024 · author #4
Matryoshka Multimodal Models cs.CV · 2024 · author #1
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples cs.CV · 2024 · author #2
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts cs.CV · 2023 · author #1
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance cs.CV · 2023 · author #4
Investigating the Catastrophic Forgetting in Multimodal Large Language Models cs.CL · 2023 · author #4
Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding cs.CV · 2023 · author #1
Out-of-distribution Detection via Frequency-regularized Generative Models cs.LG · 2022 · author #1
Masked Discrimination for Self-Supervised Learning on Point Clouds cs.CV · 2022 · author #2
VOS: Learning What You Don't Know by Virtual Outlier Synthesis cs.LG · 2022 · author #3
Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving cs.CV · 2020 · author #1

Mentions

2408.14419 #5 · arxiv_oai · confidence 0.70 Mu Cai
2505.20021 #6 · arxiv_oai · confidence 0.70 Mu Cai
2502.13130 #8 · arxiv_oai · confidence 0.70 Mu Cai
2406.20095 #9 · arxiv_oai · confidence 0.70 Mu Cai
2406.09400 #4 · arxiv_oai · confidence 0.70 Mu Cai
2410.10818 #1 · arxiv_oai · confidence 0.70 Mu Cai
2410.02763 #2 · arxiv_oai · confidence 0.70 Mu Cai
2409.12963 #4 · arxiv_oai · confidence 0.70 Mu Cai
2410.00905 #3 · arxiv_oai · confidence 0.70 Mu Cai
2409.06827 #1 · arxiv_oai · confidence 0.70 Mu Cai
2407.10972 #2 · arxiv_oai · confidence 0.70 Mu Cai
2405.17430 #1 · arxiv_oai · confidence 0.70 Mu Cai
2306.06094 #1 · arxiv_oai · confidence 0.70 Mu Cai
2402.13254 #2 · arxiv_oai · confidence 0.70 Mu Cai
2312.00784 #1 · arxiv_oai · confidence 0.70 Mu Cai
2309.10313 #4 · arxiv_oai · confidence 0.70 Mu Cai
2309.12530 #4 · arxiv_oai · confidence 0.70 Mu Cai
2208.09083 #1 · arxiv_oai · confidence 0.70 Mu Cai
2203.11183 #2 · arxiv_oai · confidence 0.70 Mu Cai
2202.01197 #3 · arxiv_oai · confidence 0.70 Mu Cai
2011.13611 #1 · arxiv_oai · confidence 0.70 Mu Cai
2605.18652 #4 · arxiv_oai · confidence 0.70 Mu Cai

Frequent Coauthors

Yong Jae Lee 18 shared papers
Jianrui Zhang 5 shared papers
Bocheng Zou 4 shared papers
Haotian Liu 4 shared papers
Yuheng Li 4 shared papers
Jianfeng Gao 3 shared papers
Jianwei Yang 3 shared papers
Yixuan Li 3 shared papers
Eric Chu 2 shared papers
Haohan Wang 2 shared papers
Himanshu Gupta 2 shared papers
Jaden Park 2 shared papers
Johan Ferret 2 shared papers
Julian Salazar 2 shared papers
Long Le 2 shared papers
Reuben Tan 2 shared papers
Robert Geirhos 2 shared papers
Samuel Albanie 2 shared papers
Subhashini Venugopalan 2 shared papers
Summer Yue 2 shared papers