Saining Xie
Identifiers
- name variant Saining Xie 0.60 · backfill
Papers (26)
- Benchmarking Visual State Tracking in Multimodal Video Understanding cs.CV · 2026 · author #11
- PaintBench: Deterministic Evaluation of Precise Visual Editing cs.GR · 2026 · author #6
- Cambrian-P: Pose-Grounded Video Understanding cs.CV · 2026 · author #8
- Improved Baselines with Representation Autoencoders cs.CV · 2026 · author #6
- Image Generators are Generalist Vision Learners cs.CV · 2026 · author #20
- Self-Refining Video Sampling cs.CV · 2026 · author #4
- Cambrian-S: Towards Spatial Supersensing in Video cs.CV · 2025 · author #15
- Diffusion Transformers with Representation Autoencoders cs.CV · 2025 · author #4
- BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset cs.CV · 2025 · author #9
- Transfer between Modalities with MetaQueries cs.CV · 2025 · author #12
- SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training cs.AI · 2025 · author #5
- Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps cs.CV · 2025 · author #11
- Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces cs.CV · 2024 · author #6
- MetaMorph: Multimodal Understanding and Generation via Instruction Tuning cs.CV · 2024 · author #9
- Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think cs.CV · 2024 · author #7
- Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs cs.CV · 2024 · author #14
- Demystifying CLIP Data cs.CV · 2023 · author #2
- Scalable Diffusion Models with Transformers cs.CV · 2022 · author #2
- Masked Autoencoders Are Scalable Vision Learners cs.CV · 2021 · author #3
- On Network Design Spaces for Visual Recognition cs.CV · 2019 · author #3
- Exploring Randomly Wired Neural Networks for Image Recognition cs.CV · 2019 · author #1
- Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification cs.CV · 2017 · author #1
- Aggregated Residual Transformations for Deep Neural Networks cs.CV · 2016 · author #1
- Top-Down Learning for Structured Labeling with Convolutional Pseudoprior cs.CV · 2015 · author #1
- Holistically-Nested Edge Detection cs.CV · 2015 · author #1
- Deeply-Supervised Nets stat.ML · 2014 · author #2
Mentions
- 1511.07409 #1 · backfill · confidence 0.70 Saining Xie
- 2604.20329 #20 · arxiv_oai · confidence 0.70 Saining Xie
- 1504.06375 #1 · backfill · confidence 0.70 Saining Xie
- 2606.03920 #11 · arxiv_oai · confidence 0.70 Saining Xie
- 2606.00188 #6 · arxiv_oai · confidence 0.70 Saining Xie
- 1409.5185 #2 · backfill · confidence 0.70 Saining Xie
- 2412.14171 #6 · arxiv_oai · confidence 0.70 Saining Xie
- 2605.22819 #8 · arxiv_oai · confidence 0.70 Saining Xie
- 2601.18577 #4 · arxiv_oai · confidence 0.70 Saining Xie
- 2501.09732 #11 · arxiv_oai · confidence 0.70 Saining Xie
- 2605.18324 #6 · arxiv_oai · confidence 0.70 Saining Xie
- 2511.04670 #15 · arxiv_oai · confidence 0.70 Saining Xie
- 2412.14164 #9 · arxiv_oai · confidence 0.70 Saining Xie
- 2406.16860 #14 · arxiv_oai · confidence 0.70 Saining Xie
- 2309.16671 #2 · arxiv_oai · confidence 0.70 Saining Xie
- 2111.06377 #3 · arxiv_oai · confidence 0.70 Saining Xie
Frequent Coauthors
- Jihan Yang 5 shared papers
- Shengbang Tong 5 shared papers
- Shusheng Yang 5 shared papers
- Zhuowen Tu 5 shared papers
- Ellis Brown 4 shared papers
- Kaiming He 4 shared papers
- Xichen Pan 4 shared papers
- Boyang Zheng 3 shared papers
- Nanye Ma 3 shared papers
- Piotr Doll\'ar 3 shared papers
- Rob Fergus 3 shared papers
- Ross Girshick 3 shared papers
- Yann LeCun 3 shared papers
- Hu Xu 2 shared papers
- Jinwoo Shin 2 shared papers
- Jiuhai Chen 2 shared papers
- Jonathan Huang 2 shared papers
- Li Fei-Fei 2 shared papers
- Pinzhi Huang 2 shared papers
- Sihyun Yu 2 shared papers