pith. sign in

Zhe Gan

Identifiers

  • name variant Zhe Gan 0.60 · backfill

Papers (36)

  1. LensVLM: Selective Context Expansion for Compressed Visual Representation of Text cs.CV · 2026 · author #8
  2. Taming Outlier Tokens in Diffusion Transformers cs.CV · 2026 · author #5
  3. Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling cs.CL · 2026 · author #13
  4. UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action cs.CV · 2025 · author #13
  5. MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training cs.CV · 2024 · author #2
  6. Ferret: Refer and Ground Anything Anywhere at Any Granularity cs.CV · 2023 · author #3
  7. GIT: A Generative Image-to-text Transformer for Vision and Language cs.CV · 2022 · author #6
  8. Topic-Guided Variational Autoencoders for Text Generation cs.CL · 2019 · author #2
  9. Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation cs.CL · 2019 · author #5
  10. Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog cs.CV · 2019 · author #1
  11. Improving Sequence-to-Sequence Learning via Optimal Transport cs.CL · 2019 · author #5
  12. StoryGAN: A Sequential Conditional GAN for Story Visualization cs.CV · 2018 · author #2
  13. Sequence Generation with Guider Network cs.CL · 2018 · author #3
  14. Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization cs.CL · 2018 · author #4
  15. JointGAN: Multi-Domain Joint Distribution Learning with Generative Adversarial Nets cs.LG · 2018 · author #3
  16. Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation cs.CV · 2018 · author #2
  17. Multi-Label Learning from Medical Plain Text with Convolutional Residual Models stat.ML · 2018 · author #3
  18. Topic Compositional Neural Language Model cs.LG · 2017 · author #2
  19. AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks cs.CV · 2017 · author #5
  20. Adversarial Symmetric Variational Autoencoder cs.LG · 2017 · author #5
  21. Triangle Generative Adversarial Networks cs.LG · 2017 · author #1
  22. Deconvolutional Paragraph Representation Learning cs.CL · 2017 · author #4
  23. Adversarial Feature Matching for Text Generation stat.ML · 2017 · author #2
  24. Stochastic Gradient Monomial Gamma Sampler stat.ML · 2017 · author #3
  25. VAE Learning via Stein Variational Gradient Descent cs.LG · 2017 · author #2
  26. Character-level Deep Conflation for Business Data Analytics cs.CL · 2017 · author #1
  27. Adaptive DCTNet for Audio Signal Classification cs.SD · 2016 · author #3
  28. Scalable Bayesian Learning of Recurrent Neural Networks for Language Modeling cs.CL · 2016 · author #1
  29. Semantic Compositional Networks for Visual Captioning cs.CV · 2016 · author #1
  30. Learning Generic Sentence Representations Using Convolutional Neural Networks cs.CL · 2016 · author #1
  31. Adaptive Feature Abstraction for Translating Video to Text cs.CV · 2016 · author #3
  32. Unsupervised Learning with Truncated Gaussian Graphical Models stat.ML · 2016 · author #4
  33. Variational Autoencoder for Deep Learning of Images, Labels and Captions stat.ML · 2016 · author #2
  34. Factored Temporal Sigmoid Belief Networks for Sequence Learning stat.ML · 2016 · author #2
  35. Bridging the Gap between Stochastic Gradient MCMC and Stochastic Optimization stat.ML · 2015 · author #3
  36. Deep Temporal Sigmoid Belief Networks for Sequence Modeling stat.ML · 2015 · author #1

Mentions

  • 2510.17790 #13 · arxiv_oai · confidence 0.70 Zhe Gan
  • 2205.14100 #6 · arxiv_oai · confidence 0.70 Zhe Gan
  • 2403.09611 #2 · arxiv_oai · confidence 0.70 Zhe Gan

Frequent Coauthors