pith. sign in

Bang Yang

Identifiers

  • name variant Bang Yang 0.60 · backfill

Papers (17)

  1. VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding cs.CV · 2024 · author #6
  2. VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework cs.CV · 2024 · author #3
  3. WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs cs.CV · 2024 · author #6
  4. Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning cs.CV · 2024 · author #1
  5. Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models cs.CV · 2023 · author #2
  6. UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework cs.CV · 2023 · author #6
  7. Multi-dimensional vibration sensing and simultaneous self-homodyne optical transmission of single wavelength net 5.36 Tb/s signal using telecom 7-core fiber physics.optics · 2023 · author #3
  8. MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning cs.CV · 2023 · author #1
  9. Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels cs.CV · 2023 · author #1
  10. Customizing General-Purpose Foundation Models for Medical Report Generation cs.CV · 2023 · author #1
  11. Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation cs.CV · 2023 · author #2
  12. ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation cs.CL · 2023 · author #1
  13. Retrieval-Augmented and Knowledge-Grounded Language Models for Faithful Clinical Medicine cs.CL · 2022 · author #2
  14. Graph-in-Graph Network for Automatic Gene Ontology Description Generation cs.AI · 2022 · author #2
  15. CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter cs.CV · 2021 · author #1
  16. O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning cs.CL · 2021 · author #4
  17. Non-Autoregressive Coarse-to-Fine Video Captioning cs.CV · 2019 · author #1

Mentions

  • 2210.12777 #2 · arxiv_oai · confidence 0.70 Bang Yang
  • 2303.06458 #1 · arxiv_oai · confidence 0.70 Bang Yang
  • 2403.09530 #6 · arxiv_oai · confidence 0.70 Bang Yang
  • 2403.09027 #3 · arxiv_oai · confidence 0.70 Bang Yang
  • 2403.07944 #6 · arxiv_oai · confidence 0.70 Bang Yang
  • 2401.17186 #1 · arxiv_oai · confidence 0.70 Bang Yang
  • 2312.03970 #2 · arxiv_oai · confidence 0.70 Bang Yang
  • 2311.10125 #6 · arxiv_oai · confidence 0.70 Bang Yang
  • 2311.06019 #3 · arxiv_oai · confidence 0.70 Bang Yang
  • 2308.13218 #1 · arxiv_oai · confidence 0.70 Bang Yang
  • 2303.15932 #2 · arxiv_oai · confidence 0.70 Bang Yang
  • 2307.01969 #1 · arxiv_oai · confidence 0.70 Bang Yang
  • 2306.05642 #1 · arxiv_oai · confidence 0.70 Bang Yang
  • 2111.15162 #1 · arxiv_oai · confidence 0.70 Bang Yang
  • 2206.05311 #2 · arxiv_oai · confidence 0.70 Bang Yang
  • 2108.02359 #4 · arxiv_oai · confidence 0.70 Bang Yang
  • 1911.12018 #1 · arxiv_oai · confidence 0.70 Bang Yang

Frequent Coauthors