pith. sign in

Sreyan Ghosh

Identifiers

  • name variant Sreyan Ghosh 0.60 · backfill

Papers (9)

  1. FIGMA: Towards FIne-Grained Music retrievAl cs.SD · 2026 · author #3
  2. Cosmos 3: Omnimodal World Models for Physical AI cs.CV · 2026 · author #80
  3. Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence cs.LG · 2026 · author #61
  4. Video-Robin: Autoregressive Diffusion Planning for Intent-Grounded Video-to-Music Generation cs.SD · 2026 · author #7
  5. Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music cs.SD · 2026 · author #1
  6. Do Audio-Visual Large Language Models Really See and Hear? cs.AI · 2026 · author #4
  7. Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception cs.SD · 2026 · author #11
  8. Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models cs.SD · 2025 · author #2
  9. MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark eess.AS · 2024 · author #8

Mentions

  • 2606.06615 #3 · arxiv_oai · confidence 0.70 Sreyan Ghosh
  • 2606.02800 #80 · arxiv_oai · confidence 0.70 Sreyan Ghosh
  • 2601.09413 #11 · arxiv_oai · confidence 0.70 Sreyan Ghosh

Frequent Coauthors