pith. sign in

Gramian multimodal representation learning and alignment,

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

Training-Free Multimodal Guidance for Video to Audio Generation

cs.LG · 2025-09-29 · unverdicted · novelty 4.0

Proposes a plug-and-play multimodal diffusion guidance mechanism that improves video-to-audio generation quality and alignment by enforcing unified multimodal coherence on pretrained audio diffusion models.

citing papers explorer

Showing 1 of 1 citing paper.

  • Training-Free Multimodal Guidance for Video to Audio Generation cs.LG · 2025-09-29 · unverdicted · none · ref 24

    Proposes a plug-and-play multimodal diffusion guidance mechanism that improves video-to-audio generation quality and alignment by enforcing unified multimodal coherence on pretrained audio diffusion models.