pith. sign in

Wanlong Fang

Identifiers

  • name variant Wanlong Fang 0.60 · backfill

Papers (16)

  1. Turing Patterns for Multimedia: Reaction-Diffusion Multi-Modal Fusion for Language-Guided Video Moment Retrieval cs.CV · 2026 · author #2
  2. Hierarchical Semantic-Augmented Navigation: Optimal Transport and Graph-Driven Reasoning for Vision-Language Navigation cs.RO · 2026 · author #2
  3. Towards Understanding Modality Interaction in Multimodal Language Models via Partial Information Decomposition cs.AI · 2026 · author #1
  4. SLAP: The Semantic Least Action Principle for Variational Video-Language Modeling cs.CV · 2026 · author #2
  5. Immuno-VLM: Immunizing Large Vision-Language Models via Generative Semantic Antibodies for Open-World Trustworthiness cs.CV · 2026 · author #2
  6. Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding cs.CV · 2026 · author #3
  7. Not All Inputs Are Valid: Towards Open-Set Video Moment Retrieval Using Language cs.CV · 2026 · author #2
  8. Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language cs.CV · 2026 · author #3
  9. CogniVerse: Revolutionizing Multi-Modal Retrieval-Augmented Generation with Cognitive Reflection and Geometric Reasoning cs.CV · 2026 · author #2
  10. Rethinking Video-Language Model from the Language Input Perspective cs.CV · 2026 · author #2
  11. Towards Unified Vision-Language Models with Incomplete Multi-Modal Inputs cs.CV · 2026 · author #2
  12. Disentangling Adversarial Prompts: A Semantic-Graph Defense for Robust LLM Security cs.CR · 2026 · author #2
  13. Unveiling the Fragility of Vision-Language Models: Multi-Modal Adversarial Synergy via Texture-Constrained Perturbations and Cross-Modal Optimization cs.CV · 2026 · author #2
  14. Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective cs.CV · 2026 · author #3
  15. How Creative Are Large Language Models in Generating Molecules? cs.CL · 2026 · author #5
  16. Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network cs.CV · 2024 · author #2

Mentions

  • 2606.01615 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2606.01565 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2606.00959 #1 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.30750 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.30745 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.30742 #3 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.29812 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.29793 #3 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.29602 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.27920 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.27894 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.27823 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.26501 #2 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2605.26441 #3 · arxiv_oai · confidence 0.70 Wanlong Fang
  • 2412.15678 #2 · arxiv_oai · confidence 0.70 Wanlong Fang

Frequent Coauthors