pith. sign in

Rongrong Ji

Identifiers

  • name variant Rongrong Ji 0.60 · backfill

Papers (43)

  1. Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs cs.CV · 2026 · author #5
  2. ForensicConcept: Transferable Forensic Concepts for AIGI Detection cs.CV · 2026 · author #7
  3. Look on Demand: A Cognitive Scheduling Framework for Visual Evidence Acquisition in Multimodal Reasoning cs.AI · 2026 · author #8
  4. A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation cs.AI · 2026 · author #6
  5. HASTE: Training-Free Video Diffusion Acceleration via Head-Wise Adaptive Sparse Attention cs.CV · 2026 · author #5
  6. ALGOGEN: Tool-Generated Verifiable Traces for Reliable Algorithm Visualization cs.AI · 2026 · author #6
  7. Motion-Aware Caching for Efficient Autoregressive Video Generation cs.CV · 2026 · author #8
  8. Prototype-Based Test-Time Adaptation of Vision-Language Models cs.CV · 2026 · author #5
  9. Q-DeepSight: Incentivizing Thinking with Images for Image Quality Assessment and Refinement cs.CV · 2026 · author #9
  10. PixDLM: A Dual-Path Multimodal Language Model for UAV Reasoning Segmentation cs.CV · 2026 · author #7
  11. ID-Selection: Importance-Diversity Based Visual Token Selection for Efficient LVLM Inference cs.CV · 2026 · author #5
  12. ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling cs.CV · 2026 · author #9
  13. GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant cs.CL · 2026 · author #8
  14. Towards Effective Long Video Understanding of Multimodal Large Language Models via One-shot Clip Retrieval cs.CV · 2025 · author #9
  15. Training-Free Multimodal Large Language Model Orchestration cs.CL · 2025 · author #8
  16. UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding cs.AI · 2025 · author #9
  17. VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction cs.CV · 2025 · author #13
  18. MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models cs.CV · 2023 · author #12
  19. Supervised Online Hashing via Similarity Distribution Learning cs.CV · 2019 · author #2
  20. Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning cs.CV · 2019 · author #7
  21. Supervised Online Hashing via Hadamard Codebook Learning cs.CV · 2019 · author #2
  22. Towards Optimal Structured CNN Pruning via Generative Adversarial Learning cs.CV · 2019 · author #2
  23. Aurora Guard: Real-Time Face Anti-Spoofing via Light Reflection cs.CV · 2019 · author #9
  24. Towards Optimal Discrete Online Hashing with Balanced Similarity cs.IR · 2019 · author #2
  25. Towards Compact ConvNets via Structure-Sparsity Regularized Filter Pruning cs.CV · 2019 · author #2
  26. Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression cs.CV · 2018 · author #8
  27. Towards Visual Feature Translation cs.CV · 2018 · author #2
  28. PVRNet: Point-View Relation Neural Network for 3D Shape Recognition cs.CV · 2018 · author #5
  29. Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training cs.CV · 2018 · author #8
  30. Hypergraph Neural Networks cs.LG · 2018 · author #4
  31. PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition cs.CV · 2018 · author #3
  32. CerfGAN: A Compact, Effective, Robust, and Fast Model for Unsupervised Multi-Domain Image-to-Image Translation cs.CV · 2018 · author #6
  33. Face Sketch Synthesis Style Similarity:A New Structure Co-occurrence Texture Measure cs.CV · 2018 · author #6
  34. Asynchronous Bidirectional Decoding for Neural Machine Translation cs.CL · 2018 · author #5
  35. Action-Attending Graphic Neural Network cs.CV · 2017 · author #5
  36. Deep Spatio-temporal Manifold Network for Action Recognition cs.CV · 2017 · author #6
  37. Output Constraint Transfer for Kernelized Correlation Filter in Tracking cs.CV · 2016 · author #8
  38. Ordinal Constrained Binary Code Learning for Nearest Neighbor Search cs.CV · 2016 · author #2
  39. Lattice-Based Recurrent Neural Network Encoders for Neural Machine Translation cs.CL · 2016 · author #4
  40. Supervised Matrix Factorization for Cross-Modality Hashing cs.IR · 2016 · author #2
  41. Variational Neural Discourse Relation Recognizer cs.CL · 2016 · author #5
  42. Video (GIF) Sentiment Analysis using Large-Scale Mid-Level Ontology cs.MM · 2015 · author #3
  43. Robust Nonnegative Matrix Factorization via $L_1$ Norm Regularization cs.LG · 2012 · author #3

Mentions

  • 2606.08511 #5 · arxiv_oai · confidence 0.70 Rongrong Ji
  • 2606.07034 #7 · arxiv_oai · confidence 0.70 Rongrong Ji
  • 1506.00765 #3 · backfill · confidence 0.70 Rongrong Ji
  • 2605.28160 #8 · arxiv_oai · confidence 0.70 Rongrong Ji
  • 2508.10016 #8 · arxiv_oai · confidence 0.70 Rongrong Ji
  • 1204.2311 #3 · backfill · confidence 0.70 Rongrong Ji
  • 2605.17278 #6 · arxiv_oai · confidence 0.70 Rongrong Ji
  • 2501.01957 #13 · arxiv_oai · confidence 0.70 Rongrong Ji

Frequent Coauthors