Content-rich aigc video quality assessment via intricate text alignment and motion-aware consistency

Content-rich aigc video quality assessment via intricate text alignment, motion-aware consistency · 2023 · arXiv 2502.04076

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

PhyGround: Benchmarking Physical Reasoning in Generative World Models

cs.CV · 2026-05-11 · accept · novelty 7.0

PhyGround is a new benchmark with curated prompts, a 13-law taxonomy, large-scale human annotations, and an open physics-specialized VLM judge for evaluating physical reasoning in generative video models.

Comparison Drives Preference: Reference-Aware Modeling for AI-Generated Video Quality Assessment

cs.CV · 2026-04-18 · unverdicted · novelty 7.0

RefVQA uses a query-centered reference graph and graph-guided difference aggregation to improve AI-generated video quality assessment by incorporating inter-video comparisons.

Navigating User Behavior toward Personalized Multimodal Generation

cs.AI · 2026-06-23 · unverdicted · novelty 6.0

NaviGen encodes user behavior via dual collaborative-textual identifiers and applies SFT+RL to produce personalized multimodal outputs and better instructions from interaction history.

TailorMind: Towards Preference-Aligned Multimodal Content Generation

cs.AI · 2026-06-22 · unverdicted · novelty 5.0

TailorMind links hypergraph collaborative filtering and textual gradient descent with multimodal generation to produce user-tailored content, showing gains in novelty, aesthetics, and reranking recall on a new benchmark from three platforms.

MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models

cs.CV · 2025-11-23 · unverdicted · novelty 5.0

MASS adds spatiotemporal motion signals and 3D grounding to VLMs and releases MASS-Bench, yielding physics-reasoning performance within 2% of Gemini-2.5-Flash after reinforcement fine-tuning.

citing papers explorer

Showing 5 of 5 citing papers.

PhyGround: Benchmarking Physical Reasoning in Generative World Models cs.CV · 2026-05-11 · accept · none · ref 37
PhyGround is a new benchmark with curated prompts, a 13-law taxonomy, large-scale human annotations, and an open physics-specialized VLM judge for evaluating physical reasoning in generative video models.
Comparison Drives Preference: Reference-Aware Modeling for AI-Generated Video Quality Assessment cs.CV · 2026-04-18 · unverdicted · none · ref 36
RefVQA uses a query-centered reference graph and graph-guided difference aggregation to improve AI-generated video quality assessment by incorporating inter-video comparisons.
Navigating User Behavior toward Personalized Multimodal Generation cs.AI · 2026-06-23 · unverdicted · none · ref 3
NaviGen encodes user behavior via dual collaborative-textual identifiers and applies SFT+RL to produce personalized multimodal outputs and better instructions from interaction history.
TailorMind: Towards Preference-Aligned Multimodal Content Generation cs.AI · 2026-06-22 · unverdicted · none · ref 3
TailorMind links hypergraph collaborative filtering and textual gradient descent with multimodal generation to produce user-tailored content, showing gains in novelty, aesthetics, and reranking recall on a new benchmark from three platforms.
MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models cs.CV · 2025-11-23 · unverdicted · none · ref 44
MASS adds spatiotemporal motion signals and 3D grounding to VLMs and releases MASS-Bench, yielding physics-reasoning performance within 2% of Gemini-2.5-Flash after reinforcement fine-tuning.

Content-rich aigc video quality assessment via intricate text alignment and motion-aware consistency

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer