arXiv preprint arXiv:2401.15688 (2024) 5

Zhenyu Wang, Enze Xie, Aoxue Li, Zhongdao Wang, Xihui Liu, Zhenguo Li · 2024 · arXiv 2401.15688

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

MetaPoint: Unlocking Precise Spatial Control in Agentic Visual Generation

cs.CV · 2026-06-03 · unverdicted · novelty 7.0

MetaPoint represents 2D coordinates as special tokens in visual generative models to enable precise spatial control using existing positional encodings without architectural modifications.

Divide-and-Conquer Approach to Holistic Cognition in High-Similarity Contexts with Limited Data

cs.CV · 2026-04-21 · unverdicted · novelty 7.0

DHCNet improves ultra-fine-grained visual categorization by progressively building holistic cognition from local discrepancies using self-shuffling and refinement on limited data.

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

cs.CV · 2024-03-08 · unverdicted · novelty 7.0

ELLA introduces a timestep-aware semantic connector to link LLMs with diffusion models for improved dense prompt following, validated on a new 1K-prompt benchmark.

ReGRPO: Reflection-Augmented Policy Optimization for Tool-Using Agents

cs.AI · 2026-06-30 · unverdicted · novelty 6.0

ReGRPO augments group-relative policy optimization with a reflective data engine that generates ErrorType-Evidence-FixPlan triplets from near-miss tool actions to improve recovery in multimodal agents.

GenED-SC: Generative Editing Semantic Communication with Integrated Multi-Modal LLMs

eess.SP · 2026-05-31 · unverdicted · novelty 4.0

A two-stage framework uses JSCC for discriminative transmission of important image regions followed by MLLM-driven generative editing to improve semantic fidelity and perceptual quality under bandwidth limits and varying channel conditions.

citing papers explorer

Showing 5 of 5 citing papers after filters.

MetaPoint: Unlocking Precise Spatial Control in Agentic Visual Generation cs.CV · 2026-06-03 · unverdicted · none · ref 55
MetaPoint represents 2D coordinates as special tokens in visual generative models to enable precise spatial control using existing positional encodings without architectural modifications.
Divide-and-Conquer Approach to Holistic Cognition in High-Similarity Contexts with Limited Data cs.CV · 2026-04-21 · unverdicted · none · ref 54
DHCNet improves ultra-fine-grained visual categorization by progressively building holistic cognition from local discrepancies using self-shuffling and refinement on limited data.
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment cs.CV · 2024-03-08 · unverdicted · none · ref 54
ELLA introduces a timestep-aware semantic connector to link LLMs with diffusion models for improved dense prompt following, validated on a new 1K-prompt benchmark.
ReGRPO: Reflection-Augmented Policy Optimization for Tool-Using Agents cs.AI · 2026-06-30 · unverdicted · none · ref 27
ReGRPO augments group-relative policy optimization with a reflective data engine that generates ErrorType-Evidence-FixPlan triplets from near-miss tool actions to improve recovery in multimodal agents.
GenED-SC: Generative Editing Semantic Communication with Integrated Multi-Modal LLMs eess.SP · 2026-05-31 · unverdicted · none · ref 9
A two-stage framework uses JSCC for discriminative transmission of important image regions followed by MLLM-driven generative editing to improve semantic fidelity and perceptual quality under bandwidth limits and varying channel conditions.

arXiv preprint arXiv:2401.15688 (2024) 5

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer