ASTRA disentangles subject identity from pose structure in diffusion transformers via retrieval-augmented pose guidance, asymmetric EURoPE embeddings, and a DSM adapter to improve multi-subject generation.
Imagerag: Dynamic image retrieval for reference-guided image generation
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
SAR-RAG augments an MLLM baseline with semantic retrieval of similar known SAR target images, yielding measurable gains in classification accuracy and dimension regression.
citing papers explorer
-
ASTRA: Enhancing Multi-Subject Generation with Retrieval-Augmented Pose Guidance and Disentangled Position Embedding
ASTRA disentangles subject identity from pose structure in diffusion transformers via retrieval-augmented pose guidance, asymmetric EURoPE embeddings, and a DSM adapter to improve multi-subject generation.
-
SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation
SAR-RAG augments an MLLM baseline with semantic retrieval of similar known SAR target images, yielding measurable gains in classification accuracy and dimension regression.