OmniAlpha is a unified multi-task RL framework that uses an alpha-aware VAE, sequence-to-sequence Diffusion Transformer, and layer-aware rewards to improve transparency-aware generation across five task categories.
Denoising diffu- sion probabilistic models, 2020
3 Pith papers cite this work. Polarity classification is still indexing.
representative citing papers
OVOD-Agent models visual reasoning as a weakly Markovian decision process with bandit-driven exploration to create a self-evolving open-vocabulary detector that improves on rare categories in COCO and LVIS.
BlendFusion uses path tracing on 3D scenes with targeted camera placement to produce higher-quality synthetic image-caption data for diffusion model training than direct generation methods.
citing papers explorer
-
OmniAlpha: Aligning Transparency-Aware Generation via Multi-Task Unified Reinforcement Learning
OmniAlpha is a unified multi-task RL framework that uses an alpha-aware VAE, sequence-to-sequence Diffusion Transformer, and layer-aware rewards to improve transparency-aware generation across five task categories.
-
OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection
OVOD-Agent models visual reasoning as a weakly Markovian decision process with bandit-driven exploration to create a self-evolving open-vocabulary detector that improves on rare categories in COCO and LVIS.
-
BlendFusion -- Scalable Synthetic Data Generation for Diffusion Model Training
BlendFusion uses path tracing on 3D scenes with targeted camera placement to produce higher-quality synthetic image-caption data for diffusion model training than direct generation methods.