Generating com- positional scenes via text-to-image rgba instance generation

Alessandro Fontanella, Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Sarah Parisot · 2024 · arXiv 2411.10913

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale

cs.CV · 2026-05-26 · unverdicted · novelty 6.0

Presents MRT, a 20B-parameter masked region diffusion model unifying text-to-layers, image-to-layers, and layers-to-layers tasks with an overflow-aware canvas layer for complete editable outputs.

citing papers explorer

Showing 1 of 1 citing paper.

MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale cs.CV · 2026-05-26 · unverdicted · none · ref 11
Presents MRT, a 20B-parameter masked region diffusion model unifying text-to-layers, image-to-layers, and layers-to-layers tasks with an overflow-aware canvas layer for complete editable outputs.

Generating com- positional scenes via text-to-image rgba instance generation

fields

years

verdicts

representative citing papers

citing papers explorer