Title resolution pending

Segment anything , author=

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

cs.CV · 2026-05-20 · unverdicted · novelty 7.0

OcclusionFormer adds explicit Z-order modeling via a new SA-Z dataset and volume-rendering compositing in a diffusion transformer to resolve occlusion ambiguities in layout-grounded image synthesis.

MIND: Decoupling Model-Induced Label Noise via Latent Manifold Disentanglement

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

MIND decouples high-dimensional model-induced label noise into subspace components via latent manifold disentanglement and a Latent Decoupling Estimator.

PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

cs.CV · 2023-09-30 · accept · novelty 6.0

PixArt-α matches commercial text-to-image quality with a diffusion transformer trained in 675 A100 GPU days through decomposed training stages, cross-attention text injection, and vision-language model dense captions.

citing papers explorer

Showing 3 of 3 citing papers.

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation cs.CV · 2026-05-20 · unverdicted · none · ref 11
OcclusionFormer adds explicit Z-order modeling via a new SA-Z dataset and volume-rendering compositing in a diffusion transformer to resolve occlusion ambiguities in layout-grounded image synthesis.
MIND: Decoupling Model-Induced Label Noise via Latent Manifold Disentanglement cs.LG · 2026-05-15 · unverdicted · none · ref 16
MIND decouples high-dimensional model-induced label noise into subspace components via latent manifold disentanglement and a Latent Decoupling Estimator.
PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis cs.CV · 2023-09-30 · accept · none · ref 89
PixArt-α matches commercial text-to-image quality with a diffusion transformer trained in 675 A100 GPU days through decomposed training stages, cross-attention text injection, and vision-language model dense captions.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer