Mvdif- fusion: Enabling holistic multi-view image generation with correspondence-aware diffusion.arXiv preprint arXiv:2307.01097

Tang, S · 2023 · arXiv 2307.01097

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

representative citing papers

PixGS: Pixel-Space Diffusion for Direct 3D Gaussian Splat Generation

cs.CV · 2026-07-02 · unverdicted · novelty 6.0

A single-stage pixel-space diffusion model for direct 3D Gaussian Splat generation that bypasses latent compression and adds geometric supervisions to outperform prior multi-stage methods.

TextHOI-3D: Text-to-3D Hand-Object Interaction via Discrete Multi-View Generation and Joint Mesh Optimization

cs.CV · 2026-06-10 · unverdicted · novelty 6.0

TextHOI-3D generates text-conditioned 3D hand-object meshes using a VQ token space and CLIP-conditioned autoregressive multi-view prediction followed by joint mesh optimization, reporting large reductions in object CD and penetration volume versus single-view baselines on HO3D-derived data.

Camera Control for Text-to-Image Generation via Learning Viewpoint Tokens

cs.CV · 2026-04-21 · unverdicted · novelty 6.0

Viewpoint tokens learned on a mixed 3D-rendered and photorealistic dataset enable precise camera control in text-to-image generation while factorizing geometry from appearance and transferring to unseen object categories.

BoostDream: Efficient Refining for High-Quality Text-to-3D Generation from Multi-View Diffusion

cs.CV · 2024-01-30 · unverdicted · novelty 6.0

BoostDream refines coarse feed-forward text-to-3D assets via 3D distillation, multi-view SDS loss from a 2D diffusion model, and prompt-consistent normal maps to produce higher-quality results more efficiently than standard SDS.

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

cs.CV · 2023-09-07 · unverdicted · novelty 6.0

SyncDreamer produces multiview-consistent images from a single input image by jointly modeling their distribution and synchronizing intermediate diffusion states via 3D-aware attention.

MVDream: Multi-view Diffusion for 3D Generation

cs.CV · 2023-08-31 · conditional · novelty 6.0

MVDream is a multi-view diffusion model that functions as a generalizable 3D prior, enabling more consistent text-to-3D generation and few-shot 3D concept learning from 2D examples.

Restore3D: Breathing Life into Broken Objects with Shape and Texture Restoration

cs.CV · 2026-07-01 · unverdicted · novelty 5.0

Restore3D restores shape and texture of broken 3D objects via multi-view image refinement with a Mask Self-Perceiver and coarse-to-fine mesh reconstruction, outperforming baselines on synthetic and real benchmarks.

Native3D: End-to-End 3D Scene Generation via Unified Mesh-Texture Modeling and Semantic Alignment

cs.CV · 2026-06-05 · unverdicted · novelty 5.0

Native3D introduces a direct 3D scene generation method using unified mesh-texture representation and 3D REPA Loss for semantic alignment, claimed to outperform prior 2D-dependent approaches.

DecoRec: Decomposed 3D Scene Reconstruction from Single-View Images via Object-Level Diffusion

cs.CV · 2026-05-16 · unverdicted · novelty 5.0

DecoRec decomposes single-view 3D scene reconstruction into per-object diffusion reconstructions followed by a differentiable rendering and diffusion-guided merging pipeline.

citing papers explorer

Showing 1 of 1 citing paper after filters.

MVDream: Multi-view Diffusion for 3D Generation cs.CV · 2023-08-31 · conditional · none · ref 155
MVDream is a multi-view diffusion model that functions as a generalizable 3D prior, enabling more consistent text-to-3D generation and few-shot 3D concept learning from 2D examples.

Mvdif- fusion: Enabling holistic multi-view image generation with correspondence-aware diffusion.arXiv preprint arXiv:2307.01097

fields

years

verdicts

representative citing papers

citing papers explorer