super hub Mixed citations

ShapeNet: An Information-Rich 3D Model Repository

Angel X. Chang, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Thomas Funkhouser, Zimo Li · 2015 · cs.GR · arXiv 1512.03012

Mixed citation behavior. Most common role is background (57%).

110 Pith papers citing it

Background 57% of classified citations

open full Pith review browse 110 citing papers more from Angel X. Chang arXiv PDF

abstract

We present ShapeNet: a richly-annotated, large-scale repository of shapes represented by 3D CAD models of objects. ShapeNet contains 3D models from a multitude of semantic categories and organizes them under the WordNet taxonomy. It is a collection of datasets providing many semantic annotations for each 3D model such as consistent rigid alignments, parts and bilateral symmetry planes, physical sizes, keywords, as well as other planned annotations. Annotations are made available through a public web-based interface to enable data visualization of object attributes, promote data-driven geometric analysis, and provide a large-scale quantitative benchmark for research in computer graphics and vision. At the time of this technical report, ShapeNet has indexed more than 3,000,000 models, 220,000 models out of which are classified into 3,135 categories (WordNet synsets). In this report we describe the ShapeNet effort as a whole, provide details for all currently available datasets, and summarize future plans.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

dataset 12 background 8 method 1

citation-polarity summary

background 12 use dataset 7 unclear 1 use method 1

claims ledger

abstract We present ShapeNet: a richly-annotated, large-scale repository of shapes represented by 3D CAD models of objects. ShapeNet contains 3D models from a multitude of semantic categories and organizes them under the WordNet taxonomy. It is a collection of datasets providing many semantic annotations for each 3D model such as consistent rigid alignments, parts and bilateral symmetry planes, physical sizes, keywords, as well as other planned annotations. Annotations are made available through a public web-based interface to enable data visualization of object attributes, promote data-driven geometri

authors

Angel X. Chang Leonidas Guibas Pat Hanrahan Qixing Huang Thomas Funkhouser Zimo Li

co-cited works

representative citing papers

Towards Realistic 3D Emission Materials: Dataset, Baseline, and Evaluation for Emission Texture Generation

cs.CV · 2026-04-13 · unverdicted · novelty 8.0

The work creates the first dataset and baseline for generating emission textures on 3D objects to reproduce glowing materials from input images.

ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data

cs.CV · 2021-11-17 · accept · novelty 8.0

ARKitScenes is the largest real-world indoor RGB-D dataset captured with mobile LiDAR, including high-resolution depth maps and 3D furniture bounding box annotations for advancing object detection and depth upsampling.

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

cs.CV · 2026-05-28 · conditional · novelty 7.0

VLMs exhibit consistent vertical-distance entanglement in embeddings from perspective bias in natural images, producing accuracy gaps that a new synthetic benchmark SpatialTunnel exposes as model-intrinsic.

Category-Level 3D Correspondence in Camera Space via Morphable Object Priors

cs.CV · 2026-05-27 · unverdicted · novelty 7.0

Morpheus learns morphable category-level shape priors to produce implicit 3D correspondences in camera space without explicit supervision and releases the HouseCorr3D benchmark with amodal and symmetry annotations.

Metric--Phase Fields: Decoupling Distance and Sign for Thin-Structure Reconstruction from Unoriented Point Clouds

cs.CV · 2026-05-25 · unverdicted · novelty 7.0

Metric-Phase Fields decouple unsigned metric proximity from a smooth phase field with learnable sharpness to enable faithful reconstruction of thin and open structures from point clouds.

ArtSplat: Feed-Forward Articulated 3D Gaussian Splatting from Sparse Multi-State Uncalibrated Views

cs.CV · 2026-05-23 · unverdicted · novelty 7.0

ArtSplat is the first feed-forward framework for articulated 3D Gaussian Splatting that reconstructs geometry and joints from sparse multi-state uncalibrated views in one pass.

MAPS: A Synthetic Dataset for Probing Vision Models in a Controlled 3D Scene Space

cs.CV · 2026-05-19 · unverdicted · novelty 7.0

MAPS provides 2618 validated 3D meshes and a controllable rendering pipeline to attribute vision model recognition failures to specific scene parameters, finding camera distance and elevation as the dominant failure factors across 20 tested models.

OffsetAxis: UDF Mesh Reconstruction via Offset-Volume Medial Axis Extraction

cs.GR · 2026-05-14 · unverdicted · novelty 7.0

OffsetAxis reconstructs meshes from unsigned distance fields by extracting the medial axis of the alpha-offset volume using ray casting and variational medial ball optimization.

Min Generalized Sliced Gromov Wasserstein: A Scalable Path to Gromov Wasserstein

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

min-GSGW learns coupled nonlinear slicers to produce a rigid-motion-invariant, scalable approximation to the Gromov-Wasserstein distance and its transport plans.

Img2CADSeq: Image-to-CAD Generation via Sequence-Based Diffusion

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

Img2CADSeq generates standard CAD sequences from images via a multi-stage pipeline with three-level hierarchical codebook encoding, importance-guided compression, and contrastive point-cloud conditioning of a VQ-Diffusion model, outperforming prior methods on new CAD-220K and PrintCAD datasets.

Count Anything at Any Granularity

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

Multi-grained counting is introduced with five granularity levels, supported by the new KubriCount dataset generated via 3D synthesis and editing, and HieraCount model that combines text and visual exemplars for improved accuracy.

The Wittgensteinian Representation Hypothesis: Is Language the Attractor of Multimodal Convergence?

cs.AI · 2026-05-10 · unverdicted · novelty 7.0

Language representations serve as the asymptotic attractor for convergence in independently trained multimodal neural networks due to feature density asymmetry.

MeshFIM: Local Low-Poly Mesh Editing via Fill-in-the-Middle Autoregressive Generation

cs.GR · 2026-05-09 · unverdicted · novelty 7.0

MeshFIM enables local low-poly mesh editing by autoregressively filling target regions conditioned on context, using boundary markers, positional embeddings, and a gated geometry encoder to enforce attachment, topology, and region limits.

Rollback-Free Stable Brick Structures Generation

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

Reinforcement learning internalizes physical stability rules for brick structures, enabling the first rollback-free generation with orders-of-magnitude faster inference.

Two Steps Are All You Need: Efficient 3D Point Cloud Anomaly Detection with Consistency Models

cs.CV · 2026-05-06 · unverdicted · novelty 7.0

Consistency learning reformulates 3D point cloud anomaly detection to predict clean geometry directly in one or two steps, yielding up to 80 times faster inference while matching state-of-the-art accuracy.

ADS: Random Sampling of Occupancy Functions using Adaptive Delaunay Scaffolding

cs.GR · 2026-05-05 · unverdicted · novelty 7.0

ADS adaptively refines a Delaunay scaffold to produce unbiased random samples on occupancy function surfaces together with a connecting mesh, using far fewer evaluations than existing approaches.

Generative Modeling with Orbit-Space Particle Flow Matching

cs.GR · 2026-05-04 · unverdicted · novelty 7.0

OGPP is a particle flow-matching method using orbit-space canonicalization and geometric paths that achieves lower error and fewer steps than prior approaches on 3D benchmarks.

Topo-ADV: Generating Topology-Driven Imperceptible Adversarial Point Clouds

cs.CV · 2026-04-10 · unverdicted · novelty 7.0

Topo-ADV uses differentiable persistent homology to create topology-altering perturbations that achieve up to 100% attack success on point cloud classifiers like PointNet while remaining geometrically imperceptible.

Training-free Spatially Grounded Geometric Shape Encoding (Technical Report)

cs.CV · 2026-04-08 · unverdicted · novelty 7.0 · 2 refs

XShapeEnc encodes arbitrary 2D spatially grounded shapes into compact invertible representations by decomposing them into unit-disk geometry and harmonic pose fields then applying Zernike bases with frequency propagation.

3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image

cs.CV · 2026-04-06 · unverdicted · novelty 7.0

3D-Fixer performs in-place 3D asset completion from single-view partial point clouds via coarse-to-fine generation with ORFA conditioning, plus a new ARSG-110K dataset, to achieve higher geometric accuracy than MIDI and Gen3DSR while keeping diffusion efficiency.

Deformation-based In-Context Learning for Point Cloud Understanding

cs.CV · 2026-04-03 · unverdicted · novelty 7.0

DeformPIC deforms query point clouds under prompt guidance for in-context learning, outperforming prior methods with lower Chamfer Distance on reconstruction, denoising, and registration tasks.

Align then Adapt: Rethinking Parameter-Efficient Transfer Learning in 4D Perception

cs.CV · 2026-02-26 · unverdicted · novelty 7.0

PointATA is a parameter-efficient transfer learning method that aligns 3D-4D modality gaps via optimal transport before adapting a frozen 3D model with video-specific modules to achieve strong 4D perception results.

CLIPoint3D: Language-Grounded Few-Shot Unsupervised 3D Point Cloud Domain Adaptation

cs.CV · 2026-02-23 · unverdicted · novelty 7.0

CLIPoint3D is the first CLIP-based framework for few-shot unsupervised 3D point cloud domain adaptation that reports 3-16% accuracy gains on PointDA-10 and GraspNetPC-10.

Physically Guided Visual Mass Estimation from a Single RGB Image

cs.CV · 2026-01-28 · unverdicted · novelty 7.0

A method estimates mass from single RGB images by fusing depth-based volume cues with vision-language model density semantics via adaptive gating and separate regression heads trained on mass labels only.

citing papers explorer

Showing 12 of 12 citing papers after filters.

Streaming Sliced Optimal Transport cs.LG · 2025-05-11 · unverdicted · none · ref 11 · internal anchor
A low-memory streaming estimator for sliced Wasserstein distance using quantile approximations on random projections with theoretical error guarantees.
DM3D: Deformable Mamba via Offset-Guided Differentiable Scanning for Point Cloud Understanding cs.CV · 2025-12-03 · unverdicted · none · ref 2 · internal anchor
DM3D introduces offset-guided differentiable scanning and continuity-aware state updates in a Mamba-based model to achieve state-of-the-art or competitive performance on point cloud classification, few-shot learning, and part segmentation.
SAM 3D: 3Dfy Anything in Images cs.CV · 2025-11-20 · unverdicted · none · ref 4 · internal anchor
SAM 3D reconstructs 3D objects from single images with geometry, texture, and pose using human-model annotated data at scale and synthetic-to-real training, achieving 5:1 human preference wins.
A solution to generalized learning from small training sets found in infant repeated visual experiences of individual objects cs.CV · 2025-10-16 · unverdicted · none · ref 43 · internal anchor
Infant daily visual experiences of objects are dominated by repeated instances of few exemplars in lumpy similarity clusters, enabling category generalization from small training sets in computational models.
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts cs.CV · 2025-09-13 · unverdicted · none · ref 2 · internal anchor
InternScenes is a new dataset of approximately 40,000 simulatable indoor scenes that combines real scans, procedural, and designer sources, preserves small objects for realistic layouts, and includes processing for simulation and interaction.
The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with Minimal 3D Knowledge cs.CV · 2025-06-11 · unverdicted · none · ref 4 · internal anchor
Data-centric novel view synthesis models with minimal 3D knowledge and no pose annotations scale better with data volume and outperform traditional bias-driven methods.
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models cs.CV · 2025-02-10 · unverdicted · none · ref 105 · internal anchor
TripoSG generates high-fidelity 3D meshes from input images via a large-scale rectified flow transformer and hybrid-trained 3D VAE on a custom 2-million-sample dataset, claiming state-of-the-art fidelity and generalization.
Hierarchical Feature Learning for Medical Point Clouds via State Space Model cs.CV · 2025-04-17 · unverdicted · none · ref 1 · internal anchor
Presents an SSM-based hierarchical feature learning method for medical point clouds that reports superior performance on classification, completion, and segmentation using a new dataset MedPointS.
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation cs.CV · 2025-01-21 · unverdicted · none · ref 8 · internal anchor
Hunyuan3D 2.0 scales flow-based diffusion transformers and texture synthesis models to generate high-resolution textured 3D assets that outperform prior state-of-the-art in geometry, alignment, and texture quality.
A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation cs.CV · 2025-08-13 · unverdicted · none · ref 284 · internal anchor
A survey that categorizes and summarizes methods applying 3D Gaussian Splatting to segmentation, editing, generation, and related tasks, including datasets and evaluation protocols.
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material cs.CV · 2025-06-18 · unverdicted · none · ref 28 · internal anchor
Hunyuan3D 2.1 is a two-part system with DiT for shape generation and Paint for texture synthesis that produces high-fidelity 3D assets with PBR materials.
Efficient Transferable Optimal Transport via Min-Sliced Transport Plans cs.CV · 2025-11-24 · unreviewed · ref 8 · internal anchor

ShapeNet: An Information-Rich 3D Model Repository

hub tools

citation-role summary

citation-polarity summary

claims ledger

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer