archive
Every paper Pith has read. Search by title, abstract, or pith.
456 papers in cs.GR · page 9
-
Video model matches leaders after $200k training
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
-
Gaussians render plausible human-object interactions from sparse views
Physically Plausible Human-Object Rendering from Sparse Views via 3D Gaussian Splatting
-
Relational DSL programs match human object placement distributions
Learning to Place Objects with Programs and Iterative Self Training
-
Neural net modulates joint torques to synthesize fatigued human motion
Fatigue-PINN: Physics-Informed Fatigue-Driven Motion Modulation and Synthesis
-
Text prompts steer 3D dance generation to match music genres
GCDance: Genre-Controlled Music-Driven 3D Full Body Dance Generation
-
3D Pareto method ranks best LLM pairs for XR devices
AIvaluateXR: An Evaluation Framework for on-Device AI in XR with Benchmarking Results
-
Human preferences train better flow-based video generators
Improving Video Generation with Human Feedback
-
Neural model forecasts volume render times from data structure
ENTIRE: Learning-based Volume Rendering Time Prediction
-
Video AI models fail most physics tests
Do generative video models understand physical principles?
-
Neural init plus physics optimization enables single-image material edits
Materialist: Physically Based Editing Using Single-Image Inverse Rendering
-
Cloud gaming system serves twice as many users at higher quality
Stimpack: An Adaptive Rendering Optimization System for Scalable Cloud Gaming
-
2D videos train 3D motion generators from text
Motion-2-To-3: Leveraging 2D Motion Data for 3D Motion Generations
-
Fenwick tree computes 3D model volumes during reconstruction
Novel 3D Binary Indexed Tree for Volume Computation of 3D Reconstructed Models from Volumetric Data
-
Fusion VAE plus diffusion produces editable SVGs from text
SVGFusion: A VAE-Diffusion Transformer for Vector Graphic Generation
-
Spatially varying 2D Gaussians beat single-color 3D ones on view synthesis
SVGS: Enhancing Gaussian Splatting Using Primitives with Spatially Varying Colors
-
Ellipsoidal maps cut distortion in genus-0 surface parameterization
Ellipsoidal Density-Equalizing Map for Genus-0 Closed Surfaces
-
Edge-preserving noise boosts diffusion model structure
Edge-preserving noise for diffusion models
-
Iterative edits keep full Morse-Smale complexes intact after lossy compression
Preserving Discrete Morse-Smale Complexes in Error-Bounded Lossy Compression
-
Visual method compares causal graphs across multiple outcomes
Visual Analysis of Multi-outcome Causal Graphs
-
Relation-aware tokens unify single and two-hand 4D mesh recovery
OmniHands: Towards Robust 4D Hand Mesh Recovery via A Versatile Transformer
-
Diffusion training accelerates with combinatorial stochasticity
ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models
-
Mamba model completes point clouds without pooling losses
3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion
-
Agents turn biology paper text into 3D models
Chat Modeling: Interaction-Enhanced Agent Framework for Visualizing Literature-Grounded Biological Structures
-
Squidgets turn sketched strokes into scene parameter controls
Squidgets: Sketch-based Widget Design for Scene Manipulation
-
Model finds skyline communities in edge-attributed bipartite graphs
Skyline Community Search over Edge-Attributed Bipartite Graphs
-
3D Gaussian splatting delivers real-time explicit rendering
A Survey on 3D Gaussian Splatting
-
Global optimization synchronizes elastic grid deployment paths
Geometric Guidance for Globally Synchronized Deployment of Elastic Geodesic Grids
-
Transformer turns single photo into 3D model in five seconds
LRM: Large Reconstruction Model for Single Image to 3D
-
Finetuned diffusion model yields consistent multi-views from one image
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
-
Single image yields consistent multi-view outputs via synchronized diffusion
SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
-
One motion module animates any personalized image model
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
-
Diffusion model adds consistent cast shadows to relit faces
DiFaReli++: Diffusion Face Relighting with Consistent Cast Shadows
-
Voice commands drive real-time 3D molecular visualizations
VOICE: Visual Oracle for Interaction, Conversation, and Explanation
-
ControlNet adds edge, pose and depth controls to diffusion image models
Adding Conditional Control to Text-to-Image Diffusion Models
-
Frequency loss yields consistent cloth parameters from wrinkles
Unphased Wrinkles: Estimating cloth elasticity parameters using a frequency-based loss
-
Motion diffusion model predicts clean samples for SOTA results
Human Motion Diffusion Model
-
Cross-attention maps let text changes edit images
Prompt-to-Prompt Image Editing with Cross Attention Control
-
One word embedding captures any visual concept from a few photos
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
-
Phasor volume shrinks MLP for frequency neural reps
PREF: Phasorial Embedding Fields for Compact Neural Representations
-
Three-tier vectors turn portraits into editable layers
Hierarchical Vectorization for Portrait Images
-
Diffusion model with classifier-free guidance tops DALL-E in human tests
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
-
Bias-free volume rendering produces accurate neural surfaces from images
NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction
-
Skin tones balanced in portraits without perfect guide alignment
Guided Facial Skin Color Correction
-
PyTorch3D speeds 3D deep learning with modular renderer
Accelerating 3D Deep Learning with PyTorch3D
-
Uniform tangent steps define stroked path interiors rigorously
Polar Stroking: New Theory and Methods for Stroking Paths
-
Pointwise measures pick data points to keep multi-variable links
Multivariate Pointwise Information-Driven Data Sampling and Visualization
-
Attention-weighted messages on scene graphs predict missing 3D objects
SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation
-
Particle system deforms soft tissue internals in 0.3s
Real-time Deformation of Soft Tissue Internal Structure with Surface Profile Variations using Particle System
-
Few poses recover face rest shape and stiffness for simulation
Data-Driven Physical Face Inversion
-
Adapted Gaussian weights sharpen visualizations with one parameter
Spectral Visualization Sharpening