Jan Kautz

Identifiers

name variant Jan Kautz 0.60 · backfill

Papers (86)

GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors cs.RO · 2026 · author #15
Cosmos 3: Omnimodal World Models for Physical AI cs.CV · 2026 · author #120
Scaling Parallel Sequence Models to Foundation-Scale Vision Encoders cs.CV · 2026 · author #17
Grounded 3D-Aware Spatial Vision-Language Modeling cs.CV · 2026 · author #12
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding cs.CV · 2026 · author #11
Polar: Agentic RL on Any Harness at Scale cs.DC · 2026 · author #11
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention cs.AI · 2026 · author #3
D-Rex : Diffusion Rendering for Relightable Expressive Avatars cs.GR · 2026 · author #4
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence cs.LG · 2026 · author #209
SpaCeFormer: Fast Proposal-Free Open-Vocabulary 3D Instance Segmentation cs.CV · 2026 · author #5
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning cs.LG · 2026 · author #212
World Action Models are Zero-shot Policies cs.RO · 2026 · author #33
Learning to Discover at Test Time cs.LG · 2026 · author #7
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization cs.CL · 2026 · author #12
NVIDIA Nemotron 3: Efficient and Open Intelligence cs.CL · 2025 · author #135
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed cs.CL · 2025 · author #12
SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control cs.RO · 2025 · author #24
World Simulation with Video Foundation Models for Physical AI cs.CV · 2025 · author #36
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge cs.CL · 2025 · author #9
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model cs.CL · 2025 · author #77
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models cs.CL · 2025 · author #7
FLARE: Robot Learning with Implicit World Modeling cs.RO · 2025 · author #18
DreamGen: Unlocking Generalization in Robot Learning through Video World Models cs.RO · 2025 · author #25
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots cs.RO · 2025 · author #13
Gated Delta Networks: Improving Mamba2 with Delta Rule cs.CL · 2024 · author #2
NVILA: Efficient Frontier Visual Language Models cs.CV · 2024 · author #24
LongVILA: Scaling Long-Context Visual Language Models for Long Videos cs.CV · 2024 · author #14
An Empirical Study of Mamba-based Language Models cs.LG · 2024 · author #14
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #9
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation cs.CV · 2024 · author #5
Importance Estimation for Neural Network Pruning cs.LG · 2019 · author #5
SCOPS: Self-Supervised Co-Part Segmentation cs.CV · 2019 · author #6
STEP: Spatio-Temporal Progressive Learning for Video Action Detection cs.CV · 2019 · author #6
Pixel-Adaptive Convolutional Neural Networks cs.CV · 2019 · author #6
Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments cs.CV · 2019 · author #6
NRMVS: Non-Rigid Multi-View Stereo cs.CV · 2019 · author #7
Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera cs.CV · 2019 · author #5
PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image cs.CV · 2018 · author #5
Context-Aware Synthesis and Placement of Object Instances cs.CV · 2018 · author #6
A Fusion Approach for Multi-Frame Optical Flow Estimation cs.CV · 2018 · author #6
Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation cs.CV · 2018 · author #4
Video-to-Video Synthesis cs.CV · 2018 · author #6
Learning Linear Transformations for Fast Arbitrary Style Transfer cs.CV · 2018 · author #3
EOE: Expected Overlap Estimation over Unstructured Point Cloud Data cs.CV · 2018 · author #3
Simultaneous Edge Alignment and Learning cs.CV · 2018 · author #7
Tackling 3D ToF Artifacts Through Learning and the FLAT Dataset cs.CV · 2018 · author #5
Superpixel Sampling Networks cs.CV · 2018 · author #5
Domain Stylization: A Strong, Simple Baseline for Synthetic to Real Image Domain Adaptation cs.CV · 2018 · author #5
Fast and Accurate Point Cloud Registration using Trees of Gaussian Mixtures cs.CV · 2018 · author #3
Synthetically Trained Neural Networks for Learning Human-Readable Plans from Real-World Demonstrations cs.RO · 2018 · author #5
IamNN: Iterative and Adaptive Mobile Neural Network for Efficient Image Classification cs.CV · 2018 · author #6
Hand Pose Estimation via Latent 2.5D Heatmap Regression cs.CV · 2018 · author #5
Switchable Temporal Propagation Network cs.CV · 2018 · author #7
Light-weight Head Pose Invariant Gaze Tracking cs.CV · 2018 · author #3
Multimodal Unsupervised Image-to-Image Translation cs.CV · 2018 · author #4
Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation cs.CV · 2018 · author #6
Deep Semantic Face Deblurring cs.CV · 2018 · author #4
SPLATNet: Sparse Lattice Networks for Point Cloud Processing cs.CV · 2018 · author #7
A Closed-form Solution to Photorealistic Image Stylization cs.CV · 2018 · author #5
Reblur2Deblur: Deblurring Videos via Self-Supervised Learning cs.CV · 2018 · author #6
Learning Binary Residual Representations for Domain-specific Video Streaming cs.CV · 2017 · author #5
Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals cs.CV · 2017 · author #8
Geometry-Aware Learning of Maps for Camera Localization cs.CV · 2017 · author #5
Sim-to-Real Transfer of Accurate Grasping with Eye-In-Hand Observations and Continuous Control cs.RO · 2017 · author #4
Separating Reflection and Transmission Images in the Wild cs.CV · 2017 · author #4
Budget-Aware Activity Detection with A Recurrent Policy Network cs.CV · 2017 · author #4
Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation cs.CV · 2017 · author #6
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs cs.CV · 2017 · author #5
On Nearest Neighbors in Non Local Means Denoising cs.CV · 2017 · author #2
Multiframe Scene Flow with Piecewise Rigid Motion cs.CV · 2017 · author #6
Learning Affinity via Spatial Propagation Networks cs.CV · 2017 · author #6
Learning to Segment Instances in Videos with Spatial Propagation Network cs.CV · 2017 · author #7
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume cs.CV · 2017 · author #4
Improving Landmark Localization with Semi-Supervised Learning cs.CV · 2017 · author #6
Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting cs.CV · 2017 · author #4
Cascaded Scene Flow Prediction using Semantic Segmentation cs.CV · 2017 · author #3
MoCoGAN: Decomposing Motion and Content for Video Generation cs.CV · 2017 · author #4
A Lightweight Approach for On-the-Fly Reflectance Estimation cs.CV · 2017 · author #6
Unsupervised Image-to-Image Translation Networks cs.CV · 2017 · author #3
Deep Learning with Energy-efficient Binary Gradient Cameras cs.CV · 2016 · author #4
Pruning Convolutional Neural Networks for Resource Efficient Inference cs.LG · 2016 · author #5
Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU cs.LG · 2016 · author #5
Learning Adaptive Parameter Tuning for Image Processing cs.CV · 2016 · author #3
Loss Functions for Neural Networks for Image Processing cs.CV · 2015 · author #4
Hierarchical Subquery Evaluation for Active Learning on a Graph cs.CV · 2015 · author #3
Speaker-following Video Subtitles cs.HC · 2014 · author #2

Mentions

1511.08861 #4 · backfill · confidence 0.70 Jan Kautz
2606.05160 #15 · arxiv_oai · confidence 0.70 Jan Kautz
1504.08219 #3 · backfill · confidence 0.70 Jan Kautz
2606.02800 #120 · arxiv_oai · confidence 0.70 Jan Kautz
2606.00746 #17 · arxiv_oai · confidence 0.70 Jan Kautz
2604.20395 #5 · arxiv_oai · confidence 0.70 Jan Kautz
1407.5145 #2 · backfill · confidence 0.70 Jan Kautz
2605.30307 #12 · arxiv_oai · confidence 0.70 Jan Kautz
2605.27365 #11 · arxiv_oai · confidence 0.70 Jan Kautz
2605.24220 #11 · arxiv_oai · confidence 0.70 Jan Kautz
2605.22791 #3 · arxiv_oai · confidence 0.70 Jan Kautz
2511.07820 #24 · arxiv_oai · confidence 0.70 Jan Kautz
2510.18941 #9 · arxiv_oai · confidence 0.70 Jan Kautz
2505.24864 #7 · arxiv_oai · confidence 0.70 Jan Kautz
2508.14444 #77 · arxiv_oai · confidence 0.70 Jan Kautz
2406.07887 #14 · arxiv_oai · confidence 0.70 Jan Kautz
2512.20856 #135 · arxiv_oai · confidence 0.70 Jan Kautz
2505.15659 #18 · arxiv_oai · confidence 0.70 Jan Kautz
2408.10188 #14 · arxiv_oai · confidence 0.70 Jan Kautz
2406.02509 #5 · arxiv_oai · confidence 0.70 Jan Kautz
2601.16175 #7 · arxiv_oai · confidence 0.70 Jan Kautz
2505.12705 #25 · arxiv_oai · confidence 0.70 Jan Kautz

Frequent Coauthors

Pavlo Molchanov 20 shared papers
Ming-Yu Liu 17 shared papers
Jinwei Gu 15 shared papers
Ming-Hsuan Yang 14 shared papers
Sifei Liu 12 shared papers
Kihwan Kim 11 shared papers
Deqing Sun 10 shared papers
Yuke Zhu 9 shared papers
Yejin Choi 8 shared papers
Andrew Tao 7 shared papers
Bryan Catanzaro 7 shared papers
Iuri Frosio 7 shared papers
Orazio Gallo 7 shared papers
Stephen Tyree 7 shared papers
Xiaodong Yang 7 shared papers
Hongxu Yin 6 shared papers
Joel Jang 6 shared papers
Mohammad Shoeybi 6 shared papers
Roger Waleffe 6 shared papers
Shizhe Diao 6 shared papers