Jan Kautz
Identifiers
- name variant Jan Kautz 0.60 · backfill
Papers (86)
- GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors cs.RO · 2026 · author #15
- Cosmos 3: Omnimodal World Models for Physical AI cs.CV · 2026 · author #120
- Scaling Parallel Sequence Models to Foundation-Scale Vision Encoders cs.CV · 2026 · author #17
- Grounded 3D-Aware Spatial Vision-Language Modeling cs.CV · 2026 · author #12
- LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding cs.CV · 2026 · author #11
- Polar: Agentic RL on Any Harness at Scale cs.DC · 2026 · author #11
- Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention cs.AI · 2026 · author #3
- D-Rex : Diffusion Rendering for Relightable Expressive Avatars cs.GR · 2026 · author #4
- Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence cs.LG · 2026 · author #209
- SpaCeFormer: Fast Proposal-Free Open-Vocabulary 3D Instance Segmentation cs.CV · 2026 · author #5
- Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning cs.LG · 2026 · author #212
- World Action Models are Zero-shot Policies cs.RO · 2026 · author #33
- Learning to Discover at Test Time cs.LG · 2026 · author #7
- GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization cs.CL · 2026 · author #12
- NVIDIA Nemotron 3: Efficient and Open Intelligence cs.CL · 2025 · author #135
- Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed cs.CL · 2025 · author #12
- SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control cs.RO · 2025 · author #24
- World Simulation with Video Foundation Models for Physical AI cs.CV · 2025 · author #36
- ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge cs.CL · 2025 · author #9
- NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model cs.CL · 2025 · author #77
- ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models cs.CL · 2025 · author #7
- FLARE: Robot Learning with Implicit World Modeling cs.RO · 2025 · author #18
- DreamGen: Unlocking Generalization in Robot Learning through Video World Models cs.RO · 2025 · author #25
- GR00T N1: An Open Foundation Model for Generalist Humanoid Robots cs.RO · 2025 · author #13
- Gated Delta Networks: Improving Mamba2 with Delta Rule cs.CL · 2024 · author #2
- NVILA: Efficient Frontier Visual Language Models cs.CV · 2024 · author #24
- LongVILA: Scaling Long-Context Visual Language Models for Long Videos cs.CV · 2024 · author #14
- An Empirical Study of Mamba-based Language Models cs.LG · 2024 · author #14
- Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation cs.CV · 2024 · author #9
- CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation cs.CV · 2024 · author #5
- Importance Estimation for Neural Network Pruning cs.LG · 2019 · author #5
- SCOPS: Self-Supervised Co-Part Segmentation cs.CV · 2019 · author #6
- STEP: Spatio-Temporal Progressive Learning for Video Action Detection cs.CV · 2019 · author #6
- Pixel-Adaptive Convolutional Neural Networks cs.CV · 2019 · author #6
- Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments cs.CV · 2019 · author #6
- NRMVS: Non-Rigid Multi-View Stereo cs.CV · 2019 · author #7
- Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera cs.CV · 2019 · author #5
- PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image cs.CV · 2018 · author #5
- Context-Aware Synthesis and Placement of Object Instances cs.CV · 2018 · author #6
- A Fusion Approach for Multi-Frame Optical Flow Estimation cs.CV · 2018 · author #6
- Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation cs.CV · 2018 · author #4
- Video-to-Video Synthesis cs.CV · 2018 · author #6
- Learning Linear Transformations for Fast Arbitrary Style Transfer cs.CV · 2018 · author #3
- EOE: Expected Overlap Estimation over Unstructured Point Cloud Data cs.CV · 2018 · author #3
- Simultaneous Edge Alignment and Learning cs.CV · 2018 · author #7
- Tackling 3D ToF Artifacts Through Learning and the FLAT Dataset cs.CV · 2018 · author #5
- Superpixel Sampling Networks cs.CV · 2018 · author #5
- Domain Stylization: A Strong, Simple Baseline for Synthetic to Real Image Domain Adaptation cs.CV · 2018 · author #5
- Fast and Accurate Point Cloud Registration using Trees of Gaussian Mixtures cs.CV · 2018 · author #3
- Synthetically Trained Neural Networks for Learning Human-Readable Plans from Real-World Demonstrations cs.RO · 2018 · author #5
- IamNN: Iterative and Adaptive Mobile Neural Network for Efficient Image Classification cs.CV · 2018 · author #6
- Hand Pose Estimation via Latent 2.5D Heatmap Regression cs.CV · 2018 · author #5
- Switchable Temporal Propagation Network cs.CV · 2018 · author #7
- Light-weight Head Pose Invariant Gaze Tracking cs.CV · 2018 · author #3
- Multimodal Unsupervised Image-to-Image Translation cs.CV · 2018 · author #4
- Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation cs.CV · 2018 · author #6
- Deep Semantic Face Deblurring cs.CV · 2018 · author #4
- SPLATNet: Sparse Lattice Networks for Point Cloud Processing cs.CV · 2018 · author #7
- A Closed-form Solution to Photorealistic Image Stylization cs.CV · 2018 · author #5
- Reblur2Deblur: Deblurring Videos via Self-Supervised Learning cs.CV · 2018 · author #6
- Learning Binary Residual Representations for Domain-specific Video Streaming cs.CV · 2017 · author #5
- Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals cs.CV · 2017 · author #8
- Geometry-Aware Learning of Maps for Camera Localization cs.CV · 2017 · author #5
- Sim-to-Real Transfer of Accurate Grasping with Eye-In-Hand Observations and Continuous Control cs.RO · 2017 · author #4
- Separating Reflection and Transmission Images in the Wild cs.CV · 2017 · author #4
- Budget-Aware Activity Detection with A Recurrent Policy Network cs.CV · 2017 · author #4
- Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation cs.CV · 2017 · author #6
- High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs cs.CV · 2017 · author #5
- On Nearest Neighbors in Non Local Means Denoising cs.CV · 2017 · author #2
- Multiframe Scene Flow with Piecewise Rigid Motion cs.CV · 2017 · author #6
- Learning Affinity via Spatial Propagation Networks cs.CV · 2017 · author #6
- Learning to Segment Instances in Videos with Spatial Propagation Network cs.CV · 2017 · author #7
- PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume cs.CV · 2017 · author #4
- Improving Landmark Localization with Semi-Supervised Learning cs.CV · 2017 · author #6
- Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization with Spatially-Varying Lighting cs.CV · 2017 · author #4
- Cascaded Scene Flow Prediction using Semantic Segmentation cs.CV · 2017 · author #3
- MoCoGAN: Decomposing Motion and Content for Video Generation cs.CV · 2017 · author #4
- A Lightweight Approach for On-the-Fly Reflectance Estimation cs.CV · 2017 · author #6
- Unsupervised Image-to-Image Translation Networks cs.CV · 2017 · author #3
- Deep Learning with Energy-efficient Binary Gradient Cameras cs.CV · 2016 · author #4
- Pruning Convolutional Neural Networks for Resource Efficient Inference cs.LG · 2016 · author #5
- Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU cs.LG · 2016 · author #5
- Learning Adaptive Parameter Tuning for Image Processing cs.CV · 2016 · author #3
- Loss Functions for Neural Networks for Image Processing cs.CV · 2015 · author #4
- Hierarchical Subquery Evaluation for Active Learning on a Graph cs.CV · 2015 · author #3
- Speaker-following Video Subtitles cs.HC · 2014 · author #2
Mentions
- 1511.08861 #4 · backfill · confidence 0.70 Jan Kautz
- 2606.05160 #15 · arxiv_oai · confidence 0.70 Jan Kautz
- 1504.08219 #3 · backfill · confidence 0.70 Jan Kautz
- 2606.02800 #120 · arxiv_oai · confidence 0.70 Jan Kautz
- 2606.00746 #17 · arxiv_oai · confidence 0.70 Jan Kautz
- 2604.20395 #5 · arxiv_oai · confidence 0.70 Jan Kautz
- 1407.5145 #2 · backfill · confidence 0.70 Jan Kautz
- 2605.30307 #12 · arxiv_oai · confidence 0.70 Jan Kautz
- 2605.27365 #11 · arxiv_oai · confidence 0.70 Jan Kautz
- 2605.24220 #11 · arxiv_oai · confidence 0.70 Jan Kautz
- 2605.22791 #3 · arxiv_oai · confidence 0.70 Jan Kautz
- 2511.07820 #24 · arxiv_oai · confidence 0.70 Jan Kautz
- 2510.18941 #9 · arxiv_oai · confidence 0.70 Jan Kautz
- 2505.24864 #7 · arxiv_oai · confidence 0.70 Jan Kautz
- 2508.14444 #77 · arxiv_oai · confidence 0.70 Jan Kautz
- 2406.07887 #14 · arxiv_oai · confidence 0.70 Jan Kautz
- 2512.20856 #135 · arxiv_oai · confidence 0.70 Jan Kautz
- 2505.15659 #18 · arxiv_oai · confidence 0.70 Jan Kautz
- 2408.10188 #14 · arxiv_oai · confidence 0.70 Jan Kautz
- 2406.02509 #5 · arxiv_oai · confidence 0.70 Jan Kautz
- 2601.16175 #7 · arxiv_oai · confidence 0.70 Jan Kautz
- 2505.12705 #25 · arxiv_oai · confidence 0.70 Jan Kautz
Frequent Coauthors
- Pavlo Molchanov 20 shared papers
- Ming-Yu Liu 17 shared papers
- Jinwei Gu 15 shared papers
- Ming-Hsuan Yang 14 shared papers
- Sifei Liu 12 shared papers
- Kihwan Kim 11 shared papers
- Deqing Sun 10 shared papers
- Yuke Zhu 9 shared papers
- Yejin Choi 8 shared papers
- Andrew Tao 7 shared papers
- Bryan Catanzaro 7 shared papers
- Iuri Frosio 7 shared papers
- Orazio Gallo 7 shared papers
- Stephen Tyree 7 shared papers
- Xiaodong Yang 7 shared papers
- Hongxu Yin 6 shared papers
- Joel Jang 6 shared papers
- Mohammad Shoeybi 6 shared papers
- Roger Waleffe 6 shared papers
- Shizhe Diao 6 shared papers