pith. sign in

Nan Jiang

Identifiers

  • name variant Nan Jiang 0.60 · backfill

Papers (55)

  1. Offline Two-Player Zero-Sum Markov Games with KL Regularization cs.LG · 2026 · author #6
  2. A New WZ Sagittae-type Dwarf Nova KSP-OT-202104a Near the Period Minimum from the KMTNet Supernova Program astro-ph.SR · 2026 · author #6
  3. No Action Without a NOD: A Heterogeneous Multi-Agent Architecture for Reliable Service Agents cs.AI · 2026 · author #3
  4. Thinking with Novel Views: A Systematic Analysis of Generative-Augmented Spatial Intelligence cs.CV · 2026 · author #4
  5. Rethinking Importance Sampling in LLM Policy Optimization: A Cumulative Token Perspective cs.LG · 2026 · author #7
  6. Q-MMR: Off-Policy Evaluation via Recursive Reweighting and Moment Matching cs.LG · 2026 · author #2
  7. JoyAI-Image: Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation cs.GR · 2026 · author #14
  8. Soft Graph Diffusion Transformer for MIMO Detection cs.IT · 2026 · author #1
  9. Fiber-integrated Quantum Frequency Conversion for Long-distance Quantum Networking quant-ph · 2026 · author #4
  10. Towards Fine-grained Temporal Perception: Post-Training Large Audio-Language Models with Audio-Side Time Prompt cs.SD · 2026 · author #5
  11. DIVERSED: Relaxed Speculative Decoding via Dynamic Ensemble Verification cs.CL · 2026 · author #8
  12. OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence cs.CL · 2026 · author #9
  13. Beyond Pessimism: Offline Learning in KL-regularized Games cs.GT · 2026 · author #3
  14. SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing cs.CV · 2026 · author #6
  15. Beyond Semantic Manipulation: Token-Space Attacks on Reward Models cs.LG · 2026 · author #5
  16. Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parametric Policies cs.LG · 2026 · author #3
  17. RLHF Workflow: From Reward Modeling to Online RLHF cs.LG · 2024 · author #7
  18. RouterBench: A Benchmark for Multi-LLM Routing System cs.LG · 2024 · author #4
  19. Online Supervised Learning for Traffic Load Prediction in Framed-ALOHA Networks cs.NI · 2019 · author #1
  20. Information-Theoretic Considerations in Batch Reinforcement Learning cs.LG · 2019 · author #2
  21. The development and evaluation of the SmartAbility Android Application to detect users' abilities cs.HC · 2019 · author #3
  22. Deep Reinforcement Learning for Real-Time Optimization in NB-IoT Networks cs.NI · 2018 · author #1
  23. Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches cs.LG · 2018 · author #2
  24. A note on Graphical Notation Reveals Topological Stability Criteria for Collective Dynamics in Complex Network math.GM · 2018 · author #4
  25. Cooperative Deep Reinforcement Learning for Multiple-Group NB-IoT Networks Optimization cs.NI · 2018 · author #1
  26. Taylor dispersion in two-dimensional bacterial turbulence physics.flu-dyn · 2018 · author #5
  27. Extremal-point density of scaling processes from fractal Brownian motion to turbulence in one dimension cond-mat.stat-mech · 2018 · author #5
  28. The mechanism of nanoparticle precipitation induced by electron irradiation in transmission electron microscopy cond-mat.mes-hall · 2018 · author #1
  29. Manipulation of Au nanoparticles using an electron probe: electron golf cond-mat.mes-hall · 2018 · author #1
  30. On Oracle-Efficient PAC RL with Rich Observations cs.LG · 2018 · author #2
  31. Hierarchical Imitation and Reinforcement Learning cs.LG · 2018 · author #2
  32. A higher-order ensemble/proper orthogonal decomposition method for the nonstationary Navier-Stokes equations math.NA · 2017 · author #2
  33. An efficient, partitioned ensemble algorithm for simulating ensembles of evolutionary MHD flows at low magnetic Reynolds number math.NA · 2017 · author #1
  34. A second-order time-stepping scheme for simulating ensembles of parameterized flow problems math.NA · 2017 · author #2
  35. An efficient algorithm for simulating ensembles of parameterized flow problems math.NA · 2017 · author #2
  36. Repeated Inverse Reinforcement Learning cs.AI · 2017 · author #2
  37. Random Access Analysis for Massive IoT Networks under A New Spatio-Temporal Model: A Stochastic Geometry Approach cs.IT · 2017 · author #1
  38. Predicting Atomic Decay Rates Using an Informational-Entropic Approach physics.atom-ph · 2017 · author #2
  39. A Novel Quantum Image Compression Method Based on JPEG quant-ph · 2017 · author #2
  40. Analysis and approximation of a fractional Laplacian-based closure model for turbulent flows and its connection to Richardson pair dispersion math.NA · 2016 · author #2
  41. Contextual Decision Processes with Low Bellman Rank are PAC-Learnable cs.LG · 2016 · author #1
  42. Neural Network Architecture Optimization through Submodularity and Supermodularity stat.ML · 2016 · author #4
  43. Optimizing Recurrent Neural Networks Architectures under Time Constraints stat.ML · 2016 · author #4
  44. Quantum Image Matching quant-ph · 2016 · author #1
  45. An Ensemble-Proper Orthogonal Decomposition Method for the Nonstationary Navier-Stokes Equations math.NA · 2016 · author #2
  46. Hand Segmentation for Hand-Object Interaction from Depth map cs.CV · 2016 · author #3
  47. Word Embedding based Correlation Model for Question/Answer Matching cs.CL · 2015 · author #3
  48. Doubly Robust Off-policy Value Evaluation for Reinforcement Learning cs.LG · 2015 · author #1
  49. Stability Bounds on Compact Astrophysical Objects from Information-Entropic Measure gr-qc · 2015 · author #2
  50. The electromagnetic decays of the charmed and bottom baryons in chiral perturbation theory hep-ph · 2015 · author #1
  51. Algorithms and Models for Turbulence Not at Statistical Equilibrium math.NA · 2015 · author #1
  52. Mass and axial charge of heavy baryons hep-ph · 2014 · author #1
  53. On the Minimum Energy of Sending Gaussian Multiterminal Sources over the Gaussian MAC cs.IT · 2013 · author #1
  54. Chaos control in random Boolean networks by reducing mean damage percolation rate nlin.CG · 2010 · author #1
  55. Formation Mechanism of Atmospheric Pressure Plasma Jet physics.plasm-ph · 2008 · author #1

Mentions

  • 1301.1061 #1 · backfill · confidence 0.70 Nan Jiang
  • 2605.04128 #14 · arxiv_oai · confidence 0.70 Nan Jiang
  • 1009.5010 #1 · backfill · confidence 0.70 Nan Jiang
  • 2405.07863 #7 · arxiv_oai · confidence 0.70 Nan Jiang
  • 2403.12031 #4 · arxiv_oai · confidence 0.70 Nan Jiang
  • 0811.0130 #1 · backfill · confidence 0.70 Nan Jiang

Frequent Coauthors