pith. sign in

Paul Pu Liang

Identifiers

  • name variant Paul Pu Liang 0.60 · backfill

Papers (39)

  1. Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals cs.LG · 2026 · author #6
  2. FrontierOR: Benchmarking LLMs' Capacity for Efficient Algorithm Design in Large-Scale Optimization cs.AI · 2026 · author #25
  3. GEM-4D: Geometry-Enhanced Video World Models for Robot Manipulation cs.CV · 2026 · author #10
  4. PaintCopilot: Modeling Painting as Autonomous Artistic Continuation cs.CV · 2026 · author #3
  5. NeuroAtlas: Benchmarking Foundation Models for Clinical EEG and Brain-Computer Interfaces cs.LG · 2026 · author #14
  6. On the Invariance and Generality of Neural Scaling Laws cs.LG · 2026 · author #4
  7. Continuous First, Discrete Later: VQ-VAEs Without Dimensional Collapse cs.LG · 2026 · author #5
  8. Self-Captioning Multimodal Interaction Tuning: Amplifying Exploitable Redundancies for Robust Vision Language Models cs.CV · 2026 · author #4
  9. Video Active Perception: Effective Inference-Time Long-Form Video Understanding with Vision-Language Models cs.CV · 2026 · author #5
  10. Act2See: Emergent Active Visual Perception for Video Reasoning cs.CV · 2026 · author #5
  11. CTM-AI: A Blueprint for General AI Inspired by a Model of Consciousness q-bio.NC · 2026 · author #5
  12. DENALI: A Dataset Enabling Non-Line-of-Sight Spatial Reasoning with Low-Cost LiDARs cs.RO · 2026 · author #5
  13. SCATR: Simple Calibrated Test-Time Ranking cs.LG · 2026 · author #6
  14. Breaking Negative Cycles: A Reflection-To-Action System For Adaptive Change cs.HC · 2026 · author #10
  15. CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery cs.AI · 2026 · author #17
  16. AromaGen: Interactive Generation of Rich Olfactory Experiences with Multimodal Language Models cs.HC · 2026 · author #6
  17. OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization cs.AI · 2026 · author #12
  18. Towards a Science of Scaling Agent Systems cs.AI · 2025 · author #12
  19. Abstract 3D Perception for Spatial Intelligence in Vision-Language Models cs.CV · 2025 · author #5
  20. PAGE-4D: VGGT-4D Perception via Disentangled Pose and Geometry Estimation cs.CV · 2025 · author #7
  21. Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine cs.CL · 2025 · author #6
  22. MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents cs.CL · 2025 · author #9
  23. PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts cs.CL · 2025 · author #12
  24. SmellNet: A Large-scale Dataset for Real-world Smell Recognition cs.AI · 2025 · author #6
  25. Beyond Cross-Modal Alignment: Measuring and Leveraging Modality Gap in Vision-Language Models cs.CV · 2025 · author #5
  26. OS-ATLAS: A Foundation Action Model for Generalist GUI Agents cs.CL · 2024 · author #10
  27. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models cs.CL · 2022 · author #305
  28. Learning Representations from Imperfect Time Series Data via Tensor Rank Regularization cs.LG · 2019 · author #1
  29. Multimodal Transformer for Unaligned Multimodal Language Sequences cs.CL · 2019 · author #3
  30. An Empirical Evaluation of Sketched SVD and its Application to Leverage Score Ordering cs.LG · 2018 · author #2
  31. Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors cs.CL · 2018 · author #4
  32. Multimodal Language Analysis with Recurrent Multistage Fusion cs.LG · 2018 · author #1
  33. Multimodal Local-Global Ranking Fusion for Emotion Recognition cs.HC · 2018 · author #1
  34. Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis cs.CL · 2018 · author #3
  35. Learning Factorized Multimodal Representations cs.LG · 2018 · author #2
  36. Efficient Low-rank Multimodal Fusion with Modality-Specific Factors cs.AI · 2018 · author #4
  37. Memory Fusion Network for Multi-view Sequential Learning cs.LG · 2018 · author #2
  38. Multimodal Sentiment Analysis with Word-Level Fusion and Reinforcement Learning cs.LG · 2018 · author #3
  39. Multi-attention Recurrent Network for Human Communication Comprehension cs.AI · 2018 · author #2

Mentions

  • 2605.27583 #6 · arxiv_oai · confidence 0.70 Paul Pu Liang
  • 2605.25246 #25 · arxiv_oai · confidence 0.70 Paul Pu Liang
  • 2602.10635 #12 · arxiv_oai · confidence 0.70 Paul Pu Liang
  • 2605.22882 #10 · arxiv_oai · confidence 0.70 Paul Pu Liang
  • 2605.20941 #3 · arxiv_oai · confidence 0.70 Paul Pu Liang
  • 2604.01658 #17 · arxiv_oai · confidence 0.70 Paul Pu Liang

Frequent Coauthors