pith. sign in

Kai Yu

Identifiers

  • name variant Kai Yu 0.60 · backfill

Papers (47)

  1. Multi-Paradigm Agent Interaction in Practice:A Systematic Analysis of Generator-Evaluator, ReAct Loop,and Adversarial Evaluation in the buddyMe Framework cs.AI · 2026 · author #3
  2. Artificial Intelligence-Assistant Cardiotocography: Unified Model for Signal Reconstruction, Fetal Heart Rate Analysis, and Variability Assessment cs.LG · 2026 · author #2
  3. Good to Go: The LOOP Skill Engine That Hits 99% Success and Slashes Token Usage by 99% via One-Shot Recording and Deterministic Replay cs.AI · 2026 · author #2
  4. No Action Without a NOD: A Heterogeneous Multi-Agent Architecture for Reliable Service Agents cs.AI · 2026 · author #8
  5. HiDream-O1-Image: A Natively Unified Image Generative Foundation Model with Pixel-level Unified Transformer cs.CV · 2026 · author #9
  6. Uniqueness for an inverse coefficient problem of a weakly coupled parabolic system math.AP · 2026 · author #2
  7. Implicit Preference Alignment for Human Image Animation cs.CV · 2026 · author #5
  8. X-Voice: Enabling Everyone to Speak 30 Languages via Zero-Shot Cross-Lingual Voice Cloning cs.SD · 2026 · author #12
  9. FaithfulFaces: Pose-Faithful Facial Identity Preservation for Text-to-Video Generation cs.CV · 2026 · author #5
  10. Dual-LoRA: Parameter-Efficient Adversarial Disentanglement for Cross-Lingual Speaker Verification eess.AS · 2026 · author #7
  11. RAS: a Reliability Oriented Metric for Automatic Speech Recognition cs.SD · 2026 · author #8
  12. Using Importance Sampling to Estimate $p$-values in All-Subset Meta-Analysis, with Applications to Single-Cell eQTL Mapping stat.ME · 2026 · author #5
  13. Diagnosing CFG Interpretation in LLMs cs.AI · 2026 · author #3
  14. Anonymization, Not Elimination: Utility-Preserved Speech Anonymization eess.AS · 2026 · author #5
  15. X-VC: Zero-shot Streaming Voice Conversion in Codec Space eess.AS · 2026 · author #9
  16. Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition cs.CL · 2026 · author #10
  17. TASU2: Controllable CTC Simulation for Alignment and Low-Resource Adaptation of Speech LLMs eess.AS · 2026 · author #8
  18. Does Pass Rate Tell the Whole Story? Evaluating Design Constraint Compliance in LLM-based Issue Resolution cs.SE · 2026 · author #1
  19. PRIME: Prototype-Driven Multimodal Pretraining for Cancer Prognosis with Missing Modalities cs.LG · 2026 · author #1
  20. CharTool: Tool-Integrated Visual Reasoning for Chart Understanding cs.AI · 2026 · author #9
  21. An Underexplored Frontier: Large Language Models for Rare Disease Patient Education and Communication -- A scoping review cs.CL · 2026 · author #3
  22. ChemDFM-R: A Chemical Reasoning LLM Enhanced with Atomized Chemical Knowledge cs.CE · 2025 · author #15
  23. HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer cs.CV · 2025 · author #12
  24. F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching eess.AS · 2024 · author #7
  25. Semantic Parsing with Dual Learning cs.CL · 2019 · author #5
  26. Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition eess.AS · 2019 · author #5
  27. AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning cs.CL · 2019 · author #6
  28. A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned Data cs.CL · 2019 · author #3
  29. End-to-End Monaural Multi-speaker ASR System without Pretraining cs.CL · 2018 · author #3
  30. Towards Universal Dialogue State Tracking cs.CL · 2018 · author #4
  31. Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting cs.CL · 2018 · author #3
  32. Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition cs.SD · 2018 · author #4
  33. On Modular Training of Neural Acoustics-to-Word Model for LVCSR cs.CL · 2018 · author #4
  34. Concept Transfer Learning for Adaptive Language Understanding cs.CL · 2017 · author #2
  35. A Large-scale Distributed Video Parsing and Evaluation Platform cs.CV · 2016 · author #1
  36. Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization cs.CV · 2016 · author #1
  37. Encoder-decoder with Focus-mechanism for Sequence Labelling Based Spoken Language Understanding cs.CL · 2016 · author #2
  38. Text Flow: A Unified Text Detection System in Natural Scene Images cs.CV · 2016 · author #5
  39. On Training Bi-directional Neural Network Language Model with Noise Contrastive Estimation cs.CL · 2016 · author #4
  40. Bidirectional LSTM-CRF Models for Sequence Tagging cs.CL · 2015 · author #3
  41. Recurrent Polynomial Network for Dialogue State Tracking cs.CL · 2015 · author #3
  42. High-dimensional Joint Sparsity Random Effects Model for Multi-task Learning cs.LG · 2013 · author #2
  43. Large Scale Strongly Supervised Ensemble Metric Learning, with Applications to Face Verification and Retrieval cs.CV · 2012 · author #3
  44. Collaborative Ensemble Learning: Combining Collaborative and Content-Based Information Filtering via Hierarchical Bayes cs.LG · 2012 · author #1
  45. Smooth Sparse Coding via Marginal Regression for Learning Sparse Representations stat.ML · 2012 · author #2
  46. Infinite Hidden Relational Models cs.AI · 2012 · author #3
  47. High Dimensional Nonlinear Learning using Local Coordinate Coding stat.ML · 2009 · author #1

Mentions

  • 1212.6094 #3 · backfill · confidence 0.70 Kai Yu
  • 1212.2508 #1 · backfill · confidence 0.70 Kai Yu
  • 1210.1121 #2 · backfill · confidence 0.70 Kai Yu
  • 1206.6864 #3 · backfill · confidence 0.70 Kai Yu
  • 2605.16821 #3 · arxiv_oai · confidence 0.70 Kai Yu
  • 2505.22705 #12 · arxiv_oai · confidence 0.70 Kai Yu
  • 0906.5190 #1 · backfill · confidence 0.70 Kai Yu
  • 2410.06885 #7 · arxiv_oai · confidence 0.70 Kai Yu

Frequent Coauthors