pith. sign in

Maosong Sun

Identifiers

  • name variant Maosong Sun 0.60 · backfill

Papers (80)

  1. Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs cs.CV · 2026 · author #8
  2. Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning cs.CV · 2026 · author #6
  3. Test-Time Deep Thinking to Explore Implicit Rules cs.AI · 2026 · author #11
  4. AutoVecCoder: Teaching LLMs to Generate Explicitly Vectorized Code cs.CL · 2026 · author #11
  5. DiffScore: Text Evaluation Beyond Autoregressive Likelihood cs.CL · 2026 · author #6
  6. Khala: Scaling Acoustic Token Language Models Toward High-Fidelity Music Generation cs.SD · 2026 · author #11
  7. From Context to Skills: Can Language Models Learn from Context Skillfully? cs.AI · 2026 · author #13
  8. MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction cs.CL · 2026 · author #34
  9. KARL: Mitigating Hallucinations in LLMs via Knowledge-Boundary-Aware Reinforcement Learning cs.LG · 2026 · author #8
  10. UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents cs.CV · 2026 · author #10
  11. FactNet: A Billion-Scale Knowledge Graph for Multilingual Factual Grounding cs.CL · 2026 · author #10
  12. CPMobius: Iterative Coach-Player Reasoning for Data-Free Reinforcement Learning cs.CL · 2026 · author #11
  13. MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization cs.CL · 2026 · author #9
  14. Finding What Matters: Anchoring Context Knowledge with Evolving Indices for Iterative Retrieval cs.CL · 2026 · author #10
  15. MEIC-DT: Memory-Efficient Incremental Clustering for Long-Text Coreference Resolution with Dual-Threshold Constraints cs.IR · 2025 · author #11
  16. FaithLens: Detecting and Explaining Faithfulness Hallucination cs.CL · 2025 · author #11
  17. Musical Score Understanding Benchmark: Evaluating Large Language Models' Comprehension of Complete Musical Scores cs.SD · 2025 · author #15
  18. ImCoref-CeS: An Improved Lightweight Pipeline for Coreference Resolution with LLM-based Checker-Splitter Refinement cs.CL · 2025 · author #12
  19. A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks cs.CL · 2025 · author #8
  20. StateX: Enhancing RNN Recall via Post-training State Expansion cs.CL · 2025 · author #6
  21. MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe cs.LG · 2025 · author #34
  22. Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization cs.CL · 2025 · author #10
  23. Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation cs.CL · 2025 · author #10
  24. AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage cs.AI · 2025 · author #10
  25. AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization cs.CL · 2025 · author #11
  26. Process Reinforcement through Implicit Rewards cs.LG · 2025 · author #23
  27. Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance cs.CV · 2024 · author #5
  28. VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents cs.IR · 2024 · author #11
  29. MiniCPM-V: A GPT-4V Level MLLM on Your Phone cs.CV · 2024 · author #23
  30. MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies cs.CL · 2024 · author #25
  31. OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems cs.CL · 2024 · author #14
  32. UltraFeedback: Boosting Language Models with Scaled AI Feedback cs.CL · 2023 · author #12
  33. AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors cs.CL · 2023 · author #15
  34. ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs cs.AI · 2023 · author #19
  35. ChatDev: Communicative Agents for Software Development cs.SE · 2023 · author #14
  36. Enhancing Chat Language Models by Scaling High-quality Instructional Conversations cs.CL · 2023 · author #8
  37. Tool Learning with Foundation Models cs.CL · 2023 · author #41
  38. Quantifying Similarity between Relations with Fact Distribution cs.AI · 2019 · author #5
  39. Modeling Semantic Compositionality with Sememe Knowledge cs.CL · 2019 · author #7
  40. COS960: A Chinese Word Similarity Dataset of 960 Word Pairs cs.CL · 2019 · author #5
  41. ERNIE: Enhanced Language Representation with Informative Entities cs.CL · 2019 · author #5
  42. Graph Neural Networks with Generated Parameters for Relation Extraction cs.CL · 2019 · author #6
  43. OpenHowNet: An Open Sememe-based Lexical Knowledge Base cs.CL · 2019 · author #5
  44. Knowledge Representation Learning: A Quantitative Review cs.CL · 2018 · author #5
  45. COSINE: Compressive Network Embedding on Large-scale Information Networks cs.SI · 2018 · author #4
  46. Neural Diffusion Model for Microscopic Cascade Prediction cs.SI · 2018 · author #2
  47. CED: Credible Early Detection of Social Media Rumors cs.SI · 2018 · author #5
  48. Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization cs.CL · 2018 · author #5
  49. Language Modeling with Sparse Product of Sememe Experts cs.CL · 2018 · author #6
  50. FewRel: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation cs.LG · 2018 · author #7
  51. Overview of CAIL2018: Legal Judgment Prediction Competition cs.AI · 2018 · author #6
  52. Enhancing Stock Movement Prediction with Adversarial Training q-fin.TR · 2018 · author #5
  53. Improving the Transformer Translation Model with Document-Level Context cs.CL · 2018 · author #3
  54. Automatic Judgment Prediction via Legal Reading Comprehension cs.AI · 2018 · author #4
  55. Chinese Poetry Generation with a Salient-Clue Mechanism cs.AI · 2018 · author #3
  56. Chinese Poetry Generation with a Working Memory Model cs.AI · 2018 · author #2
  57. CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction cs.CL · 2018 · author #6
  58. Incorporating Chinese Characters of Words for Lexical Sememe Prediction cs.CL · 2018 · author #5
  59. Denoising Distant Supervision for Relation Extraction via Instance-Level Adversarial Training cs.CL · 2018 · author #3
  60. Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval cs.IR · 2018 · author #3
  61. THUMT: An Open Source Toolkit for Neural Machine Translation cs.CL · 2017 · author #5
  62. Joint POS Tagging and Dependency Parsing with Transition-based Neural Networks cs.CL · 2017 · author #5
  63. Neural Emoji Recommendation in Dialogue Systems cs.CL · 2016 · author #4
  64. A Unified Framework for Community Detection and Network Representation Learning cs.SI · 2016 · author #6
  65. Neural Machine Translation with Pivot Languages cs.CL · 2016 · author #4
  66. Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion cs.CL · 2016 · author #3
  67. Incorporating Relation Paths in Neural Relation Extraction cs.CL · 2016 · author #4
  68. Knowledge Representation via Joint Learning of Sequential Text and Knowledge Graphs cs.CL · 2016 · author #4
  69. Topic Sensitive Neural Headline Generation cs.CL · 2016 · author #5
  70. A Neural Network Approach to Joint Modeling Social Networks and Mobile Trajectories cs.SI · 2016 · author #2
  71. Agreement-based Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora cs.CL · 2016 · author #4
  72. Semi-Supervised Learning for Neural Machine Translation cs.CL · 2016 · author #6
  73. Neural Headline Generation with Sentence-wise Optimization cs.CL · 2016 · author #5
  74. Generating Chinese Classical Poems with RNN Encoder-Decoder cs.CL · 2016 · author #3
  75. Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation cs.CL · 2015 · author #6
  76. Minimum Risk Training for Neural Machine Translation cs.CL · 2015 · author #6
  77. Modeling Relation Paths for Representation Learning of Knowledge Bases cs.CL · 2015 · author #4
  78. Contrastive Unsupervised Word Alignment with Non-Local Features cs.CL · 2014 · author #2
  79. Reduce Meaningless Words for Joint Chinese Word Segmentation and Part-of-speech Tagging cs.CL · 2013 · author #2
  80. Binary Tree based Chinese Word Segmentation cs.CL · 2013 · author #3

Mentions

  • 2604.27660 #13 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2601.16462 #10 · arxiv_oai · confidence 0.70 Maosong Sun
  • 1410.2082 #2 · backfill · confidence 0.70 Maosong Sun
  • 2605.30611 #8 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2411.14279 #5 · arxiv_oai · confidence 0.70 Maosong Sun
  • 1305.5918 #2 · backfill · confidence 0.70 Maosong Sun
  • 1305.3981 #3 · backfill · confidence 0.70 Maosong Sun
  • 2605.25437 #6 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2602.02979 #11 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2605.24828 #11 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2605.17978 #11 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2310.01377 #12 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2410.10594 #11 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2304.08354 #41 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2308.10848 #15 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2305.14233 #8 · arxiv_oai · confidence 0.70 Maosong Sun
  • 2509.18154 #34 · arxiv_oai · confidence 0.70 Maosong Sun

Frequent Coauthors