Maosong Sun
Identifiers
- name variant Maosong Sun 0.60 · backfill
Papers (80)
- Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs cs.CV · 2026 · author #8
- Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning cs.CV · 2026 · author #6
- Test-Time Deep Thinking to Explore Implicit Rules cs.AI · 2026 · author #11
- AutoVecCoder: Teaching LLMs to Generate Explicitly Vectorized Code cs.CL · 2026 · author #11
- DiffScore: Text Evaluation Beyond Autoregressive Likelihood cs.CL · 2026 · author #6
- Khala: Scaling Acoustic Token Language Models Toward High-Fidelity Music Generation cs.SD · 2026 · author #11
- From Context to Skills: Can Language Models Learn from Context Skillfully? cs.AI · 2026 · author #13
- MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction cs.CL · 2026 · author #34
- KARL: Mitigating Hallucinations in LLMs via Knowledge-Boundary-Aware Reinforcement Learning cs.LG · 2026 · author #8
- UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents cs.CV · 2026 · author #10
- FactNet: A Billion-Scale Knowledge Graph for Multilingual Factual Grounding cs.CL · 2026 · author #10
- CPMobius: Iterative Coach-Player Reasoning for Data-Free Reinforcement Learning cs.CL · 2026 · author #11
- MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization cs.CL · 2026 · author #9
- Finding What Matters: Anchoring Context Knowledge with Evolving Indices for Iterative Retrieval cs.CL · 2026 · author #10
- MEIC-DT: Memory-Efficient Incremental Clustering for Long-Text Coreference Resolution with Dual-Threshold Constraints cs.IR · 2025 · author #11
- FaithLens: Detecting and Explaining Faithfulness Hallucination cs.CL · 2025 · author #11
- Musical Score Understanding Benchmark: Evaluating Large Language Models' Comprehension of Complete Musical Scores cs.SD · 2025 · author #15
- ImCoref-CeS: An Improved Lightweight Pipeline for Coreference Resolution with LLM-based Checker-Splitter Refinement cs.CL · 2025 · author #12
- A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks cs.CL · 2025 · author #8
- StateX: Enhancing RNN Recall via Post-training State Expansion cs.CL · 2025 · author #6
- MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe cs.LG · 2025 · author #34
- Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization cs.CL · 2025 · author #10
- Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation cs.CL · 2025 · author #10
- AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage cs.AI · 2025 · author #10
- AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization cs.CL · 2025 · author #11
- Process Reinforcement through Implicit Rewards cs.LG · 2025 · author #23
- Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance cs.CV · 2024 · author #5
- VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents cs.IR · 2024 · author #11
- MiniCPM-V: A GPT-4V Level MLLM on Your Phone cs.CV · 2024 · author #23
- MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies cs.CL · 2024 · author #25
- OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems cs.CL · 2024 · author #14
- UltraFeedback: Boosting Language Models with Scaled AI Feedback cs.CL · 2023 · author #12
- AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors cs.CL · 2023 · author #15
- ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs cs.AI · 2023 · author #19
- ChatDev: Communicative Agents for Software Development cs.SE · 2023 · author #14
- Enhancing Chat Language Models by Scaling High-quality Instructional Conversations cs.CL · 2023 · author #8
- Tool Learning with Foundation Models cs.CL · 2023 · author #41
- Quantifying Similarity between Relations with Fact Distribution cs.AI · 2019 · author #5
- Modeling Semantic Compositionality with Sememe Knowledge cs.CL · 2019 · author #7
- COS960: A Chinese Word Similarity Dataset of 960 Word Pairs cs.CL · 2019 · author #5
- ERNIE: Enhanced Language Representation with Informative Entities cs.CL · 2019 · author #5
- Graph Neural Networks with Generated Parameters for Relation Extraction cs.CL · 2019 · author #6
- OpenHowNet: An Open Sememe-based Lexical Knowledge Base cs.CL · 2019 · author #5
- Knowledge Representation Learning: A Quantitative Review cs.CL · 2018 · author #5
- COSINE: Compressive Network Embedding on Large-scale Information Networks cs.SI · 2018 · author #4
- Neural Diffusion Model for Microscopic Cascade Prediction cs.SI · 2018 · author #2
- CED: Credible Early Detection of Social Media Rumors cs.SI · 2018 · author #5
- Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization cs.CL · 2018 · author #5
- Language Modeling with Sparse Product of Sememe Experts cs.CL · 2018 · author #6
- FewRel: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation cs.LG · 2018 · author #7
- Overview of CAIL2018: Legal Judgment Prediction Competition cs.AI · 2018 · author #6
- Enhancing Stock Movement Prediction with Adversarial Training q-fin.TR · 2018 · author #5
- Improving the Transformer Translation Model with Document-Level Context cs.CL · 2018 · author #3
- Automatic Judgment Prediction via Legal Reading Comprehension cs.AI · 2018 · author #4
- Chinese Poetry Generation with a Salient-Clue Mechanism cs.AI · 2018 · author #3
- Chinese Poetry Generation with a Working Memory Model cs.AI · 2018 · author #2
- CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction cs.CL · 2018 · author #6
- Incorporating Chinese Characters of Words for Lexical Sememe Prediction cs.CL · 2018 · author #5
- Denoising Distant Supervision for Relation Extraction via Instance-Level Adversarial Training cs.CL · 2018 · author #3
- Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval cs.IR · 2018 · author #3
- THUMT: An Open Source Toolkit for Neural Machine Translation cs.CL · 2017 · author #5
- Joint POS Tagging and Dependency Parsing with Transition-based Neural Networks cs.CL · 2017 · author #5
- Neural Emoji Recommendation in Dialogue Systems cs.CL · 2016 · author #4
- A Unified Framework for Community Detection and Network Representation Learning cs.SI · 2016 · author #6
- Neural Machine Translation with Pivot Languages cs.CL · 2016 · author #4
- Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion cs.CL · 2016 · author #3
- Incorporating Relation Paths in Neural Relation Extraction cs.CL · 2016 · author #4
- Knowledge Representation via Joint Learning of Sequential Text and Knowledge Graphs cs.CL · 2016 · author #4
- Topic Sensitive Neural Headline Generation cs.CL · 2016 · author #5
- A Neural Network Approach to Joint Modeling Social Networks and Mobile Trajectories cs.SI · 2016 · author #2
- Agreement-based Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora cs.CL · 2016 · author #4
- Semi-Supervised Learning for Neural Machine Translation cs.CL · 2016 · author #6
- Neural Headline Generation with Sentence-wise Optimization cs.CL · 2016 · author #5
- Generating Chinese Classical Poems with RNN Encoder-Decoder cs.CL · 2016 · author #3
- Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation cs.CL · 2015 · author #6
- Minimum Risk Training for Neural Machine Translation cs.CL · 2015 · author #6
- Modeling Relation Paths for Representation Learning of Knowledge Bases cs.CL · 2015 · author #4
- Contrastive Unsupervised Word Alignment with Non-Local Features cs.CL · 2014 · author #2
- Reduce Meaningless Words for Joint Chinese Word Segmentation and Part-of-speech Tagging cs.CL · 2013 · author #2
- Binary Tree based Chinese Word Segmentation cs.CL · 2013 · author #3
Mentions
- 2604.27660 #13 · arxiv_oai · confidence 0.70 Maosong Sun
- 2601.16462 #10 · arxiv_oai · confidence 0.70 Maosong Sun
- 1410.2082 #2 · backfill · confidence 0.70 Maosong Sun
- 2605.30611 #8 · arxiv_oai · confidence 0.70 Maosong Sun
- 2411.14279 #5 · arxiv_oai · confidence 0.70 Maosong Sun
- 1305.5918 #2 · backfill · confidence 0.70 Maosong Sun
- 1305.3981 #3 · backfill · confidence 0.70 Maosong Sun
- 2605.25437 #6 · arxiv_oai · confidence 0.70 Maosong Sun
- 2602.02979 #11 · arxiv_oai · confidence 0.70 Maosong Sun
- 2605.24828 #11 · arxiv_oai · confidence 0.70 Maosong Sun
- 2605.17978 #11 · arxiv_oai · confidence 0.70 Maosong Sun
- 2310.01377 #12 · arxiv_oai · confidence 0.70 Maosong Sun
- 2410.10594 #11 · arxiv_oai · confidence 0.70 Maosong Sun
- 2304.08354 #41 · arxiv_oai · confidence 0.70 Maosong Sun
- 2308.10848 #15 · arxiv_oai · confidence 0.70 Maosong Sun
- 2305.14233 #8 · arxiv_oai · confidence 0.70 Maosong Sun
- 2509.18154 #34 · arxiv_oai · confidence 0.70 Maosong Sun
Frequent Coauthors
- Zhiyuan Liu 44 shared papers
- Xu Han 17 shared papers
- Yang Liu 11 shared papers
- Cheng Yang 9 shared papers
- Yankai Lin 9 shared papers
- Ruobing Xie 8 shared papers
- Shuo Wang 8 shared papers
- Shuzheng Si 8 shared papers
- Jie Zhou 7 shared papers
- Kangyang Luo 7 shared papers
- Weize Chen 7 shared papers
- Zhenghao Liu 7 shared papers
- Fanchao Qi 6 shared papers
- Huanbo Luan 6 shared papers
- Ning Ding 6 shared papers
- Yuan Yao 6 shared papers
- Bokai Xu 5 shared papers
- Chaojun Xiao 5 shared papers
- Cunchao Tu 5 shared papers
- Dahai Li 5 shared papers