Maosong Sun

Identifiers

name variant Maosong Sun 0.60 · backfill

Papers (80)

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs cs.CV · 2026 · author #8
Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning cs.CV · 2026 · author #6
Test-Time Deep Thinking to Explore Implicit Rules cs.AI · 2026 · author #11
AutoVecCoder: Teaching LLMs to Generate Explicitly Vectorized Code cs.CL · 2026 · author #11
DiffScore: Text Evaluation Beyond Autoregressive Likelihood cs.CL · 2026 · author #6
Khala: Scaling Acoustic Token Language Models Toward High-Fidelity Music Generation cs.SD · 2026 · author #11
From Context to Skills: Can Language Models Learn from Context Skillfully? cs.AI · 2026 · author #13
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction cs.CL · 2026 · author #34
KARL: Mitigating Hallucinations in LLMs via Knowledge-Boundary-Aware Reinforcement Learning cs.LG · 2026 · author #8
UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents cs.CV · 2026 · author #10
FactNet: A Billion-Scale Knowledge Graph for Multilingual Factual Grounding cs.CL · 2026 · author #10
CPMobius: Iterative Coach-Player Reasoning for Data-Free Reinforcement Learning cs.CL · 2026 · author #11
MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization cs.CL · 2026 · author #9
Finding What Matters: Anchoring Context Knowledge with Evolving Indices for Iterative Retrieval cs.CL · 2026 · author #10
MEIC-DT: Memory-Efficient Incremental Clustering for Long-Text Coreference Resolution with Dual-Threshold Constraints cs.IR · 2025 · author #11
FaithLens: Detecting and Explaining Faithfulness Hallucination cs.CL · 2025 · author #11
Musical Score Understanding Benchmark: Evaluating Large Language Models' Comprehension of Complete Musical Scores cs.SD · 2025 · author #15
ImCoref-CeS: An Improved Lightweight Pipeline for Coreference Resolution with LLM-based Checker-Splitter Refinement cs.CL · 2025 · author #12
A Goal Without a Plan Is Just a Wish: Efficient and Effective Global Planner Training for Long-Horizon Agent Tasks cs.CL · 2025 · author #8
StateX: Enhancing RNN Recall via Post-training State Expansion cs.CL · 2025 · author #6
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe cs.LG · 2025 · author #34
Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization cs.CL · 2025 · author #10
Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation cs.CL · 2025 · author #10
AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage cs.AI · 2025 · author #10
AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization cs.CL · 2025 · author #11
Process Reinforcement through Implicit Rewards cs.LG · 2025 · author #23
Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance cs.CV · 2024 · author #5
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents cs.IR · 2024 · author #11
MiniCPM-V: A GPT-4V Level MLLM on Your Phone cs.CV · 2024 · author #23
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies cs.CL · 2024 · author #25
OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems cs.CL · 2024 · author #14
UltraFeedback: Boosting Language Models with Scaled AI Feedback cs.CL · 2023 · author #12
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors cs.CL · 2023 · author #15
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs cs.AI · 2023 · author #19
ChatDev: Communicative Agents for Software Development cs.SE · 2023 · author #14
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations cs.CL · 2023 · author #8
Tool Learning with Foundation Models cs.CL · 2023 · author #41
Quantifying Similarity between Relations with Fact Distribution cs.AI · 2019 · author #5
Modeling Semantic Compositionality with Sememe Knowledge cs.CL · 2019 · author #7
COS960: A Chinese Word Similarity Dataset of 960 Word Pairs cs.CL · 2019 · author #5
ERNIE: Enhanced Language Representation with Informative Entities cs.CL · 2019 · author #5
Graph Neural Networks with Generated Parameters for Relation Extraction cs.CL · 2019 · author #6
OpenHowNet: An Open Sememe-based Lexical Knowledge Base cs.CL · 2019 · author #5
Knowledge Representation Learning: A Quantitative Review cs.CL · 2018 · author #5
COSINE: Compressive Network Embedding on Large-scale Information Networks cs.SI · 2018 · author #4
Neural Diffusion Model for Microscopic Cascade Prediction cs.SI · 2018 · author #2
CED: Credible Early Detection of Social Media Rumors cs.SI · 2018 · author #5
Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization cs.CL · 2018 · author #5
Language Modeling with Sparse Product of Sememe Experts cs.CL · 2018 · author #6
FewRel: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation cs.LG · 2018 · author #7
Overview of CAIL2018: Legal Judgment Prediction Competition cs.AI · 2018 · author #6
Enhancing Stock Movement Prediction with Adversarial Training q-fin.TR · 2018 · author #5
Improving the Transformer Translation Model with Document-Level Context cs.CL · 2018 · author #3
Automatic Judgment Prediction via Legal Reading Comprehension cs.AI · 2018 · author #4
Chinese Poetry Generation with a Salient-Clue Mechanism cs.AI · 2018 · author #3
Chinese Poetry Generation with a Working Memory Model cs.AI · 2018 · author #2
CAIL2018: A Large-Scale Legal Dataset for Judgment Prediction cs.CL · 2018 · author #6
Incorporating Chinese Characters of Words for Lexical Sememe Prediction cs.CL · 2018 · author #5
Denoising Distant Supervision for Relation Extraction via Instance-Level Adversarial Training cs.CL · 2018 · author #3
Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval cs.IR · 2018 · author #3
THUMT: An Open Source Toolkit for Neural Machine Translation cs.CL · 2017 · author #5
Joint POS Tagging and Dependency Parsing with Transition-based Neural Networks cs.CL · 2017 · author #5
Neural Emoji Recommendation in Dialogue Systems cs.CL · 2016 · author #4
A Unified Framework for Community Detection and Network Representation Learning cs.SI · 2016 · author #6
Neural Machine Translation with Pivot Languages cs.CL · 2016 · author #4
Joint Representation Learning of Text and Knowledge for Knowledge Graph Completion cs.CL · 2016 · author #3
Incorporating Relation Paths in Neural Relation Extraction cs.CL · 2016 · author #4
Knowledge Representation via Joint Learning of Sequential Text and Knowledge Graphs cs.CL · 2016 · author #4
Topic Sensitive Neural Headline Generation cs.CL · 2016 · author #5
A Neural Network Approach to Joint Modeling Social Networks and Mobile Trajectories cs.SI · 2016 · author #2
Agreement-based Learning of Parallel Lexicons and Phrases from Non-Parallel Corpora cs.CL · 2016 · author #4
Semi-Supervised Learning for Neural Machine Translation cs.CL · 2016 · author #6
Neural Headline Generation with Sentence-wise Optimization cs.CL · 2016 · author #5
Generating Chinese Classical Poems with RNN Encoder-Decoder cs.CL · 2016 · author #3
Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation cs.CL · 2015 · author #6
Minimum Risk Training for Neural Machine Translation cs.CL · 2015 · author #6
Modeling Relation Paths for Representation Learning of Knowledge Bases cs.CL · 2015 · author #4
Contrastive Unsupervised Word Alignment with Non-Local Features cs.CL · 2014 · author #2
Reduce Meaningless Words for Joint Chinese Word Segmentation and Part-of-speech Tagging cs.CL · 2013 · author #2
Binary Tree based Chinese Word Segmentation cs.CL · 2013 · author #3

Mentions

2604.27660 #13 · arxiv_oai · confidence 0.70 Maosong Sun
2601.16462 #10 · arxiv_oai · confidence 0.70 Maosong Sun
1410.2082 #2 · backfill · confidence 0.70 Maosong Sun
2605.30611 #8 · arxiv_oai · confidence 0.70 Maosong Sun
2411.14279 #5 · arxiv_oai · confidence 0.70 Maosong Sun
1305.5918 #2 · backfill · confidence 0.70 Maosong Sun
1305.3981 #3 · backfill · confidence 0.70 Maosong Sun
2605.25437 #6 · arxiv_oai · confidence 0.70 Maosong Sun
2602.02979 #11 · arxiv_oai · confidence 0.70 Maosong Sun
2605.24828 #11 · arxiv_oai · confidence 0.70 Maosong Sun
2605.17978 #11 · arxiv_oai · confidence 0.70 Maosong Sun
2310.01377 #12 · arxiv_oai · confidence 0.70 Maosong Sun
2410.10594 #11 · arxiv_oai · confidence 0.70 Maosong Sun
2304.08354 #41 · arxiv_oai · confidence 0.70 Maosong Sun
2308.10848 #15 · arxiv_oai · confidence 0.70 Maosong Sun
2305.14233 #8 · arxiv_oai · confidence 0.70 Maosong Sun
2509.18154 #34 · arxiv_oai · confidence 0.70 Maosong Sun

Frequent Coauthors

Zhiyuan Liu 44 shared papers
Xu Han 17 shared papers
Yang Liu 11 shared papers
Cheng Yang 9 shared papers
Yankai Lin 9 shared papers
Ruobing Xie 8 shared papers
Shuo Wang 8 shared papers
Shuzheng Si 8 shared papers
Jie Zhou 7 shared papers
Kangyang Luo 7 shared papers
Weize Chen 7 shared papers
Zhenghao Liu 7 shared papers
Fanchao Qi 6 shared papers
Huanbo Luan 6 shared papers
Ning Ding 6 shared papers
Yuan Yao 6 shared papers
Bokai Xu 5 shared papers
Chaojun Xiao 5 shared papers
Cunchao Tu 5 shared papers
Dahai Li 5 shared papers