Identifiers
-
name variant
Hao Peng
0.60 · backfill
Papers (64)
-
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning
cs.LG · 2026 · author #4
-
FinHarness: An Inline Lifecycle Safety Harness for Finance LLM Agents
cs.CL · 2026 · author #12
-
ExTax: Explainable Disinformation Detection via Persuasion, Emotion, and Narrative Role Taxonomies
cs.CL · 2026 · author #10
-
MedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoning
cs.AI · 2026 · author #6
-
CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists
cs.AI · 2026 · author #10
-
Towards a Universal Causal Reasoner
cs.CL · 2026 · author #5
-
RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably
cs.CL · 2026 · author #8
-
Useful Memories Become Faulty When Continuously Updated by LLMs
cs.AI · 2026 · author #7
-
SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety
cs.CR · 2026 · author #8
-
Unintended Negative Impacts of Promotional Language in Patent Evaluation
cs.CL · 2026 · author #3
-
StoryAlign: Evaluating and Training Reward Models for Story Generation
cs.CL · 2026 · author #2
-
TriAlignGR: Triangular Multitask Alignment with Multimodal Deep Interest Mining for Generative Recommendation
cs.IR · 2026 · author #2
-
Kwai Summary Attention Technical Report
cs.CL · 2026 · author #5
-
FedRio: Personalized Federated Social Bot Detection via Cooperative Reinforced Contrastive Adversarial Distillation
cs.AI · 2026 · author #7
-
Structural Diversity Drives Disruptive Scientific Innovation
cs.SI · 2026 · author #9
-
Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning
cs.CL · 2026 · author #7
-
Countdown-Code: A Testbed for Studying The Emergence and Generalization of Reward Hacking in RLVR
cs.LG · 2026 · author #4
-
GLM-5: from Vibe Coding to Agentic Engineering
cs.LG · 2026 · author #79
-
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
cs.LG · 2026 · author #5
-
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning
cs.CV · 2026 · author #15
-
WisPaper: Your AI Scholar Search Engine
cs.IR · 2025 · author #8
-
SeSE: Black-Box Uncertainty Quantification for Large Language Models Based on Structural Information Theory
cs.CL · 2025 · author #2
-
Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
cs.AI · 2025 · author #65
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
cs.CL · 2025 · author #56
-
Activation-Guided Local Editing for Jailbreaking Attacks
cs.CR · 2025 · author #3
-
The endoscopic character identity for even special orthogonal groups
math.RT · 2025 · author #1
-
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
cs.LG · 2025 · author #12
-
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
cs.LG · 2025 · author #5
-
RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning
cs.CL · 2025 · author #5
-
Process Reinforcement through Implicit Rewards
cs.LG · 2025 · author #20
-
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
cs.CL · 2024 · author #4
-
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
cs.CL · 2024 · author #11
-
CoreGuard: Safeguarding Foundational Capabilities of LLMs Against Model Stealing in Edge Deployment
cs.CR · 2024 · author #8
-
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
cs.SE · 2024 · author #22
-
Dynamic Network Embedding via Incremental Skip-gram with Negative Sampling
cs.LG · 2019 · author #1
-
Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for Large-Scale Multi-Label Text Classification
cs.IR · 2019 · author #1
-
Fine-grained Event Categorization with Heterogeneous Graph Convolutional Networks
cs.SI · 2019 · author #1
-
Social Influence and Unfollowing Accelerate the Emergence of Echo Chambers
cs.CY · 2019 · author #3
-
Text Generation with Exemplar-based Adaptive Decoding
cs.CL · 2019 · author #1
-
Understanding Beauty via Deep Facial Features
cs.CV · 2019 · author #3
-
Multi-materials beam hardening artifacts correction for computed tomography (CT) based on X-ray spectrum estimation
physics.med-ph · 2018 · author #5
-
Graph Convolutional Neural Networks via Motif-based Attention
cs.LG · 2018 · author #1
-
Rational Recurrences
cs.CL · 2018 · author #1
-
Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention
cs.CV · 2018 · author #4
-
Backpropagating through Structured Argmax using a SPIGOT
cs.CL · 2018 · author #1
-
Joint Analysis of Individual-level and Summary-level GWAS Data by Leveraging Pleiotropy
q-bio.GN · 2018 · author #3
-
Learning Joint Semantic Parsers from Disjoint Data
cs.CL · 2018 · author #1
-
A unified image reconstruction framework for quantitative dual- and triple-energy CT imaging of material compositions
physics.med-ph · 2018 · author #5
-
"You are no Jack Kennedy": On Media Selection of Highlights from Presidential Debates
cs.SI · 2018 · author #2
-
Performance Dynamics and Success in Online Games
cs.SI · 2018 · author #2
-
Improving Orbit Prediction Accuracy through Supervised Machine Learning
astro-ph.EP · 2018 · author #1
-
Deep Multitask Learning for Semantic Dependency Parsing
cs.CL · 2017 · author #1
-
Asynchronous Distributed Variational Gaussian Processes for Regression
stat.ML · 2017 · author #1
-
A Gb/s Parallel Block-based Viterbi Decoder for Convolutional Codes on GPU
cs.DC · 2016 · author #1
-
A Convolutional Attention Network for Extreme Summarization of Source Code
cs.LG · 2016 · author #2
-
A Comparative Study on Regularization Strategies for Embedding-based Neural Networks
cs.CL · 2015 · author #1
-
Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Path
cs.CL · 2015 · author #5
-
Discriminative Neural Sentence Modeling by Tree-Based Convolution
cs.CL · 2015 · author #2
-
Building Program Vector Representations for Deep Learning
cs.SE · 2014 · author #4
-
EigenGP: Gaussian Process Models with Adaptive Eigenfunctions
cs.LG · 2014 · author #1
-
An extension of Motzkin-Straus Theorem to non-uniform hypergraphs and its applications
math.CO · 2013 · author #2
-
On Frankl and Furedi's conjecture for 3-uniform hypergraphs
math.CO · 2012 · author #2
-
Nonperturbative tuning of an improved relativistic heavy-quark action with application to bottom spectroscopy
hep-lat · 2012 · author #7
-
An Anti-attack Model Based on Complex Network Theory in P2P networks
cs.NI · 2011 · author #1
Mentions
-
2606.04923
#4 · arxiv_oai · confidence 0.70
Hao Peng
-
1504.01106
#2 · backfill · confidence 0.70
Hao Peng
-
2511.16275
#2 · arxiv_oai · confidence 0.70
Hao Peng
-
1409.3358
#4 · backfill · confidence 0.70
Hao Peng
-
1401.0362
#1 · backfill · confidence 0.70
Hao Peng
-
1312.4135
#2 · backfill · confidence 0.70
Hao Peng
-
2602.01058
#5 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.27333
#12 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.27045
#10 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.26567
#6 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.26029
#10 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.24873
#5 · arxiv_oai · confidence 0.70
Hao Peng
-
1211.7056
#2 · backfill · confidence 0.70
Hao Peng
-
2605.05704
#8 · arxiv_oai · confidence 0.70
Hao Peng
-
1206.2554
#7 · backfill · confidence 0.70
Hao Peng
-
1108.5530
#1 · backfill · confidence 0.70
Hao Peng
-
2410.17891
#11 · arxiv_oai · confidence 0.70
Hao Peng
-
2601.06943
#15 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.05249
#2 · arxiv_oai · confidence 0.70
Hao Peng
-
1905.03919
#3 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.15514
#8 · arxiv_oai · confidence 0.70
Hao Peng
-
2505.15134
#5 · arxiv_oai · confidence 0.70
Hao Peng
-
2412.15204
#4 · arxiv_oai · confidence 0.70
Hao Peng
-
2605.04831
#2 · backfill · confidence 0.70
Hao Peng
-
2604.10678
#7 · backfill · confidence 0.70
Hao Peng
-
1602.03001
#2 · backfill · confidence 0.70
Hao Peng