Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics , publisher =

author Zhang, W · 2019 · DOI 10.18653/v1/p19-1426

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Combining On-Policy Optimization and Distillation for Long-Context Reasoning in Large Language Models

cs.CL · 2026-05-12 · unverdicted · novelty 7.0

dGRPO merges outcome-based policy optimization with dense teacher guidance from on-policy distillation, yielding more stable long-context reasoning on the new LongBlocks synthetic dataset.

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

cs.CL · 2026-05-21 · unverdicted · novelty 5.0

LANG combines language-adaptive hint guidance, progressive decay, and difficulty-tailored learning horizons in RL to boost non-English reasoning performance while preserving language consistency.

Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation

cs.LG · 2024-11-08 · unverdicted · novelty 4.0

DiffGap introduces adaptive alignment of denoising steps and temperature annealing in diffusion models for 3D molecule generation, reporting better docking scores and binding affinity than prior methods on CrossDocked2020.

citing papers explorer

Showing 3 of 3 citing papers.

Combining On-Policy Optimization and Distillation for Long-Context Reasoning in Large Language Models cs.CL · 2026-05-12 · unverdicted · none · ref 84
dGRPO merges outcome-based policy optimization with dense teacher guidance from on-policy distillation, yielding more stable long-context reasoning on the new LongBlocks synthetic dataset.
LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance cs.CL · 2026-05-21 · unverdicted · none · ref 4
LANG combines language-adaptive hint guidance, progressive decay, and difficulty-tailored learning horizons in RL to boost non-English reasoning performance while preserving language consistency.
Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation cs.LG · 2024-11-08 · unverdicted · none · ref 48
DiffGap introduces adaptive alignment of denoising steps and temperature annealing in diffusion models for 3D molecule generation, reporting better docking scores and binding affinity than prior methods on CrossDocked2020.

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics , publisher =

fields

years

verdicts

representative citing papers

citing papers explorer