Path-Lock Expert uses two mode-specific MLP experts per decoder layer with a deterministic router to cleanly separate think and no-think modes, improving no-think accuracy and conciseness on math benchmarks while keeping think performance intact.
Substitute both expansions into (19): Ldense(β)≈C+ 1 2 β⊤(π0H0 +π 1H1)β−β ⊤(π0H0β⋆ 0 +π 1H1β⋆ 1), (20) where C=π 0L0(β⋆
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Path-Lock Expert: Separating Reasoning Mode in Hybrid Thinking via Architecture-Level Separation
Path-Lock Expert uses two mode-specific MLP experts per decoder layer with a deterministic router to cleanly separate think and no-think modes, improving no-think accuracy and conciseness on math benchmarks while keeping think performance intact.