NRGPT unifies GPT with energy-based modeling by treating inference as dynamical exploration on an energy landscape that reduces to gradient descent under specific conditions.
∂EA ∂gB ΓPBηT ∂EB ∂gB T # (32) Note that when B > A , ∂EA/∂gB = 0. Hence, equation 34 can be separated into A=B and B < A ˙EA = X B<A Tr ∂EA ∂gB ∂gB ∂xB ˙xB − 1 rA Tr
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
NRGPT: An Energy-based Alternative for GPT
NRGPT unifies GPT with energy-based modeling by treating inference as dynamical exploration on an energy landscape that reduces to gradient descent under specific conditions.