A two step algorithm for learning from unspecific reinforcement

Ion-Olimpiu Stamatescu; Reimer Kuehn

arxiv: cond-mat/9902354 · v3 · submitted 1999-02-26 · ❄️ cond-mat.stat-mech · cond-mat.dis-nn

A two step algorithm for learning from unspecific reinforcement

Reimer Kuehn , Ion-Olimpiu Stamatescu This is my paper

classification ❄️ cond-mat.stat-mech cond-mat.dis-nn

keywords learninggeneralizationunspecificasymptoticallyconvergenceperfectreinforcementalgorithm

0 comments

read the original abstract

We study a simple learning model based on the Hebb rule to cope with "delayed", unspecific reinforcement. In spite of the unspecific nature of the information-feedback, convergence to asymptotically perfect generalization is observed, with a rate depending, however, in a non- universal way on learning parameters. Asymptotic convergence can be as fast as that of Hebbian learning, but may be slower. Moreover, for a certain range of parameter settings, it depends on initial conditions whether the system can reach the regime of asymptotically perfect generalization, or rather approaches a stationary state of poor generalization.

This paper has not been read by Pith yet.

A two step algorithm for learning from unspecific reinforcement

discussion (0)