arxiv: 1803.09425 · v1 · pith:3SXO43IVnew · submitted 2018-03-26 · 💻 cs.ET · cs.AI· physics.data-an· physics.optics

Scalable photonic reinforcement learning by time-division multiplexing of laser chaos

Makoto Naruse , Takatomo Mihana , Hirokazu Hori , Hayato Saigo , Kazuya Okamura , Mikio Hasegawa , Atsushi Uchida This is my paper

classification 💻 cs.ET cs.AIphysics.data-anphysics.optics

keywords learningreinforcementbanditultrafastchaosdecisiondemonstratedlaser

0 comments p. Extension

Add this Pith Number to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{3SXO43IV}

Prints a linked pith:3SXO43IV badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Reinforcement learning involves decision making in dynamic and uncertain environments and constitutes a crucial element of artificial intelligence. In our previous work, we experimentally demonstrated that the ultrafast chaotic oscillatory dynamics of lasers can be used to solve the two-armed bandit problem efficiently, which requires decision making concerning a class of difficult trade-offs called the exploration-exploitation dilemma. However, only two selections were employed in that research; thus, the scalability of the laser-chaos-based reinforcement learning should be clarified. In this study, we demonstrated a scalable, pipelined principle of resolving the multi-armed bandit problem by introducing time-division multiplexing of chaotically oscillated ultrafast time-series. The experimental demonstrations in which bandit problems with up to 64 arms were successfully solved are presented in this report. Detailed analyses are also provided that include performance comparisons among laser chaos signals generated in different physical conditions, which coincide with the diffusivity inherent in the time series. This study paves the way for ultrafast reinforcement learning by taking advantage of the ultrahigh bandwidths of light wave and practical enabling technologies.

This paper has not been read by Pith yet.

Scalable photonic reinforcement learning by time-division multiplexing of laser chaos

discussion (0)