Soichiro Nishimori
Identifiers
- name variant Soichiro Nishimori 0.60 · backfill
Papers (3)
- Finite-Time Regret Analysis of Retry-Aware Bandits cs.LG · 2026 · author #3
- Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX cs.AI · 2026 · author #1
- Mitigating Reward Hacking in RLHF via Advantage Sign Robustness cs.LG · 2026 · author #3
Mentions
- 2605.20854 #3 · arxiv_oai · confidence 0.70 Soichiro Nishimori
- 2605.20577 #1 · arxiv_oai · confidence 0.70 Soichiro Nishimori
Frequent Coauthors
- Masashi Sugiyama 2 shared papers
- Bingkui Tong 1 shared papers
- Eason Yu 1 shared papers
- Johannes Ackermann 1 shared papers
- Junpei Komiyama 1 shared papers
- Keigo Habara 1 shared papers
- Paavo Parmas 1 shared papers
- Shinnosuke Ono 1 shared papers
- Shinri Okano 1 shared papers
- Sotetsu Koyamada 1 shared papers
- Takashi Ishida 1 shared papers