Title resolution pending

Playing Atari with Deep Reinforcement Learning , author= · 2013

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Insider Attacks in Multi-Agent LLM Consensus Systems

cs.MA · 2026-05-08 · unverdicted · novelty 5.0

A malicious agent in multi-agent LLM consensus systems can be trained via a surrogate world model and RL to reduce consensus rates and prolong disagreement more effectively than direct prompt attacks.

Interpretable experiential learning based on state history and global feedback

cs.LG · 2026-05-01 · unverdicted · novelty 4.0

A transition graph model with utility and evidence counts learns behaviors from state history and feedback, showing performance comparable to neural networks on Atari Breakout.

citing papers explorer

Showing 2 of 2 citing papers.

Insider Attacks in Multi-Agent LLM Consensus Systems cs.MA · 2026-05-08 · unverdicted · none · ref 63
A malicious agent in multi-agent LLM consensus systems can be trained via a surrogate world model and RL to reduce consensus rates and prolong disagreement more effectively than direct prompt attacks.
Interpretable experiential learning based on state history and global feedback cs.LG · 2026-05-01 · unverdicted · none · ref 22
A transition graph model with utility and evidence counts learns behaviors from state history and feedback, showing performance comparable to neural networks on Atari Breakout.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer