Concrete Problems in AI Safety

Dario Amodei, Christopher Olah, Jacob Steinhardt, Paul Christiano, John Schulman, Dandelion Mané · 2016

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

extension 1

citation-polarity summary

extend 1

representative citing papers

Unsolved Problems in ML Safety

cs.LG · 2021-09-28 · accept · novelty 6.0

The paper presents a roadmap that identifies four unsolved problems in ML safety: robustness against hazards, monitoring for hazards, alignment of model goals with human intent, and systemic safety.

citing papers explorer

Showing 1 of 1 citing paper.

Unsolved Problems in ML Safety cs.LG · 2021-09-28 · accept · none · ref 5
The paper presents a roadmap that identifies four unsolved problems in ML safety: robustness against hazards, monitoring for hazards, alignment of model goals with human intent, and systemic safety.

Concrete Problems in AI Safety

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer