pith. sign in

Concrete Problems in AI Safety

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

citation-role summary

extension 1

citation-polarity summary

fields

cs.LG 1

years

2021 1

verdicts

ACCEPT 1

roles

extension 1

polarities

extend 1

representative citing papers

Unsolved Problems in ML Safety

cs.LG · 2021-09-28 · accept · novelty 6.0

The paper presents a roadmap that identifies four unsolved problems in ML safety: robustness against hazards, monitoring for hazards, alignment of model goals with human intent, and systemic safety.

citing papers explorer

Showing 1 of 1 citing paper.

  • Unsolved Problems in ML Safety cs.LG · 2021-09-28 · accept · none · ref 5

    The paper presents a roadmap that identifies four unsolved problems in ML safety: robustness against hazards, monitoring for hazards, alignment of model goals with human intent, and systemic safety.