What can AI do for me: Evaluating Machine Learning Interpretations in Cooperative Play

Jordan Boyd-Graber; Shi Feng

arxiv: 1810.09648 · v3 · pith:4HGMX5GXnew · submitted 2018-10-23 · 💻 cs.AI

What can AI do for me: Evaluating Machine Learning Interpretations in Cooperative Play

Shi Feng , Jordan Boyd-Graber This is my paper

classification 💻 cs.AI

keywords cooperativedesignhumaninterpretationinterpretationslanguagelearningmachine

0 comments

read the original abstract

Machine learning is an important tool for decision making, but its ethical and responsible application requires rigorous vetting of its interpretability and utility: an understudied problem, particularly for natural language processing models. We propose an evaluation of interpretation on a real task with real human users, where the effectiveness of interpretation is measured by how much it improves human performance. We design a grounded, realistic human-computer cooperative setting using a question answering task, Quizbowl. We recruit both trivia experts and novices to play this game with computer as their teammate, who communicates its prediction via three different interpretations. We also provide design guidance for natural language processing human-in-the-loop settings.

This paper has not been read by Pith yet.

What can AI do for me: Evaluating Machine Learning Interpretations in Cooperative Play

discussion (0)