pith. sign in

arxiv: 1902.04187 · v1 · pith:BGVHRCI2new · submitted 2019-02-11 · 💻 cs.LG · cs.AI· cs.CL· stat.ML

LS-Tree: Model Interpretation When the Data Are Linguistic

classification 💻 cs.LG cs.AIcs.CLstat.ML
keywords importancescoresdatalinguisticmethodmodelsassignaxiomatic
0
0 comments X
read the original abstract

We study the problem of interpreting trained classification models in the setting of linguistic data sets. Leveraging a parse tree, we propose to assign least-squares based importance scores to each word of an instance by exploiting syntactic constituency structure. We establish an axiomatic characterization of these importance scores by relating them to the Banzhaf value in coalitional game theory. Based on these importance scores, we develop a principled method for detecting and quantifying interactions between words in a sentence. We demonstrate that the proposed method can aid in interpretability and diagnostics for several widely-used language models.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.