Assessing the Stability of Interpretable Models

You are here

TitleAssessing the Stability of Interpretable Models
Publication TypeJournal Article
Year of Publication2018
AuthorsGuidotti, R, Ruggieri, S
JournalarXiv preprint arXiv:1810.09352
AbstractInterpretable classification models are built with the purpose of providing a comprehensible description of the decision logic to an external oversight agent. When considered in isolation, a decision tree, a set of classification rules, or a linear model, are widely recognized as human-interpretable. However, such models are generated as part of a larger analytical process, which, in particular, comprises data collection and filtering. Selection bias in data collection or in data pre-processing may affect the model learned. Although model induction algorithms are designed to learn to generalize, they pursue optimization of predictive accuracy. It remains unclear how interpretability is instead impacted. We conduct an experimental analysis to investigate whether interpretable models are able to cope with data selection bias as far as interpretability is concerned.
PDF icon 1810.09352.pdf1.2 MB
Research Project: