TY - GEN
T1 - CEC
T2 - 27th ACM International Conference on Information and Knowledge Management, CIKM 2018
AU - Deutch, Daniel
AU - Frost, Nave
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
PY - 2018/10/17
Y1 - 2018/10/17
N2 - Explaining the results of data-intensive computation via provenance has been extensively studied in the literature. We focus here on explaining the output of Machine Learning Classifiers, which are main components of many contemporary Data Science applications. We have developed a simple generic approach for explaining classification results, by looking for constrained perturbations to parts of the input that would have the most significant effect on the classification. Our solution requires white-box access to the model internals and a specification of constraints that define which perturbations are reasonable" in the application domain; both are typically available to the data scientist. We propose to demonstrate CEC, a system prototype that is based on these foundations to provide generic explanations for Neural Networks and Random Forests. We will demonstrate the system usefulness in the context of two application domains: bank marketing campaigns, and visually clear explanations for image classifications. We will highlight the benefit that such explanations could yield to the data scientist and interactively engage the audience in computing and viewing explanations for different cases and different sets of constraints.
AB - Explaining the results of data-intensive computation via provenance has been extensively studied in the literature. We focus here on explaining the output of Machine Learning Classifiers, which are main components of many contemporary Data Science applications. We have developed a simple generic approach for explaining classification results, by looking for constrained perturbations to parts of the input that would have the most significant effect on the classification. Our solution requires white-box access to the model internals and a specification of constraints that define which perturbations are reasonable" in the application domain; both are typically available to the data scientist. We propose to demonstrate CEC, a system prototype that is based on these foundations to provide generic explanations for Neural Networks and Random Forests. We will demonstrate the system usefulness in the context of two application domains: bank marketing campaigns, and visually clear explanations for image classifications. We will highlight the benefit that such explanations could yield to the data scientist and interactively engage the audience in computing and viewing explanations for different cases and different sets of constraints.
KW - Data provenance
KW - Database constraints theory
KW - Supervised learning by classification
UR - http://www.scopus.com/inward/record.url?scp=85058026527&partnerID=8YFLogxK
U2 - 10.1145/3269206.3269214
DO - 10.1145/3269206.3269214
M3 - Conference contribution
AN - SCOPUS:85058026527
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 1879
EP - 1882
BT - CIKM 2018 - Proceedings of the 27th ACM International Conference on Information and Knowledge Management
A2 - Paton, Norman
A2 - Candan, Selcuk
A2 - Wang, Haixun
A2 - Allan, James
A2 - Agrawal, Rakesh
A2 - Labrinidis, Alexandros
A2 - Cuzzocrea, Alfredo
A2 - Zaki, Mohammed
A2 - Srivastava, Divesh
A2 - Broder, Andrei
A2 - Schuster, Assaf
PB - Association for Computing Machinery
Y2 - 22 October 2018 through 26 October 2018
ER -