TY - JOUR
T1 - SIGN
T2 - Statistical Inference Graphs Based on Probabilistic Network Activity Interpretation
AU - Konforti, Yael
AU - Shpigler, Alon
AU - Lerner, Boaz
AU - Bar-Hillel, Aharon
N1 - Funding Information:
This work was supported in part by the Israeli Ministry of Science and Technology and Israel Innovation Authority through the Phenomics consortium. Yael Konforti and Alon Shpigler contributed equally to this work. Preliminary version was published in ECCV 2020 [1].
Publisher Copyright:
© 1979-2012 IEEE.
PY - 2023/3/1
Y1 - 2023/3/1
N2 - Convolutional neural networks (CNNs) have achieved superior accuracy in many visual-related tasks. However, the inference process through a CNN's intermediate layers is opaque, making it difficult to interpret such networks or develop trust in their operation. In this article, we introduce SIGN method for modeling the network's hidden layer activity using probabilistic models. The activity patterns in layers of interest are modeled as Gaussian mixture models, and transition probabilities between clusters in consecutive modeled layers are estimated to identify paths of inference. For fully connected networks, the entire layer activity is clustered, and the resulting model is a hidden Markov model. For convolutional layers, spatial columns of activity are clustered, and a maximum likelihood model is developed for mining an explanatory inference graph. The graph describes the hierarchy of activity clusters most relevant for network prediction. We show that such inference graphs are useful for understanding the general inference process of a class, as well as explaining the (correct or incorrect) decisions the network makes about specific images. In addition, SIGN provide interesting observations regarding hidden layer activity in general, including the concentration of memorization in a single middle layer in fully connected networks, and a highly local nature of column activities in the top CNN layers.
AB - Convolutional neural networks (CNNs) have achieved superior accuracy in many visual-related tasks. However, the inference process through a CNN's intermediate layers is opaque, making it difficult to interpret such networks or develop trust in their operation. In this article, we introduce SIGN method for modeling the network's hidden layer activity using probabilistic models. The activity patterns in layers of interest are modeled as Gaussian mixture models, and transition probabilities between clusters in consecutive modeled layers are estimated to identify paths of inference. For fully connected networks, the entire layer activity is clustered, and the resulting model is a hidden Markov model. For convolutional layers, spatial columns of activity are clustered, and a maximum likelihood model is developed for mining an explanatory inference graph. The graph describes the hierarchy of activity clusters most relevant for network prediction. We show that such inference graphs are useful for understanding the general inference process of a class, as well as explaining the (correct or incorrect) decisions the network makes about specific images. In addition, SIGN provide interesting observations regarding hidden layer activity in general, including the concentration of memorization in a single middle layer in fully connected networks, and a highly local nature of column activities in the top CNN layers.
KW - Deep neural networks
KW - XAI
KW - convolutional neural networks
KW - explainable AI
KW - interpretable AI
KW - statistical inference
KW - visualization
UR - http://www.scopus.com/inward/record.url?scp=85141345031&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2022.3181472
DO - 10.1109/TPAMI.2022.3181472
M3 - Article
C2 - 35696462
AN - SCOPUS:85141345031
SN - 0162-8828
VL - 45
SP - 3783
EP - 3797
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 3
ER -