TY - JOUR
T1 - Layerwise Change of Knowledge in Neural Networks
AU - Cheng, Xu
AU - Cheng, Lei
AU - Peng, Zhaoran
AU - Xu, Yang
AU - Han, Tian
AU - Zhang, Quanshi
N1 - Publisher Copyright:
Copyright 2024 by the author(s)
PY - 2024
Y1 - 2024
N2 - This paper aims to explain how a deep neural network (DNN) gradually extracts new knowledge and forgets noisy features through layers in forward propagation. Up to now, although the definition of knowledge encoded by the DNN has not reached a consensus, Li & Zhang (2023); Ren et al. (2023a; 2024) have derived a series of mathematical evidence to take interactions as symbolic primitive inference patterns encoded by a DNN. We extend the definition of interactions and, for the first time, extract interactions encoded by intermediate layers. We quantify and track the newly emerged interactions and the forgotten interactions in each layer during the forward propagation, which shed new light on the learning behavior of DNNs. The layer-wise change of interactions also reveals the change of the generalization capacity and instability of feature representations of a DNN.
AB - This paper aims to explain how a deep neural network (DNN) gradually extracts new knowledge and forgets noisy features through layers in forward propagation. Up to now, although the definition of knowledge encoded by the DNN has not reached a consensus, Li & Zhang (2023); Ren et al. (2023a; 2024) have derived a series of mathematical evidence to take interactions as symbolic primitive inference patterns encoded by a DNN. We extend the definition of interactions and, for the first time, extract interactions encoded by intermediate layers. We quantify and track the newly emerged interactions and the forgotten interactions in each layer during the forward propagation, which shed new light on the learning behavior of DNNs. The layer-wise change of interactions also reveals the change of the generalization capacity and instability of feature representations of a DNN.
UR - http://www.scopus.com/inward/record.url?scp=85203822489&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85203822489&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85203822489
VL - 235
SP - 8038
EP - 8059
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 41st International Conference on Machine Learning, ICML 2024
Y2 - 21 July 2024 through 27 July 2024
ER -