Abstract
A suitable similarity index for comparing learnt neural networks plays animportant role in understanding the behaviour of the highly-nonlinearfunctions, and can provide insights on further theoretical analysis andempirical studies. We define two key steps when comparing models: firstly, therepresentation abstracted from the learnt model, where we propose to leverageboth feature vectors and gradient ones (which are largely ignored in priorwork) into designing the representation of a neural network. Secondly, wedefine the employed similarity index which gives desired invariance properties,and we facilitate the chosen ones with sketching techniques for comparingvarious datasets efficiently. Empirically, we show that the proposed approachprovides a state-of-the-art method for computing similarity of neural networksthat are trained independently on different datasets and the tasks defined bythe datasets.