One-class Support Vector Machine (OC-SVM) for a long time has been one of themost effective anomaly detection methods and widely adopted in both research aswell as industrial applications. The biggest issue for OC-SVM is, however, thecapability to operate with large and high-dimensional datasets due toinefficient features and optimization complexity. Those problems might bemitigated via dimensionality reduction techniques such as manifold learning orauto-encoder. However, previous work often treats representation learning andanomaly prediction separately. In this paper, we propose autoencoder basedone-class SVM (AE-1SVM) that brings OC-SVM, with the aid of random Fourierfeatures to approximate the radial basis kernel, into deep learning context bycombining it with a representation learning architecture and jointly exploitstochastic gradient descend to obtain end-to-end training. Interestingly, thisalso opens up the possible use of gradient-based attribution methods to explainthe decision making for anomaly detection, which has ever been challenging as aresult of the implicit mappings between the input space and the kernel space.To the best of our knowledge, this is the first work to study theinterpretability of deep learning in anomaly detection. We evaluate our methodon a wide range of unsupervised anomaly detection tasks in which our end-to-endtraining architecture achieves a performance significantly better than theprevious work using separate training.