Abstract
Deep learning has become very popular for tasks such as predictive modelingand pattern recognition in handling big data. Deep learning is a powerfulmachine learning method that extracts lower level features and feeds themforward for the next layer to identify higher level features that improveperformance. However, deep neural networks have drawbacks, which include manyhyper-parameters and infinite architectures, opaqueness into results, andrelatively slower convergence on smaller datasets. While traditional machinelearning algorithms can address these drawbacks, they are not typically capableof the performance levels achieved by deep neural networks. To improveperformance, ensemble methods are used to combine multiple base learners. Superlearning is an ensemble that finds the optimal combination of diverse learningalgorithms. This paper proposes deep super learning as an approach whichachieves log loss and accuracy results competitive to deep neural networkswhile employing traditional machine learning algorithms in a hierarchicalstructure. The deep super learner is flexible, adaptable, and easy to trainwith good performance across different tasks using identical hyper-parametervalues. Using traditional machine learning requires fewer hyper-parameters,allows transparency into results, and has relatively fast convergence onsmaller datasets. Experimental results show that the deep super learner hassuperior performance compared to the individual base learners, single-layerensembles, and in some cases deep neural networks. Performance of the deepsuper learner may further be improved with task-specific tuning.