Automated Machine Learning: State-of-The-Art and Open Challenges

Abstract

With the continuous and vast increase in the amount of data in our digitalworld, it has been acknowledged that the number of knowledgeable datascientists can not scale to address these challenges. Thus, there was a crucialneed for automating the process of building good machine learning models. Inthe last few years, several techniques and frameworks have been introduced totackle the challenge of automating the process of Combined Algorithm Selectionand Hyper-parameter tuning (CASH) in the machine learning domain. The main aimof these techniques is to reduce the role of the human in the loop and fill thegap for non-expert machine learning users by playing the role of the domainexpert. In this paper, we present a comprehensive survey for the state-of-the-artefforts in tackling the CASH problem. In addition, we highlight the researchwork of automating the other steps of the full complex machine learningpipeline (AutoML) from data understanding till model deployment. Furthermore,we provide comprehensive coverage for the various tools and frameworks thathave been introduced in this domain. Finally, we discuss some of the researchdirections and open challenges that need to be addressed in order to achievethe vision and goals of the AutoML process.

Quick Read (beta)

loading the full paper ...