Building a good predictive model requires an array of activities such as dataimputation, feature transformations, estimator selection, hyper-parametersearch and ensemble construction. Given the large, complex and heterogenousspace of options, off-the-shelf optimization methods are infeasible forrealistic response times. In practice, much of the predictive modeling processis conducted by experienced data scientists, who selectively make use ofavailable tools. Over time, they develop an understanding of the behavior ofoperators, and perform serial decision making under uncertainty, colloquiallyreferred to as educated guesswork. With an unprecedented demand for applicationof supervised machine learning, there is a call for solutions thatautomatically search for a good combination of parameters across these tasks tominimize the modeling error. We introduce a novel system called APRL(Autonomous Predictive modeler via Reinforcement Learning), that uses pastexperience through reinforcement learning to optimize such sequential decisionmaking from within a set of diverse actions under a time constraint on apreviously unseen predictive learning problem. APRL actions are taken tooptimize the performance of a final ensemble. This is in contrast to othersystems, which maximize individual model accuracy first and create ensembles asa disconnected post-processing step. As a result, APRL is able to reduce up to71\% of classification error on average over a wide variety of problems.