AutoML-GPT: Automatic Machine Learning with GPT

Abstract

AI tasks encompass a wide range of domains and fields. While numerous AImodels have been designed for specific tasks and applications, they oftenrequire considerable human efforts in finding the right model architecture,optimization algorithm, and hyperparameters. Recent advances in large languagemodels (LLMs) like ChatGPT show remarkable capabilities in various aspects ofreasoning, comprehension, and interaction. Consequently, we propose developingtask-oriented prompts and automatically utilizing LLMs to automate the trainingpipeline. To implement this concept, we present the AutoML-GPT, which employsGPT as the bridge to diverse AI models and dynamically trains models withoptimized hyperparameters. AutoML-GPT dynamically takes user requests from themodel and data cards and composes the corresponding prompt paragraph.Ultimately, with this prompt paragraph, AutoML-GPT will automatically conductthe experiments from data processing to model architecture, hyperparametertuning, and predicted training log. By leveraging {\ours}'s robust languagecapabilities and the available AI models, AutoML-GPT can tackle numerousintricate AI tasks across various tasks and datasets. This approach achievesremarkable results in computer vision, natural language processing, and otherchallenging areas. Extensive experiments and ablation studies demonstrate thatour method can be general, effective, and beneficial for many AI tasks.

Quick Read (beta)

loading the full paper ...