DLHub: Model and Data Serving for Science

Abstract

While the Machine Learning (ML) landscape is evolving rapidly, there has beena relative lag in the development of the "learning systems" needed to enablebroad adoption. Furthermore, few such systems are designed to support thespecialized requirements of scientific ML. Here we present the Data andLearning Hub for science (DLHub), a multi-tenant system that provides bothmodel repository and serving capabilities with a focus on science applications.DLHub addresses two significant shortcomings in current systems. First, itsselfservice model repository allows users to share, publish, verify, reproduce,and reuse models, and addresses concerns related to model reproducibility bypackaging and distributing models and all constituent components. Second, itimplements scalable and low-latency serving capabilities that can leverageparallel and distributed computing resources to democratize access to publishedmodels through a simple web interface. Unlike other model serving frameworks,DLHub can store and serve any Python 3-compatible model or processing function,plus multiple-function pipelines. We show that relative to other model servingsystems including TensorFlow Serving, SageMaker, and Clipper, DLHub providesgreater capabilities, comparable performance without memoization and batching,and significantly better performance when the latter two techniques can beemployed. We also describe early uses of DLHub for scientific applications.

Quick Read (beta)

loading the full paper ...