Adopting Large Language Models to Automated System Integration

Abstract

Modern enterprise computing systems integrate numerous subsystems to resolvea common task by yielding emergent behavior. A widespread approach is usingservices implemented with Web technologies like REST or OpenAPI, which offer aninteraction mechanism and service documentation standard, respectively. Eachservice represents a specific business functionality, allowing encapsulationand easier maintenance. Despite the reduced maintenance costs on an individualservice level, increased integration complexity arises. Consequently, automatedservice composition approaches have arisen to mitigate this issue.Nevertheless, these approaches have not achieved high acceptance in practicedue to their reliance on complex formal modeling. Within this Ph.D. thesis, weanalyze the application of Large Language Models (LLMs) to automaticallyintegrate the services based on a natural language input. The result is areusable service composition, e.g., as program code. While not alwaysgenerating entirely correct results, the result can still be helpful byproviding integration engineers with a close approximation of a suitablesolution, which requires little effort to become operational. Our researchinvolves (i) introducing a software architecture for automated servicecomposition using LLMs, (ii) analyzing Retrieval Augmented Generation (RAG) forservice discovery, (iii) proposing a novel natural language query-basedbenchmark for service discovery, and (iv) extending the benchmark to completeservice composition scenarios. We have presented our software architecture asCompositio Prompto, the analysis of RAG for service discovery, and submitted aproposal for the service discovery benchmark. Open topics are primarily theextension of the service discovery benchmark to service composition scenariosand the improvements of the service composition generation, e.g., usingfine-tuning or LLM agents.

Quick Read (beta)

loading the full paper ...