Abstract
The advent of Generative Artificial Intelligence (GAI) has heralded aninflection point that changed how society thinks about knowledge acquisition.While GAI cannot be fully trusted for decision-making, it may still providevaluable information that can be integrated into a decision pipeline. Ratherthan seeing the lack of certitude and inherent randomness of GAI as a problem,we view it as an opportunity. Indeed, variable answers to given prompts can beleveraged to construct a prior distribution which reflects assuredness of AIpredictions. This prior distribution may be combined with tailored datasets fora fully Bayesian analysis with an AI-driven prior. In this paper, we exploresuch a possibility within a non-parametric Bayesian framework. The basic ideaconsists of assigning a Dirichlet process prior distribution on thedata-generating distribution with AI generative model as its baseline.Hyper-parameters of the prior can be tuned out-of-sample to assess theinformativeness of the AI prior. Posterior simulation is achieved by computinga suitably randomized functional on an augmented data that consists of observed(labeled) data as well as fake data whose labels have been imputed using AI.This strategy can be parallelized and rapidly produces iid samples from theposterior by optimization as opposed to sampling from conditionals. Our methodenables (predictive) inference and uncertainty quantification leveraging AIpredictions in a coherent probabilistic manner.