Adapting Probabilistic Risk Assessment for AI

  • 2025-04-25 18:59:14
  • Anna Katariina Wisakanto, Joe Rogero, Avyay M. Casheekar, Richard Mallah
  • 0

Abstract

Modern general-purpose artificial intelligence (AI) systems present an urgentrisk management challenge, as their rapidly evolving capabilities and potentialfor catastrophic harm outpace our ability to reliably assess their risks.Current methods often rely on selective testing and undocumented assumptionsabout risk priorities, frequently failing to make a serious attempt atassessing the set of pathways through which Al systems pose direct or indirectrisks to society and the biosphere. This paper introduces the probabilisticrisk assessment (PRA) for AI framework, adapting established PRA techniquesfrom high-reliability industries (e.g., nuclear power, aerospace) for the newchallenges of advanced AI. The framework guides assessors in identifyingpotential risks, estimating likelihood and severity, and explicitly documentingevidence, underlying assumptions, and analyses at appropriate granularities.The framework's implementation tool synthesizes the results into a risk reportcard with aggregated risk estimates from all assessed risks. This systematicapproach integrates three advances: (1) Aspect-oriented hazard analysisprovides systematic hazard coverage guided by a first-principles taxonomy of AIsystem aspects (e.g. capabilities, domain knowledge, affordances); (2) Riskpathway modeling analyzes causal chains from system aspects to societal impactsusing bidirectional analysis and incorporating prospective techniques; and (3)Uncertainty management employs scenario decomposition, reference scales, andexplicit tracing protocols to structure credible projections with novelty orlimited data. Additionally, the framework harmonizes diverse assessment methodsby integrating evidence into comparable, quantified absolute risk estimates forcritical decisions. We have implemented this as a workbook tool for AIdevelopers, evaluators, and regulators, available on the project website.

 

Quick Read (beta)

loading the full paper ...