Skip to main content
Article
Software Health Management with Bayesian Networks
Innovations in Systems and Software Engineering (2013)
  • Johann Schumann
  • Timmy Mbaya
  • Ole J Mengshoel, Carnegie Mellon University
  • Knot Pipatsrisawat
  • Ashok Srivastava
  • Arthur Choi
  • Adnan Darwiche
Abstract
Software Health Management (SWHM) is an emerging field which addresses the critical need to detect, diagnose, predict, and mitigate adverse events due to software faults and failures. These faults could arise for numerous reasons including coding errors, unanticipated faults or failures in hardware, or problematic interactions with the external environment. This paper demonstrates a novel approach to software health management based on a rigorous Bayesian formulation that monitors the behavior of software and operating system, performs probabilistic diagnosis, and provides information about the most likely root causes of a failure or software problem. Translation of the Bayesian network model into an efficient data structure, an arithmetic circuit, makes it possible to perform SWHM on resource-restricted embedded computing platforms as found in aircraft, unmanned aircraft, or satellites. SWHM is especially important for safety critical systems such as aircraft control systems. In this paper, we demonstrate our Bayesian SWHM system on three realistic scenarios from an aircraft control system: (1) aircraft file-system based faults, (2) signal handling faults, and (3) navigation faults due to IMU (inertial measurement unit) failure or compromised GPS (Global Positioning System) integrity. We show that the method successfully detects and diagnoses faults in these scenarios. We also discuss the importance of verification and validation of SWHM systems.
Keywords
  • software health management,
  • fault detection,
  • diagnosis,
  • aircraft control system,
  • Bayesian networks,
  • probabilistic diagnosis
Publication Date
June, 2013
Publisher Statement
@article{schumann13software,
 author = {Schumann, J. and Mbaya, T. and Mengshoel, O. J. and Pipatsrisawat, K. and Srivastava, A. and Choi, A. and Darwiche, A.},
 title = {Software Health Management with Bayesian Networks},
 journal = {Innov. Syst. Softw. Eng.},
 volume = {9},
 number = {4},
 month = dec,
 year = {2013},
 pages = {271--292}

Citation Information
Johann Schumann, Timmy Mbaya, Ole J Mengshoel, Knot Pipatsrisawat, et al.. "Software Health Management with Bayesian Networks" Innovations in Systems and Software Engineering (2013)
Available at: http://works.bepress.com/ole_mengshoel/47/