Data-Driven Prioritization of Grid Interruptions: A Lean Six Sigma Study Using Public Outage Logs

Authors

  • Rayhanul Islam Sony Trine University, 1 University Ave, Angola, IN 46703, USA
  • Md Ariful Islam Bhuiyan California State University Northridge, 18111 Nordhoff St, Northridge, CA 91330, USA
  • Dipta Roy California State University Northridge, 18111 Nordhoff St, Northridge, CA 91330, USA

DOI:

https://doi.org/10.32996/jcsts.2026.8.7.3

Keywords:

Power outage analytics, Lean Six Sigma, Random Forest, XGBoost, Deep ANN, hybrid meta-learning, SMOTE augmentation, data-driven prioritization, grid reliability, energy resilience

Abstract

The supply of power that is reliable and uninterrupted is one of the pillars of the contemporary energy infrastructure. Nonetheless, the issue of grid interruptions which are frequent and long-lasting is still compromising the resilience of the system and customer satisfaction. Traditional approaches to outage-management and prioritization tend to be reactive and based on manual evaluation and missing records of events. To overcome such shortcomings, this paper suggests a hybrid meta-learning model that would combine Random Forest (RF), Extreme Gradient Boosting (XGBoost), and a Deep Artificial Neural Network (Deep ANN) to come up with the data-driven prioritization of grid interruptions. The data were processed extensively with the Missing-value treatment, derivation of impact-score, feature scaling using robust scale and SMOTE and controlled Gaussian perturbation using publicly available data 15 Years of Power Outages (2000 - 2014) dataset. The proposed hybrid model integrates RF and XGBOOST in an ensemble layer while capturing the variance decomposition and residual optimization for variance decomposable nonlinearities and the Deep ANN meta-learner captures complex non-linear dependencies. Instead of maintenance, proactive and timely, the system prioritizes the outage events on the basis of the predicted severity of impact in the framework of Lean Six Sigma DMAIC. Comparative analysis with four baseline models which included RF, XGBoost, Deep ANN, and RF-XGB models showed that the hybrid model was better as it had 99.43% accuracy, 98.92% precision, 99.15% recall, and 99.03% F1-score. The obtained results support that the suggested meta-learning technique offers a significant enhancement in the robustness, scalability, and interpretability of prediction of outage impact. This study creates a solid information-driven approach to the implementation of the Six Sigma principles into the contemporary energy-analytics processes that opens the way to increased intelligence and agile power-grid operations.

Downloads

Published

2026-03-16

Issue

Section

Research Article

How to Cite

Rayhanul Islam Sony, Md Ariful Islam Bhuiyan, & Dipta Roy. (2026). Data-Driven Prioritization of Grid Interruptions: A Lean Six Sigma Study Using Public Outage Logs. Journal of Computer Science and Technology Studies, 8(7), 35-61. https://doi.org/10.32996/jcsts.2026.8.7.3