Sepsis prediction in ICU patients using deep learning

Loading...
Thumbnail Image

Supplementary material

Other Title

Authors

Rajan, Deepa

Author ORCID Profiles (clickable)

Degree

Master of Applied Technologies (Computing)

Grantor

Unitec, Te Pūkenga – New Zealand Institute of Skills and Technology

Date

2025

Supervisors

Ramirez-Prado, Guillermo
Sharifzadeh, Hamid

Type

Masters Thesis

Ngā Upoko Tukutuku (Māori subject headings)

Keyword

sepsis
intensive care units (ICU)
electronic health records
medical informatics
neural networks
deep learning

Citation

Rajan, D. (2025). Sepsis prediction in ICU patients using deep learning (Unpublished document submitted in partial fulfilment of the requirements for the degree of Master of Applied Technologies (Computing)). Unitec, Te Pūkenga - New Zealand Institute of Skills and Technology https://hdl.handle.net/10652/6945

Abstract

RESEARCH QUESTIONS • How does the performance of the CNN-LSTM model compare to traditional machine learning models (e.g., Decision Tree, Random Forest, SVM) in terms of accuracy, precision, recall, and F1-score for early sepsis detection? • What is the minimum subset of clinical features (e.g., heart rate, temperature, lactate levels) required to maintain a high-performing prediction model with reduced computational cost? • How do SHAP (SHapley Additive exPlanations) values contribute to the interpretability of the CNN-LSTM model and support clinical decision-making in ICU environments? • What are the limitations of using synthetic oversampling techniques such as SMOTE in the context of highly imbalanced clinical datasets, and how do they affect the model’s generalizability? ABSTRACT Sepsis, the most lethal cause of intensive care unit (ICU) admissions, is a life-threatening condition. Early and correct diagnosis is key in improving survival. The conventional tools, like SOFA and SIRS scores, are static and retrospective, thereby resulting into a delayed identifying of the condition, and also to false positive detection. This is a pilot study aimed to solve this set of challenging questions by a novel approach of deep learning based early sepsis prediction using multi-variate time-series lab test, vital sign, and electronic health record (EHR) data. The proposed approach leverages Convolutional Neural Networks (CNNs) for automatic data feature extraction, combined with Long Short-Term Memory (LSTM) networks to identify temporal progressions in patients’ data. Database preprocessing by normalisation, imputation, and the Synthetic Minority Over-sampling Technique (SMOTE) is utilized to correctly handle common issues, e.g., noise, missing values, and class imbalance. Interpretability is strengthened by explanations generated via Shapley Additive explanations (SHAP), thus promoting clinical use and enabling clear interpretation of model predictions. Our CNN-LSTM model also performed well in 22,824 test samples, with the training data consisting of almost 58,000 hourly CCU data: accuracy 95.63%, precision 97.6%, recall 93.5%, F1-score 95.5% for sepsis classification. The high NPV for excluding non-sepsis was similarly established. The attention mechanism allowed the model to be significantly more interpretable by focusing on key prediction characteristics. Together, this work shows that deep learning can predict sepsis with high accuracy well in advance of the manifestation of clinical symptoms and may be used as a practical way to trigger early intervention and improve outcomes in the ICU.

Publisher

Link to ePress publication

DOI

Copyright holder

Author

Copyright notice

All rights reserved

Copyright license

Available online at