SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1245
Sepsis Prediction Using Machine Learning
1Mohammad Ateeq, 2 Vineet K Joshi, 3D Naga Praneeth, 4Gugulothu Ravi
4Assistant Professor, Department of Computer Science and Engineering, SNIST, Hyderabad-501301, India
1,2,3B. Tech Scholars, Department of Computer Science and Engineering, SNIST, Hyderabad-501301, India
-----------------------------------------------------------------------***-----------------------------------------------------------------------
Abstract:- Sepsis is a blood poisoning condition that
can increase the mortality risk in ICU patients when
the body exhibits a dysregulated host response to an
infection and results in organ failure or tissue
damage.The expense of treating sepsis in hospitals is
rising yearly as it develops into a serious health issue.
Different techniques have been developed to monitor
sepsis electronically, however in order to reduce the risk
of death, it is crucial to forecast sepsis as soon as feasible
before clinical reports or conventional techniques. The
primary characteristics influencing the classifier's
predictions have been outlined, making the model easier
for medical professionals to understand. MLP Classifier
has been used for the early diagnosis of sepsis,
particularly in ICU patients been applied.This study
demonstrates how machine learning algorithms,
employing six vital signs taken from patient records over
the age of 18, can reliably predict sepsis at the time of
a patient's admittance into the intensive care unit.
Sepsis may be predicted early, which can assist
doctors administer supportive care and save
mortality and medical costs. Unprecedented
assessment measures have been obtained, and they
can be very helpful in accurately and promptly
predicting sepsis.
I. INTRODUCTION
Sepsis is a life-threatening condition that occurs when the
body's response to an infection leads to inflammation
andtissue damage throughout the body. It is a leading
causeof death in hospitals and can progress rapidly if
not properly diagnosed and treated. Early detection of
sepsis is crucial for improving patient outcomes, and
machine learning techniques have the potential to
significantly improve the accuracy and speed of sepsis
diagnosis. One approach to sepsis detection using
machine learning is touse patient data, such as vital signs
and laboratory results, to train a model to predict the
likelihood of sepsis.
This data can be collected from electronic health
recordsor other sources and may include demographic
information, previous medical history, and current
symptoms. The model can then be used to identify
patients who are at high risk for sepsis, allowing
healthcare providers to initiate early treatment and
potentially prevent the progression of the condition.
Utilizing machine learning as a different strategy for
analysis. patterns in patient data and identify early
warning signs of sepsis. This can be done by analysing
trends in vital signs over time or by looking for changes
in biomarkers that are indicative of sepsis. By identifying
these early warning signs, healthcare providers can
intervene before the condition becomes severe and
potentially save lives. Overall, the use of machine
learning in sepsis detection has the potential to
significantly improve patient outcomes by enabling early
diagnosis and treatment. But it's crucial to pay close
attention to the ethical implications of using machine
learning in healthcare and ensure that the technology is
used responsibly and in a way that benefits patients.
The development of artificial intelligence technologies
has made it possible to diagnose sepsis early on. These
techniques have been created to study and anticipate
the health of the human body and acquire accurate
prescription information to support doctors in making
rapid and effective decisions. They integrate electronic
medical records, medical imaging, pathophysiology, and
other data.
In several medical specialties, an AI-based diagnostic
system has been proven to be efficient. In the domain of
sepsis diagnosis, prognosis, and therapy machine learning
algorithms used include supervised learning and
reinforcement learning. For example, Beck et al. develop
the C-Path (Computational Pathologist) system to
automatically diagnose breast cancer and determine the
likelihood that patients will survive by looking at breast
tissue imaging.
The primary two difficulties in the current study involve
the use of different physiological indicators and
modelling efficient machine learning algorithms for the
II. LITERATURE SURVEY
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1246
diagnosis, prognosis, and treatment of sepsis. Similarly,
Additionally important for predicting sepsis in advance is
choose appropriate variables and design valuable
algorithms in the clinical setting. The model's input
variables are physiological indications, and the model's
output parameter is the patient's condition would
develop sepsis many hours later. In particular, input
data include vital indicators such as heart rate, oxygen
saturation, and body temperature; biomarkers such as
procalcitonin andinterleukin-6; laboratory values such
as bicarbonate andcreatinine; and demographic factors
such as gender and age. In Most of the categories, a large
number of missing values, such as those in MIMIC III
(Intensive CareMedical Information Market Database),
which has been utilised in several research. Most
studies omit variable with a large number of missing
values from predictors, resulting in the loss of useful
information. To fill in missing information, some
research utilize imputation and mean filling methods,
although this might lead to selection bias or
confounding factor mixes. The data preparation
approach must be examined in light of the features of
various data sets.
The machine learning algorithms generally include
support vector machines, gradient boosting trees,
random forests, Logistic regression, and neural
networks. Amongthem, MLP Classifier have shown good
performance. The A model with improved prediction
capabilities will be examined further rand improved
results for clinical service. so, that Early sepsis choices
can be better madeby physician diagnosis.
The research have performed well in the area of sepsis
prediction. The quantity of data utilised in these studies
is, however, reduced because the majority of the
missing values are handled by direct deletion or
forward filling, and the model's explanatory power is
therefore constrained. The following arguments in
detail explain why it is difficult to implement these
techniques inclinical settings. A comprehensive data set
is lacking. Researchers make use of information from
various patient groups, such as the MIMIC public
database or other unbiased sources of hospital data.
They choose different clinical factors to create their
models, and the size of thedata also varies considerably.
Different clinical criteria for sepsis and assessment
indicators are used as prediction settings' premise and
indicators.
1.DATA SET:-Dataset contains data of 36 thousand
patients. Each patient is represented by 41 features.
Fig 1. Data Set
2.FEATURE SELECTION:- In the mean processing
method, 41 variables were determined to participate in
the training model, including (a) vital signs indicators
(HR, O2Sat, Temp, SBP, MAP, DBP, Resp), (b) laboratory
variables (HCO3, pH, PaCO2, AST, BUN, AlkalinePhos,
Chloride, Creatinine, Lactate, Magnesium, Potassium,
Bilirubin_total, PTT, WBC, Fibrinogen, Platelets), and
(c) demographic indicators (Age, Gender).
III. METHODOLOGY
Fig 2. Vital Signs(a)
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1247
Fig 3. Laboratory Variables(b)
The variables that had missing proportions of greater than
98% were eliminated. The demographic metrics
HospAdmTime (the interval between hospitalisation and
ICU) and ICULOS (the interval between ICU
hospitalizations) have been removed. HospAdmTime
displays various numerical levels depending on the health
of various patients, which may be connected to sepsis's
extended incubation period. Since the primary goal of this
study is to develop guidelines for predicting early sepsis
from changes in certain physiological data, these
variables are omitted. According to the statistics entered,
patients with sepsis have a significant fatality rate. They
frequently require prolonged ICU care, and the ICULOS
value is typically excessively high. Contrarily, patients
without sepsis typically get treatment in the ICU for a
brief period of time before being discharged after their
health has improved, resulting in a low ICULOS rating.
The variable ICULOS is eliminated because the variation
in ICULOS value is caused by the different nature of the
sickness situation, which is against the causal sequence of
early sepsis anticipated from physiological data.
Fig 4: Demographics Indicator(c)
3.METHOD TO PREDICT SEPSIS:- MLPs are neural
network models that work as universal approximators,
i.e., they can approximate any continuous function, MLPs
are composed of neurons called perceptions. a perceptron
receives n features as input (x = x1, x2,…, xn), and each of
these features is associated to a weight. Input features
must be numeric. So, nonnumeric input features have
to be converted to numeric ones in order to use a
perceptron.
The result of this computation is then passed onto
an activation function f, which will produce the output of
the perceptron. In the original perceptron, the
activationfunction is a step function:
Thus, we can see that the perceptron determines
whether w1x1 + w2x2 + ⋯ + wnxn − θ > 0 is true or
false. The equation w1x1 + w2x2 + ⋯ + wnxn − θ = 0 is
the equation of a hyperplane.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1248
3.1 IMPROVEMENT OF PREPROCESSING
WARNING PERIODS:- The 6-hour warning period for
each patient is immediately integrated to produce a single
observation in the mean processing approach discussed
above, however the model's performance in terms of
prediction may not be sufficient. Further research is being
done to determine whether or not higher performance
results from segmentation time windows that are finer
ordenser. In order to calculate the mean vector, the 6-
hour warning period is split into 2- or 3-hour time
windows, and the mean processing procedure for the safe
period and illness period is left alone. Figure 5displays
further information. The improvement is compared with
the original models of their generalisation capabilities
based on MLP’s Classifier and new datasets for training
models are created in the same manner.
Fig 5. MLP Mean Calculator
4.
Fig 6: Feature Importance
IV. UML DIAGRAM
A sequence diagram (UML) is a visual representation of
the flow of messages between objects during an
interaction. A group of objects connected by lifelines and
the messages they exchange throughout the course of
an interaction make up a sequence diagram.
Fig 7. Sequential Diagram
1.MODEL PERFORMANCE:- In the mean processing
method (method1), the MLP Classifier algorithms differ
in performance. The MLP’s algorithm has a Log loss rate
of 0.15, with better distinction performance between 0-
1 categories.
V. RESULTS
FEATURE IMPORTANCE:- For the feature
importance score, we take the MLP Classifier algorithm
in the mean processing method as an example; the top
10variables with feature importance scores are Temp,
O2Sat, Resp, HR, Age, SBP, MAP, PTT, PaCO2, and
Potassium as shown in Figure 6. This means that these
variables play an important role in predicting the risk
of sepsis.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1252
The log-loss shows how accurate the forecast was the
likelihood corresponds to the matching real or true
value
The Accuracy score is calculated by dividing the number
of correct predictions by the total prediction number.
2.MLP ALGORITHM:-This model optimizes the log- loss
function using LBFGS or stochastic gradient descent.
(0 or 1 in case of binary classification). The more
the predicted the further the probability deviates from
the actual value the log-loss value.
class sklearn.neural_network.MLPClassifi
er(hidden_layer_sizes=(100,), activation='relu', *, sol
ver='adam', alpha=0.0001, batch_size='auto', learni
ng_rate='constant', learning_rate_init=0.001, power_t=0.5,
max_iter=200, shuffle=True, random_state=N one,
tol=0.0001, verbose=False, warm_start=False,
momentum=0.9, nesterovs_momentum=True, early_
stopping=False, validation_fraction=0.1, beta_1=0.9,
beta_2=0.999, epsilon=1e- 08, n_iter_no_change=10,
max_fun=15000)
Training of variables in algorithm for classification.
Fig 8. Accuracy
Fig 9. Log loss
VI.CONCLUSION
It is possible to use machine learning techniques for the
detection of sepsis, a serious and potentially life
threatening condition that can arise as a complication
of infection. Sepsis is a complex and dynamic process
thatcan be difficult to diagnose, and early identification
and treatment are critical for improving patient
outcomes. Several machine learning techniques exist
that have been explored for the detection of sepsis,
including supervisedlearning methods such as decision
tree algorithms and support vector machines, as well as
unsupervised learning methods such as clustering and
anomaly detection.
One potential advantage of using machine learning for
sepsis detection is the ability to analyse and interpret
large amounts of patient data, including electronic
health records, laboratory results, and vital signs, in
order to identify patterns and correlations that may be
indicativeof sepsis.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1254
Overall, the use of machine learning for sepsis
detection has the potential to improve the accuracy
and timeliness of sepsis diagnosis, which can help to
improve patient outcomes and reduce healthcare
costs. However, more research is needed to fully
understand the effectiveness and limitations of these
approaches and to optimize theirperformance in real-
world settings.
VII.REFERENCES
[1] K. E. Rudd, S. C. Johnson, K. M. Agesa et al.,
“Global, regional, and national sepsis incidence
and mortality, 1990–2017: analysis for the global
burden of disease study,” The Lancet, vol. 395,
no. 10219, pp. 200–211, 2020.
[2] L. Su, Z. Xu, F. Chang et al., “Early prediction of
mortality, severity, and length of stay in the
intensive care unit of sepsis patients based on
sepsis 3.0 by machine learning models,”
Frontiers in Medicine, vol. 8, 883 pages, 2021.
[3] K. C. Yuan, L. W. Tsai, K. H. Lee et al., “The
development an artificial intelligence
algorithm for early sepsis diagnosis in the
intensive care unit,” International Journal of
Medical Informatics, vol. 141, Article ID 104176,
2020.
[4] J. E. García-Gallo, N. J. Fonseca-Ruiz, L. A. Celi,and
J. F. Duitama-Muñoz, “A machine learning-
based model for 1 year mortality prediction in
patients admitted to an intensive care unit with
adiagnosis of sepsis,” Medicina Intensiva, vol. 44,
no. 3, pp. 160–170, 2020.
[5] J. Kim, H. Chang, D. Kim, D. H. Jang, I. Park, and
K. Kim, “Machine learning for prediction of
septic shock at initial triage in emergency
department,” Journal of Critical Care, vol. 55,
pp. 163–170, 2020.
[6] A. H. Beck, A. R. Sangoi, S. Leung et al.,
“Systematic analysis of breast cancer
morphology uncovers stromal features
associated with survival,” Science Translational
Medicine,vol. 3, no. 108, 108ra113 pages, 2011.
[7] D. J. Stekhoven and P. Bühlmann, “MissForest—
non-parametric missing value imputation for
mixed-type data,” Bioinformatics, vol. 28, no. 1,
pp. 112–118, 2012.
[8] R Core Team, R: A Language and Environment
for Statistical Computing. Vienna: R Foundation
for Statistical Computing, R Core Team, Vienna,
Austria, 2014.
[9] J. C. Gower, “A general coefficient of similarity
and some of its properties,” Biometrics, vol. 27,
no. 4, pp. 857–871, 1971.

More Related Content

PDF
Multi Disease Detection using Deep Learning
PDF
A COMPREHENSIVE SURVEY ON CARDIAC ARREST RISK LEVEL PREDICTION SYSTEM
PDF
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
PDF
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
PDF
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
PDF
Cloud based Health Prediction System
PDF
ICU MORTALITY PREDICTION
PDF
IRJET- Data Mining Techniques to Predict Diabetes
Multi Disease Detection using Deep Learning
A COMPREHENSIVE SURVEY ON CARDIAC ARREST RISK LEVEL PREDICTION SYSTEM
IRJET- Chronic Kidney Disease Prediction based on Naive Bayes Technique
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
Cloud based Health Prediction System
ICU MORTALITY PREDICTION
IRJET- Data Mining Techniques to Predict Diabetes

Similar to Sepsis Prediction Using Machine Learning (20)

PDF
A Machine Learning Model for Diabetes
PDF
Predictions And Analytics In Healthcare: Advancements In Machine Learning
PDF
Application of artificial intelligence techniques in the intensive care unit
PDF
Precision medicine in hepatology: harnessing IoT and machine learning for per...
PDF
IRJET- Result on the Application for Multiple Disease Prediction from Symptom...
PDF
Sepsis Prediction Using Machine Learning
PDF
IRJET - Review on Classi?cation and Prediction of Dengue and Malaria Dise...
PDF
IRJET- Heart Disease Prediction System
PDF
Heart Disease Prediction using Data Mining
PDF
IRJET- Predicting Diabetes Disease using Effective Classification Techniques
PDF
IRJET - Classification and Prediction for Hospital Admissions through Emergen...
PDF
IRJET - E-Health Chain and Anticipation of Future Disease
PDF
IRJET- Sepsis Severity Prediction using Machine Learning
PPTX
Design and Implementation of an expert system for breast cancer diagnosis and...
PDF
Multiple disease prediction using Machine Learning Algorithms
PDF
Care expert assistant for Medicare system using Machine learning
PDF
Structure and development of a clinical decision support system: application ...
PPTX
development of healthcare decision management system.pptx
PDF
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
PDF
IRJET- An Information Forwarder for Healthcare Service and analysis using Big...
A Machine Learning Model for Diabetes
Predictions And Analytics In Healthcare: Advancements In Machine Learning
Application of artificial intelligence techniques in the intensive care unit
Precision medicine in hepatology: harnessing IoT and machine learning for per...
IRJET- Result on the Application for Multiple Disease Prediction from Symptom...
Sepsis Prediction Using Machine Learning
IRJET - Review on Classi?cation and Prediction of Dengue and Malaria Dise...
IRJET- Heart Disease Prediction System
Heart Disease Prediction using Data Mining
IRJET- Predicting Diabetes Disease using Effective Classification Techniques
IRJET - Classification and Prediction for Hospital Admissions through Emergen...
IRJET - E-Health Chain and Anticipation of Future Disease
IRJET- Sepsis Severity Prediction using Machine Learning
Design and Implementation of an expert system for breast cancer diagnosis and...
Multiple disease prediction using Machine Learning Algorithms
Care expert assistant for Medicare system using Machine learning
Structure and development of a clinical decision support system: application ...
development of healthcare decision management system.pptx
IRJET- Develop Futuristic Prediction Regarding Details of Health System for H...
IRJET- An Information Forwarder for Healthcare Service and analysis using Big...
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
PDF
Kiona – A Smart Society Automation Project
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
PDF
Breast Cancer Detection using Computer Vision
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
Kiona – A Smart Society Automation Project
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
Breast Cancer Detection using Computer Vision
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Ad

Recently uploaded (20)

DOCX
573137875-Attendance-Management-System-original
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Structs to JSON How Go Powers REST APIs.pdf
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
Welding lecture in detail for understanding
PDF
composite construction of structures.pdf
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
web development for engineering and engineering
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
573137875-Attendance-Management-System-original
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Structs to JSON How Go Powers REST APIs.pdf
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Foundation to blockchain - A guide to Blockchain Tech
bas. eng. economics group 4 presentation 1.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Welding lecture in detail for understanding
composite construction of structures.pdf
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
web development for engineering and engineering
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx

Sepsis Prediction Using Machine Learning

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1245 Sepsis Prediction Using Machine Learning 1Mohammad Ateeq, 2 Vineet K Joshi, 3D Naga Praneeth, 4Gugulothu Ravi 4Assistant Professor, Department of Computer Science and Engineering, SNIST, Hyderabad-501301, India 1,2,3B. Tech Scholars, Department of Computer Science and Engineering, SNIST, Hyderabad-501301, India -----------------------------------------------------------------------***----------------------------------------------------------------------- Abstract:- Sepsis is a blood poisoning condition that can increase the mortality risk in ICU patients when the body exhibits a dysregulated host response to an infection and results in organ failure or tissue damage.The expense of treating sepsis in hospitals is rising yearly as it develops into a serious health issue. Different techniques have been developed to monitor sepsis electronically, however in order to reduce the risk of death, it is crucial to forecast sepsis as soon as feasible before clinical reports or conventional techniques. The primary characteristics influencing the classifier's predictions have been outlined, making the model easier for medical professionals to understand. MLP Classifier has been used for the early diagnosis of sepsis, particularly in ICU patients been applied.This study demonstrates how machine learning algorithms, employing six vital signs taken from patient records over the age of 18, can reliably predict sepsis at the time of a patient's admittance into the intensive care unit. Sepsis may be predicted early, which can assist doctors administer supportive care and save mortality and medical costs. Unprecedented assessment measures have been obtained, and they can be very helpful in accurately and promptly predicting sepsis. I. INTRODUCTION Sepsis is a life-threatening condition that occurs when the body's response to an infection leads to inflammation andtissue damage throughout the body. It is a leading causeof death in hospitals and can progress rapidly if not properly diagnosed and treated. Early detection of sepsis is crucial for improving patient outcomes, and machine learning techniques have the potential to significantly improve the accuracy and speed of sepsis diagnosis. One approach to sepsis detection using machine learning is touse patient data, such as vital signs and laboratory results, to train a model to predict the likelihood of sepsis. This data can be collected from electronic health recordsor other sources and may include demographic information, previous medical history, and current symptoms. The model can then be used to identify patients who are at high risk for sepsis, allowing healthcare providers to initiate early treatment and potentially prevent the progression of the condition. Utilizing machine learning as a different strategy for analysis. patterns in patient data and identify early warning signs of sepsis. This can be done by analysing trends in vital signs over time or by looking for changes in biomarkers that are indicative of sepsis. By identifying these early warning signs, healthcare providers can intervene before the condition becomes severe and potentially save lives. Overall, the use of machine learning in sepsis detection has the potential to significantly improve patient outcomes by enabling early diagnosis and treatment. But it's crucial to pay close attention to the ethical implications of using machine learning in healthcare and ensure that the technology is used responsibly and in a way that benefits patients. The development of artificial intelligence technologies has made it possible to diagnose sepsis early on. These techniques have been created to study and anticipate the health of the human body and acquire accurate prescription information to support doctors in making rapid and effective decisions. They integrate electronic medical records, medical imaging, pathophysiology, and other data. In several medical specialties, an AI-based diagnostic system has been proven to be efficient. In the domain of sepsis diagnosis, prognosis, and therapy machine learning algorithms used include supervised learning and reinforcement learning. For example, Beck et al. develop the C-Path (Computational Pathologist) system to automatically diagnose breast cancer and determine the likelihood that patients will survive by looking at breast tissue imaging. The primary two difficulties in the current study involve the use of different physiological indicators and modelling efficient machine learning algorithms for the II. LITERATURE SURVEY
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1246 diagnosis, prognosis, and treatment of sepsis. Similarly, Additionally important for predicting sepsis in advance is choose appropriate variables and design valuable algorithms in the clinical setting. The model's input variables are physiological indications, and the model's output parameter is the patient's condition would develop sepsis many hours later. In particular, input data include vital indicators such as heart rate, oxygen saturation, and body temperature; biomarkers such as procalcitonin andinterleukin-6; laboratory values such as bicarbonate andcreatinine; and demographic factors such as gender and age. In Most of the categories, a large number of missing values, such as those in MIMIC III (Intensive CareMedical Information Market Database), which has been utilised in several research. Most studies omit variable with a large number of missing values from predictors, resulting in the loss of useful information. To fill in missing information, some research utilize imputation and mean filling methods, although this might lead to selection bias or confounding factor mixes. The data preparation approach must be examined in light of the features of various data sets. The machine learning algorithms generally include support vector machines, gradient boosting trees, random forests, Logistic regression, and neural networks. Amongthem, MLP Classifier have shown good performance. The A model with improved prediction capabilities will be examined further rand improved results for clinical service. so, that Early sepsis choices can be better madeby physician diagnosis. The research have performed well in the area of sepsis prediction. The quantity of data utilised in these studies is, however, reduced because the majority of the missing values are handled by direct deletion or forward filling, and the model's explanatory power is therefore constrained. The following arguments in detail explain why it is difficult to implement these techniques inclinical settings. A comprehensive data set is lacking. Researchers make use of information from various patient groups, such as the MIMIC public database or other unbiased sources of hospital data. They choose different clinical factors to create their models, and the size of thedata also varies considerably. Different clinical criteria for sepsis and assessment indicators are used as prediction settings' premise and indicators. 1.DATA SET:-Dataset contains data of 36 thousand patients. Each patient is represented by 41 features. Fig 1. Data Set 2.FEATURE SELECTION:- In the mean processing method, 41 variables were determined to participate in the training model, including (a) vital signs indicators (HR, O2Sat, Temp, SBP, MAP, DBP, Resp), (b) laboratory variables (HCO3, pH, PaCO2, AST, BUN, AlkalinePhos, Chloride, Creatinine, Lactate, Magnesium, Potassium, Bilirubin_total, PTT, WBC, Fibrinogen, Platelets), and (c) demographic indicators (Age, Gender). III. METHODOLOGY Fig 2. Vital Signs(a)
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1247 Fig 3. Laboratory Variables(b) The variables that had missing proportions of greater than 98% were eliminated. The demographic metrics HospAdmTime (the interval between hospitalisation and ICU) and ICULOS (the interval between ICU hospitalizations) have been removed. HospAdmTime displays various numerical levels depending on the health of various patients, which may be connected to sepsis's extended incubation period. Since the primary goal of this study is to develop guidelines for predicting early sepsis from changes in certain physiological data, these variables are omitted. According to the statistics entered, patients with sepsis have a significant fatality rate. They frequently require prolonged ICU care, and the ICULOS value is typically excessively high. Contrarily, patients without sepsis typically get treatment in the ICU for a brief period of time before being discharged after their health has improved, resulting in a low ICULOS rating. The variable ICULOS is eliminated because the variation in ICULOS value is caused by the different nature of the sickness situation, which is against the causal sequence of early sepsis anticipated from physiological data. Fig 4: Demographics Indicator(c) 3.METHOD TO PREDICT SEPSIS:- MLPs are neural network models that work as universal approximators, i.e., they can approximate any continuous function, MLPs are composed of neurons called perceptions. a perceptron receives n features as input (x = x1, x2,…, xn), and each of these features is associated to a weight. Input features must be numeric. So, nonnumeric input features have to be converted to numeric ones in order to use a perceptron. The result of this computation is then passed onto an activation function f, which will produce the output of the perceptron. In the original perceptron, the activationfunction is a step function: Thus, we can see that the perceptron determines whether w1x1 + w2x2 + ⋯ + wnxn − θ > 0 is true or false. The equation w1x1 + w2x2 + ⋯ + wnxn − θ = 0 is the equation of a hyperplane.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1248 3.1 IMPROVEMENT OF PREPROCESSING WARNING PERIODS:- The 6-hour warning period for each patient is immediately integrated to produce a single observation in the mean processing approach discussed above, however the model's performance in terms of prediction may not be sufficient. Further research is being done to determine whether or not higher performance results from segmentation time windows that are finer ordenser. In order to calculate the mean vector, the 6- hour warning period is split into 2- or 3-hour time windows, and the mean processing procedure for the safe period and illness period is left alone. Figure 5displays further information. The improvement is compared with the original models of their generalisation capabilities based on MLP’s Classifier and new datasets for training models are created in the same manner. Fig 5. MLP Mean Calculator 4. Fig 6: Feature Importance IV. UML DIAGRAM A sequence diagram (UML) is a visual representation of the flow of messages between objects during an interaction. A group of objects connected by lifelines and the messages they exchange throughout the course of an interaction make up a sequence diagram. Fig 7. Sequential Diagram 1.MODEL PERFORMANCE:- In the mean processing method (method1), the MLP Classifier algorithms differ in performance. The MLP’s algorithm has a Log loss rate of 0.15, with better distinction performance between 0- 1 categories. V. RESULTS FEATURE IMPORTANCE:- For the feature importance score, we take the MLP Classifier algorithm in the mean processing method as an example; the top 10variables with feature importance scores are Temp, O2Sat, Resp, HR, Age, SBP, MAP, PTT, PaCO2, and Potassium as shown in Figure 6. This means that these variables play an important role in predicting the risk of sepsis.
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1252 The log-loss shows how accurate the forecast was the likelihood corresponds to the matching real or true value The Accuracy score is calculated by dividing the number of correct predictions by the total prediction number. 2.MLP ALGORITHM:-This model optimizes the log- loss function using LBFGS or stochastic gradient descent. (0 or 1 in case of binary classification). The more the predicted the further the probability deviates from the actual value the log-loss value. class sklearn.neural_network.MLPClassifi er(hidden_layer_sizes=(100,), activation='relu', *, sol ver='adam', alpha=0.0001, batch_size='auto', learni ng_rate='constant', learning_rate_init=0.001, power_t=0.5, max_iter=200, shuffle=True, random_state=N one, tol=0.0001, verbose=False, warm_start=False, momentum=0.9, nesterovs_momentum=True, early_ stopping=False, validation_fraction=0.1, beta_1=0.9, beta_2=0.999, epsilon=1e- 08, n_iter_no_change=10, max_fun=15000) Training of variables in algorithm for classification. Fig 8. Accuracy Fig 9. Log loss VI.CONCLUSION It is possible to use machine learning techniques for the detection of sepsis, a serious and potentially life threatening condition that can arise as a complication of infection. Sepsis is a complex and dynamic process thatcan be difficult to diagnose, and early identification and treatment are critical for improving patient outcomes. Several machine learning techniques exist that have been explored for the detection of sepsis, including supervisedlearning methods such as decision tree algorithms and support vector machines, as well as unsupervised learning methods such as clustering and anomaly detection. One potential advantage of using machine learning for sepsis detection is the ability to analyse and interpret large amounts of patient data, including electronic health records, laboratory results, and vital signs, in order to identify patterns and correlations that may be indicativeof sepsis.
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 12 | Dec 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 1254 Overall, the use of machine learning for sepsis detection has the potential to improve the accuracy and timeliness of sepsis diagnosis, which can help to improve patient outcomes and reduce healthcare costs. However, more research is needed to fully understand the effectiveness and limitations of these approaches and to optimize theirperformance in real- world settings. VII.REFERENCES [1] K. E. Rudd, S. C. Johnson, K. M. Agesa et al., “Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the global burden of disease study,” The Lancet, vol. 395, no. 10219, pp. 200–211, 2020. [2] L. Su, Z. Xu, F. Chang et al., “Early prediction of mortality, severity, and length of stay in the intensive care unit of sepsis patients based on sepsis 3.0 by machine learning models,” Frontiers in Medicine, vol. 8, 883 pages, 2021. [3] K. C. Yuan, L. W. Tsai, K. H. Lee et al., “The development an artificial intelligence algorithm for early sepsis diagnosis in the intensive care unit,” International Journal of Medical Informatics, vol. 141, Article ID 104176, 2020. [4] J. E. García-Gallo, N. J. Fonseca-Ruiz, L. A. Celi,and J. F. Duitama-Muñoz, “A machine learning- based model for 1 year mortality prediction in patients admitted to an intensive care unit with adiagnosis of sepsis,” Medicina Intensiva, vol. 44, no. 3, pp. 160–170, 2020. [5] J. Kim, H. Chang, D. Kim, D. H. Jang, I. Park, and K. Kim, “Machine learning for prediction of septic shock at initial triage in emergency department,” Journal of Critical Care, vol. 55, pp. 163–170, 2020. [6] A. H. Beck, A. R. Sangoi, S. Leung et al., “Systematic analysis of breast cancer morphology uncovers stromal features associated with survival,” Science Translational Medicine,vol. 3, no. 108, 108ra113 pages, 2011. [7] D. J. Stekhoven and P. Bühlmann, “MissForest— non-parametric missing value imputation for mixed-type data,” Bioinformatics, vol. 28, no. 1, pp. 112–118, 2012. [8] R Core Team, R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing, R Core Team, Vienna, Austria, 2014. [9] J. C. Gower, “A general coefficient of similarity and some of its properties,” Biometrics, vol. 27, no. 4, pp. 857–871, 1971.