SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 341
Comparative Analysis of Various Algorithms for Fetal Risk Prediction
Jay Mistry1
1 Fourth Year Student, Information Technology Department, Thadomal Shahani Engineering College, Bandra(W),
Mumbai - 400050
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Fetal problems emerge while your unborn child
grows within the womb. Congenital refers to the fact that a
child is born with these conditions. Some fetal disorders are
inherited genetically or from a parent. A machine learning
approach for predicting if a woman is having a high risk fetus
is needed. We have collected the data from an online
repository. The data is been balanced and pre-processed for
better prediction. This dataset is being used on machine
learning algorithms like Random Forest, Bagging,
AdaBoostM1, SMO, Kstar, Naïve Bayes, Hoeffding Tree and
Classification via Regression to build a model which will help
in predicting the fetal condition. All of these algorithms is
compared with specific performance metrics and the best
result is showcased.
Key Words: Fetal, pre-processed, prediction,
performance metrics
1. INTRODUCTION
As included in NCI dictionary of Cancer terms, fetus is
defined as the unborn baby that develops and grows inside
the uterus in humans. The fetal period begins 8 weeks after
fertilization of an egg by a sperm and ends at the time of
birth. There might be cases where a pregnant lady could
have pregnancy risks or distressing conditions for the fetus.
This distress is may be due to low levels of amniotic fluids,
high levels of amniotic fluids, placental abruption,
uncontrolled diabetes and pregnancy lasting for more than
40 weeks. Fetal surgery may be advantageous for babies
with specific birth abnormalities. Our specialists carry out
these extremely difficult treatmentswhileyourunbornchild
is still in the womb. There are tests that are used to assess
the fetal health such as fetal movement counts, biophysical
profile, contraction stress test and Dopplerultrasoundexam
of the umbilical artery. A method for keepingtrack ofuterine
contractions and fetal heart rate during pregnancy is called
Cardiotocography, or CTG. It is used to evaluate thehealthof
the fetus and to spot fetal distress early.
To create a model that represents the various data classes
and forecasts future data trends, classification and
prediction are used. With the aid of prediction models,
classification forecasts the category labels of data. We have
the clearest knowledge of the data at a broad scale thanks to
this analysis. Prediction models predict continuous-valued
functions, whereas classification models predict categorical
class labels. For instance, based on a person's income and
line of work, we can create a classification model to classify
bank loan applications as safe or risky, or a prediction model
to estimate how much money a potential consumer will
spend on computer equipment.
1.1 Motivation
Considering all the tests that used to detect fetal
abnormalities, very fewofthemareactuallyreliable.Prenatal
tests are not always perfect. The data for false-positives or
false-negatives varies from test to test. These procedures
might carry a real risk of miscarriage because an amount of
amniotic fluid or tissue from around the fetus is needed.
1.2 Problem Statement
An analysis of various machine learning algorithms to
select which algorithm can accurately predict the level of
risk in fetus based on few performance metrics.
2. Related Work
In [10] J. Li and X. Liu have implemented twelve machine
learning algorithms on CTG dataset. The proposedmodelhas
performed brilliantly in different classification model
evaluations. The four top modelsarethencombinedtocreate
the Blender Model using the soft voting integration method,
which is then contrasted with the stacking integration
method. The model described in this study outperformedthe
conventional machine learning models in a variety of
Classification Model evaluations, achieving accuracy rates of
0.959, AUCs of 0.988, recall rates of 0.916, precision rates of
0.959, F1s of 0.958, and MCCs of 0.886.
In [7] R. Chinnaiyan and S. Alex have used Machine Learning
algorithm to build a predictive classifier to forecast the fetal
health and growth state from a set of pre-classified patterns
knowledge. The majority of this evaluation of the literature
focuses on fetal anomalies that occur during the first
trimester of pregnancy. The major goal of this article review
is to investigate the various machine learning processes for
accurate diagnosis and prognosis of abdominal anomalies in
order to lower the incidence rate. Tested machine learning
algorithmssave timeandeffortwhiledeliveringmoreprecise
results. Segmentation, Image Enhancement, Feature
Extraction, and Image Classification are used to accomplish
this.
In [6] the results of the tests display a classifier model to be
83% and 84% accurate, before and after feature selection,
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 342
respectively. This researchproposesaclassificationbasedon
association (CBA) that is a rule-based method to the
Cardiotocographicanalysis. A useful indicator of fetushealth
confirmation is fetal improvement. One of the main causesof
foetal death is irregular fetal movement. Therefore, early
diagnosis is necessary to promotethe foetal health state.The
method under consideration aims to develop a model
employing the associative classification technique to study
fetal movement in order to improve the accuracy of
diagnoses for expectant women while minimising fetal
movement (DFM)
3. IMPLEMENTED WORK
Based on the above research, we have implemented an
analysis between different Machine Learning algorithms.
After downloading the data from Kaggle (a free online
repository), it was inserted on the data mining application
WEKA. The dataset looks is shown in the Figure 1.
Figure 1. Dataset
For better prediction, data was preprocessed. If you work
with large amounts of raw data or big data, you are aware of
how crucial data preprocessing is to enhancingthequality of
the data as the retrieved raw data is frequently unreliable,
imperfect, and noisy. We must ensure that our training data
is in the right format before using it to train an algorithm.
Data pretreatment is a crucial stage in data mining since it
helps to spot errors, outliers, noise, and missing important
variables. These data mistakes would persist without data
pretreatment in data science, lowering the calibre of data
mining. After the data cleaning is finished, a number of
processing steps are includedindata pre-processing,suchas
data integration, data conversion, and other processing
steps. The filters available in WEKA tool helped in
transforming numerical attributes to nominal attributes
which is showed in the Figure 2.
Figure 2. Nominal representation
Risk was divided into three categories namely Low, Medium
and High level risk. 1 (Blue color) being the lowest and 3
(Sky blue color) being the highest.
Class-balancing wasalsoperformedsimultaneously.Theaim
of Normalization is to scale down features to a similar scale.
This increases training stability. The nominal data now
consists of values ranging between 0 to 1. The screen in
WEKA provided description about the data in the dataset.
Missing values and redundant data were removed so that it
won’t hinder the accuracy of the model. The dataset consists
of 11 columns and 1 column for prediction. The attributes
are baseline value, fetal_movement, light_decelerations,
severe_decelerations and prolongued_decelerations,
uterine_contractions etc [4].Thegraphs betweenall ofthose
11 columns were visualized using WEKA.
Open the classify tab in WEKA and go on selecting different
algorithms for receiving a better accuracy. Each of those
algorithms also include hyper-parameter tuning. For this,
RandomForest, Bagging, AdaBoostM1, SMO, Kstar,
NaiveBayes, HoeffdingTree and ClassificationviaRegression
algorithms were used.
The selected algorithms are then analyzed and compared
with three performance metrics to evaluate the model. An
accurate model with good accuracy and performance
measure is then picked as the preferred algorithm. The flow
of the procedure is constructed using a block diagram in the
Figure 3.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 343
Figure 3. Block diagram
4. RESULTS AND ANALYSIS
In the classification process, all the algorithms were trained
with the training datasets as well as with k-fold cross
validation (k=10). A summary of each model was noted
which is displayed in Table 1.
Algorithms Trainin
g (%)
Precisi
on
Recall F-score Time
(secs)
NaiveBayes 89.5 0.96 0.92 0.94 0.01
AdaBoostM1-
DecisionStump
77.8 0.77 1 0.87 0.08
Bagging-J48 91.8 0.92 0.97 0.95 0.01
RandomForest 99.9 0.99 1 0.99 0.06
HoeffdingTree 89.5 0.96 0.92 0.94 0.01
Classificationvia
Regression
97.4 0.97 0.93 0.98 0.04
SMO 98.2 0.98 0.99 0.98 0.04
Kstar 99.9 0.99 1 0.99 5.49
Table 1. Analysis
According to the result displayed, we can see that
RandomForest and Kstar algorithm givesthesameaccuracy.
However when time complexityisincludedRandomForestis
preferred more. Precision, Recall and F-measure are super
standard way to evaluate a model. Precision helps us to
measure the ability to classify positive samples in a dataset.
Recall helps us to measure positive samples that are
correctly classified by the model. F-measure is the harmonic
mean of Precision and Recall. If the F-scoreishigherthanthe
model is considered better. Formula for each of the metric is
shown in Figure 4.
Figure 4. Formula
Parameters that assisted RandomForesttogethighaccuracy
are batchSize=100, numIterations=100, seed=1,
macDepth=0. These were set by the WEKA tool by default.
5. CONCLUSIONS
A healthy birth comes from a healthy pregnancy. Prenatal
care improves the chances of a healthy and risk-free
pregnancy and birth. This beginswithpre-pregnancycare. A
pre-pregnancy check and prenatal care can help in the
prevention of complications and help women in
understanding how they can keep the baby healthy while
taking care of themselves. Doctors can solve the issue by
predicting if the fetus is under distress or risk-free.
To do the prediction, first data is downloaded, pre-
processed. Various algorithm were trained on that model.
The accuracy we received was highest for RandomForest
algorithm. Precision, Recall and F-score also comments that
RandomForest would be the preferred algorithm amongthe
selected ones.
In future, we can create a system which predicts
abnormalities in fetus and which procedure needs to be
carried out even though it may require extensive dataset.
REFERENCES
[1] P. N. Tan, M. Steinbach, Vipin Kumar, “Introduction to
Data Mining”, Pearson Education.
[2] Han, Kamber, "Data Mining Concepts and Techniques",
Morgan Kaufmann 3nd Edition.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072
© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 344
[3] WEKA: https://www.cs.waikato.ac.nz/ml/weka/
[4] Dataset:-
https://www.kaggle.com/datasets/andrewmvd/fetal-
health-classification
[5] A. K. Pradhan, J. K. Rout, A. B. Maharana, B. K.
Balabantaray and N. K. Ray, "A Machine Learning
Approach for the Prediction of Fetal Health using CTG,"
2021 19th OITS International Conference on
Information Technology (OCIT), 2021, pp. 239-244,doi:
10.1109/OCIT53463.2021.00056.
[6] J. Piri and P. Mohapatra, "Exploring Fetal Health Status
Using an Association Based Classification Approach,"
2019 International Conference on Information
Technology (ICIT), 2019, pp. 166-171, doi:
10.1109/ICIT48102.2019.00036.
[7] R. Chinnaiyan and S. Alex, "Machine Learning
Approaches for Early Diagnosis and Prediction of Fetal
Abnormalities," 2021 International Conference on
Computer Communication and Informatics (ICCCI),
2021, pp. 1-3, doi: 10.1109/ICCCI50826.2021.9402317.
[8] A. Chowdhury, A. Chahar, R. Eswara, M. A. Raheem, S.
Ehetesham and B. K. Thulasidoss, "Fetal Health
Prediction using neural networks," 2022 8th
International Conference on Advanced Computing and
Communication Systems (ICACCS), 2022, pp. 256-260,
doi: 10.1109/ICACCS54159.2022.9784987.
[9] K. Agrawal and H. Mohan, "Cardiotocography Analysis
for Fetal State Classification Using Machine Learning
Algorithms," 2019 International Conference on
Computer Communication and Informatics (ICCCI),
2019, pp. 1-6, doi: 10.1109/ICCCI.2019.8822218.
[10] J. Li and X. Liu, "Fetal Health Classification Based on
Machine Learning," 2021 IEEE 2nd International
Conference on Big Data, Artificial Intelligence and
Internet of Things Engineering (ICBAIE), 2021, pp. 899-
902, doi: 10.1109/ICBAIE52039.2021.9389902.

More Related Content

PPTX
FETUS[1] [Read-Only].pptx First trimmers pregnancy
PPTX
ecvehcfejcvjehvdcbdjcb dbc jdcbjbcjdb cd.pptx
PPTX
Automating fetal heart monitor using machine learning
PDF
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
PDF
IRJET- Disease Prediction System
PPTX
Data Science Project: Advancements in Fetal Health Classification
PDF
IRJET- Prediction of Heart Disease using RNN Algorithm
PDF
IRJET- Heart Disease Prediction System
FETUS[1] [Read-Only].pptx First trimmers pregnancy
ecvehcfejcvjehvdcbdjcb dbc jdcbjbcjdb cd.pptx
Automating fetal heart monitor using machine learning
Impact of Classification Algorithms on Cardiotocography Dataset for Fetal Sta...
IRJET- Disease Prediction System
Data Science Project: Advancements in Fetal Health Classification
IRJET- Prediction of Heart Disease using RNN Algorithm
IRJET- Heart Disease Prediction System

Similar to Comparative Analysis of Various Algorithms for Fetal Risk Prediction (20)

PDF
Classification of Heart Diseases Patients using Data Mining Techniques
PDF
Classification of Health Care Data Using Machine Learning Technique
PDF
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
PDF
IRJET- Disease Prediction using Machine Learning
PDF
Heart Disease Prediction Using Random Forest Algorithm
PDF
Opposition Based Firefly Algorithm Optimized Feature Subset Selection Approac...
PDF
Heart_Disease_Prediction_and_Classificat.pdf
PDF
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
PDF
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
PDF
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
PPTX
the impact of Artificila intelligence in medicine
PPTX
Machine learning to solve bioinformatics problems
PDF
A comparative analysis of classification techniques on medical data sets
PDF
ICU Mortality Rate Estimation Using Machine Learning and Artificial Neural Ne...
PDF
Fundamentals of data science presentation
PDF
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
PDF
IRJET- Human Heart Disease Prediction using Ensemble Learning and Particle Sw...
PDF
Disease Prediction Using Machine Learning
PPT
Survey on data mining techniques in heart disease prediction
PDF
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHM
Classification of Heart Diseases Patients using Data Mining Techniques
Classification of Health Care Data Using Machine Learning Technique
IRJET- Machine Learning Classification Algorithms for Predictive Analysis in ...
IRJET- Disease Prediction using Machine Learning
Heart Disease Prediction Using Random Forest Algorithm
Opposition Based Firefly Algorithm Optimized Feature Subset Selection Approac...
Heart_Disease_Prediction_and_Classificat.pdf
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
AN ALGORITHM FOR PREDICTIVE DATA MINING APPROACH IN MEDICAL DIAGNOSIS
E-Healthcare monitoring System for diagnosis of Heart Disease using Machine L...
the impact of Artificila intelligence in medicine
Machine learning to solve bioinformatics problems
A comparative analysis of classification techniques on medical data sets
ICU Mortality Rate Estimation Using Machine Learning and Artificial Neural Ne...
Fundamentals of data science presentation
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Human Heart Disease Prediction using Ensemble Learning and Particle Sw...
Disease Prediction Using Machine Learning
Survey on data mining techniques in heart disease prediction
CARDIOVASCULAR DISEASE PREDICTION USING GENETIC ALGORITHM
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
PDF
Kiona – A Smart Society Automation Project
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
PDF
Breast Cancer Detection using Computer Vision
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
Kiona – A Smart Society Automation Project
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
Breast Cancer Detection using Computer Vision
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Ad

Recently uploaded (20)

PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
composite construction of structures.pdf
PPTX
Construction Project Organization Group 2.pptx
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
bas. eng. economics group 4 presentation 1.pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
Welding lecture in detail for understanding
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPT
Mechanical Engineering MATERIALS Selection
PPTX
Sustainable Sites - Green Building Construction
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
composite construction of structures.pdf
Construction Project Organization Group 2.pptx
OOP with Java - Java Introduction (Basics)
Internet of Things (IOT) - A guide to understanding
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
bas. eng. economics group 4 presentation 1.pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Welding lecture in detail for understanding
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
UNIT 4 Total Quality Management .pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Mechanical Engineering MATERIALS Selection
Sustainable Sites - Green Building Construction
Foundation to blockchain - A guide to Blockchain Tech
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx

Comparative Analysis of Various Algorithms for Fetal Risk Prediction

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 341 Comparative Analysis of Various Algorithms for Fetal Risk Prediction Jay Mistry1 1 Fourth Year Student, Information Technology Department, Thadomal Shahani Engineering College, Bandra(W), Mumbai - 400050 ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Fetal problems emerge while your unborn child grows within the womb. Congenital refers to the fact that a child is born with these conditions. Some fetal disorders are inherited genetically or from a parent. A machine learning approach for predicting if a woman is having a high risk fetus is needed. We have collected the data from an online repository. The data is been balanced and pre-processed for better prediction. This dataset is being used on machine learning algorithms like Random Forest, Bagging, AdaBoostM1, SMO, Kstar, Naïve Bayes, Hoeffding Tree and Classification via Regression to build a model which will help in predicting the fetal condition. All of these algorithms is compared with specific performance metrics and the best result is showcased. Key Words: Fetal, pre-processed, prediction, performance metrics 1. INTRODUCTION As included in NCI dictionary of Cancer terms, fetus is defined as the unborn baby that develops and grows inside the uterus in humans. The fetal period begins 8 weeks after fertilization of an egg by a sperm and ends at the time of birth. There might be cases where a pregnant lady could have pregnancy risks or distressing conditions for the fetus. This distress is may be due to low levels of amniotic fluids, high levels of amniotic fluids, placental abruption, uncontrolled diabetes and pregnancy lasting for more than 40 weeks. Fetal surgery may be advantageous for babies with specific birth abnormalities. Our specialists carry out these extremely difficult treatmentswhileyourunbornchild is still in the womb. There are tests that are used to assess the fetal health such as fetal movement counts, biophysical profile, contraction stress test and Dopplerultrasoundexam of the umbilical artery. A method for keepingtrack ofuterine contractions and fetal heart rate during pregnancy is called Cardiotocography, or CTG. It is used to evaluate thehealthof the fetus and to spot fetal distress early. To create a model that represents the various data classes and forecasts future data trends, classification and prediction are used. With the aid of prediction models, classification forecasts the category labels of data. We have the clearest knowledge of the data at a broad scale thanks to this analysis. Prediction models predict continuous-valued functions, whereas classification models predict categorical class labels. For instance, based on a person's income and line of work, we can create a classification model to classify bank loan applications as safe or risky, or a prediction model to estimate how much money a potential consumer will spend on computer equipment. 1.1 Motivation Considering all the tests that used to detect fetal abnormalities, very fewofthemareactuallyreliable.Prenatal tests are not always perfect. The data for false-positives or false-negatives varies from test to test. These procedures might carry a real risk of miscarriage because an amount of amniotic fluid or tissue from around the fetus is needed. 1.2 Problem Statement An analysis of various machine learning algorithms to select which algorithm can accurately predict the level of risk in fetus based on few performance metrics. 2. Related Work In [10] J. Li and X. Liu have implemented twelve machine learning algorithms on CTG dataset. The proposedmodelhas performed brilliantly in different classification model evaluations. The four top modelsarethencombinedtocreate the Blender Model using the soft voting integration method, which is then contrasted with the stacking integration method. The model described in this study outperformedthe conventional machine learning models in a variety of Classification Model evaluations, achieving accuracy rates of 0.959, AUCs of 0.988, recall rates of 0.916, precision rates of 0.959, F1s of 0.958, and MCCs of 0.886. In [7] R. Chinnaiyan and S. Alex have used Machine Learning algorithm to build a predictive classifier to forecast the fetal health and growth state from a set of pre-classified patterns knowledge. The majority of this evaluation of the literature focuses on fetal anomalies that occur during the first trimester of pregnancy. The major goal of this article review is to investigate the various machine learning processes for accurate diagnosis and prognosis of abdominal anomalies in order to lower the incidence rate. Tested machine learning algorithmssave timeandeffortwhiledeliveringmoreprecise results. Segmentation, Image Enhancement, Feature Extraction, and Image Classification are used to accomplish this. In [6] the results of the tests display a classifier model to be 83% and 84% accurate, before and after feature selection,
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 342 respectively. This researchproposesaclassificationbasedon association (CBA) that is a rule-based method to the Cardiotocographicanalysis. A useful indicator of fetushealth confirmation is fetal improvement. One of the main causesof foetal death is irregular fetal movement. Therefore, early diagnosis is necessary to promotethe foetal health state.The method under consideration aims to develop a model employing the associative classification technique to study fetal movement in order to improve the accuracy of diagnoses for expectant women while minimising fetal movement (DFM) 3. IMPLEMENTED WORK Based on the above research, we have implemented an analysis between different Machine Learning algorithms. After downloading the data from Kaggle (a free online repository), it was inserted on the data mining application WEKA. The dataset looks is shown in the Figure 1. Figure 1. Dataset For better prediction, data was preprocessed. If you work with large amounts of raw data or big data, you are aware of how crucial data preprocessing is to enhancingthequality of the data as the retrieved raw data is frequently unreliable, imperfect, and noisy. We must ensure that our training data is in the right format before using it to train an algorithm. Data pretreatment is a crucial stage in data mining since it helps to spot errors, outliers, noise, and missing important variables. These data mistakes would persist without data pretreatment in data science, lowering the calibre of data mining. After the data cleaning is finished, a number of processing steps are includedindata pre-processing,suchas data integration, data conversion, and other processing steps. The filters available in WEKA tool helped in transforming numerical attributes to nominal attributes which is showed in the Figure 2. Figure 2. Nominal representation Risk was divided into three categories namely Low, Medium and High level risk. 1 (Blue color) being the lowest and 3 (Sky blue color) being the highest. Class-balancing wasalsoperformedsimultaneously.Theaim of Normalization is to scale down features to a similar scale. This increases training stability. The nominal data now consists of values ranging between 0 to 1. The screen in WEKA provided description about the data in the dataset. Missing values and redundant data were removed so that it won’t hinder the accuracy of the model. The dataset consists of 11 columns and 1 column for prediction. The attributes are baseline value, fetal_movement, light_decelerations, severe_decelerations and prolongued_decelerations, uterine_contractions etc [4].Thegraphs betweenall ofthose 11 columns were visualized using WEKA. Open the classify tab in WEKA and go on selecting different algorithms for receiving a better accuracy. Each of those algorithms also include hyper-parameter tuning. For this, RandomForest, Bagging, AdaBoostM1, SMO, Kstar, NaiveBayes, HoeffdingTree and ClassificationviaRegression algorithms were used. The selected algorithms are then analyzed and compared with three performance metrics to evaluate the model. An accurate model with good accuracy and performance measure is then picked as the preferred algorithm. The flow of the procedure is constructed using a block diagram in the Figure 3.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 343 Figure 3. Block diagram 4. RESULTS AND ANALYSIS In the classification process, all the algorithms were trained with the training datasets as well as with k-fold cross validation (k=10). A summary of each model was noted which is displayed in Table 1. Algorithms Trainin g (%) Precisi on Recall F-score Time (secs) NaiveBayes 89.5 0.96 0.92 0.94 0.01 AdaBoostM1- DecisionStump 77.8 0.77 1 0.87 0.08 Bagging-J48 91.8 0.92 0.97 0.95 0.01 RandomForest 99.9 0.99 1 0.99 0.06 HoeffdingTree 89.5 0.96 0.92 0.94 0.01 Classificationvia Regression 97.4 0.97 0.93 0.98 0.04 SMO 98.2 0.98 0.99 0.98 0.04 Kstar 99.9 0.99 1 0.99 5.49 Table 1. Analysis According to the result displayed, we can see that RandomForest and Kstar algorithm givesthesameaccuracy. However when time complexityisincludedRandomForestis preferred more. Precision, Recall and F-measure are super standard way to evaluate a model. Precision helps us to measure the ability to classify positive samples in a dataset. Recall helps us to measure positive samples that are correctly classified by the model. F-measure is the harmonic mean of Precision and Recall. If the F-scoreishigherthanthe model is considered better. Formula for each of the metric is shown in Figure 4. Figure 4. Formula Parameters that assisted RandomForesttogethighaccuracy are batchSize=100, numIterations=100, seed=1, macDepth=0. These were set by the WEKA tool by default. 5. CONCLUSIONS A healthy birth comes from a healthy pregnancy. Prenatal care improves the chances of a healthy and risk-free pregnancy and birth. This beginswithpre-pregnancycare. A pre-pregnancy check and prenatal care can help in the prevention of complications and help women in understanding how they can keep the baby healthy while taking care of themselves. Doctors can solve the issue by predicting if the fetus is under distress or risk-free. To do the prediction, first data is downloaded, pre- processed. Various algorithm were trained on that model. The accuracy we received was highest for RandomForest algorithm. Precision, Recall and F-score also comments that RandomForest would be the preferred algorithm amongthe selected ones. In future, we can create a system which predicts abnormalities in fetus and which procedure needs to be carried out even though it may require extensive dataset. REFERENCES [1] P. N. Tan, M. Steinbach, Vipin Kumar, “Introduction to Data Mining”, Pearson Education. [2] Han, Kamber, "Data Mining Concepts and Techniques", Morgan Kaufmann 3nd Edition.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 09 Issue: 10 | Oct 2022 www.irjet.net p-ISSN: 2395-0072 © 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 344 [3] WEKA: https://www.cs.waikato.ac.nz/ml/weka/ [4] Dataset:- https://www.kaggle.com/datasets/andrewmvd/fetal- health-classification [5] A. K. Pradhan, J. K. Rout, A. B. Maharana, B. K. Balabantaray and N. K. Ray, "A Machine Learning Approach for the Prediction of Fetal Health using CTG," 2021 19th OITS International Conference on Information Technology (OCIT), 2021, pp. 239-244,doi: 10.1109/OCIT53463.2021.00056. [6] J. Piri and P. Mohapatra, "Exploring Fetal Health Status Using an Association Based Classification Approach," 2019 International Conference on Information Technology (ICIT), 2019, pp. 166-171, doi: 10.1109/ICIT48102.2019.00036. [7] R. Chinnaiyan and S. Alex, "Machine Learning Approaches for Early Diagnosis and Prediction of Fetal Abnormalities," 2021 International Conference on Computer Communication and Informatics (ICCCI), 2021, pp. 1-3, doi: 10.1109/ICCCI50826.2021.9402317. [8] A. Chowdhury, A. Chahar, R. Eswara, M. A. Raheem, S. Ehetesham and B. K. Thulasidoss, "Fetal Health Prediction using neural networks," 2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS), 2022, pp. 256-260, doi: 10.1109/ICACCS54159.2022.9784987. [9] K. Agrawal and H. Mohan, "Cardiotocography Analysis for Fetal State Classification Using Machine Learning Algorithms," 2019 International Conference on Computer Communication and Informatics (ICCCI), 2019, pp. 1-6, doi: 10.1109/ICCCI.2019.8822218. [10] J. Li and X. Liu, "Fetal Health Classification Based on Machine Learning," 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE), 2021, pp. 899- 902, doi: 10.1109/ICBAIE52039.2021.9389902.