This document presents research on using the random forest machine learning algorithm to classify and predict heart disease.
The researchers used a dataset of 303 patient records combining data from 4 databases to train and test their random forest model. They achieved a classification accuracy of 86.9% and diagnosis rate of 93.3% when predicting heart disease risk.
Future work could include improving data quality, collecting a larger and more diverse dataset, further optimizing model hyperparameters, and developing classifiers to predict specific heart conditions rather than a general heart disease risk prediction. The goal of this research is to develop an accurate and efficient heart disease prediction tool to help doctors diagnose diseases early and improve patient outcomes.