Classifying familial hypercholesterolaemia: a tree-based machine learning approach

Classifying familial hypercholesterolaemia: a tree-based machine learning approach. International Journal of Advanced Computer Science and Applications, 12 (9). pp. 66-73. ISSN 2156-5570 (2021)



Abstract

Abstract: Familial hypercholesterolaemia is the most common and serious form of inherited hyperlipidaemia. It has an autosomal dominant mode of inheritance, and is characterised by severely elevated low-density lipoprotein cholesterol levels. Familial hypercholesterolaemia is an important cause of premature coronary heart disease, but is potentially treatable. However, the majority of familial hypercholesterolaemia individuals are under-diagnosed and under-treated, resulting in lost opportunities for premature coronary heart disease prevention. This study aims to assess performance of machine learning algorithms for enhancing familial hypercholesterolaemia detection within the Malaysian population. We applied three machine learning algorithms (random forest, gradient boosting and decision tree) to classify familial hypercholesterolaemia among Malaysian patients and to identify relevant features from four well-known diagnostic instruments: Simon Broome, Dutch Lipid Clinic Criteria, US Make Early Diagnosis to Prevent Early Deaths and Japanese FH Management Criteria. The performance of these classifiers was compared using various measurements for accuracy, precision, sensitivity and specificity. Our results indicated that the decision tree classifier had the best performance, with an accuracy of 99.72%, followed by the gradient boosting and random forest classifiers, with accuracies of 99.54% and 99.52%, respectively. The three classifiers with Recursive Feature Elimination method selected six common features of familial hypercholesterolaemia diagnostic criteria (family history of coronary heart disease, low-density lipoprotein cholesterol levels, presence of tendon xanthomata and/or corneal arcus, family hypercholesterolaemia, and family history of familial hypercholesterolaemia) that generate the highest accuracy in predicting familial hypercholesterolaemia. We anticipate machine learning algorithms will enhance rapid diagnosis of familial hypercholesterolaemia by providing the tools to develop a virtual screening test for familial hypercholesterolaemia.

Item Type: Article
Keywords: Familial Hypercholesterolaemia (FH), Predicting FH, Machine learning algorithms, Tree-based classifier
Taxonomy: By Niche > Genome > Genomes Data Processing
By Niche > Genome > Human Genome Research
Local Content Hub: Niche > Genome
Depositing User: Hazrul Amir Tomyang (Puncak Alam)
Date Deposited: 08 Sep 2023 03:53
Last Modified: 08 Sep 2023 03:53
Related URLs:

Actions (login required)

View Item View Item