TY - JOUR
T1 - Early Diagnosis of Thyroid Cancer Diseases Using Computational Intelligence Techniques: A Case Study of a Saudi Arabian Dataset
AU - Olatunji, Sunday O.
AU - Alotaibi, Sarah
AU - Almutairi, Ebtisam
AU - Alrabae, Zainab
AU - Almajid, Yasmeen
AU - Altabee, Rahaf
AU - Altassan, Mona
AU - Basheer Ahmed, Mohammed Imran
AU - Farooqui, Mehwash
AU - Alhiyafi, Jamal
N1 - Four computational intelligence models developed for predicting the existence of Thyroid Cancer disease preemptively. * The computational intelligence models were implemented on a Saudi Arabian dataset. * The models achieved promising results with Random Forest technique achieving the best result. * The best results were obtained using techniques' optimized parameters setting with recursive feature elimination approach.
PY - 2021/4
Y1 - 2021/4
N2 - In recent times, researchers have noticed that chronic diseases have become more common. In the Kingdom of Saudi Arabia, the number of patients with thyroid cancer (TC) has become a concern, necessitating a proactive system that can help cut down the incidence of this disease, where the system can assist in early interventions to prevent or cure the disease. In this paper, we introduce our work developing machine learning-based tools that can serve as early warning systems by detecting TC at very early stages (pre-symptomatic stage). In addition, we aimed at obtaining the greatest possible accuracy while using fewer features. It must be noted that while there have been past efforts to use machine learning in predicting TC, this is the first attempt using a Saudi Arabian dataset as well as targeting diagnosis in the pre-symptomatic stage (pre-emptive diagnosis). The techniques used in this work include random forest (RF), artificial neural network (ANN), support vector machine (SVM), and naïve Bayes (NB), each of which was selected for their unique capabilities. The highest accuracy rate obtained was 90.91% with the RF technique, while SVM, ANN, and NB achieved 84.09%, 88.64%, and 81.82% accuracy, respectively. These levels were obtained by using only seven features out of an available 15. Considering the pattern of the obtained results, it is clear that the RF technique is better and, hence, recommended for this specific problem.
AB - In recent times, researchers have noticed that chronic diseases have become more common. In the Kingdom of Saudi Arabia, the number of patients with thyroid cancer (TC) has become a concern, necessitating a proactive system that can help cut down the incidence of this disease, where the system can assist in early interventions to prevent or cure the disease. In this paper, we introduce our work developing machine learning-based tools that can serve as early warning systems by detecting TC at very early stages (pre-symptomatic stage). In addition, we aimed at obtaining the greatest possible accuracy while using fewer features. It must be noted that while there have been past efforts to use machine learning in predicting TC, this is the first attempt using a Saudi Arabian dataset as well as targeting diagnosis in the pre-symptomatic stage (pre-emptive diagnosis). The techniques used in this work include random forest (RF), artificial neural network (ANN), support vector machine (SVM), and naïve Bayes (NB), each of which was selected for their unique capabilities. The highest accuracy rate obtained was 90.91% with the RF technique, while SVM, ANN, and NB achieved 84.09%, 88.64%, and 81.82% accuracy, respectively. These levels were obtained by using only seven features out of an available 15. Considering the pattern of the obtained results, it is clear that the RF technique is better and, hence, recommended for this specific problem.
KW - Artificial neural network
KW - Diagnose
KW - Decision trees
KW - Ensemble methods
KW - Machine learning
KW - Naïve bayesian
KW - Random forest
KW - Support vector machine
KW - Thyroid cancer disease
UR - https://www.sciencedirect.com/science/article/pii/S0010482521000615?via%3Dihub
U2 - 10.1016/j.compbiomed.2021.104267
DO - 10.1016/j.compbiomed.2021.104267
M3 - Article
VL - 131
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
ER -