Skip to main content

2024 | OriginalPaper | Buchkapitel

An Ensemble Machine Learning Approach with Hybrid Feature Selection Technique to Detect Thyroid Disease

verfasst von : Priyanka Roy, Fahim Mohammad Sadique Srijon, Mahmudul Hasan, Pankaj Bhowmik, Adiba Mahjabin Nitu

Erschienen in: Proceedings of the 2nd International Conference on Big Data, IoT and Machine Learning

Verlag: Springer Nature Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Thyroid disease is a prevalent health problem that requires early detection for effective treatment. However, there is no universal model for detecting thyroid abnormalities efficiently. This study proposes a three-layer thyroid disease detection framework that utilizes different feature engineering techniques to explore various thyroid datasets, improving model performance. We propose a hybrid framework to identify the most relevant features utilizing feature selection techniques contributing significantly to the model’s performance. We evaluate the proposed bagging XGBoost ensemble model’s performance against K-nearest neighbors, extreme learning machines, and random forest classifiers. It surpassed all with 98.44% accuracy with only 53.57% of the dataset’s features. The proposed model outperformed other classifiers with 98.65% accuracy during cross-validation on the second dataset. Stress testing with train–test split ratios of 70–30 and 30–70 produced 94.92% and 94.68% accuracy rates, respectively, that also outperform the benchmark ML models. These findings have significant implications for improving thyroid disease diagnosis using machine learning techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
4.
Zurück zum Zitat Keestra S, Högqvist Tabor V, Alvergne A (2020) Reinterpreting patterns of variation in human thyroid function. Evol Med Publ Health 9:93–112CrossRef Keestra S, Högqvist Tabor V, Alvergne A (2020) Reinterpreting patterns of variation in human thyroid function. Evol Med Publ Health 9:93–112CrossRef
5.
Zurück zum Zitat Vijayalakshmi K, Dheeraj S, Deepthi B (2018) Intelligent thyroid prediction system using big data. Int J Comput Sci Eng 6:326–331 Vijayalakshmi K, Dheeraj S, Deepthi B (2018) Intelligent thyroid prediction system using big data. Int J Comput Sci Eng 6:326–331
6.
Zurück zum Zitat Ippolito PP (2022) Hyperparameter tuning. Appl Data Sci Tour 231–251 Ippolito PP (2022) Hyperparameter tuning. Appl Data Sci Tour 231–251
7.
Zurück zum Zitat Almahshi HM, Almasri EA, Alquran H, Mustafa WA, Alkhayyat A (2022) Hypothyroidism prediction and detection using machine learning. In: 2022 5th international conference on engineering technology and its applications (IICETA) Almahshi HM, Almasri EA, Alquran H, Mustafa WA, Alkhayyat A (2022) Hypothyroidism prediction and detection using machine learning. In: 2022 5th international conference on engineering technology and its applications (IICETA)
8.
Zurück zum Zitat Shahajalal M, Rahman MM, Pranto SA, Ema RR, Islam T, Raihan M (2021) Utilization of machine learning algorithms for thyroid disease prediction. Adv Intell Syst Comput 57–69 Shahajalal M, Rahman MM, Pranto SA, Ema RR, Islam T, Raihan M (2021) Utilization of machine learning algorithms for thyroid disease prediction. Adv Intell Syst Comput 57–69
9.
Zurück zum Zitat Chaubey G, Bisen D, Arjaria S, Yadav V (2020) Thyroid disease prediction using machine learning approaches. Nat Acad Sci Lett 44:233–238CrossRef Chaubey G, Bisen D, Arjaria S, Yadav V (2020) Thyroid disease prediction using machine learning approaches. Nat Acad Sci Lett 44:233–238CrossRef
11.
Zurück zum Zitat Islam SS, Haque MS, Miah MSU, Sarwar TB, Nugraha R (2022) Application of machine learning algorithms to predict the thyroid disease risk: an experimental comparative study. PeerJ Comput Sci 8:e898CrossRef Islam SS, Haque MS, Miah MSU, Sarwar TB, Nugraha R (2022) Application of machine learning algorithms to predict the thyroid disease risk: an experimental comparative study. PeerJ Comput Sci 8:e898CrossRef
12.
Zurück zum Zitat Wu Y, Rao K, Liu J, Han C, Gong L, Chong Y, Liu Z, Xu X (2020) Machine learning algorithms for the prediction of central lymph node metastasis in patients with papillary thyroid cancer. Front Endocrinol 11 Wu Y, Rao K, Liu J, Han C, Gong L, Chong Y, Liu Z, Xu X (2020) Machine learning algorithms for the prediction of central lymph node metastasis in patients with papillary thyroid cancer. Front Endocrinol 11
13.
Zurück zum Zitat Rao A, Renuka B (2020) A machine learning approach to predict thyroid disease at early stages of diagnosis. In: 2020 IEEE international conference for innovation in technology (INOCON) Rao A, Renuka B (2020) A machine learning approach to predict thyroid disease at early stages of diagnosis. In: 2020 IEEE international conference for innovation in technology (INOCON)
14.
Zurück zum Zitat Zou Y, Shi Y, Sun F, Liu J, Guo Y, Zhang H, Lu X, Gong Y, Xia S (2021) Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: individual prediction using SHapley additive exPlanations. SSRN Electron J Zou Y, Shi Y, Sun F, Liu J, Guo Y, Zhang H, Lu X, Gong Y, Xia S (2021) Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: individual prediction using SHapley additive exPlanations. SSRN Electron J
15.
Zurück zum Zitat Shankar K, Lakshmanaprabu SK, Gupta D, Maseleno A, de Albuquerque VHC (2018) Retracted ARTICLE: optimal feature-based multi-kernel SVM approach for thyroid disease classification. J Supercomput 76:1128–1143CrossRef Shankar K, Lakshmanaprabu SK, Gupta D, Maseleno A, de Albuquerque VHC (2018) Retracted ARTICLE: optimal feature-based multi-kernel SVM approach for thyroid disease classification. J Supercomput 76:1128–1143CrossRef
16.
Zurück zum Zitat Zhao F, Zhang H, Cheng D, Wang W, Li Y, Wang Y, Lu D, Dong C, Ren D, Yang L (2022) Predicting the risk of nodular thyroid disease in coal miners based on different machine learning models. Front Med 9 Zhao F, Zhang H, Cheng D, Wang W, Li Y, Wang Y, Lu D, Dong C, Ren D, Yang L (2022) Predicting the risk of nodular thyroid disease in coal miners based on different machine learning models. Front Med 9
17.
Zurück zum Zitat Chaganti R, Rustam F, De La Torre Díez I, Mazón JLV, Rodríguez CL, Ashraf I (2022) Thyroid disease prediction using selective features and machine learning techniques. Cancers 14:3914CrossRef Chaganti R, Rustam F, De La Torre Díez I, Mazón JLV, Rodríguez CL, Ashraf I (2022) Thyroid disease prediction using selective features and machine learning techniques. Cancers 14:3914CrossRef
18.
Zurück zum Zitat Dharmarajan K, Balasree K, Arunachalam AS, Abirmai K (2020) Thyroid disease classification using decision tree and SVM. Indian J Publ Health Res Dev Dharmarajan K, Balasree K, Arunachalam AS, Abirmai K (2020) Thyroid disease classification using decision tree and SVM. Indian J Publ Health Res Dev
19.
Zurück zum Zitat Bhowmik P, Bhowmik PC (2022) A machine learning approach for phishing websites prediction with novel feature selection framework. In: Hossain S, Hossain MS, Kaiser MS, Majumder SP, Ray K (eds) Proceedings of international conference on fourth industrial revolution and beyond 2021. Springer, Singapore, pp 357–370 Bhowmik P, Bhowmik PC (2022) A machine learning approach for phishing websites prediction with novel feature selection framework. In: Hossain S, Hossain MS, Kaiser MS, Majumder SP, Ray K (eds) Proceedings of international conference on fourth industrial revolution and beyond 2021. Springer, Singapore, pp 357–370
20.
Zurück zum Zitat Cormen TH, Leiserson CE, Rivest RL, Stein C (2001) Introduction to algorithms Cormen TH, Leiserson CE, Rivest RL, Stein C (2001) Introduction to algorithms
21.
Zurück zum Zitat Han J, Kamber M, Pei J (2012) Data mining concepts and techniques, 3rd edn Han J, Kamber M, Pei J (2012) Data mining concepts and techniques, 3rd edn
22.
Zurück zum Zitat Raschka S, Mirjalili V (2017) Python machine learning: machine learning and deep learning with Python, scikit-learn, and TensorFlow, 2nd edn. Packt Publishing Raschka S, Mirjalili V (2017) Python machine learning: machine learning and deep learning with Python, scikit-learn, and TensorFlow, 2nd edn. Packt Publishing
23.
24.
Zurück zum Zitat Deng X, Ye A, Zhong J, Xu D, Yang W, Song Z, Zhang Z, Guo J, Wang T, Tian Y, Pan H, Zhang Z, Wang H, Wu C, Shao J, Chen X (2022) Bagging-XGBoost algorithm based extreme weather identification and short-term load forecasting model. Energy Rep 8:8661–8674CrossRef Deng X, Ye A, Zhong J, Xu D, Yang W, Song Z, Zhang Z, Guo J, Wang T, Tian Y, Pan H, Zhang Z, Wang H, Wu C, Shao J, Chen X (2022) Bagging-XGBoost algorithm based extreme weather identification and short-term load forecasting model. Energy Rep 8:8661–8674CrossRef
Metadaten
Titel
An Ensemble Machine Learning Approach with Hybrid Feature Selection Technique to Detect Thyroid Disease
verfasst von
Priyanka Roy
Fahim Mohammad Sadique Srijon
Mahmudul Hasan
Pankaj Bhowmik
Adiba Mahjabin Nitu
Copyright-Jahr
2024
Verlag
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-99-8937-9_26

Premium Partner