Towards Explainable Machine Learning in Early Diabetes Prediction

Ingrid Andersen

Full Article

Published: May 20, 2024

Ingrid Andersen

University of Copenhagen, Denmark

Abstract

This study explores the application of explainable machine learning techniques in the early prediction of diabetes, aiming to bridge the gap between complex model predictions and their interpretability in clinical settings. Utilizing the Pima Indian Diabetes Dataset, we preprocess data by addressing missing values, normalizing features, and selecting clinically relevant variables. Interpretable models such as decision trees, logistic regression, and rule-based classifiers are employed to provide transparent insights into diabetes risk factors. Techniques like SHAP values and LIME are integrated to further elucidate individual predictions, enhancing the clarity and trustworthiness of the models. The results indicate that high glucose levels and BMI are significant predictors of diabetes, aligning with clinical expectations. Through case studies, we demonstrate how these explainable models support informed clinical decisions, offering a robust framework for early diabetes detection and personalized patient care. This research underscores the importance of interpretability in machine learning, promoting its adoption in healthcare for better patient outcomes and more reliable diagnostic tools.

Downloads

Download data is not yet available.

Issue

Vol. 3 No. 1 (2024)

Section

Articles

This work is licensed under a Creative Commons Attribution 4.0 International License.

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details