Show simple item record

dc.contributor.advisorSagahyroon, Assim
dc.contributor.advisorAloul, Fadi
dc.contributor.advisorPasquier, Michel
dc.contributor.authorJian, Yazan Khaled
dc.date.accessioned2022-01-25T07:45:06Z
dc.date.available2022-01-25T07:45:06Z
dc.date.issued2021-12
dc.identifier.other35.232-2021.52
dc.identifier.urihttp://hdl.handle.net/11073/21599
dc.descriptionA Master of Science thesis in Computer Engineering by Yazan Khaled Jian entitled, “A Machine Learning Approach to Predicting Diabetes Complications”, submitted in December 2021. Thesis advisor is Dr. Assim Sagahyroon and thesis co-advisors are Dr. Fadi Aloul and Dr. Michel Pasquier. Soft copy is available (Thesis, Completion Certificate, Approval Signatures, and AUS Archives Consent Form).en_US
dc.description.abstractMachine learning and data mining techniques have been widely used over the years to extract knowledge from data. The goal of this thesis is to study several diabetes complications. Diabetes Mellitus (DM) is a chronic disease that is considered to be life threatening. It can affect any part of the body over time resulting in more serious complications such as impacts on eyesight, perception, motor control and more. To study diabetes complications, a dataset collected by the Rashid Centre for Diabetes and Research (RCDR) located in Ajman, UAE was utilized. The dataset consists of 884 records with 79 features and 8 complications. The complications’ set contains metabolic syndrome, dyslipidemia, neuropathy, nephropathy, diabetic foot, hypertension, obesity, and retinopathy. Some essential preprocessing steps were needed to handle the missing values and imbalanced data problems. Moreover, several techniques were used to study the problem in hand. The first part of this thesis focused on generating association rules from the dataset using unsupervised learning techniques. This step was essential to extract valuable knowledge and relations between several attributes in the dataset and helped to develop a better understanding of DM and its complications. For instance, we extracted several rules indicating some possible relations between metabolic syndrome, hypertension and dyslipidemia. Further preprocessing steps were needed such as data discretization. For the second part of the research, different supervised classification algorithms were utilized to build several models to predict and diagnose eight diabetes complications. Furthermore, feature selection was performed to select the top 5 and 10 features for each complication. Repeated stratified k-fold cross validation was employed for a better estimation of the performance with a k=10 and a total of 10 repetitions. Accuracy and F1-score were used to evaluate the models’ performance reaching a maximum of 97.8% and 97.7% for accuracy and F1-scores, respectively.en_US
dc.description.sponsorshipDepartment of Computer Science and Engineeringen_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesMaster of Science in Computer Engineering (MSCoE)en_US
dc.subjectDiabetes Predictionen_US
dc.subjectDiabetes Complicationsen_US
dc.subjectSupervised Learningen_US
dc.subjectAssociation Rule Miningen_US
dc.titleA Machine Learning Approach to Predicting Diabetes Complicationsen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record