IJCATR Volume 13 Issue 8

Role of Advanced Data Analytics in Higher Education: Using Machine Learning Models to Predict Student Success

Harold Tobias Adu-Twum, Emmanuel Adu Sarfo, Evans Nartey, Adesola Adetunji, Adebowale Olufemi Ayannusi, Thomas Andrew Walugembe
10.7753/IJCATR1308.1006
keywords : Predicting Student Dropout, Binary Classification, Machine Learning in Higher Education, Boosting Methods, Advance Data in Higher Education, Student Retention

PDF
This research article explores the pressing issue of college student dropout, employing a suite of predictive modeling techniques to forecast students' likelihood of discontinuing their studies. In an era where higher education plays a crucial role in individual career prospects and societal progress, understanding and mitigating dropout rates is of paramount importance. We leverage a comprehensive dataset, incorporating demographic, socioeconomic, academic, and financial factors, to train and test predictive models: Logistic Regression, Random Forest, Decision Tree Classifier, Support Vector Machine (SVM), and Gradient Boosting. Our analysis reveals that the gradient boosting model outperforms its counterparts in predicting student dropout, achieving the highest precision, recall, and F1 score among the evaluated models. Gradient boosting model correctly identified 94.4% of the student as dropouts. This superiority underscores the gradient boosting’s robustness in handling the complex, multidimensional nature of factors influencing college retention. Moreover, it was found that circular units approved in the first semester is the most important factor in determining student success in terms of graduating or dropping out. This is then followed by tuition fees up to date. The previous qualification of students has the lowest predictive power indicating that current circumstances of students – academic courses, and finances contributes to students’ success more. The findings underscore the potential of machine learning in crafting targeted interventions to support at-risk students, thereby enhancing retention rates and contributing to the broader objectives of educational equity and success.
@artical{h1382024ijcatr13081006,
Title = "Role of Advanced Data Analytics in Higher Education: Using Machine Learning Models to Predict Student Success ",
Journal ="International Journal of Computer Applications Technology and Research(IJCATR)",
Volume = "13",
Issue ="8",
Pages ="54 - 61",
Year = "2024",
Authors ="Harold Tobias Adu-Twum, Emmanuel Adu Sarfo, Evans Nartey, Adesola Adetunji, Adebowale Olufemi Ayannusi, Thomas Andrew Walugembe"}
  • Our findings demonstrate that the gradient boosting model outperforms other models in predicting student dropout, achieving high precision, recall, and F1 scores.
  • Through targeted interventions informed by our predictive modeling, educational institutions can enhance retention rates and contribute to broader objectives of educational equity and success.