Machine Learning Models for Predicting Olympic Medal Outcomes

Authors : Raiyan Sayeed, Mohammed Tanvir Hassan, Md. Naimur Rahman, Faiza Binte Zaman, Sabbir Ahmed, Md Saef Ullah Miah

DOI : 10.1109/iatmsi64286.2025.10984687

Volume : 3

Issue : 1

Year : 2025

Page No : 1-6

In this paper, the prediction of Olympic medal winners has been explored using various machine learning models, utilizing a dataset spanning 128 years of Olympic history from 1896 to 2024. Thirteen Machine Learning models - Logistic Regression, Polynomial Logistic Regression, XGBoost, Random Forest, KNN, Naive Bayes, LightGBM, AdaBoost, Decision Tree, Extra Trees, Gradient Boosting, Neural Network, and SVM were used and evaluated. The dataset was preprocessed, and models were trained and tested to predict future medal outcomes. Superior performance metrics were demonstrated by ensemble models such as XGBoost, LightGBM, and Gradient Boosting with accuracy rates of 83%, 84%, and 84% respectively, and notable AUC values. Conversely, lower performances were exhibited by simpler models like Logistic Regression, Polynomial Logistic Regression, and SVM. Discrepancies in prediction graphs suggested potential issues in model training or dataset encoding. Future research will focus on integrating additional features and more sophisticated techniques to enhance model performance and prediction accuracy. The findings are anticipated to contribute significantly to the field of sports analytics and assist national Olympic committees in strategic planning and resource allocation for future Olympic Games.


Citation Data