Research Article Open Access

Gradient Boosting for Heart Stroke Prediction: Investigating Unexpected Risk Factors

Aniket Kailas Shahade1 and Priyanka V. Deshmukh1
  • 1 Department of AI & ML, Symbiosis Institute of Technology, Pune Campus, Symbiosis International (Deemed University), Pune, India

Abstract

Heart stroke prediction is a critical area in healthcare, aiming to identify individuals at risk and provide timely intervention. This research leverages machine learning algorithms, including Decision Tree, Random Forest, AdaBoost, and Gradient Boost, to predict the likelihood of stroke, with Gradient Boosting delivering the most accurate results. Our analysis uncovers intriguing and unexpected relationships between stroke risk and various factors such as heart disease, hypertension, and smoking habits. Contrary to conventional wisdom, our findings suggest that individuals with lower incidences of hypertension and heart disease exhibit increased stroke risk. Additionally, non-smokers appear to have a higher likelihood of experiencing a stroke compared to smokers. Furthermore, Body Mass Index (BMI), marital status, residence type, and work type also significantly influence stroke risk. These anomalous findings necessitate further investigation to understand the underlying causes and implications. This study highlights the importance of using advanced machine learning techniques to uncover complex patterns in health data, which can lead to more effective prevention strategies.

Journal of Computer Science
Volume 21 No. 1, 2025, 124-133

DOI: https://doi.org/10.3844/jcssp.2025.124.133

Submitted On: 29 August 2024 Published On: 16 December 2024

How to Cite: Shahade, A. K. & Deshmukh, P. V. (2025). Gradient Boosting for Heart Stroke Prediction: Investigating Unexpected Risk Factors. Journal of Computer Science, 21(1), 124-133. https://doi.org/10.3844/jcssp.2025.124.133

  • 169 Views
  • 82 Downloads
  • 0 Citations

Download

Keywords

  • Heart Stroke Prediction
  • Gradient Boosting
  • Machine Learning
  • Hypertension
  • Heart Disease
  • Smoking
  • Body Mass Index
  • Demographic Factors
  • Health Data Analysis
  • Risk Factors