Interdisciplinary

AI Reveals Key Factors Driving Maternal Healthcare Access in Bangladesh

AI Insight

This study used machine learning models to identify key factors affecting postnatal care utilization in Bangladesh, analyzing data from the 2022 Bangladesh Demographic and Health Survey. Random Forest, XGBoost, and CatBoost models substantially outperformed traditional logistic regression, achieving AUC scores above 0.90 compared to 0.85. SHAP analysis revealed that delivery location, husband's occupation, rural residence, wealth index, and media exposure were the most important predictors of whether mothers accessed postnatal care services.


The findings provide evidence-based guidance for improving maternal healthcare in Bangladesh, suggesting targeted interventions such as promoting facility-based deliveries, enhancing maternal education, addressing economic disparities, and expanding health-related media coverage in rural areas. The demonstrated superiority of machine learning approaches over conventional statistical methods indicates these techniques could improve health policy planning in resource-limited settings.


by Amartay Kumar Dhar, Sharmin Akther, Farhana Akter Bina

Postnatal care (PNC) plays a crucial role in minimizing maternal and neonatal morbidity and mortality, but the uptake of services in Bangladesh remains below the recommended level. Although logistic regression has been widely used, it may miss complex nonlinear interactions among social, economic, and healthcare factors. This study contributes to the body of knowledge by using machine learning (ML) to identify the most significant determinants of PNC and to enhance prediction accuracy. We compared logistic regression to several ML models, including Random Forest, XGBoost, CatBoost, Support Vector Machine, AdaBoost, and Gradient Boosting, using nationally representative data from the 2022 Bangladesh Demographic and Health Survey (BDHS) with ADASYN oversampling to correct class imbalance. Among all models, Random Forest achieved the highest AUC (0.9050), closely followed by XGBoost (0.9036) and CatBoost (0.9028), all of which substantially outperformed logistic regression (AUC = 0.8470). SHAP analysis of the Random Forest model indicated that delivery place, husband’s occupation, rural residence, wealth index, and media exposure were the most influential predictors of PNC utilization, alongside maternal education, women’s occupation, and age-related factors. The results indicate that ML is more effective than classical procedures for revealing latent patterns and making accurate predictions. Policy implications include encouraging facility-based deliveries, improving maternal education, reducing wealth disparities, and enhancing media coverage of health, particularly among rural and low-income groups. This paper not only identifies key drivers of PNC in Bangladesh but also demonstrates how ML can supplement traditional methods to reinforce maternal health policy and interventions.

Source: Determinants of maternal postnatal care utilization in Bangladesh: A machine learning and SHAP-based analysis of BDHS 2022 data