Bias Mitigation in Clinical AI: Auditing Race/Gender Disparities in Sepsis Prediction Models
Keywords:
Clinical AI fairness, Sepsis prediction, Algorithmic bias, Healthcare equity, Bias mitigation strategiesAbstract
This paper exposes persistent race and gender biases in AI-based sepsis prediction models, arguing that these inequities undermine patient outcomes and demanding prioritization of fairness as a core clinical metric. An audit of multiple AI tools in urban hospitals revealed consistent accuracy gaps, notably more false negatives for Black, Hispanic, female, and non-binary patients, which delayed care and worsened clinical results. These disparities stem from imbalanced training data, distance miscalibration, and structural inequities embedded in clinical practice. The manuscript surveys algorithmic bias types, presents audit frameworks (Fairlearn, Aequitas), and evaluates mitigation strategies such as data rebalancing, fair regularization, threshold adjustment, and explainable tools like SHAP and LIME. It further argues that implementing AI in healthcare must be grounded in the ethical imperatives of beneficence, non-maleficence, and justice. Future research will focus on intersectional bias analysis and prospective audits integrated into electronic health records. The findings attribute an immediate need for institutional responsibility towards facilitating clinical AI systems that promote health equity among all populations.
References
Brender, N., Yzeiraj, B., & Fragniere, E. (2015). The management audit as a tool to foster corporate governance: an inquiry in Switzerland. Managerial Auditing Journal, 30(8/9), 785-811. https://doi.org/10.1108/MAJ-03-2014-1013
Buolamwini, J. A. (2017). Gender shades: intersectional phenotypic and demographic evaluation of face datasets and gender classifiers (Doctoral dissertation, Massachusetts Institute of Technology).
Chan, C. W., Farias, V. F., & Escobar, G. J. (2017). The impact of delays on service times in the intensive care unit. Management Science, 63(7), 2049-2072. https://doi.org/10.1287/mnsc.2016.2441
Chavan, A. (2021). Exploring event-driven architecture in microservices: Patterns, pitfalls, and best practices. International Journal of Software and Research Analysis. https://ijsra.net/content/exploring-event-driven-architecture-microservices-patterns-pitfalls-and-best-practices
Chavan, A. (2022). Importance of identifying and establishing context boundaries while migrating from monolith to microservices. Journal of Engineering and Applied Sciences Technology, 4, E168. http://doi.org/10.47363/JEAST/2022(4)E168
Chimakonam, J. O., & Ofana, D. E. (2022). How intercultural philosophy can contribute to social integration. Journal of Intercultural Studies, 43(5), 606-620. https://doi.org/10.1080/07256868.2022.2063824
Elias, A., & Paradies, Y. (2021). The costs of institutional racism and its ethical implications for healthcare. Journal of bioethical inquiry, 18(1), 45-58. https://link.springer.com/article/10.1007/s11673-020-10073-0
Grant, M., Wilford, A., Haskins, L., Phakathi, S., Mntambo, N., & Horwood, C. M. (2017). Trust of community health workers influences the acceptance of community-based maternal and child health services. African Journal of Primary Health Care and Family Medicine, 9(1), 1-8. https://hdl.handle.net/10520/EJC-96ce469f4
Gumede, W., Bob, U., de Beer, D., Lues, R., & Anelich, L. (2020). Position paper: priority setting for interventions in pre-and post-pandemic management: the case of covid-19. https://www.anelichconsulting.co.za/wp-content/uploads/2020/06/SATN-COVID-19-Position-Paper.pdf
Konneru, N. M. K. (2021). Integrating security into CI/CD pipelines: A DevSecOps approach with SAST, DAST, and SCA tools. International Journal of Science and Research Archive. Retrieved from https://ijsra.net/content/role-notification-scheduling-improving-patient
Kumar, A. (2019). The convergence of predictive analytics in driving business intelligence and enhancing DevOps efficiency. International Journal of Computational Engineering and Management, 6(6), 118-142. Retrieved from https://ijcem.in/wp-content/uploads/THE-CONVERGENCE-OF-PREDICTIVE-ANALYTICS-IN-DRIVING-BUSINESS-INTELLIGENCE-AND-ENHANCING-DEVOPS-EFFICIENCY.pdf
Lesko, C. R., Buchanan, A. L., Westreich, D., Edwards, J. K., Hudgens, M. G., & Cole, S. R. (2017). Generalizing study results: a potential outcomes perspective. Epidemiology, 28(4), 553-561. https://journals.lww.com/epidem/toc/2017/07000
Lionetti, F., Aron, A., Aron, E. N., Burns, G. L., Jagiellowicz, J., & Pluess, M. (2018). Dandelions, tulips and orchids: Evidence for the existence of low-sensitive, medium-sensitive and high-sensitive individuals. Translational psychiatry, 8(1), 24. https://www.nature.com/articles/s41398-017-0090-6
Lu, J., Sattler, A., Wang, S., Khaki, A. R., Callahan, A., Fleming, S., ... & Shah, N. H. (2022). Considerations in the reliability and fairness audits of predictive models for advance care planning. Frontiers in Digital Health, 4, 943768. https://doi.org/10.3389/fdgth.2022.943768
Maney, D. L. (2016). Perils and pitfalls of reporting sex differences. Philosophical Transactions of the Royal Society B: Biological Sciences, 371(1688), 20150119. https://doi.org/10.1098/rstb.2015.0119
Marlow, S., & Swail, J. (2014). Gender, risk and finance: why can't a woman be more like a man?. Entrepreneurship & Regional Development, 26(1-2), 80-96. https://doi.org/10.1080/08985626.2013.860484
Mazurenko, O., Richter, J., Swanson-Kazley, A., & Ford, E. (2016). Examination of the relationship between management and clinician agreement on communication openness, teamwork, and patient satisfaction in the US hospitals. Journal of Hospital Administration, 5(4), 20-27. http://dx.doi.org/10.5430/jha.v5n4p20
Nyati, S. (2018). Transforming telematics in fleet management: Innovations in asset tracking, efficiency, and communication. International Journal of Science and Research (IJSR), 7(10), 1804-1810. Retrieved from https://www.ijsr.net/getabstract.php?paperid=SR24203184230
Prescott, S. L., & Logan, A. C. (2018). From authoritarianism to advocacy: lifestyle-driven, socially-transmitted conditions require a transformation in medical training and practice. Challenges, 9(1), 10. https://doi.org/10.3390/challe9010010
Raju, R. K. (2017). Dynamic memory inference network for natural language inference. International Journal of Science and Research (IJSR), 6(2). https://www.ijsr.net/archive/v6i2/SR24926091431.pdf
Sardana, J. (2022). Scalable systems for healthcare communication: A design perspective. International Journal of Science and Research Archive. https://doi.org/10.30574/ijsra.2022.7.2.0253
Sardana, J. (2022). The role of notification scheduling in improving patient outcomes. International Journal of Science and Research Archive. Retrieved from https://ijsra.net/content/role-notification-scheduling-improving-patient
Singh, V. (2022). Integrating large language models with computer vision for enhanced image captioning: Combining LLMS with visual data to generate more accurate and context-rich image descriptions. Journal of Artificial Intelligence and Computer Vision, 1(E227). http://doi.org/10.47363/JAICC/2022(1)E227
Singh, V. (2022). Visual question answering using transformer architectures: Applying transformer models to improve performance in VQA tasks. Journal of Artificial Intelligence and Cognitive Computing, 1(E228). https://doi.org/10.47363/JAICC/2022(1)E228
Slobogean, G. P., Giannoudis, P. V., Frihagen, F., Forte, M. L., Morshed, S., & Bhandari, M. (2015). Bigger data, bigger problems. Journal of orthopaedic trauma, 29, S43-S46. https://journals.lww.com/jorthotrauma/toc/2015/12001
Sukhadiya, J., Pandya, H., & Singh, V. (2018). Comparison of Image Captioning Methods. INTERNATIONAL JOURNAL OF ENGINEERING DEVELOPMENT AND RESEARCH, 6(4), 43-48. https://rjwave.org/ijedr/papers/IJEDR1804011.pdf
Weiss, D., & Eikemo, T. A. (2017). Technological innovations and the rise of social inequalities in health. Scandinavian journal of public health, 45(7), 714-719. https://doi.org/10.1177/1403494817711371
Yanisky-Ravid, S., & Hallisey, S. (2018). ‘Equality and Privacy by Design’: Ensuring Artificial Intelligence (AI) Is Properly Trained & Fed: A New Model of AI Data Transparency & Certification As Safe Harbor Procedures. Available at SSRN 3278490. https://dx.doi.org/10.2139/ssrn.3278490
Yoon, J., Zame, W. R., & Van Der Schaar, M. (2018). Estimating missing data in temporal data streams using multi-directional recurrent neural networks. IEEE Transactions on Biomedical Engineering, 66(5), 1477-1490.
Zhang, A., Xing, L., Zou, J., & Wu, J. C. (2022). Shifting machine learning for healthcare from development to deployment and from models to data. Nature biomedical engineering, 6(12), 1330-1345. https://www.nature.com/articles/s41551-022-00898-y
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Kawaljeet Singh Chadha

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain the copyright of their articles published in this journal. All articles are licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly cited.