Bias Mitigation in Clinical AI: Auditing Race/Gender Disparities in Sepsis Prediction Models

Kawaljeet Singh Chadha

Authors

Kawaljeet Singh Chadha Business Analyst II MI, McLaren Health Care, TX, USA

Keywords:

Clinical AI fairness, Sepsis prediction, Algorithmic bias, Healthcare equity, Bias mitigation strategies

Abstract

This paper exposes persistent race and gender biases in AI-based sepsis prediction models, arguing that these inequities undermine patient outcomes and demanding prioritization of fairness as a core clinical metric. An audit of multiple AI tools in urban hospitals revealed consistent accuracy gaps, notably more false negatives for Black, Hispanic, female, and non-binary patients, which delayed care and worsened clinical results. These disparities stem from imbalanced training data, distance miscalibration, and structural inequities embedded in clinical practice. The manuscript surveys algorithmic bias types, presents audit frameworks (Fairlearn, Aequitas), and evaluates mitigation strategies such as data rebalancing, fair regularization, threshold adjustment, and explainable tools like SHAP and LIME. It further argues that implementing AI in healthcare must be grounded in the ethical imperatives of beneficence, non-maleficence, and justice. Future research will focus on intersectional bias analysis and prospective audits integrated into electronic health records. The findings attribute an immediate need for institutional responsibility towards facilitating clinical AI systems that promote health equity among all populations.

References

Brender, N., Yzeiraj, B., & Fragniere, E. (2015). The management audit as a tool to foster corporate governance: an inquiry in Switzerland. Managerial Auditing Journal, 30(8/9), 785-811. https://doi.org/10.1108/MAJ-03-2014-1013

Buolamwini, J. A. (2017). Gender shades: intersectional phenotypic and demographic evaluation of face datasets and gender classifiers (Doctoral dissertation, Massachusetts Institute of Technology).

Chan, C. W., Farias, V. F., & Escobar, G. J. (2017). The impact of delays on service times in the intensive care unit. Management Science, 63(7), 2049-2072. https://doi.org/10.1287/mnsc.2016.2441

Chavan, A. (2021). Exploring event-driven architecture in microservices: Patterns, pitfalls, and best practices. International Journal of Software and Research Analysis. https://ijsra.net/content/exploring-event-driven-architecture-microservices-patterns-pitfalls-and-best-practices

Chavan, A. (2022). Importance of identifying and establishing context boundaries while migrating from monolith to microservices. Journal of Engineering and Applied Sciences Technology, 4, E168. http://doi.org/10.47363/JEAST/2022(4)E168

Chimakonam, J. O., & Ofana, D. E. (2022). How intercultural philosophy can contribute to social integration. Journal of Intercultural Studies, 43(5), 606-620. https://doi.org/10.1080/07256868.2022.2063824

Elias, A., & Paradies, Y. (2021). The costs of institutional racism and its ethical implications for healthcare. Journal of bioethical inquiry, 18(1), 45-58. https://link.springer.com/article/10.1007/s11673-020-10073-0

Grant, M., Wilford, A., Haskins, L., Phakathi, S., Mntambo, N., & Horwood, C. M. (2017). Trust of community health workers influences the acceptance of community-based maternal and child health services. African Journal of Primary Health Care and Family Medicine, 9(1), 1-8. https://hdl.handle.net/10520/EJC-96ce469f4

Gumede, W., Bob, U., de Beer, D., Lues, R., & Anelich, L. (2020). Position paper: priority setting for interventions in pre-and post-pandemic management: the case of covid-19. https://www.anelichconsulting.co.za/wp-content/uploads/2020/06/SATN-COVID-19-Position-Paper.pdf

Konneru, N. M. K. (2021). Integrating security into CI/CD pipelines: A DevSecOps approach with SAST, DAST, and SCA tools. International Journal of Science and Research Archive. Retrieved from https://ijsra.net/content/role-notification-scheduling-improving-patient

Kumar, A. (2019). The convergence of predictive analytics in driving business intelligence and enhancing DevOps efficiency. International Journal of Computational Engineering and Management, 6(6), 118-142. Retrieved from https://ijcem.in/wp-content/uploads/THE-CONVERGENCE-OF-PREDICTIVE-ANALYTICS-IN-DRIVING-BUSINESS-INTELLIGENCE-AND-ENHANCING-DEVOPS-EFFICIENCY.pdf

Lesko, C. R., Buchanan, A. L., Westreich, D., Edwards, J. K., Hudgens, M. G., & Cole, S. R. (2017). Generalizing study results: a potential outcomes perspective. Epidemiology, 28(4), 553-561. https://journals.lww.com/epidem/toc/2017/07000

Lionetti, F., Aron, A., Aron, E. N., Burns, G. L., Jagiellowicz, J., & Pluess, M. (2018). Dandelions, tulips and orchids: Evidence for the existence of low-sensitive, medium-sensitive and high-sensitive individuals. Translational psychiatry, 8(1), 24. https://www.nature.com/articles/s41398-017-0090-6

Lu, J., Sattler, A., Wang, S., Khaki, A. R., Callahan, A., Fleming, S., ... & Shah, N. H. (2022). Considerations in the reliability and fairness audits of predictive models for advance care planning. Frontiers in Digital Health, 4, 943768. https://doi.org/10.3389/fdgth.2022.943768

Maney, D. L. (2016). Perils and pitfalls of reporting sex differences. Philosophical Transactions of the Royal Society B: Biological Sciences, 371(1688), 20150119. https://doi.org/10.1098/rstb.2015.0119

Marlow, S., & Swail, J. (2014). Gender, risk and finance: why can't a woman be more like a man?. Entrepreneurship & Regional Development, 26(1-2), 80-96. https://doi.org/10.1080/08985626.2013.860484

Mazurenko, O., Richter, J., Swanson-Kazley, A., & Ford, E. (2016). Examination of the relationship between management and clinician agreement on communication openness, teamwork, and patient satisfaction in the US hospitals. Journal of Hospital Administration, 5(4), 20-27. http://dx.doi.org/10.5430/jha.v5n4p20

Nyati, S. (2018). Transforming telematics in fleet management: Innovations in asset tracking, efficiency, and communication. International Journal of Science and Research (IJSR), 7(10), 1804-1810. Retrieved from https://www.ijsr.net/getabstract.php?paperid=SR24203184230

Prescott, S. L., & Logan, A. C. (2018). From authoritarianism to advocacy: lifestyle-driven, socially-transmitted conditions require a transformation in medical training and practice. Challenges, 9(1), 10. https://doi.org/10.3390/challe9010010

Raju, R. K. (2017). Dynamic memory inference network for natural language inference. International Journal of Science and Research (IJSR), 6(2). https://www.ijsr.net/archive/v6i2/SR24926091431.pdf

Sardana, J. (2022). Scalable systems for healthcare communication: A design perspective. International Journal of Science and Research Archive. https://doi.org/10.30574/ijsra.2022.7.2.0253

Sardana, J. (2022). The role of notification scheduling in improving patient outcomes. International Journal of Science and Research Archive. Retrieved from https://ijsra.net/content/role-notification-scheduling-improving-patient

Singh, V. (2022). Integrating large language models with computer vision for enhanced image captioning: Combining LLMS with visual data to generate more accurate and context-rich image descriptions. Journal of Artificial Intelligence and Computer Vision, 1(E227). http://doi.org/10.47363/JAICC/2022(1)E227

Singh, V. (2022). Visual question answering using transformer architectures: Applying transformer models to improve performance in VQA tasks. Journal of Artificial Intelligence and Cognitive Computing, 1(E228). https://doi.org/10.47363/JAICC/2022(1)E228

Slobogean, G. P., Giannoudis, P. V., Frihagen, F., Forte, M. L., Morshed, S., & Bhandari, M. (2015). Bigger data, bigger problems. Journal of orthopaedic trauma, 29, S43-S46. https://journals.lww.com/jorthotrauma/toc/2015/12001

Sukhadiya, J., Pandya, H., & Singh, V. (2018). Comparison of Image Captioning Methods. INTERNATIONAL JOURNAL OF ENGINEERING DEVELOPMENT AND RESEARCH, 6(4), 43-48. https://rjwave.org/ijedr/papers/IJEDR1804011.pdf

Weiss, D., & Eikemo, T. A. (2017). Technological innovations and the rise of social inequalities in health. Scandinavian journal of public health, 45(7), 714-719. https://doi.org/10.1177/1403494817711371

Yanisky-Ravid, S., & Hallisey, S. (2018). ‘Equality and Privacy by Design’: Ensuring Artificial Intelligence (AI) Is Properly Trained & Fed: A New Model of AI Data Transparency & Certification As Safe Harbor Procedures. Available at SSRN 3278490. https://dx.doi.org/10.2139/ssrn.3278490

Yoon, J., Zame, W. R., & Van Der Schaar, M. (2018). Estimating missing data in temporal data streams using multi-directional recurrent neural networks. IEEE Transactions on Biomedical Engineering, 66(5), 1477-1490.

Zhang, A., Xing, L., Zou, J., & Wu, J. C. (2022). Shifting machine learning for healthcare from development to deployment and from models to data. Nature biomedical engineering, 6(12), 1330-1345. https://www.nature.com/articles/s41551-022-00898-y

Frontiers in Emerging Computer Science and Information Technology

Article Details Page