A Robust Hybrid Cost Sensitive Stacking Ensemble Model for Hepatitis Survival Prediction and Clinical Decision Support

Muhammad Sam'an(1*), Farikhin Farikhin(2)


(1) Universitas Muhammadiyah Semarang
(2) Diponegoro University
(*) Corresponding Author

Abstract


Chronic hepatitis continues to pose a significant global health challenge, frequently advancing to liver cirrhosis and hepatocellular carcinoma if not managed with precise prognostic interventions. The capacity to accurately predict patient survival is essential for optimizing resource allocation and treatment planning. Although Machine Learning (ML) has shown promise in medical diagnostics, standard algorithms often underperform when applied to hepatitis datasets characterized by severe class imbalance and high dimensionality. Conventional models tend to bias predictions toward the majority class (survival), resulting in a high rate of False Negatives for the minority class (mortality), which is clinically unacceptable. Moreover, single-classifier approaches often lack the generalization capability necessary for robust clinical deployment. To address these deficiencies, this study proposes a Hybrid Cost-Sensitive Stacking Ensemble Model (HCS-SEM). The framework integrates three strategic components: (1) a rigorous Split-First Synthetic Minority Oversampling Technique (SMOTE) protocol to resolve class skewness without data leakage; (2) a Chi-Square feature ranking mechanism to eliminate redundant clinical attributes; and (3) a Two-Tier Stacking Architecture employing Random Forest, SVM, and Gradient Boosting as base learners, optimized by a Logistic Regression meta-learner. Experimental validation on the UCI Hepatitis dataset demonstrates that HCS-SEM significantly outperforms standalone classifiers and traditional ensemble methods. The model achieves superior performance metrics, particularly in Sensitivity and F1-Score, confirmed by the Friedman Rank Test and Nemenyi post-hoc analysis. These findings suggest that the proposed HCS-SEM provides a robust, clinically viable tool for hepatitis prognosis, offering high-precision decision support for medical practitioners managing high-risk patients.

Keywords


Hepatitis Prognosis; Stacking Ensemble; Cost-Sensitive Learning; SMOTE; Clinical Decision Support

Full Text:

PDF

References


Simmonds P, Bukh J, Combet C, Deléage G, Enomoto N, Feinstone S, et al. Consensus Proposals for

a Unified System of Nomenclature of Hepatitis C Virus Genotypes *. Hepatology. 2005 10;42:962-73.

Available from: https://doi.org/10.1002/hep.20819.

Gower E, Estes C, Blach S, Razavi-Shearer K, Razavi H. Global epidemiology and genotype distribution

of the hepatitis C virus infection. Journal of Hepatology. 2014 11;61:S45-57. Available from: https:

//doi.org/10.1016/j.jhep.2014.07.027.

Zuhdi N. ndonesia Termasuk 20 Negara dengan Angka Hepatitis yang Tertinggi Global. Me-

dia Indonesia. 2023. Available from: https://mediaindonesia.com/humaniora/581686/

indonesia-termasuk-20-negara-dengan-angka-hepatitis-yang-tertinggi-global#

goog_rewarded.

Chien RN, Kao JH, Peng CY, Chen CH, Liu CJ, Huang YH, et al. Taiwan consensus statement on the

management of chronic hepatitis B. Journal of the Formosan Medical Association. 2019 1;118:7-38.

Available from: https://doi.org/10.1016/j.jfma.2018.11.008.

Hoffmann G, Bietenbeck A, Lichtinghagen R, Klawonn F. Using machine learning techniques to generate

laboratory diagnostic pathways—a case study. Journal of Laboratory and Precision Medicine. 2018

;3:58-8. Available from: https://doi.org/10.21037/jlpm.2018.06.01.

Wong GLH, Hui VWK, Tan Q, Xu J, Lee HW, Yip TCF, et al. Novel machine learning models outperform

risk scores in predicting hepatocellular carcinoma in patients with chronic viral hepatitis. JHEP Reports.

3;4:100441. Available from: https://doi.org/10.1016/j.jhepr.2022.100441.

Yağanoğlu M. Hepatitis C virus data analysis and prediction using machine learning. Data & Knowl-

edge Engineering. 2022 11;142:102087. Available from: https://doi.org/10.1016/j.datak.2022.

Ali N, Srivastava D, Tiwari A, Pandey A, Pandey AK, Sahu A. Predicting Life Expectancy of Hepatitis B

Patients using Machine Learning. In: 2022 IEEE International Conference on Distributed Computing

and Electrical Circuits and Electronics (ICDCECE). IEEE; 2022. p. 1-4. Available from: https://doi.

org/10.1109/ICDCECE53908.2022.9793025.

El-Salam SMA, Ezz MM, Hashem S, Elakel W, Salama R, ElMakhzangy H, et al. Performance of machine

learning approaches on prediction of esophageal varices for Egyptian chronic hepatitis C patients.

Informatics in Medicine Unlocked. 2019;17:100267. Available from: https://doi.org/10.1016/j.

imu.2019.100267.

Obaido G, Ogbuokiri B, Swart TG, Ayawei N, Kasongo SM, Aruleba K, et al. An Interpretable Machine

Learning Approach for Hepatitis B Diagnosis. Applied Sciences. 2022 11;12:11127. Available from:

https://doi.org/10.3390/app122111127.

Elreedy D, Atiya AF. A Comprehensive Analysis of Synthetic Minority Oversampling Technique

(SMOTE) for handling class imbalance. Information Sciences. 2019 12;505:32-64. Available from:

https://doi.org/10.1016/j.ins.2019.07.070.

Wongvorachan T, He S, Bulut O. A Comparison of Undersampling, Oversampling, and SMOTE Methods

for Dealing with Imbalanced Classification in Educational Data Mining. Information. 2023 1;14:54.

Available from: https://doi.org/10.3390/info14010054.

Oladimeji OO, Oladimeji A, Olayanju O. Machine Learning Models for Diagnostic Classification of

Hepatitis C Tests. Frontiers in Health Informatics. 2021 3;10:70. Available from: https://doi.org/

30699/fhi.v10i1.274.

Ali AM, Hassan MR, Aburub F, Alauthman M, Aldweesh A, Al-Qerem A, et al. Explainable Machine

Learning Approach for Hepatitis C Diagnosis Using SFS Feature Selection. Machines. 2023 3;11:391.

Available from: https://doi.org/10.3390/machines11030391.

KayvanJoo AH, Ebrahimi M, Haqshenas G. Prediction of hepatitis C virus interferon/ribavirin therapy

outcome based on viral nucleotide attributes using machine learning algorithms. BMC Research Notes.

12;7:565. Available from: https://doi.org/10.1186/1756-0500-7-565.

Farghaly HM, Shams MY, El-Hafeez TA. Hepatitis C Virus prediction based on machine learning

framework: a real-world case study in Egypt. Knowledge and Information Systems. 2023 6;65:2595-617.

Available from: https://doi.org/10.1007/s10115-023-01851-4.

Butt MB, Alfayad M, Saqib S, Khan MA, Ahmad M, Khan MA, et al. Diagnosing the Stage of Hepatitis

C Using Machine Learning. Journal of Healthcare Engineering. 2021 12;2021:1-8. Available from:

https://doi.org/10.1155/2021/8062410.


Article Metrics

Abstract view : 45 times
PDF - 11 times

DOI: https://doi.org/10.26714/jichi.v6i2.17519

Refbacks

  • There are currently no refbacks.


____________________________________________________________________________
Journal of Intelligent Computing and Health Informatics (JICHI)
ISSN 2715-6923 (print) | 2721-9186 (online)
Organized by
Department of Informatics
Faculty of Engineering
Universitas Muhammadiyah Semarang

W : https://jurnal.unimus.ac.id/index.php/ICHI
E : jichi.informatika@unimus.ac.id, ahmadilham@unimus.ac.id

View My Stats

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.