To predict credit risk we can use several machine learning models.
- Random Oversampling Model - Result
- SMOTE Oversampling - Result
- Undersampling - Result
- Combination (Over and Under) Sampling - Result
- Balanced Random Forest Classifier - Result
- Easy Ensemble AdaBoost Classifier - Result
- ALL MODEL Result Comparison -
All models show that 100% Precision on Low Risk but low numbers on High Risk Precision. High Risk Sensitivity range is between 59% ~ 69%. Low Risk Sensitivity range is between 45% ~ 91%. Only Low Risk Precision shows 100%. All models have low High Risk F1 score which can refer that imbalance between sensitivity and precision. Sensitivity and Precision are important measure for credit risk prediction, so I do not recommand any of these models to predict credit risk. If one of model must use, then either Balanced Random Forest Classifier or Easy Ensemble AdaBoost Classifier model should use for the prediction since those two model has highest Balanced Accuracy Score and F1 scores.