Enhancement of Backward Feature Elimination as a Pre-Processing Method for K Nearest Neighbor Algorithm Applied to Insurance Fraud Detection

Ma. Pauline Yvana B.  Amores; Charisse Nicole P.  Aberin; Vivien A.  Agustin; Herminiño C.  Lagunzad; Richard C.  Regala; Raymund M.  Dioses

doi:10.15379/ijmst.v10i1.2685

Authors

Ma. Pauline Yvana B. Amores Student, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.
Charisse Nicole P. Aberin Student, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines
Vivien A. Agustin Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.
Herminiño C. Lagunzad Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines
Richard C. Regala Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.
Raymund M. Dioses Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.

DOI:

https://doi.org/10.15379/ijmst.v10i1.2685

Keywords:

Backward Feature Elimination, Confidence Interval, Fraud Detection, K-Nearest Neighbor Algorithm

Abstract

This study presents novel enhancements to the Backward Feature Elimination (BFE) method for improved insurance fraud detection using the K-Nearest Neighbor (KNN) algorithm. The research addresses issues inherent in the baseline BFE process, such as the over-reliance on p-values, the potential for misleading results, and suboptimal feature selection leading to overfitting. To address these, the study integrates confidence intervals and feature importance into the BFE process, establishing a more robust and reliable criterion for feature selection. Moreover, feature engineering techniques are introduced during preprocessing to enhance model performance. The modified BFE method demonstrates superior performance over the baseline model regarding the recall, precision, and F1 score. Stratified K-Fold Cross-Validation, ROC-AUC Score, and Coefficient of Variation (CV) confirm the consistency and robustness of the enhanced model across varying data subsets. These innovations offer a comprehensive and reliable solution to feature selection in the BFE method, applied to the KNN model for effective insurance fraud detection. The study mitigates the issues related to p-value dependence and boosts model performance, paving the way for more accurate and robust fraud detection systems.

Downloads

Download data is not yet available.

Enhancement of Backward Feature Elimination as a Pre-Processing Method for K Nearest Neighbor Algorithm Applied to Insurance Fraud Detection

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)