Enhancement of Backward Feature Elimination as a Pre-Processing Method for K Nearest Neighbor Algorithm Applied to Insurance Fraud Detection

Authors

  • Ma. Pauline Yvana B. Amores Student, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.
  • Charisse Nicole P. Aberin Student, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines
  • Vivien A. Agustin Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.
  • Herminiño C. Lagunzad Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines
  • Richard C. Regala Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.
  • Raymund M. Dioses Thesis Adviser, College of Engineering - Pamantasan ng Lungsod ng Maynila, Philippines.

DOI:

https://doi.org/10.15379/ijmst.v10i1.2685

Keywords:

Backward Feature Elimination, Confidence Interval, Fraud Detection, K-Nearest Neighbor Algorithm

Abstract

This study presents novel enhancements to the Backward Feature Elimination (BFE) method for improved insurance fraud detection using the K-Nearest Neighbor (KNN) algorithm. The research addresses issues inherent in the baseline BFE process, such as the over-reliance on p-values, the potential for misleading results, and suboptimal feature selection leading to overfitting. To address these, the study integrates confidence intervals and feature importance into the BFE process, establishing a more robust and reliable criterion for feature selection. Moreover, feature engineering techniques are introduced during preprocessing to enhance model performance. The modified BFE method demonstrates superior performance over the baseline model regarding the recall, precision, and F1 score. Stratified K-Fold Cross-Validation, ROC-AUC Score, and Coefficient of Variation (CV) confirm the consistency and robustness of the enhanced model across varying data subsets. These innovations offer a comprehensive and reliable solution to feature selection in the BFE method, applied to the KNN model for effective insurance fraud detection. The study mitigates the issues related to p-value dependence and boosts model performance, paving the way for more accurate and robust fraud detection systems.

Downloads

Download data is not yet available.

Downloads

Published

2023-07-13

How to Cite

[1]
M. P. Y. B. . Amores, C. N. P. . Aberin, V. A. . Agustin, H. C. . Lagunzad, R. C. . Regala, and R. M. . Dioses, “Enhancement of Backward Feature Elimination as a Pre-Processing Method for K Nearest Neighbor Algorithm Applied to Insurance Fraud Detection”, ijmst, vol. 10, no. 1, pp. 927-935, Jul. 2023.

Most read articles by the same author(s)