Using isolation forest in anomaly detection: The case of credit card transactions

Soumaya Ounacer, Hicham Ait El Bour, Younes Oubrahim, Mohamed Yassine Ghoumari, Mohamed Azzouazi

Abstract


With the evolution of new technology especially in the domain of e-commerce and online banking, the payment by credit card has seen a significant increase. The credit card has become the most used tool for online shopping. This high rate in use brings about fraud and a considerable damage. It is very important to stop fraudulent transactions because they cause huge financial losses over time. The detection of fraudulent transactions is an important application in anomaly detection. There are different approaches to detecting anomalies namely SVM, logistic regression, decision tree and so on. However, they remain limited since they are supervised algorithms that require to be trained by labels in order to know whether the transactions are fraudulent or not. The goal of this paper is to have a credit card fraud detection system which is able to detect the highest number of new transactions in real time with high accuracy. We will also compare, in this paper, different unsupervised techniques for credit card fraud detection namely LOF, one class SVM, K-means and Isolation Forest so as to single out the best approach.

Full Text:

PDF

References


T. Banerjee, M. Mishra, N. C. Debnath, and P. Choudhury, “Implementing E-Commerce model for Agricultural Produce : A Research Roadmap,” vol. 7, no. 1, pp. 302–310, 2019.

HuaMing, Kishan G. Mehrotra Chilukuri K. Mohan, Anomaly Algorithms Principles and Detection.2017.

V. CHANDOLA, A. BANERJEE, and V. KUMAR “Anomaly Detection: a SURVEY,” Conform. Predict. Reliab. Mach. Learn. Theory, Adapt. Appl., vol. 41, no. 3, pp. 71–97, 2014.

R. A. Flarence, S. Bethu, V. Sowmya, K. Anusha, and B. S. Babu, “Importance of Supervised Learning in Prediction Analysis,” vol. 6, no. 1, pp. 201–214, 2018.

N. Venkateswaran, A. Shekhar, and S. Changder, “Using machine learning for intelligent shard sizing on the cloud,” vol. 7, no. 1, pp. 109–124, 2019.

D. Sabinasz, “Dealing with Unbalanced Classes in Machine Learning,” Deep Ideas. 2017.

H. Jihal, M. A. Talhaoui, A. Daif, and M. Azzouazi, “Predictive Analytics as A Service on Moroccan Tax Evasion,” vol. 7, pp. 90–92, 2018.

S. Maes and K. Tuyl, “Credit Card Fraud Detection Using Bayesian and Neural Networks,” no. August 2002, 2013.

Y. Sahin and E. Duman, “Detecting Credit Card Fraud by Decision Trees and Support Vector Machines,” vol. I, 2011.

N. Malini and M. Phil, “Analysis on Credit Card Fraud Identification Techniques based on KNN and Outlier Detection,” pp. 3–6, 2017.

M. M. Breunig, H. Kriegel, R. T. Ng, and J. Sander, “LOF : Identifying Density-Based Local Outliers,” pp. 1–12, 2000.

M. M. Breunig, H.-P. Kriegel, R. T. Ng, and J. Sander, “OPTICS-OF: Identifying Local Outliers,” no. September, pp. 262–270, 2010.

B. Sch and R. Williamson, “Support Vector Method for Novelty Detection,” pp. 582–588, 2000.

M. E. Celebi, H. A. Kingravi, and P. A. Vela, “A comparative study of efficient initialization methods for the k-means clustering algorithm,” Expert Syst. Appl., vol. 40, no. 1, pp. 200–210, 2013.

E. Lewinson, “Outlier Detection with Isolation Forest,” Towards Data Science. 2017.

F. T. Liu and K. M. Ting, “Isolation Forest.”2009.

F. T. Liu, K. M. Ting, and Z. H. Zhou, “Isolation forest,” Proc. - IEEE Int. Conf. Data Mining, ICDM, pp. 413–422, 2008.

F. T. Liu and K. M. Ting, “Isolation Forest: Isolation Forest,” 1980.

“Credit Card Fraud _ Kaggle, Anonymized credit card transactions labeled as fraudulent or genuine.” .




DOI: http://dx.doi.org/10.21533/pen.v6i2.533

Refbacks

  • There are currently no refbacks.


Copyright (c) 2019 soumaya ounacer, Hicham Ait El Bour, Younes Oubrahim, Mohamed Yassine Ghoumari, Mohamed Azzouazi

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

ISSN: 2303-4521

Digital Object Identifier DOI: 10.21533/pen

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License