Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Towards automated feature engineering for credit card fraud detection using multi-perspective HMMs

Published: 01 January 2020 Publication History

Abstract

Machine learning and data mining techniques have been used extensively in order to detect credit card frauds. However, most studies consider credit card transactions as isolated events and not as a sequence of transactions.
In this framework, we model a sequence of credit card transactions from three different perspectives, namely (i) The sequence contains or doesn’t contain a fraud (ii) The sequence is obtained by fixing the card-holder or the payment terminal (iii) It is a sequence of spent amount or of elapsed time between the current and previous transactions. Combinations of the three binary perspectives give eight sets of sequences from the (training) set of transactions. Each one of these sequences is modelled with a Hidden Markov Model (HMM). Each HMM associates a likelihood to a transaction given its sequence of previous transactions. These likelihoods are used as additional features in a Random Forest classifier for fraud detection.
Our multiple perspectives HMM-based approach offers automated feature engineering to model temporal correlations so as to improve the effectiveness of the classification task and allows for an increase in the detection of fraudulent transactions when combined with the state of the art expert based feature engineering strategy for credit card fraud detection.
In extension to previous works, we show that this approach goes beyond ecommerce transactions and provides a robust feature engineering over different datasets, hyperparameters and classifiers. Moreover, we compare strategies to deal with structural missing values.

References

[1]
Donato J.M., Schryver J.C., Hinkel G.C., Schmoyer R.L., Leuze M.R., Grandy N.W., Mining multi-dimensional data for decision support, Future Gener. Comput. Syst. 15 (1999).
[2]
Bolton R., Hand D.J., Unsupervised profiling methods for fraud detection, in: Credit Scoring and Credit Control VII, 2001.
[3]
Whitrow C., Hand D.J., Juszczak P., Weston D.J., Adams N.M., Transaction aggregation strategy for credit card fraud detection, Data Min. Knowl. Discov. 18 (1) (2008).
[4]
Bahnsen A.C., Aouada D., Stojanovic A., Ottersten B., Feature engineering strategies for credit card fraud detection, Expert Syst. Appl. (2016).
[5]
Chandola V., Banerjee A., Kumar V., Anomaly detection for discrete sequences: A survey, IEEE Trans. Knowl. Data Eng. (2012).
[6]
Pozzolo A.D., Boracchi G., Caelen O., Alippi C., Bontempi G., Credit card fraud detection: a realistic modeling and a novel learning strategy, IEEE Trans. Neural Netw. Learn. Syst. (2017).
[7]
Lucas Y., Portier P.-E., Laporte L., Calabretto S., Caelen O., He-Guelton L., Granitzer M., Multiple perspectives hmm-based feature engineering for credit card fraud detection, in: 34th ACM/SIGAPP Symposium on Applied Computing (SAC2019), 2019.
[8]
Jurgovsky J., Granitzer M., Ziegler K., Calabretto S., Portier P., He-Guelton L., Caelen O., Sequence classification for credit-card fraud detection, Expert Syst. Appl. (2018).
[9]
Maes S., Tuyls K., Vanschoenwinkel B., Manderick B., Credit card fraud detection using bayesian and neural networks, in: Proceedings of NF, 2002.
[10]
Bhattacharyya S., Jha S., Tharakunnel K., Westland J.C., Data mining for credit card fraud: A comparative study, Decis. Support Syst. 50 (3) (2011).
[11]
Bahnsen A.C., Stojanovic A., Aouada D., Ottersten B., Cost sensitive credit card fraud detection using bayes minimum risk, in: Proceedings of the 2013 12th International Conference on Machine Learning and Applications, 2013.
[12]
Pozzolo A.D., Caelen O., Borgne Y.L., Waterschoot S., Bontempi G., Learned lessons in credit card fraud detection from a practitioner perspective, Expert Syst. Appl. (2014).
[13]
Mahmoudi N., Duman E., Detecting credit card fraud by modified fisher discriminant analysis, Expert Syst. Appl. 42 (5) (2015).
[14]
Breiman L., Friedman J., Olshen R.A., Stone C.J., Classification and regression trees (p. 147), in: The Wadsworth Statistic / Probability Series, 1984.
[15]
Minegishi T., Niimi A., Proposal of credit card fraudulent use detection by online-type decision tree construction and verification of generality, Int. J. Inf. Secur. Res. 1 (4) (2011).
[16]
Saia R., Carta S., Evaluating the benefits of using proactive transformed-domain-based techniques in fraud detection tasks, Future Gener. Comput. Syst. 93 (2019).
[17]
Jha S., Guillen M., Westland J., Employing transaction aggregation strategy to detect credit card fraud, Expert Syst. Appl. 39 (16) (2012).
[18]
Krivko M., A hybrid model for plastic card fraud detection systems, Expert Syst. Appl. 37 (8) (2010).
[19]
Vlasselaer V.V., Bravo C., Caelen O., Eliassi-Rad T., Akoglu L., Snoeck M., Baesens B., Apate: a novel approach for automated credit card transaction fraud detection using network-based extensions, Decis. Support Syst. (2015).
[20]
Dietterich T., Machine learning for sequential data: A review, in: Structural, Syntactic, and Statistical Pattern Recognition, 2002.
[21]
Strivastava A., Kundu A., Sural S., Majundar A.K., Credit card fraud detection using hidden markov model, IEEE Trans. Dependabcle Secure Comput. (2008).
[22]
Graves A., Supervised Sequence Labelling with Recurrent Neural Networks, Vol. 385, Springer, 2012.
[23]
Wiese B., Omlin C., Credit card transactions, fraud detection and machine learning: Modelling time with lstm recurrent neural networks, in: Innovations in Neural Information Paradigms and Applications, Springer, 2009.
[24]
Rabiner L.R., Juang B.H., Hidden markov models for speech recognition, Technometrics (1991).
[25]
Baum L.E., An inequality and associated maximization technique, in: Statistial Estimation for Probabilistic Functions of Markov Processes, 1972.
[26]
Viterbi A.J., Error bounds for convolutional codes and an asymptotically optimum decoding algorithm, IEEE Trans. Inf. Theory IT-13 (1967).
[27]
Ali M.A., Azad M.A., Centeno M.P., Hao F., van Moorsel A., Consumer-facing technology fraud: Economics, attack methods and potential solutions, Future Gener. Comput. Syst. 100 (2019).
[28]
Davis J., Goadrich M., The relationship between precision-recall and roc curves, in: ICML ’06 Proceedings of the 23rd International Conference on Machine Learning, 2006.
[29]
Pozzolo A.D., Boracchi G., Caelen O., Alippi C., Bontempi G., Credit card fraud detection and concept-drift adaptation with delayed supervised information, in: 2015 International Joint Conference on Neural Networks (IJCNN), 2015.
[30]
Freund Y., Schapire R., A decision-theoretic generalization of on-line learning and an application to boosting, J. Comput. Syst. Sci. (1997).

Cited By

View all
  • (2024)An interpretable automated feature engineering framework for improving logistic regressionApplied Soft Computing10.1016/j.asoc.2024.111269153:COnline publication date: 1-Mar-2024
  • (2023)Credit Card Fraud Detector for Lower Ranged Transactions using AI AlgorithmsProceedings of the 5th International Conference on Information Management & Machine Intelligence10.1145/3647444.3647881(1-5)Online publication date: 23-Nov-2023
  • (2023)Credit card fraud detection in the era of disruptive technologiesJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2022.11.00835:1(145-174)Online publication date: 1-Jan-2023
  • Show More Cited By

Index Terms

  1. Towards automated feature engineering for credit card fraud detection using multi-perspective HMMs
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Future Generation Computer Systems
      Future Generation Computer Systems  Volume 102, Issue C
      Jan 2020
      1062 pages

      Publisher

      Elsevier Science Publishers B. V.

      Netherlands

      Publication History

      Published: 01 January 2020

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)An interpretable automated feature engineering framework for improving logistic regressionApplied Soft Computing10.1016/j.asoc.2024.111269153:COnline publication date: 1-Mar-2024
      • (2023)Credit Card Fraud Detector for Lower Ranged Transactions using AI AlgorithmsProceedings of the 5th International Conference on Information Management & Machine Intelligence10.1145/3647444.3647881(1-5)Online publication date: 23-Nov-2023
      • (2023)Credit card fraud detection in the era of disruptive technologiesJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2022.11.00835:1(145-174)Online publication date: 1-Jan-2023
      • (2023)Smart credit card fraud detection system based on dilated convolutional neural network with sampling techniqueMultimedia Tools and Applications10.1007/s11042-023-15730-182:20(31691-31708)Online publication date: 15-May-2023
      • (2023)Credit card fraud detection using ensemble data mining methodsMultimedia Tools and Applications10.1007/s11042-023-14698-282:19(29057-29075)Online publication date: 9-Mar-2023
      • (2023)An integration of deep learning model with Navo Minority Over-Sampling Technique to detect the frauds in credit cardsMultimedia Tools and Applications10.1007/s11042-023-14365-682:14(21757-21774)Online publication date: 25-Jan-2023
      • (2022)A two-stage hybrid credit risk prediction model based on XGBoost and graph-based deep neural networkExpert Systems with Applications: An International Journal10.1016/j.eswa.2022.116624195:COnline publication date: 1-Jun-2022
      • (2022)Fraud detection and prevention in e-commerceElectronic Commerce Research and Applications10.1016/j.elerap.2022.10120756:COnline publication date: 1-Nov-2022
      • (2022)Interpretable data science for decision makingDecision Support Systems10.1016/j.dss.2021.113664150:COnline publication date: 22-Apr-2022
      • (2022)Predicting credit card fraud using multipurpose classification based on evolutionary rulesSecurity and Privacy10.1002/spy2.2395:5Online publication date: 9-Sep-2022
      • Show More Cited By

      View Options

      View options

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media