Using machine learning algorithms to detect frauds in telephone networks

  • Sergiu Apostu Dunarea de Jos University of Galati
Keywords: principal component analysis, logistic regression, random forest, word2vec

Abstract

This paper presents an analysis of voice traffic in telephone networks, based on machine learning algorithms to detect frauds made by callers. Starting from the raw data set that includes information about the call date, destination number, duration and caller's number, in our approach we were able to identify fraudulent calls in early stages. For balance, the data set was split in 2 parts: one for training and one for testing. To obtain mean’s values from dataset, a standardization technique was applied in order to scale the data before the dimensionality reduction using Principal Component Analysis. Then, the first two components were used as inputs for Logistic Regression and Random Forest models, having the caller as target. Finally, the target was moved on the destination file so as to identify the caller and the moment when the call has started based on a vector representation of words.

Published
2021-01-21
How to Cite
1.
Apostu S. Using machine learning algorithms to detect frauds in telephone networks. The Annals of “Dunarea de Jos“ University of Galati. Fascicle III, Electrotechnics, Electronics, Automatic Control, Informatics [Internet]. 21Jan.2021 [cited 28Apr.2024];43(3):16-0. Available from: https://www.gup.ugal.ro/ugaljournals/index.php/eeaci/article/view/4129
Section
Articles

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.