Using machine learning algorithms to detect frauds in telephone networks

  • Sergiu Apostu Dunarea de Jos University of Galati
Keywords: principal component analysis, logistic regression, random forest, word2vec


This paper presents an analysis of voice traffic in telephone networks, based on machine learning algorithms to detect frauds made by callers. Starting from the raw data set that includes information about the call date, destination number, duration and caller's number, in our approach we were able to identify fraudulent calls in early stages. For balance, the data set was split in 2 parts: one for training and one for testing. To obtain mean’s values from dataset, a standardization technique was applied in order to scale the data before the dimensionality reduction using Principal Component Analysis. Then, the first two components were used as inputs for Logistic Regression and Random Forest models, having the caller as target. Finally, the target was moved on the destination file so as to identify the caller and the moment when the call has started based on a vector representation of words.


Download data is not yet available.
How to Cite
Apostu S. Using machine learning algorithms to detect frauds in telephone networks. The Annals of “Dunarea de Jos“ University of Galati. Fascicle III, Electrotechnics, Electronics, Automatic Control, Informatics [Internet]. 21Jan.2021 [cited 28Nov.2021];43(3):16-0. Available from: