Automated analysis of pedophilic conversations using Machine Learning
Keywords:
Online grooming, Machine learning, Automatic classification, Mobile messagingAbstract
This research project is based on implementing a software solution to detect text messages with pedophilic intentions through mobile messaging applications. A corpus of this type of conversations is analyzed to select the most relevant features using natural language processing techniques and achieve a better performance in the predictive model with supervised Machine Learning classification algorithms. The Support Vector Machine algorithm was adopted as the text classification model, with the tests performed on this model very promising results were obtained with an accuracy of 80% in the classification of these messages, some of these had a lot of noise and grammatical errors so that the learning capacity of the model was affected, Therefore, after a processing stage of the data set and the necessary adjustments, it was possible to optimize the model a little more and reach 84% in terms of precision and accuracy, in addition to the F1 score that indicates a performance of 86% for the classification model built. Finally, the model is implemented in a Bot that is added to a Telegram group by connecting to its API for test conversation analysis to observe how well it performed the automatic classification of messages sent in the group.
Keywords: Online grooming, Machine learning, Automatic classification, Mobile messaging.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Editorial "Universo Sur"

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
La editorial "Universo Sur", de la Universidad de Cienfuegos, publica el contenido de la Revista "Universidad y Sociedad" bajo una Licencia Creative Commons Atribución-NoComercial-SinDerivar 4.0 Internacional.
© Podrá reproducirse, de forma parcial o total, el contenido de esta publicación, siempre que se haga de forma literal y se mencione la fuente.