Filtrado de SPAM en SMS mediante algoritmos de aprendizaje automático

Authors

Keywords:

Machine Learning, SMS, Super Vector Machine, logistic regression, KNN, RandomForest, AdaBoost

Abstract

One of the most common forms of communication using mobile phones is through SMS, or short message service. Financial institutions, television companies and the telephone operators are examples of companies that takes advantage of this type of communication; however, this technology is not exempt from unwanted messages or SPAM. This article describes both the application of automatic learning algorithms for SPAM's filter and an experimenting with a data set of 5,574 SMS to evaluate the performance of models using techniques such as Logistic Regression, Super Vector Machine, KNN, Random Forest and AdaBoost to filter and predict unwanted messages.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

Abdulhamid, S. M., Abd Latiff, M. S., Chiroma, H., Osho, O., Abdul-Salaam, G., Abubakar, A. I., Herawan, T. (2017). A Review on Mobile SMS Spam Filtering Techniques. IEEE Access, 5, 15650-15666. https://doi.org/10.1109/ACCESS.2017.2666785

Almeida, T. A., Alberto, T. C. (2013). Learning to block undesired comments in the blogosphere. 12th International Conference on Machine Learning and Applications (pp. 261-266). IEEE. https://doi.org/10.1109/ICMLA.2013.133

Almeida, T. A., Gómez Hildago, J. M., Yamakami, A. (2011). Contributions to the study of SMS spam filtering. Proceedings of the 11th ACM symposium on Document engineering - DocEng ’11, pp. 259-262. https://doi.org/10.1145/2034691.2034742

Almeida, T. A., Gómez Hidalgo, J. M., Silva, T. P. (2013). Towards SMS Spam Filtering: Results under a New Dataset. International Journal Of Information Security Science, 2(1), 1-18.

Bin, Z., Gang, Z., Yunbo, F., Xiaolu, Z., Weiqiang, J., Jing, D., Jiafeng, G. (2016). Behavior analysis based SMS spammer detection in mobile communication networks. First International Conference on Data Science in Cyberspace (DSC) (pp. 538-543). IEEE. https://doi.org/10.1109/DSC.2016.48

Chan, P. P. K., Yang, C., Yeung, D. S., Ng, W. W. Y. (2015). Spam filtering for short messages in adversarial environment. Neurocomputing, 155, 167-176. https://doi.org/10.1016/J.NEUCOM.2014.12.034

Firte, L., Lemnaru, C., Potolea, R. (2010). Spam detection filter using KNN algorithm and resampling. Proceedings of the 2010 IEEE 6th International Conference on Intelligent Computer Communication and Processing. IEEE. https://doi.org/10.1109/ICCP.2010.5606466

Guzella, T. S., Caminhas, W. M. (2009). A review of machine learning approaches to Spam filtering. Expert Systems with Applications, 36(7), 10206-10222. https://doi.org/10.1016/J.ESWA.2009.02.037

Ma, J., Zhang, Y., Liu, J., Yu, K., Wang, X. (2016). Intelligent SMS Spam Filtering Using Topic Model. International Conference on Intelligent Networking and Collaborative Systems (INCoS) (pp. 380-383). IEEE. https://doi.org/10.1109/INCoS.2016.47

Rafique, M. Z., Abulaish, M. (2012). Graph-based learning model for detection of SMS spam on smart phones. 8th International Wireless Communications and Mobile Computing Conference (IWCMC) (pp. 1046-1051). IEEE. https://doi.org/10.1109/IWCMC.2012.6314350

Shirani-Mehr, H. (2013). SMS Spam Detection using Machine Learning Approach. Retrieved from https://pdfs.semanticscholar.org/a083/c8ea8e898269927e1cc0a935477175179b58.pdf

Yu, Y., Chen, Y. (2012). A novel content based and social network aided online spam short message filter. Proceedings of the 10th World Congress on Intelligent Control and Automation (pp. 444-449). IEEE. https://doi.org/10.1109/WCICA.2012.6357916

Zhang, X., Xiong, G., Hu, Y., Zhu, F., Dong, X., Nyberg, T. R. (2016). A method of SMS spam filtering based on AdaBoost algorithm. 12th World Congress on Intelligent Control and Automation (WCICA) (pp. 2328-2332). IEEE. https://doi.org/10.1109/WCICA.2016.7578522

Published

2017-12-30

How to Cite

Pin, L. (2017). Filtrado de SPAM en SMS mediante algoritmos de aprendizaje automático. Maskana, 8(1), 109–117. Retrieved from https://publicaciones.ucuenca.edu.ec/ojs/index.php/maskana/article/view/1971

Issue

Section

First Congress of Computer Science