Filtrado de SPAM en SMS mediante algoritmos de aprendizaje automático
Keywords:
Machine Learning, SMS, Super Vector Machine, logistic regression, KNN, RandomForest, AdaBoostAbstract
One of the most common forms of communication using mobile phones is through SMS, or short message service. Financial institutions, television companies and the telephone operators are examples of companies that takes advantage of this type of communication; however, this technology is not exempt from unwanted messages or SPAM. This article describes both the application of automatic learning algorithms for SPAM's filter and an experimenting with a data set of 5,574 SMS to evaluate the performance of models using techniques such as Logistic Regression, Super Vector Machine, KNN, Random Forest and AdaBoost to filter and predict unwanted messages.
Downloads
Metrics
References
Abdulhamid, S. M., Abd Latiff, M. S., Chiroma, H., Osho, O., Abdul-Salaam, G., Abubakar, A. I., Herawan, T. (2017). A Review on Mobile SMS Spam Filtering Techniques. IEEE Access, 5, 15650-15666. https://doi.org/10.1109/ACCESS.2017.2666785
Almeida, T. A., Alberto, T. C. (2013). Learning to block undesired comments in the blogosphere. 12th International Conference on Machine Learning and Applications (pp. 261-266). IEEE. https://doi.org/10.1109/ICMLA.2013.133
Almeida, T. A., Gómez Hildago, J. M., Yamakami, A. (2011). Contributions to the study of SMS spam filtering. Proceedings of the 11th ACM symposium on Document engineering - DocEng ’11, pp. 259-262. https://doi.org/10.1145/2034691.2034742
Almeida, T. A., Gómez Hidalgo, J. M., Silva, T. P. (2013). Towards SMS Spam Filtering: Results under a New Dataset. International Journal Of Information Security Science, 2(1), 1-18.
Bin, Z., Gang, Z., Yunbo, F., Xiaolu, Z., Weiqiang, J., Jing, D., Jiafeng, G. (2016). Behavior analysis based SMS spammer detection in mobile communication networks. First International Conference on Data Science in Cyberspace (DSC) (pp. 538-543). IEEE. https://doi.org/10.1109/DSC.2016.48
Chan, P. P. K., Yang, C., Yeung, D. S., Ng, W. W. Y. (2015). Spam filtering for short messages in adversarial environment. Neurocomputing, 155, 167-176. https://doi.org/10.1016/J.NEUCOM.2014.12.034
Firte, L., Lemnaru, C., Potolea, R. (2010). Spam detection filter using KNN algorithm and resampling. Proceedings of the 2010 IEEE 6th International Conference on Intelligent Computer Communication and Processing. IEEE. https://doi.org/10.1109/ICCP.2010.5606466
Guzella, T. S., Caminhas, W. M. (2009). A review of machine learning approaches to Spam filtering. Expert Systems with Applications, 36(7), 10206-10222. https://doi.org/10.1016/J.ESWA.2009.02.037
Ma, J., Zhang, Y., Liu, J., Yu, K., Wang, X. (2016). Intelligent SMS Spam Filtering Using Topic Model. International Conference on Intelligent Networking and Collaborative Systems (INCoS) (pp. 380-383). IEEE. https://doi.org/10.1109/INCoS.2016.47
Rafique, M. Z., Abulaish, M. (2012). Graph-based learning model for detection of SMS spam on smart phones. 8th International Wireless Communications and Mobile Computing Conference (IWCMC) (pp. 1046-1051). IEEE. https://doi.org/10.1109/IWCMC.2012.6314350
Shirani-Mehr, H. (2013). SMS Spam Detection using Machine Learning Approach. Retrieved from https://pdfs.semanticscholar.org/a083/c8ea8e898269927e1cc0a935477175179b58.pdf
Yu, Y., Chen, Y. (2012). A novel content based and social network aided online spam short message filter. Proceedings of the 10th World Congress on Intelligent Control and Automation (pp. 444-449). IEEE. https://doi.org/10.1109/WCICA.2012.6357916
Zhang, X., Xiong, G., Hu, Y., Zhu, F., Dong, X., Nyberg, T. R. (2016). A method of SMS spam filtering based on AdaBoost algorithm. 12th World Congress on Intelligent Control and Automation (WCICA) (pp. 2328-2332). IEEE. https://doi.org/10.1109/WCICA.2016.7578522
Downloads
Published
How to Cite
Issue
Section
License
Copyright © Autors. Creative Commons Attribution 4.0 License. for any article submitted from 6 June 2017 onwards. For manuscripts submitted before, the CC BY 3.0 License was used.
You are free to:
Share — copy and redistribute the material in any medium or format |
Adapt — remix, transform, and build upon the material for any purpose, even commercially. |
Under the following conditions:
Attribution — You must give appropriate credit, provide a link to the licence, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licenser endorses you or your use. |
No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the licence permits. |