Cyber-bullying Detection based on Machine Learning Method (Case Study: Instagram Comment Section)

Eri Eli Lavindi

Abstract


Social media is popular communication platform for last decade. Social Platform such as Facebook, Instagram, and Twitter provide real-time and efficient way of communication overseas. The ease of using social media does not only provide positive benefits, but can also have a negative impact on its users. One of social media negative impact is cyber-bullying which define as a type of harassment through online media. The effect of cyber-bullying to the victim particularly is mental health disorder. Usually, being the victim of cyber-bullying can increase the stress and anxiety level, lower self-esteem, loneliness, sadness, and disappointment. This study evaluates the comment on Instagram post of Indonesia influencer to determine whether it classified as bullying or non-bullying. This study utilizes count vectorizer as feature extraction and compare several machine learning methods such as Naïve Bayes, SVM, and Random Forest. The evaluation result show that both Naïve Bayes and Random Forest are able achieve 77% accuracy. Moreover, Naïve Bayes method also generate higher percentage compared to other methods. This result indicate that Naïve Bayes are capable in detecting cyber-bullying comment in social media platform.

Keywords


Cyber-bullying, Machine Learning, Social media,Text Mining

Full Text:

PDF

References


APJI, “Profil Internet Indonesia 2022,” Apji.or.Od, no. June, 2022, [Online]. Available: apji.or.id

P. D. Nugraheni, “The New Face of Cyberbullying in Indonesia: How can We Provide Justice to the Victims?,” Indones. J. Int. Clin. Leg. Educ., vol. 3, no. 1, pp. 57–76, 2021, doi: 10.15294/ijicle.v3i1.43153.

N. A. Azeez, S. O. Idiakose, C. J. Onyema, and C. Van Der Vyver, “Cyberbullying Detection in Social Networks: Artificial Intelligence Approach,” J. Cyber Secur. Mobil., vol. 10, no. 4, pp. 745–774, 2021, doi: 10.13052/jcsm2245-1439.1046.

T. Ige and S. Adewale, “AI Powered Anti-Cyber Bullying System using Machine Learning Algorithm of Multinomial Naïve Bayes and Optimized Linear Support Vector Machine Interception of Cyberbully Contents in a Messaging System by Machine Learning Algorithm,” Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 5, pp. 5–9, 2022, doi: 10.14569/IJACSA.2022.0130502.

S. G. Handono, K. Laeheem, and R. Sittichai, “Factors related with cyberbullying among the youth of Jakarta, Indonesia,” Child. Youth Serv. Rev., vol. 99, no. August 2018, pp. 235–239, 2019, doi: 10.1016/j.childyouth.2019.02.012.

A. Perera and P. Fernando, “Accurate cyberbullying detection and prevention on social media,” Procedia Comput. Sci., vol. 181, pp. 605–611, 2021, doi: 10.1016/j.procs.2021.01.207.

S. Almutiry and M. Abdel Fattah, “Arabic CyberBullying Detection Using Arabic Sentiment Analysis,” Egypt. J. Lang. Eng., vol. 8, no. 1, pp. 39–50, 2021, doi: 10.21608/ejle.2021.50240.1017.

S. Kemp, “Digital-2022-Indonesia-February-2022-v01_compressed.pdf.” pp. 24–84, 2022. [Online]. Available: https://datareportal.com/reports/digital-2022-indonesia?msclkid=54849450ac3011eca46cf06ec644a888

J. Cristian, V. Louise, and S. Koka, “Prediksi Keberhasilan Lamaran Pekerjaan Dengan Count Vectorizer dan Logistic Regression Abstak,” vol. 4, pp. 16–25, 2022.

R. Kustiawan, A. Adiwijaya, and M. D. Purbolaksono, “A Multi-label Classification on Topic of Hadith Verses in Indonesian Translation using CART and Bagging,” J. Media Inform. Budidarma, vol. 6, no. 2, p. 868, 2022, doi: 10.30865/mib.v6i2.3787.

D. J. Ladani and N. P. Desai, “Stopword Identification and Removal Techniques on TC and IR applications: A Survey,” 2020 6th Int. Conf. Adv. Comput. Commun. Syst. ICACCS 2020, pp. 466–472, 2020, doi: 10.1109/ICACCS48705.2020.9074166.

J. Singh and V. Gupta, “Text stemming: Approaches, applications, and challenges,” ACM Comput. Surv., vol. 49, no. 3, pp. 1–46, 2016.

A. Averina, H. Hadi, and J. Siswantoro, “Analisis Sentimen Multi-Kelas Untuk Film Berbasis Teks Ulasan Menggunakan Model Regresi Logistik,” Teknika, vol. 11, no. 2, pp. 123–128, 2022, doi: 10.34148/teknika.v11i2.461.

N. Rezaeian and G. Novikova, “Persian text classification using naive bayes algorithms and support vector machine algorithm,” Indones. J. Electr. Eng. Informatics, vol. 8, no. 1, pp. 178–188, 2020, doi: 10.11591/ijeei.v8i1.1696.

R. Kashef, “A boosted SVM classifier trained by incremental learning and decremental unlearning approach,” Expert Syst. Appl., vol. 167, p. 114154, 2021, doi: 10.1016/j.eswa.2020.114154.

R. Yacouby and D. Axman, “Probabilistic Extension of Precision, Recall, and F1 Score for More Thorough Evaluation of Classification Models,” pp. 79–91, 2020, doi: 10.18653/v1/2020.eval4nlp-1.9.

D. M. W. Powers, “Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation,” pp. 37–63, 2020, [Online]. Available: http://arxiv.org/abs/2010.16061

J. A. Cottam, N. C. Heller, C. L. Ebsch, R. Deshmukh, P. MacKey, and G. Chin, “Evaluation of Alignment: Precision, Recall, Weighting and Limitations,” Proc. - 2020 IEEE Int. Conf. Big Data, Big Data 2020, pp. 2513–2519, 2020, doi: 10.1109/BigData50022.2020.9378064.

E. Dritsas and M. Trigka, “Lung Cancer Risk Prediction with Machine Learning Models,” Big Data Cogn. Comput., vol. 6, no. 4, 2022, doi: 10.3390/bdcc6040139.

R. Soleymani, E. Granger, and G. Fumera, “F-measure curves: A tool to visualize classifier performance under imbalance,” Pattern Recognit., vol. 100, 2020, doi: 10.1016/j.patcog.2019.107146.




DOI: http://dx.doi.org/10.32497/jaict.v8i1.4339

Refbacks

  • There are currently no refbacks.


ISSN: 2541-6340
Online ISSN: 2541-6359

Visitor: 

View My Stats

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.