Classification of Hoax News Using the Naïve Bayes Method

Main Article Content

Rama Qubra
Rizal Adi Saputra

Abstract

The rampant dissemination of false and unsourced information, commonly known as hoaxes, has become a pervasive issue in the era of internet media. In the digital age, the widespread dissemination of false and unverified information has emerged as a critical concern within the realm of internet media. Hoax news can be used to influence elections, sway public opinion, and create political instability. The rapid evolution of information technology has contributed to the uncontrollable proliferation of hoax content, necessitating the development of intelligent systems for effective classification. This research focuses on implementing a robust classification system for identifying hoax news circulating through internet media. The method used in this program is the Naive Bayes method, specifically Naive Bayes Multinomial, which works with the assumption that each feature (word) is considered independent from the others. Text vectorization using CountVectorizer converts text into a numeric vector, which can be used by classification algorithms. This program uses a trained model to make predictions on testing data and calculate evaluation metrics such as accuracy, confusion matrix, and classification reports. By leveraging these methodologies, the study aims to enhance the accuracy and efficiency of distinguishing genuine news from deceptive hoaxes. The highest accuracy value obtained in this research was 94.73% with a division of 20% test data and 80% training data. True Negative (TN): 4555, False Positive (FP): 178 and False Negative (FN): 295, True Positive (TP): 3952

Article Details

How to Cite
Qubra, R., & Saputra, R. A. (2024). Classification of Hoax News Using the Naïve Bayes Method. International Journal Software Engineering and Computer Science (IJSECS), 4(1), 40–48. https://doi.org/10.35870/ijsecs.v4i1.2068
Section
Articles
Author Biographies

Rama Qubra, Universitas Halu Oleo

Universitas Halu Oleo, Kendari City, Southeast Sulawesi, Indonesia

Rizal Adi Saputra, Universitas Halu Oleo

Universitas Halu Oleo, Kendari City, Southeast Sulawesi, Indonesia

References

Pratiwi, I. Y. R., Asmara, R. A., & Rahutomo, F. (2018). Study of hoax news detection using naïve bayes classifier in Indonesian language. Proceedings of the 11th International Conference on Information and Communication Technology and System, ICTS 2017, 73–78. https://doi.org/10.1109/ICTS.2017.8265649

Santoso, H. A., Rachmawanto, E. H., Nugraha, A., Nugroho, A. A., Setiadi, D. R. I. M., & Basuki, R. S. (2020). Hoax classification and sentiment analysis of Indonesian news using Naive Bayes optimization. Telkomnika (Telecommunication Computing Electronics and Control), 18(2), 799–806. https://doi.org/10.12928/TELKOMNIKA.V18I2.14744

Muhabatin, H., Prabowo, C., Ali, I., Lukman Rohmat, C., Rizki Amalia, D., sitasi, C., & Rizki, D. (2021). Klasifikasi Berita Hoax Menggunakan Algoritma Naïve Bayes Berbasis PSO. Informatics for Educators and Professionals, 5(2), 156–165. https://turnbackhoax.id/

Febriyanty, N. E., Hariyadi, M. A., & Crysdian, C. (2023). Hoax Detection News Using Naïve Bayes and Support Vector Machine Algorithm. International Journal of Advances in Data and Information Systems, 4(2), 191–200. https://doi.org/10.25008/ijadis.v4i2.1306

Pasaribu, N. A., & Sriani. (2023). The Shopee Application User Reviews Sentiment Analysis Employing Naïve Bayes Algorithm. International Journal Software Engineering and Computer Science (IJSECS), 3(3), 194–204. https://doi.org/10.35870/ijsecs.v3i3.1699

Mustofa, H., & Mahfudh, A. A. (2019). Klasifikasi Berita Hoax Dengan Menggunakan Metode Naive Bayes. Walisongo Journal of Information Technology, 1(1), 1–12. https://doi.org/10.21580/wjit.2019.1.1.3915

Audina, M., Karyawati, A. E., Supriana, I. W., Suhartana, I. K. G., Astawa, I. G. S., & Santiyasa, I. W. (2022). Klasifikasi Berita Hoaks Covid-19 Menggunakan Kombinasi Metode K-Nearest Neighbor dan Information Gain. Jurnal Elektronik Ilmu Komputer Udayana, 10(4), 319–327. https://cekfakta.com

Ramadhan, N. G., Adhinata, F. D., Segara, A. J. T., & Rakhmadani, D. P. (2022). Deteksi Berita Palsu Menggunakan Metode Random Forest dan Logistic Regression. JURIKOM (Jurnal Riset Komputer), 9(2), 251. https://doi.org/10.30865/jurikom.v9i2.3979

Sagita, R., Enri, U., & Primajaya, A. (2020). Klasifikasi Berita Clickbait Menggunakan K-Nearest Neighbor (KNN). JOINS (Journal of Information System), 5(2), 230–239. https://doi.org/10.33633/joins.v5i2.3705

Dinesh, T., & Rajendran, T. (2021). Higher Classification of Fake Political News Using Decision Tree Algorithm Over Naive Bayes Algorithm. REVISTA GEINTEC-GESTAO INOVACAO E TECNOLOGIAS, 2-11, 1084–1096. https://doi.org/10.03.2021

Siti Nurul Hassanah, W., Puji Lestari, Y., & Adi Saputra, R. (2023). Digital Image Processing to Detect Cracks in Buildings Using Naïve Bayes Algorithm (Case Study: Faculty of Engineering, Halu Oleo University). Jurnal Informatika Dan Teknologi Informasi, 20(1), 1–14. https://doi.org/10.31515/telematika.v20i1.8925

Zaman, B., Justitia, A., Sani, K. N., & Purwanti, E. (2020). An Indonesian Hoax News Detection System Using Reader Feedback and Naïve Bayes Algorithm. Cybernetics and Information Technologies, 20(1), 82–94. https://doi.org/10.2478/cait-2020-0006

Kesumawati, A., & Thalib, A. K. (2018). Hoax Classification with Term Frequency-Inverse Document Frequency Using Non-Linear SVM and Naïve Bayes. Int. J. Advance Soft Compu. Appl, 10(3).

Ogilvie, T. L., Sharma, M., & Huu, Z. X. (2022). Fake news detection using naive bayes classifier and forward selection in the digital era. Machine Learning, 29(2/3), 131-163. https://doi.org/10.1023/a:1007465528199

Muhammad Habib, S., Haerani, E., Kurnia Gusti, S., Ramadhani, S., & Informatika UIN Sultan Syarif Kasim Riau Jl Soebrantas, T. H. (2022). Klasifikasi Berita Menggunakan Metode Naïve Bayes Classifier. Jurnal Nasional Komputasi Dan Teknologi Informasi, 5(2), 248–258.

Nigam, K., McCallum, A., Thrun, S., & Mitchell, T. M. (1998). Using em to classify text from labeled and unlabeled documents. https://doi.org/10.21236/ada350490

Carvalho, V. D. H. d., Nepomuceno, T. C. C., Poleto, T., Turet, J. G., & Costa, A. P. C. S. (2022). Mining public opinions on covid-19 vaccination: a temporal analysis to support combating misinformation. Tropical Medicine and Infectious Disease, 7(10), 256. https://doi.org/10.3390/tropicalmed7100256

Senhadji, S., & Ahmed, R. A. S. (2022). Fake news detection using naïve bayes and long short term memory algorithms. IAES International Journal of Artificial Intelligence (IJ-AI), 11(2), 746. https://doi.org/10.11591/ijai.v11.i2.pp746-752

Valentina, S., & Songpan, W. (2022). Features extraction based on probability weighting for fake news classification on social media. Current Applied Science and Technology, 23(2). https://doi.org/10.55003/cast.2022.02.23.014

Davardoost, F., Sangar, A. B., & Majidzadeh, K. (2022). An innovative model for extracting olap cubes from nosql database based on scalable naïve bayes classifier. Mathematical Problems in Engineering, 2022, 1-11. https://doi.org/10.1155/2022/2860735

Muhamad, H., Prasojo, C. A., Sugianto, N. A., Surtiningsih, L., & Cholissodin, I. (2017). Optimasi naïve bayes classifier dengan menggunakan particle swarm optimization pada data iris. Jurnal Teknologi Informasi Dan Ilmu Komputer, 4(3), 180. https://doi.org/10.25126/jtiik.201743251

Friedman, N., Geiger, D., & Goldszmidt, M. (1997). Untitled. Machine Learning, 29(2/3), 131-163. https://doi.org/10.1023/a:1007465528199

Taheri, S., Yearwood, J., Mammadov, M., & Seifollahi, S. (2013). Attribute weighted naive bayes classifier using a local optimization. Neural Computing and Applications, 24(5), 995-1002. https://doi.org/10.1007/s00521-012-1329-z.