Machine Learning Models to Identify and Classify Clickbait Headlines Accurately

https://doi.org/10.48185/jaai.v6i1.1148

Authors

  • Waleed Alanazi College of Computer Science and Engineering, University of Ha’il, Hail, 11800, Saudi Arabia
  • Abdulghani B. Alshaibani College of Computer Science and Engineering, University of Ha’il, Hail, 11800, Saudi Arabia
  • Badiea Abdulkarem Mohammed Al-Shaibani College of Computer Science and Engineering, University of Ha’il, Hail, 11800, Saudi Arabia
  • Zeyad Ghaleb Al-Mekhlafi College of Computer Science and Engineering, University of Ha’il, Hail, 11800, Saudi Arabia
  • Abdulraheem B. Alshaibani College of Computer Science and Engineering, University of Ha’il, Hail, 11800, Saudi Arabia
  • Anwar B. Alshaibani College of Computer Science and Engineering, University of Ha’il, Hail, 11800, Saudi Arabia

Keywords:

Machine learning, Clickbait headlines, classification, identification, news credibility

Abstract

One potential research problem related to clickbait data could be to investigate the impact of clickbait headlines on news consumption and perception of news credibility. The objective of using Machine Learning (ML) models to analyze clickbait data in this work is to determine an accurate model for identifying and classifying clickbait headlines, understand the features that make them successful, evaluate the model's performance in real-world scenarios, and compare the performance of different ML models to select the best one for clickbait classification. By achieving these objectives, the research could provide valuable insights into the mechanisms behind clickbait and the effectiveness of ML models in detecting and mitigating its impact. This research could inform the development of more effective algorithms and tools for combating clickbait and improving news literacy. The suggested methodology for detecting clickbait using machine learning involves collecting a large amount of clickbait and non-clickbait headlines, pre-processing and cleaning the data, identifying and extracting relevant features, selecting an appropriate ML algorithm, training and evaluating the model, making necessary adjustments, and deploying the final model in a production environment to detect clickbait in real-world data. The specific steps and details may vary depending on the task complexity and data availability.

Downloads

Download data is not yet available.

Author Biography

Badiea Abdulkarem Mohammed Al-Shaibani, College of Computer Science and Engineering, University of Ha’il, Hail, 11800, Saudi Arabia

Badiea Abdulkarem Mohammed received his BSc in Computer Science from Babylon University, Iraq in 2002, M.Tech in Computer Science from University of Hyderabad, India in 2007 and PhD from Universiti Sains Malaysia, Malaysia in 2018. He is currently an Assistant Professor in the College of Computer Science and Engineering at University of Hail, KSA. He is permanently Assistant Professor at Hodeidah University, Yemen. His research focuses on Wireless Networks, Mobile Networks, Vehicle networks, WSN, Cybersecurity, and Image Processing. He is an IEEE member, Member, IAENG member, and ASR member. In his research area, he has published many papers in reputed journals and conferences.

References

A. Chakraborty, B. Paranjape, S. Kakarla and N. Ganguly, "Stop Clickbait: Detecting and preventing clickbaits in online news media," 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), San Francisco, CA, USA, 2016, pp. 9-16, doi: 10.1109/ASONAM.2016.7752207.

J. Alghamdi, Y. Lin and S. Luo, “A Comparative Study of Machine Learning and Deep Learning Techniques for Fake News Detection,” Information, vol. 13, no. 12, pp. 576, 2022. https://doi.org/10.3390/info13120576

S. Mishra, P. Shukla and R. Agarwal, "Analyzing Machine Learning Enabled Fake News Detection Techniques for Diversified Datasets," Wireless Communications and Mobile Computing, vol. 2022, Article ID 1575365, 18 pages, 2022. https://doi.org/10.1155/2022/1575365

M. Al-Sarem, F. Saeed, Z. G. Al-Mekhlafi, B. A. Mohammed, M. Hadwan, T. Al-Hadhrami, M. T. Alshammari, A. Alreshidi and T. S. Alshammari, “An Improved Multiple Features and Machine Learning-Based Approach for Detecting Clickbait News on Social Networks,” Applied Sciences, vol. 11, no. 20, pp. 9487, 2021. doi: 10.3390/app11209487

I. Ahmad, M. A. Alqarni, A. A. Almazroi and A. Tariq, " Experimental Evaluation of Clickbait Detection Using Machine Learning Models," Intelligent Automation & Soft Computing, vol. 26, no. 6, pp. 1335-1344, 2020. doi: 10.32604/iasc.2020.013861.

B Siregar and I Habibie and E B Nababan and Fahmi, " Identification of Indonesian clickbait news headlines with long short-term memory recurrent neural network algorithm," ournal of Physics: Conference Series, vol. 1882, no. 1, pp. 012129, 2021. doi: 10.1088/1742-6596/1882/1/012129.

K. Sharifani, M. Amini, Y. Akbari and J. A. Godarzi, “Operating Machine Learning across Natural Language Processing Techniques for Improvement of Fabricated News Model” International Journal of Science and Information System Research, vol. 12, no. 9, pp. 20 - 44, 2022. Available at SSRN: https://ssrn.com/abstract=4251017

M. Dong, L. Yao, X. Wang, B. Benatallah and C. Huang, “Similarity-Aware Deep Attentive Model for Clickbait Detection,” In: Yang, Q., Zhou, ZH., Gong, Z., Zhang, ML., Huang, SJ. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2019. Lecture Notes in Computer Science, vol 11440. Springer, Cham. https://doi.org/10.1007/978-3-030-16145-3_5

B. Naeem, A. Khan, M. O. Beg and H. Mujtaba, “A deep learning framework for clickbait detection on social area network using natural language cues,” Journal of Computational Social Science, vol. 3, pp. 231–243, 2020. https://doi.org/10.1007/s42001-020-00063-y

D. M. Daoud and S. Abou El-Seoud, “An Effective Approach for Clickbait Detection Based on Supervised Machine Learning Technique”, International Journal of Online and Biomedical Engineering, vol. 15, no. 03, pp. 21–32, 2019.

M. Al-Sarem, F. Saeed, Z. G. Al-Mekhlafi, B. A. Mohammed, M. Hadwan, T. Al-Hadhrami, M. T. Alshammari, A. Alreshidi and T. S. Alshammari, “An Improved Multiple Features and Machine Learning-Based Approach for Detecting Clickbait News on Social Networks,” Applied Sciences, vol. 11, no. 20, pp. 9487, 2021. doi: 10.3390/app11209487

I. Ahmad, M. A. Alqarni, A. A. Almazroi and A. Tariq, " Experimental Evaluation of Clickbait Detection Using Machine Learning Models," Intelligent Automation & Soft Computing, vol. 26, no. 6, pp. 1335-1344, 2020. doi: 10.32604/iasc.2020.013861.

B Siregar and I Habibie and E B Nababan and Fahmi, " Identification of Indonesian clickbait news headlines with long short-term memory recurrent neural network algorithm," ournal of Physics: Conference Series, vol. 1882, no. 1, pp. 012129, 2021. doi: 10.1088/1742-6596/1882/1/012129.

K. Sharifani, M. Amini, Y. Akbari and J. A. Godarzi, “Operating Machine Learning across Natural Language Processing Techniques for Improvement of Fabricated News Model” International Journal of Science and Information System Research, vol. 12, no. 9, pp. 20 - 44, 2022. Available at SSRN: https://ssrn.com/abstract=4251017

M. Dong, L. Yao, X. Wang, B. Benatallah and C. Huang, “Similarity-Aware Deep Attentive Model for Clickbait Detection,” In: Yang, Q., Zhou, ZH., Gong, Z., Zhang, ML., Huang, SJ. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2019. Lecture Notes in Computer Science, vol 11440. Springer, Cham. https://doi.org/10.1007/978-3-030-16145-3_5

B. Naeem, A. Khan, M. O. Beg and H. Mujtaba, “A deep learning framework for clickbait detection on social area network using natural language cues,” Journal of Computational Social Science, vol. 3, pp. 231–243, 2020. https://doi.org/10.1007/s42001-020-00063-y

D. M. Daoud and S. Abou El-Seoud, “An Effective Approach for Clickbait Detection Based on Supervised Machine Learning Technique”, International Journal of Online and Biomedical Engineering, vol. 15, no. 03, pp. 21–32, 2019.

Tong Liu, Ke Yu, Lu Wang, Xuanyu Zhang, Hao Zhou, Xiaofei Wu, “Clickbait detection on WeChat: A deep model integrating semantic and syntactic,” information Knowledge-Based Systems, Vol. 245, pp. 108605, 2022. https://doi.org/10.1016/j.knosys.2022.108605

Mark Bronakowski, Mahmood Al-khassaweneh, and Ali Al Bataineh, “Automatic Detection of Clickbait Headlines Using Semantic Analysis and Machine Learning Techniques,” Applied Sciences, vol. 13, no. 4, pp. 2456, 2023. https://doi.org/10.3390/app13042456

Abdul Razaque, Bandar Alotaibi, Munif Alotaibi, Shujaat Hussain, Aziz Alotaibi, and Vladimir Jotsov, “Clickbait Detection Using Deep Recurrent Neural Network,” Applied Sciences, vol. 12, no. 1, pp. 504, 2022. https://doi.org/10.3390/app12010504

Valentina Apresjan and Alexander Orlov, “Pragmatic mechanisms of manipulation in Russian online media: How clickbait works (or does not),” Journal of Pragmatics, vo. 195, pp. 91-108, 2022. https://doi.org/10.1016/j.pragma.2022.02.003

Jiaming Zheng, Ke Yu, and Xiaofei Wu, “ A deep model based on Lure and Similarity for Adaptive Clickbait Detection,” Knowledge-Based Systems, Vol. 214, pp. 106714, 2021. https://doi.org/10.1016/j.knosys.2020.106714

A. Al Bataineh, D. Kaur and S. M. J. Jalali, "Multi-Layer Perceptron Training Optimization Using Nature Inspired Computing," in IEEE Access, vol. 10, pp. 36963-36977, 2022, doi: 10.1109/ACCESS.2022.3164669.

P. Rajapaksha, R. Farahbakhsh and N. Crespi, "BERT, XLNet or RoBERTa: The Best Transfer Learning Model to Detect Clickbaits," in IEEE Access, vol. 9, pp. 154704-154716, 2021, doi: 10.1109/ACCESS.2021.3128742.

Abdul Razaque; Bandar Alotaibi; Munif Alotaibi; Fathi Amsaad; Ansagan Manasov; Salim Hariri; Banu B. Yergaliyeva; Aziz Alotaib et al., "Blockchain-Enabled Deep Recurrent Neural Network Model for Clickbait Detection," in IEEE Access, vol. 10, pp. 3144-3163, 2022, doi: 10.1109/ACCESS.2021.3137078.

Christin, A. Counting clicks: Quantification and variation in web journalism in the United States and France. Am. J. Sociol. 2018, 123, 1382–1415.

Razaque, A.; Alotaibi, B.; Alotaibi, M.; Hussain, S.; Alotaibi, A.; Jotsov, V. Clickbait Detection Using Deep Recurrent Neural Network. Appl. Sci. 2022, 12, 504.

Thomas, P. Clickbait identification using neural networks. arXiv 2017, rXiv:1710.08721.

Kumar, V.; Khattar, D.; Gairola, S.; Kumar Lal, Y.; Varma, V. Identifying clickbait: A multi-strategy approach using neural networks. In Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, Ann Arbor, MI, USA, 8–12 July 2018; pp. 1225–1228.

Cao, X.; Le, T. Machine learning based detection of clickbait posts in social media. arXiv 2017, arXiv:1710.01977.

Published

2025-04-15

How to Cite

Alanazi, W., Alshaibani, A. B., Al-Shaibani, B. A. M., Al-Mekhlafi, Z. G., Alshaibani, A. B., & Alshaibani, A. B. (2025). Machine Learning Models to Identify and Classify Clickbait Headlines Accurately. Journal of Applied Artificial Intelligence, 6(1), 1–17. https://doi.org/10.48185/jaai.v6i1.1148

Most read articles by the same author(s)