Document Type : Research Paper


Computer Science Department, University of Technology, Baghdad, Iraq,


Due to the increasing electronic publishing sites for printed and certified newspapers, the reader faced the problem of reaching his goal by accessing these sites, which led to the neglect of a large section of important publications. The provision of an automated measure to verify the positive and negative articles based on the analysis of readers' comments on the articles is a necessity to see the important articles that are compatible with the corpus generated by us for inference. The project achieved the previous target and achieved a success rate. The Bag-of-Words Model (BoW) was used to obtain the repetition of the block of words to build the corpus. The proposed system is evaluated based on four metrics (Accuracy= 93%, Precision= 94%, Recall= 94% and F-measure= 94%). The effectiveness results obtain by this system was (Accuracy, Precision and Recall and F-measure).


[1] L. Mathe, and M. Caldwell, “A content analysis of readers’ comments on political and economic news articles from zimbabwe online media,” Communication, Vol. 8, No. 1, pp. 56-68, 2017.
[2] N. Diakopoulos, and M. Naaman, “Towards quality discourse in online news comments,” The 2nd int. Conference On Building Construction And Environment-BCEE2, China, 2015.
[3] F. S Faiaz, and H. K Milh, “News trends in the 2010 Iraq Council of Representatives elections in the electronic press,” (available in Arabic) Adab al-Farahidi, No. 24, pp. 133-142, 2016.
[4] K.H Hasan, “Patterns of use of electronic newspapers by the Iraqi public,” (available in Arabic) Medad Arts, Vol. 1, No. 16, pp. 759-784, 2019.
[5] A. AbdelRaouf, C. A. Higgins, T. Pridmore, and M. Khalil, “Building a multi- modal Arabic corpus (MMAC),” Document Analysis and Recognition, Vol. 13, No. 4, pp. 285-302, 2010.
[6] M. El-Haj, and R. Koulali, “Kalimat a Multipurpose Arabic Corpus,” Second workshop on Arabic corpus linguistics (WACL-2), 22-25, 2013.
[7] S. R. McMillen, “Threads of deliberation: a textual analysis of online news comments,” MSc. Thesis, Scripps College of Communication, Univ. of Ohio, 2013.
[8] C. W. Kim, H. Park, Y. Lee, and H. W. Park, “Towards the operationalization of controversial news: a study of online news articles and reader comments during the 2017 presidential election in south Korea,” Cement, concrete, and aggregates, Vol. 53, No. 3, pp. 1097-1108, 2018.
[9] N. Henrich, and B. Holmes, “Web news readers’ comments: towards developing a ethodology for using on-line comments in social inquiry,” Media and Communication Studies, Vol. 5, No. 1, pp. 1-4, 2013.
[10] M. Z. Kurdi, “Natural language processing and computational linguistics,” John & Sons, UK and USA, 2016.
[11] L. Deng, and Y. Liu, “Deep Learning in natural language processing,” Springer, USA and China, 2018.
[12] P. Goos, and D. Meintrup, “Statistics with JMP: graphs, descriptive statistics, and probability,” John Wiley & Sons, India, 2015.
[13] K. Shaalan, A. Hassanien, and F. Tolba, “Intelligent natural language processing: trends and applications,” Springer, United Arab Emirates and Egypt, 2018.
[14] D. S. Al-Azzawy, and F. M. Al-Rufaye, “Arabic words clustering by using K- means algorithm,” Annual Conference on New Trends in Information & Communications Technology Applications (NTICT), IEEE, 2017.
[15] J. Eisenstein, “Natural language processing,” Cambridge, MA: The MIT Press, 2018.
[16] K. Soania, “Attitudes of the student audience towards electronic journalism a field study on a sample of university students using the Internet” (available in Arabic) M.Sc. Thesis, Humanities Dept., Univ. of Muhammd Kider, Algeria, 2011.