Print ISSN: 1681-6900

Online ISSN: 2412-0758

Keywords : Random Forest

Textual Dataset Classification Using Supervised Machine Learning Techniques

Hanan Q. Jaleel; Jane J. Stephan; Sinan A. Naji

Engineering and Technology Journal, 2022, Volume 40, Issue 4, Pages 527-538
DOI: 10.30684/etj.v40i4.1970

Text classification has been a significant domain of study and research because of the increased volume of text datasets and documents available in digital format. Text classification is one of the major approaches used to arrange digital information via automatically allocating text dataset records or documents into predetermined classes depending on their contents. This paper proposes a technique that implements supervised machine learning algorithms such as KNN, Decision tree, Random Forest, Bernoulli Naive Bayes, and Multinomial Naive Bayes classifiers to classify a dataset into distinct classes. The proposed technique combines the above-mentioned machine learning classifiers with the TF-IDF feature extraction method as a vector space model to achieve more precise classification results. The proposed technique yields high accuracy, precision, recall, and f1-measure metric values for all the implemented classifiers. After comparing the obtained results of different classifiers, it is found that the Random Forest classifier is the best algorithm used to classify the textual dataset records with the highest accuracy value of 0.9995930.

Random Forest (RF) and Artificial Neural Network (ANN) Algorithms for LULC Mapping

Tay H. Shihab; Amjed N. Al-Hameedawi; Ammar M. Hamza

Engineering and Technology Journal, 2020, Volume 38, Issue 4A, Pages 510-514
DOI: 10.30684/etj.v38i4A.399

In this paper to make use of complementary potential in the mapping of LULC spatial data is acquired from LandSat 8 OLI sensor images are taken in 2019. They have been rectified, enhanced and then classified according to Random forest (RF) and artificial neural network (ANN) methods. Optical remote sensing images have been used to get information on the status of LULC classification, and extraction details. The classification of both satellite image types is used to extract features and to analyse LULC of the study area. The results of the classification showed that the artificial neural network method outperforms the random forest method. The required image processing has been made for Optical Remote Sensing Data to be used in LULC mapping, include the geometric correction, Image Enhancements, The overall accuracy when using the ANN methods 0.91 and the kappa accuracy was found 0.89 for the training data set. While the overall accuracy and the kappa accuracy of the test dataset were found 0.89 and 0.87 respectively