IR@KDU Repository

Binary and Multi-Class Classification Using Supervised Machine Learning Algorithms and Ensemble Model

Show simple item record

dc.contributor.author Asela, H
dc.date.accessioned 2021-12-24T06:31:31Z
dc.date.available 2021-12-24T06:31:31Z
dc.date.issued 2021
dc.identifier.uri http://ir.kdu.ac.lk/handle/345/5210
dc.description.abstract Classification is a vital aspect in data mining, where vast quantities of data are segregated into discrete classes. Models based on different statistical and machine learning approaches are used for this task. However, the classification performance depends on multiple factors like selected algorithm, domain and features of the dataset. The objective of this study is to evaluate the classification performance of widely used supervised machine learning algorithms; Decision Tree (DT), Naïve Bayes (NB) algorithm, Support Vector Classifier (SVC), KNearest Neighbour (KNN) algorithm and the Ensemble Model (EM) based on soft voting technique. These algorithms are tested on 6 datasets in different domains, and the datasets contain both multi-class and binary class data as well as balanced and imbalanced data. Accuracy, Precision and Recall are used as evaluation metrics to evaluate the classification performance in balanced datasets, where F1- measure is used in imbalanced dataset for the same task. The evaluation results indicate that EM outperformed single algorithms at most instances. When comparing single algorithms, KNN performed best with multi class classification, where SVC performed best in binary classification in balanced datasets. Also, KNN showed the best classification performance when it comes to imbalanced dataset. All the algorithms performed well when the data set is balanced. However, the classification performance in all models including EM is below expectation, when the data distribution is highly imbalanced. en_US
dc.language.iso en en_US
dc.subject classification en_US
dc.subject machine learning en_US
dc.subject supervised algorithms en_US
dc.subject ensemble model en_US
dc.subject soft voting classifier en_US
dc.title Binary and Multi-Class Classification Using Supervised Machine Learning Algorithms and Ensemble Model en_US
dc.type Article Full Text en_US
dc.identifier.journal KDU IRC, 2021 en_US
dc.identifier.issue Faculty of Computing en_US
dc.identifier.pgnos 130-136 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IR@KDU


Browse

My Account