Binary and Multi-Class Classification Using Supervised Machine Learning Algorithms and Ensemble Model

Asela, H

dc.contributor.author	Asela, H
dc.date.accessioned	2021-12-24T06:31:31Z
dc.date.available	2021-12-24T06:31:31Z
dc.date.issued	2021
dc.identifier.uri	http://ir.kdu.ac.lk/handle/345/5210
dc.description.abstract	Classification is a vital aspect in data mining, where vast quantities of data are segregated into discrete classes. Models based on different statistical and machine learning approaches are used for this task. However, the classification performance depends on multiple factors like selected algorithm, domain and features of the dataset. The objective of this study is to evaluate the classification performance of widely used supervised machine learning algorithms; Decision Tree (DT), Naïve Bayes (NB) algorithm, Support Vector Classifier (SVC), KNearest Neighbour (KNN) algorithm and the Ensemble Model (EM) based on soft voting technique. These algorithms are tested on 6 datasets in different domains, and the datasets contain both multi-class and binary class data as well as balanced and imbalanced data. Accuracy, Precision and Recall are used as evaluation metrics to evaluate the classification performance in balanced datasets, where F1- measure is used in imbalanced dataset for the same task. The evaluation results indicate that EM outperformed single algorithms at most instances. When comparing single algorithms, KNN performed best with multi class classification, where SVC performed best in binary classification in balanced datasets. Also, KNN showed the best classification performance when it comes to imbalanced dataset. All the algorithms performed well when the data set is balanced. However, the classification performance in all models including EM is below expectation, when the data distribution is highly imbalanced.	en_US
dc.language.iso	en	en_US
dc.subject	classification	en_US
dc.subject	machine learning	en_US
dc.subject	supervised algorithms	en_US
dc.subject	ensemble model	en_US
dc.subject	soft voting classifier	en_US
dc.title	Binary and Multi-Class Classification Using Supervised Machine Learning Algorithms and Ensemble Model	en_US
dc.type	Article Full Text	en_US
dc.identifier.journal	KDU IRC, 2021	en_US
dc.identifier.issue	Faculty of Computing	en_US
dc.identifier.pgnos	130-136	en_US

Files in this item

Name:: 13.pdf
Size:: 518.8Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Computing [62]

Show simple item record