Enhanced Classification Accuracy on Naive Bayes Data Mining Models

Dahal, Keshav, Hossain, Alamgir, Mofizur Rahman, Chowdhury and Kabir, Faisal (2011) Enhanced Classification Accuracy on Naive Bayes Data Mining Models. International Journal of Computer Applications, 28 (3). pp. 9-16. ISSN 0975-8887

Full text not available from this repository. (Request a copy)
Official URL: http://dx.doi.org/10.5120/3371-4657


A classification paradigm is a data mining framework containing all the concepts extracted from the training dataset to differentiate one class from other classes existed in data. The primary goal of the classification frameworks is to provide a better result in terms of accuracy. However, in most of the cases we can not get better accuracy particularly for huge dataset and dataset with several groups of data . When a classification framework considers whole dataset for training then the algorithm may become unusuable because dataset consisits of several group of data. The alternative way of making classification useable is to identify a similar group of data from the whole training data set and then training each group of similar data. In our paper, we first split the training data using k-means clustering and then train each group with Naive Bayes Classification algorithm. In addition, we saved each model to classify sample or unknown or test data. For unknown data, we classify with the best match group/model and attain higher accuracy rate than the conventional Naive Bayes classifier.

Item Type: Article
Uncontrolled Keywords: Classification, naive bayes, clustering, classification accuracy
Subjects: G900 Others in Mathematical and Computing Sciences
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Ay Okpokam
Date Deposited: 18 Jan 2012 10:48
Last Modified: 13 Oct 2019 00:31
URI: http://nrl.northumbria.ac.uk/id/eprint/4564

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics