Learning reliable classifiers from small or incomplete data sets: the naive credal classifier 2

Corani, Giorgio and Zaffalon, Marco (2008) Learning reliable classifiers from small or incomplete data sets: the naive credal classifier 2. Journal of Machine Learning Research, 9. pp. 581-621. ISSN 1533-7928

Preview

Text
corani08b.pdf - Published Version
Download (30kB) | Preview

Official Website: http://jmlr.csail.mit.edu/papers/volume9/corani08b...

Abstract

In this paper, the naive credal classifier, which is a set-valued counterpart of naive Bayes, is extended to a general and flexible treatment of incomplete data, yielding a new classifier called naive credal classifier 2 (NCC2). The new classifier delivers classifications that are reliable even in the presence of small sample sizes and missing values. Extensive empirical evaluations show that, by issuing set-valued classifications, NCC2 is able to isolate and properly deal with instances that are hard to classify (on which naive Bayes accuracy drops considerably), and to perform as well as naive Bayes on the other instances. The experiments point to a general problem: they show that with missing values, empirical evaluations may not reliably estimate the accuracy of a traditional classifier, such as naive Bayes. This phenomenon adds even more value to the robust approach to classification implemented by NCC2.

Item Type:	Scientific journal article, Newspaper article or Magazine article
Uncontrolled Keywords:	imprecise probabilities, missing data, naive Bayes, naive credal classifier 2, Java
Subjects:	Mathematical sciences > Statistics > Statistical modelling Computer sciences > Artificial intelligence > Machine learning
Depositing User:	Giorgio Corani
Date Deposited:	12 May 2014 09:35
Last Modified:	06 Jun 2016 06:52
URI:	http://repository.supsi.ch/id/eprint/4728

Actions (login required)

View Item