Corani, Giorgio and Zaffalon, Marco (2008) Learning reliable classifiers from small or incomplete data sets: the naive credal classifier 2. Journal of Machine Learning Research, 9. pp. 581-621. ISSN 1533-7928
|
Text
corani08b.pdf - Published Version Download (30kB) | Preview |
Abstract
In this paper, the naive credal classifier, which is a set-valued counterpart of naive Bayes, is extended to a general and flexible treatment of incomplete data, yielding a new classifier called naive credal classifier 2 (NCC2). The new classifier delivers classifications that are reliable even in the presence of small sample sizes and missing values. Extensive empirical evaluations show that, by issuing set-valued classifications, NCC2 is able to isolate and properly deal with instances that are hard to classify (on which naive Bayes accuracy drops considerably), and to perform as well as naive Bayes on the other instances. The experiments point to a general problem: they show that with missing values, empirical evaluations may not reliably estimate the accuracy of a traditional classifier, such as naive Bayes. This phenomenon adds even more value to the robust approach to classification implemented by NCC2.
Item Type: | Scientific journal article, Newspaper article or Magazine article |
---|---|
Uncontrolled Keywords: | imprecise probabilities, missing data, naive Bayes, naive credal classifier 2, Java |
Subjects: | Mathematical sciences > Statistics > Statistical modelling Computer sciences > Artificial intelligence > Machine learning |
Depositing User: | Giorgio Corani |
Date Deposited: | 12 May 2014 09:35 |
Last Modified: | 06 Jun 2016 06:52 |
URI: | http://repository.supsi.ch/id/eprint/4728 |
Actions (login required)
View Item |