Academia.eduAcademia.edu

Semantic Annotation and Search

2016

Abstract

It is standard to perform classification tasks under the assumption that class labels are deterministic. In this context, the F-measure is an increasingly popular measure of performance for a classifier, and expresses a flexible trade-off between precision and recall. However, it may just as easily be advisable to remove this assumption and consider instances as belonging to each class with given probabilities. The presence of uncertainty in a training set may be due to subjectivity of a classification task or noise introduced during data collection. In this paper, we adapt the classical F-measure to the uncertain context and present an efficient, easy-to-implement algorithm for the optimization of this new “noisy ” F-measure within the maximum entropy modeling framework. We provide comprehensive theoretical justification along with numerical experiments that demon-strate the novelty and effectiveness of this approach.