Learning by Conformal Predictors with Additional Information

Yang, Meng; Nouretdinov, Ilia; Luo, Zhiyuan

Learning by Conformal Predictors with Additional Information

Zhiyuan Luo

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

In many supervised learning applications, the existence of additional information in training data is very common. Recently, Vapnik introduced a new method called LUPI which provides a learning paradigm under privileged (or additional) information. It describes the SVM+ technique to process this information in batch mode. Following this method, we apply the approach to deal with additional information by conformal predictors. An application to a medical diagnostic problem is considered and the results are reported.

Figures (4)

Table 1. Data set with additional information In machine learning classification problems, in batch setting, we usually work with a set of training and testing examples. In a data-rich world, there often exist some “pieces” of information about the data that we can add and use it. But, this information may be available at a training stage and not for the new examples at the testing stage. For example, usually doctors try to make diag- nosis using all available information, but if at the end of an investigation the diagnosis is still unclear, they may send the patient for some additional tests such as pathological reports, blood test, MRI scans, etc. This is additional or privileged information and can be used to improve the quality of training set and hence, the decision rules. However, the same additional information may not be available for new patients. The question is: can this additional information at the training stage improve the accuracy of diagnosis for the new patients? Tradi- tional learning methods cannot use the additional information directly when it is not available in test set — it is summarised Table[I] Recently, Vapnik proposed a

see in the following description of the on-line prediction with additional infor- mation protocol. At the n-th step, we have observed the previous examples (@1, 27,91), +) (@n—1, 2,1; Yn—1) and new object x, and our task is to predict Yn without «*. The new example will be added to the training examples and used to generate a new rule for next prediction. On-line mode is a simple form of the slow learning from [II] where the feedback is given with a delay. In this protocol we assume that some symptoms may also come with a delay. For ex- ample, if a prediction algorithm is designed to classify whether a patient has a disease or not by some symptoms and blood test in on-line mode, but the blood test result is not available (will be given, maybe, one day later). On-line prediction with additional information Protocol: LVR » —_ 19

Table 2. Single prediction by Conformal Predictor on Abdominal Pain dataset Table 3. Predictions by SVM+ and SVM on Abdominal Pain dataset

Experimental results are given in Table 2] where the binary classification is performed in one against all other classes. In batch learning mode, we only care about accuracies of predictions. To avoid the influence of redundant attributes, we use some selected symptoms here. For each disease group, we use 5 most relevant symptoms selected in [7| as “usual” attributes because these 5 selected symptoms could provide the similar confidence level as whole set of symptoms. The features provided by experts in [I] are used as privileged attributes. The dataset is randomly divided into training set (4387 examples) and test set (2000

Giovanni Cherubin

2021

Conformal Predictors (CP) are wrappers around ML models, providing error guarantees under weak assumptions on the data distribution. They are suitable for a wide range of problems, from classification and regression to anomaly detection. Unfortunately, their very high computational complexity limits their applicability to large datasets. In this work, we show that it is possible to speed up a CP classifier considerably, by studying it in conjunction with the underlying ML method, and by exploiting incremental&decremental learning. For methods such as k-NN, KDE, and kernel LSSVM, our approach reduces the running time by one order of magnitude, whilst producing exact solutions. With similar ideas, we also achieve a linear speed up for the harder case of bootstrapping. Finally, we extend these techniques to improve upon an optimization of k-NN CP for regression. We evaluate our findings empirically, and discuss when methods are suitable for CP optimization.

Log In

Learning by Conformal Predictors with Additional Information

Sign up for access to the world's latest research

Abstract

Related papers