Support vector machines for classification
Radek Zka [email protected] http://bio.img.cas.cz/zikar
Support vector machines for classification
History Statistical learning SVM principles SVM applications SVM implementations Examples References
History
Vapnik, V., 1979, Estimation of dependencies based on empirical data Vapnik, V., 1995, The nature of statistical learning theory Microarray gene expression data analysis, protein structural class. ~1999-2000
Statistical learning
Data Hypothesis => errors
o
Expectation of the test error (empirical risk)
Learning machines
o o
NN SVR ~ regression SVC ~ classification:
SVM principles (SVC) I.
Training data (vector, scalar set)
[0.32, 0.2, 0.1], -1; [0.8, 0.9, 2.1], +1; [1.1, 3.1, 2.1]; +1,
Model (parameters - Lagrange multipliers, hyperplane parameters)
a1 = 0.57, a2 = 1.37,, w = [0.91, 0.81, 0.74], b = 1.2
Unclassified data (vector set) Classification using model parameters (scalars)
y1 = -1, y2 = +0.9, y3 = +1
SVM principles (SVC) II.
Data Functions Hyperplane Distance Margin Lagrangian Params of hyperplane Classification
SVM principles (SVC) III.
Linearly separable data Linearly non-separable data
o
Generalized optimal separating hyperplane Generalisation in high dimensional space Kernel functions
SVM applications
Pattern recognition
o
Features: words counts Features: expr. levels in diff. conditions Features: AA composition
DNA array expression data analysis
o
Protein classification
o
SVM implementations I.
SVMlight bsvm libsvm
- satyr.net2.private:/usr/local/bin
svm_learn, svm_classify
svm-train, svm-classify, svm-scale
- satyr.net2.private:/usr/local/bin
- satyr.net2.private:/usr/local/bin
svm-train, svm-predict, svm-scale, svm-toy
mySVM MATLAB svm toolbox
Differences: available Kernel functions, optimization, multiple class., user interfaces
SVM implementations II.
SVMlight
o
o
Simple text data format Fast, C routines
Multiple class. GUI: svm-toy Graphical interface 2D
bsvm
o
LIBSVM
o
MATLAB svm toolbox
o
Data format
Universal, simple, human readable text SVMlight
libsvm
o
2D gr. interface
bsvm
o
multi-class.
References
Steve R. Gunn: SVM for Classification and Regression (1998) Ch. J. C. Burges: A Tutorial on SVM for Pattern Recognition (1998) T. Evgeniou, M. Pontil, T. Poggio: Regularization Networks and SVM (2000) SVM for predicting protein structural class, BMC Bioinformatics, (2001), 2:3 Knowledge-based analysis of microarray gene expression data by using support vector machines, PNAS, 97, 262-267 SVM classification and validation of cancer tissue samples using microarray expression data, Bioinformatics, (2000), 10(16), 906-914 http://www.kernel-machines.org/publications.html