Last Modified: 11-Jun-2010
The library implements Optimized Cutting Plane Algorithm (OCAS) for training linear SVM classifiers from large-scale data. The computational effort of OCAS scales with O(m log m) where m is the sample size. In an extensive empirical evaluation OCAS significantly outperforms current state of the art SVM solvers, like SVM^light, SVM^perf and BMRM, achieving speedups of over 1,000 on some datasets over SVM^light and 20 over SVM^perf, while obtaining the same precise Support Vector solution. OCAS even in the early optimization steps shows often faster convergence than the so far in this domain prevailing approximative methods SGD and Pegasos. Effectively parallelizing OCAS we were able to train on a dataset of size 15 million examples (itself about 32GB in size) in just 671 seconds --- a competing string kernel SVM required 97,484 seconds to train on 10 million examples sub-sampled from this dataset.
 The library also implements COFFIN framework for
efficient training of translation invariant image classifiers from virtual
examples. As an example, we used the implemented framework to train a linear SVM
on a gender classification dataset of almost 5 million images on a plain
notebook with just 4GB of memory.
The library also implements COFFIN framework for
efficient training of translation invariant image classifiers from virtual
examples. As an example, we used the implemented framework to train a linear SVM
on a gender classification dataset of almost 5 million images on a plain
notebook with just 4GB of memory.
 Training translation invariant image classifiers from virtual examples.
 Training translation invariant image classifiers from virtual examples. Functions for computing image features based on Local Binary Patterns (LBP).
 Functions for computing image features based on Local Binary Patterns (LBP).1. Binary classification problem: OCAS solves the following unconstrained convex optimization task
 
 is the regularization constant and
 is the regularization constant and
 are training examples. The result is 
the parameter vector
 are training examples. The result is 
the parameter vector  of the linear rule
 of the linear rule
 which assigns vector
which assigns vector  a class label +1 and/or -1.
 a class label +1 and/or -1. 
2. Multi-class classification probem: OCAS solves the following unconstrained convex optimization task
 
 is the regularization constant,
 is the regularization constant, 
 is a finite set of labels and
 is a finite set of labels and
 are training
examples. The result is a set of parameter vectors
 are training
examples. The result is a set of parameter vectors  of the linear rule
 of the linear rule
 which assigns vector
which assigns vector  a class label
from
 a class label
from  .
. 
 Version 0.93, 2010-06-11, libocas_v093.zip
 Version 0.93, 2010-06-11, libocas_v093.zip