123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354 |
- = svm-toolkit
- Support-vector machines are a popular tool in data mining. This package
- includes an amended version of the Java implementation of the libsvm library
- (version 3.11). Additional methods and examples are provided to support
- standard training techniques, such as cross-validation, and simple
- visualisations. Training/testing of models can use a variety of built-in or
- user-defined evaluation methods, including overall accuracy, geometric mean,
- precision and recall.
- Copyright (c) 2011-12, Peter Lane
- = Install
- This software works with JRuby, in 1.9 mode.
- To install:
- $ jruby -S gem install svm-toolkit --source https://gem.fury.io/peterlane
- = Features
- == Current
- - All features of LibSVM 3.11 are supported, and many are augmented with Ruby wrappers.
- - Loading Problem definitions from file in Svmlight, Csv or Arff (simple subset) format.
- - Creating Problem definitions from values supplied programmatically in arrays.
- - Rescaling of feature values.
- - Integrated cost/gamma search for model with RBF kernel, taking advantage of multiple cores.
- - Contour plot visualisation of cost/gamma search results.
- - Model provides value of w-squared for hyperplane.
- - svm-demo application, a version of the svm_toy applet which comes with libsvm.
- - Model stores indices of training instances used as support vectors.
- - User-selected evaluation techniques supported in Model#evaluate_dataset and Svm.cross_validation_search.
- - Library provides evaluation classes for OverallAccuracy, GeometricMean, ClassPrecision, ClassRecall, MatthewsCorrelationCoefficient.
- == Planned
- - splitting problem sets for train/cross/test
- - support for sampling, SMOTE and related processes (perhaps in separate package)
- - active-learning
- == Acknowledgements
- The svm-toolkit is based on LibSVM, which is available from:
- http://www.csie.ntu.edu.tw/~cjlin/libsvm/
- The contour plot uses the PlotPackage library, available from:
- http://thehuwaldtfamily.org/java/Packages/Plot/PlotPackage.html
- Contributor:
- * {Knut Hellan}[https://github.com/khellan], the Matthews Correlation Coefficient.
|