Comparing Algorithms - Guide to the App

DocOfi
April 11, 2016

Comparing Algorithms - Guide to the App This presentation together with an App will give you a brief look at the performance of different algorithms found in the caret package by comparing the datapoints the model failed to correctly classify. The full article can be found at Comparing Algorithms.

Summary

  • We compared the performance of different classifiers on the famous (Fisher's or Anderson's) iris data set which gives the measurements in centimeters of the variables sepal length and width and petal length and width, respectively, for 50 flowers from each of 3 species of iris.
  • We focussed on the data points misclassified by the models
  • Identifying the datapoint visually is difficult. The App provides an invaluable assistance to identifying the miscalssified data points.

A table tells only half of the story

      Model Accuracy Kappa MissedPts
1 RanForest     96.0    94         3
2       GBM     94.7    92         4
3    NBayes     96.0    94         3
4   NeuralN     96.0    94         3
5  Multinom     98.7    98         1
6       SVM     93.3    90         5
  • The models perform quite similarly and their accuracy are not far from each other…
  • What makes them different from each other is not apparent

plot of chunk unnamed-chunk-2

  • knowing how many data points were misclassified is good, but…
  • Identifying the datapoints and the values they represent allow us to compare the models strategy for classification against each other
  • Using the App makes this very easy

Links