The Bit Plumber: Quantifying the Efficacy of Machine Learning Algorithms

There are many articles and books and talks and so on all making claims about machine learning algorithms. Out of those, how many actually QUANTIFY THE EFFICACY OF THE ALGORITHM?.

Below are some references which will hopefully save people some leg work and help others quantify the performance of their algorithms.

Articles
Books
1. Assessing and Improving Prediction and Classification by Timothy Masters
  1. Amazon
2. Evaluating and Comparing the Performance of Machine Learning Algorithms by Melanie Mitchell
  1. Part 1
  2. Part 2
3. Evaluating Learning Algorithms: A Classification Perspective by Japkowicz, Shah
  1. Amazon
    
    Customer Reviews
    
    Howard B. Bandy on July 7, 2014
    
    ... examples presented are all of stationary data ...
  2. Cambridge.Org
    
    Looking for an examination copy?
    
    If you are interested in the title for your course we can consider offering an examination copy. To register your interest please contact collegesales@cambridge.org providing details of the course you are teaching.
  3. Google
    
    Has table of contents and you can look at some randomly selected pages
  4. MohakShah.Com
    
    Email Address: eval@mohakshah.com
    
    For electronic editions, ...
    
    Computing Resources
    
    WEKA Machine Learning Toolkit
    
    Visualization Software for Clasifier Evaluation
4. Evaluation and Analysis of Supervised Learning Algorithms and Classifiers by Niklas Lavesson
5. Machine Learning and Data Mining: 14 Evaluation and Credibility by Pier Luca Lanzi
  1. slideshare.net
  2. Data Mining Course
    
    Classification: Evaluation

1 comment:

lugerpittJune 9, 2015 at 10:25 AM
Another reference for you

Max Kuhn and Kjell Johnson, 2013, Applied Predictive Modeling, Springer, http://appliedpredictivemodeling.com/
i. Ch 4. Introduces the problem of overfitting (model fits the data available, not the system the data came from)
ii. Ch 5 Measuring performance in regression models, Covers MSE and variations, discusses the tradeoff between variance and bias (do you want the character of the system correct (correct variance), or to estimate a specific parameter correctly (low bias)
iii. Ch 11 Measuring performance in classification models. Covers confusion matrices, Kappa (observed vs expected probability), Sensitivity/specificity, ROC, PPV/NPV, lift charts.

Sunday, June 7, 2015

Quantifying the Efficacy of Machine Learning Algorithms

1 comment: