Confusionmatrix: Difference between revisions

From Eigenvector Research Documentation Wiki
Jump to navigation Jump to search
imported>Donal
imported>Donal
Line 30: Line 30:


* '''model''' = previously generated classifier model or pred structure,
* '''model''' = previously generated classifier model or pred structure,
* '''usecv''' = 0 or 1. 0 indicates confusion matrix should be based on self-prediction results, 1 indicates it is based on using CV on calibration data,
* '''usecv''' = 0 or 1. 0 indicates confusion matrix should be based on self-prediction results, 1 indicates it is based on using cross-validation results (assuming they are available in the model),
* '''trueClass''' = vector of numeric values indicating the true sample classes,
* '''trueClass''' = vector of numeric values indicating the true sample classes,
* '''predClass''' = vector of numeric values indicating the predicted sample classes.
* '''predClass''' = vector of numeric values indicating the predicted sample classes.

Revision as of 11:51, 27 December 2011

Purpose

Create a confusion matrix showing classification rates from a classification model or from a list of actual classes and a list of predicted classes.

Synopsis

[misclassed, classids, texttable] = confusionmatrix(model);  % create confusion matrix from classifier model
[misclassed, classids, texttable] = confusionmatrix(model, usecv);  % create confusion matrix from model using CV results
[misclassed, classids, texttable] = confusionmatrix(trueClass, predClass); % create confusion matrix from vectors of true and pred classes

Description

Confusionmatrix creates a table of results showing True Positive, False Postive, True Negative and False Negative rates (TP FP TN FN) as a matrix for each class modeled in an input model. Input models must be of type PLSDA, SVMDA, KNN, or SIMCA.

Optional second parameter "usecv" specifies use of the cross-validation based "model.detail.cvmisclassification" instead of the default self-prediction classifications "model.classification".

Input can consist of vectors of true class and predicted class instead of a model.

Classification rates are defined as:

TP: proportion of positive cases that were correctly identified
FP: proportion of negatives cases that were incorrectly classified as positive
TN: proportion of negatives cases that were classified correctly
FN: proportion of positive cases that were incorrectly classified as negative


Inputs

  • model = previously generated classifier model or pred structure,
  • usecv = 0 or 1. 0 indicates confusion matrix should be based on self-prediction results, 1 indicates it is based on using cross-validation results (assuming they are available in the model),
  • trueClass = vector of numeric values indicating the true sample classes,
  • predClass = vector of numeric values indicating the predicted sample classes.

Outputs

  • misclassed = confusion matrix, nclasses x 4 array, one row per class, columns are True/False Postive/Negative rates (TP FP TN FN),
  • classids = class names (identifiers),
  • texttable = cell array containing a text representation of the confusion matrix. Thi i-th element of the cell array, texttable{i}, is the i-th line of the texttable. Note that this text representation of the confusion matrix is displayed if the function is called with no output assignment.

Example

Calling confusionmatrix with no output variables assigned: 'confusionmatrix(model)' displays the output:

>> confusionmatrix(model)
Confusion Matrix:                                         
   Class:      TP          FP          TN          FN    
        K      1.00000     0.03774     0.96226     0.00000
       BL      0.77778     0.00000     1.00000     0.22222
       SH      1.00000     0.00000     1.00000     0.00000
       AN      1.00000     0.00000     1.00000     0.00000

See Also

confusiontable, plsda, svmda, knn, simca