To evaluate a classifier, we have to compare a output to another reference classification – generally a perfect classification, but in practice the output of another gold standard test – and cross tabulates the data into a 2×2 contingency table, comparing the two classifications.