The use of the area under the ROC curve in the evaluation of machine learning algorithms

(1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7), pp. 1145-1159.

View at publisher

Description

In this paper we investigate the use of the area under the receiver operating characteristic (ROC) curve (AUC) as a performance measure for machine learning algorithms. As a case study we evaluate six machine learning algorithms (C4.5, Multiscale Classifier, Perceptron, Multi-layer Perceptron, k-Nearest Neighbours, and a Quadratic Discriminant Function) on six "real world" medical diagnostics data sets. We compare and discuss the use of AUC to the more conventional overall accuracy and find that AUC exhibits a number of desirable properties when compared to overall accuracy: increased sensitivity in Analysis of Variance (ANOVA) tests; a standard error that decreased as both AUC and the number of test samples increased; decision threshold independent; and it is invariant to a priori class probabilities. The paper concludes with the recommendation that AUC be used in preference to overall accuracy for "single number" evaluation of machine learning algorithms.

Impact and interest:

6,083 citations in Scopus
5,089 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

4,276 since deposited on 20 Feb 2020
667 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 180272
Item Type: Contribution to Journal (Journal Article)
Refereed: Yes
ORCID iD:
Bradley, Andrew P.orcid.org/0000-0003-0109-6844
Measurements or Duration: 15 pages
Keywords: Accuracy measures, Cross-validation, Standard error, The area under the ROC curve (AUC), The ROC curve, Wilcoxon statistic, Receiver operating characteristics (ROC), Decision theory, Learning algorithms, Medical applications, Learning systems
DOI: 10.1016/S0031-3203(96)00142-2
ISSN: 0031-3203
Pure ID: 44494684
Divisions: Past > QUT Faculties & Divisions > Science & Engineering Faculty
Funding Information: Acknowledgements--The Author is grateful to Geoffrey Hawson and Michael Ray of the Prince Charles in Brisbane for allowing access to the post-operative heart bleeding data set used in this study. The work of Michael Ray and Geoffrey Hawson is kindly supported by the Prince Charles Hospital Private Practice Study, Education, and Research Trust Fund. Thanks are also due to Gary Glonek, Brian Lovell, Dennis Longstaff, and the anonymous referees for helpful comments on earlier drafts of this paper.
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 20 Feb 2020 13:36
Last Modified: 29 Mar 2026 06:27