Chrome Extension
WeChat Mini Program
Use on ChatGLM

The Precision-Recall Plot is More Informative Than the ROC Plot when Evaluating Binary Classifiers on Imbalanced Datasets.

PLoS ONE(2015)SCI 3区

Univ Bergen

Cited 4289|Views49
Abstract
Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (PRC) plots are used less frequently. Many bioinformatics studies develop and evaluate classifiers that are to be applied to strongly imbalanced datasets in which the number of negatives outweighs the number of positives significantly. While ROC plots are visually appealing and provide an overview of a classifier's performance across a wide range of specificities, one can ask whether ROC plots could be misleading when applied in imbalanced classification scenarios. We show here that the visual interpretability of ROC plots in the context of imbalanced datasets can be deceptive with respect to conclusions about the reliability of classification performance, owing to an intuitive but wrong interpretation of specificity. PRC plots, on the other hand, can provide the viewer with an accurate prediction of future classification performance due to the fact that they evaluate the fraction of true positives among positive predictions. Our findings have potential implications for the interpretation of a large number of studies that use ROC plots on imbalanced datasets.
More
Translated text
PDF
Bibtex
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
  • Pretraining has recently greatly promoted the development of natural language processing (NLP)
  • We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
  • We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
  • The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
  • Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Try using models to generate summary,it takes about 60s
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Related Papers

Adversarial Examples in the Physical World.

International Conference on Learning Representations 2016

被引用7261

Safety Verification of Deep Neural Networks.

Computer Aided Verification Lecture Notes in Computer Science 2017

被引用1195

Assessing Threat of Adversarial Examples on Deep Neural Networks.

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) 2016

被引用58

Adversarial Examples for Evaluating Reading Comprehension Systems.

Conference on Empirical Methods in Natural Language Processing 2017

被引用1848

Feature-Guided Black-Box Safety Testing of Deep Neural Networks.

Tools and Algorithms for the Construction and Analysis of Systems Lecture Notes in Computer Science 2018

被引用137

Deflecting Adversarial Attacks with Pixel Deflection

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018

被引用409

Learning to Attack: Adversarial Transformation Networks

THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE 2018

被引用206

Deep Networks with RBF Layers to Prevent Adversarial Examples

Artificial Intelligence and Soft Computing Lecture Notes in Computer Science 2018

被引用10

Practical Black-Box Attacks on Deep Neural Networks Using Efficient Query Mechanisms

Computer Vision – ECCV 2018 Lecture Notes in Computer Science 2018

被引用316

How to Steal a Machine Learning Classifier with Deep Learning

IEEE International Conference on Technologies for Homeland Security 2017

被引用60

Security of Smart Manufacturing Systems

Journal of Manufacturing Systems 2018

被引用387

Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:论文指出在处理不平衡数据集时,精确度-召回率(PRC)图比ROC图更能准确反映二元分类器的性能。

方法】:作者通过分析ROC图和PRC图在视觉解读和性能评价上的差异,说明了PRC图在评估分类器在不平衡数据集上的表现时更为可靠。

实验】:论文没有具体描述实验过程,但提出通过比较分类器在不同特异性水平下的表现,发现ROC图可能误导对分类性能的评价,而PRC图则能更准确地预测未来的分类性能。文中未提及使用的数据集名称。