ROC curve analysis-什么是ROC分析

What is a ROC curve?

A ROC curve is a plot of the true positive rate (Sensitivity) in function of the false positive rate (100-Specificity) for different cut-off points of a parameter. Each point on the ROC curve represents a sensitivity/specificity pair corresponding to a particular decision threshold. The Area Under the ROC curve (AUC) is a measure of how well a parameter can distinguish between two diagnostic groups (diseased/normal).

MedCalc creates a complete sensitivity/specificity report.

The ROC curve is a fundamental tool for diagnostic test evaluation.

Theory summary

The diagnostic performance of a test, or the accuracy of a test to discriminate diseased cases from normal cases is evaluated using Receiver Operating Characteristic (ROC) curve analysis (Metz, 1978; Zweig & Campbell, 1993). ROC curves can also be used to compare the diagnostic performance of two or more laboratory or diagnostic tests (Griner et al., 1981).

When you consider the results of a particular test in two populations, one population with a disease, the other population without the disease, you will rarely observe a perfect separation between the two groups. Indeed, the distribution of the test results will overlap, as shown in the following figure.

ROC-curve-overlapping-distributions

For every possible cut-off point or criterion value you select to discriminate between the two populations, there will be some cases with the disease correctly classified as positive (TP = True Positive fraction), but some cases with the disease will be classified negative (FN = False Negative fraction). On the other hand, some cases without the disease will be correctly classified as negative (TN = True Negative fraction), but some cases without the disease will be classified as positive (FP = False Positive fraction).

Schematic outcomes of a test

The different fractions (TP, FP, TN, FN) are represented in the following table.

Disease
Test Present n Absent n Total
Positive True Positive (TP) a False Positive (FP) c a + c
Negative False Negative (FN) b True Negative (TN) d b + d
Total a + b c + d

The following statistics can be defined:

Sensitivity
a
a + b
Specificity
d
c + d
Positive
Likelihood
Ratio
Sensitivity
1 – Specificity
Negative
Likelihood
Ratio
1 – Sensitivity
Specificity
Positive
Predictive
Value
a
a + c
Negative
Predictive
Value
d
b + d
  • Sensitivity: probability that a test result will be positive when the disease is present (true positive rate, expressed as a percentage).
    Sensitivity=aa+b
  • Specificity: probability that a test result will be negative when the disease is not present (true negative rate, expressed as a percentage).
    Specificity=dc+d
  • Positive likelihood ratio: ratio between the probability of a positive test result given the presence of the disease and the probability of a positive test result given the absence of the disease, i.e.
    +LR=True positive rateFalse positive rate=Sensitivity1Specificity
  • Negative likelihood ratio: ratio between the probability of a negative test result given the presence of the disease and the probability of a negative test result given the absence of the disease, i.e.
    LR=False negative rateTrue negative rate=1SensitivitySpecificity
  • Positive predictive value: probability that the disease is present when the test is positive (expressed as a percentage).
    PPV=aa+c
  • Negative predictive value: probability that the disease is not present when the test is negative (expressed as a percentage).
    NPV=db+d

Sensitivity and specificity versus criterion value

When you select a higher criterion value, the false positive fraction will decrease with increased specificity but on the other hand the true positive fraction and sensitivity will decrease:

ROC-curve-sensitivity-specificity

When you select a lower threshold value, then the true positive fraction and sensitivity will increase. On the other hand the false positive fraction will also increase, and therefore the true negative fraction and specificity will decrease.

The ROC curve

In a Receiver Operating Characteristic (ROC) curve the true positive rate (Sensitivity) is plotted in function of the false positive rate (100-Specificity) for different cut-off points. Each point on the ROC curve represents a sensitivity/specificity pair corresponding to a particular decision threshold. A test with perfect discrimination (no overlap in the two distributions) has a ROC curve that passes through the upper left corner (100% sensitivity, 100% specificity). Therefore the closer the ROC curve is to the upper left corner, the higher the overall accuracy of the test (Zweig & Campbell, 1993).

ROC-curve-example

如若转载,请注明出处:https://www.ouq.net/2340.html

(0)
打赏 微信打赏,为服务器增加50M流量 微信打赏,为服务器增加50M流量 支付宝打赏,为服务器增加50M流量 支付宝打赏,为服务器增加50M流量
上一篇 02/16/2023
下一篇 04/30/2023

相关推荐

  • ChatGPT 中文使用教程调教指南

    ChatGPT模型是由OpenAI训练的大型语言模型,能够生成类人文本。通过向它提供提示,它可以生成继续对话或扩展给定提示的响应。 直接问它能干什么我是一个训练有素的大型语言模型,可以帮助你回答各种问题,比如关于政治、历史、科学、技术、艺术…

    02/09/2023
    273
  • Cytoscape基因互作网络分析软件和教程

    Cytoscape是一个开放源代码软件平台,用于可视化分子相互作用网络和生物途径, 并将这些网络与注释,基因表达谱和其他状态数据整合在一起 。 尽管Cytoscape最初是为生物学研究而设计的,但现在它已成为复杂网络分析和可视化的通用平台。…

    03/21/2020
    290
  • 糖尿病视网膜病变数据集-Diabetic Retinopathy Database

    DiaRetDB1 是一个公共数据库,用于评估和基准测试糖尿病视网膜病变检测算法。该数据库包含眼底数字图像和专家注释的几种著名糖尿病眼底病变(硬渗出液、软渗出液、微动脉瘤和出血)的基本事实。原始图像和原始地面实况均可用。除了数据,我们还提供…

    04/11/2022
    368
  • ggplot2绘图教程

    直方图:geom_histogram 分组直方图的排列方式 position=”dodge” ggplot(small)+geom_histogram(aes(x=price, fill=cut), position=”dodge”) ht…

    03/07/2022
    146
  • 在Ubuntu 18.04上编译bcl2fastq v2.20

    与MiSeq使用MiSeq Reporter自动将二进制碱基调用(BCL)文件转换为FASTQ格式不同,NextSeq、HiSeq和NovaSeq系统的输出需要用户开发的或第三方的数据分析工具,如bcl2fastq,来转换为FASTQ。除了…

    单细胞测序 05/18/2022
    369