RNAseq:mRNA表达FPKM标准话原理和计算方法

06/26/2021 00:23 • 生物信息技术 • 阅读 159

mRNA Expression HT-Seq Normalization

RNA-Seq expression level read counts produced by HT-Seq are normalized using two similar methods: FPKM and FPKM-UQ. Normalized values should be used only within the context of the entire gene set. Users are encouraged to normalize raw read count values if a subset of genes is investigated.

FPKM

The Fragments per Kilobase of transcript per Million mapped reads (FPKM) calculation normalizes read count by dividing it by the gene length and the total number of reads mapped to protein-coding genes.

Upper Quartile FPKM

The upper quartile FPKM (FPKM-UQ) is a modified FPKM calculation in which the total protein-coding read count is replaced by the 75th percentile read count value for the sample.

Calculations

RC_g: Number of reads mapped to the gene
RC_pc: Number of reads mapped to all protein-coding genes
RC_g75: The 75th percentile read count value for genes in the sample
L: Length of the gene in base pairs; Calculated as the sum of all exons in a gene

Note: The read count is multiplied by a scalar (10⁹) during normalization to account for the kilobase and ‘million mapped reads’ units.

举例

Sample 1: Gene A

Gene length: 3,000 bp
1,000 reads mapped to Gene A
1,000,000 reads mapped to all protein-coding regions
Read count in Sample 1 for 75th percentile gene: 2,000

FPKM for Gene A = (1,000)*(10^9)/[(3,000)*(1,000,000)] = 333.33

FPKM-UQ for Gene A = (1,000)*(10^9)/[(3,000)*(2,000)] = 166,666.67

如若转载，请注明出处：https://www.ouq.net/rnaseqmrnafpkmnormalizationprotocol.html

赞 (0)

打赏

微信打赏，为服务器增加50M流量

微信打赏，为服务器增加50M流量

支付宝打赏，为服务器增加50M流量

支付宝打赏，为服务器增加50M流量

Homebrew最新安装命令

上一篇 06/25/2021

2021最新JCR影响因子

下一篇 06/30/2021

R

R：使用R连接数据库处理数据

1.数据库连接 library(DBI) library(dplyr) library(dbplyr) library(odbc) con <- dbConnect(odbc…

03/06/2022
072
生物信息技术

ROC curve analysis-什么是ROC分析

What is a ROC curve? A ROC curve is a plot of the true positive rate (Sensitivity) in func…

04/30/2023
0127
单细胞测序

Smart-seq3 Protocol

• The current protocols is based on the TN5 from Nextera Xt kit. However Illumina TDE1, wo…

04/23/2022
0203
机器学习

CellProfiler Analyst 基于机器学习的细胞荧光分析软件

CellProfiler Analyst（CPA）允许对数据进行交互式探索和分析，特别是来自高通量、基于图像的实验。它最受欢迎的功能是一个有监督的机器学习系统（”分类器…

05/01/2023
0168
R

利用R语言批量修改文件名file.rename函数

R语言中文件操作的函数有： file.create(…, showWarnings = TRUE)file.exists(…)file.remove(&#8…

03/25/2020
6664