RNAseq:mRNA表达FPKM标准话原理和计算方法

06/26/2021 00:23 • 生物信息技术

mRNA Expression HT-Seq Normalization

RNA-Seq expression level read counts produced by HT-Seq are normalized using two similar methods: FPKM and FPKM-UQ. Normalized values should be used only within the context of the entire gene set. Users are encouraged to normalize raw read count values if a subset of genes is investigated.

FPKM

The Fragments per Kilobase of transcript per Million mapped reads (FPKM) calculation normalizes read count by dividing it by the gene length and the total number of reads mapped to protein-coding genes.

Upper Quartile FPKM

The upper quartile FPKM (FPKM-UQ) is a modified FPKM calculation in which the total protein-coding read count is replaced by the 75th percentile read count value for the sample.

Calculations

RC_g: Number of reads mapped to the gene
RC_pc: Number of reads mapped to all protein-coding genes
RC_g75: The 75th percentile read count value for genes in the sample
L: Length of the gene in base pairs; Calculated as the sum of all exons in a gene

Note: The read count is multiplied by a scalar (10⁹) during normalization to account for the kilobase and ‘million mapped reads’ units.

举例

Sample 1: Gene A

Gene length: 3,000 bp
1,000 reads mapped to Gene A
1,000,000 reads mapped to all protein-coding regions
Read count in Sample 1 for 75th percentile gene: 2,000

FPKM for Gene A = (1,000)*(10^9)/[(3,000)*(1,000,000)] = 333.33

FPKM-UQ for Gene A = (1,000)*(10^9)/[(3,000)*(2,000)] = 166,666.67

本站原创，如若转载，请注明出处：https://www.ouq.net/951.html

赞 (0)

打赏

微信打赏，为服务器增加50M流量

微信打赏，为服务器增加50M流量

支付宝打赏，为服务器增加50M流量

支付宝打赏，为服务器增加50M流量

Homebrew最新安装命令

上一篇 06/25/2021 19:11

2021最新JCR影响因子

下一篇 06/30/2021 19:25

通过WGS组学分析HLA分型

参考基因组：1000 Genomes reference: http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/GRCh38_reference_genome/GRCh…

生物信息技术 02/11/2026
042
RNA-seq read count normalization (TPM and RPKM)

RNA-seq read count normalization (TPM and RPKM) #Rscript normalize_featurecounts.R counts_table.txt tpm ; #Rscript norma…

R 08/27/2025
0367
R：无法安装ncdf4，错误nc-config… no|解决

无法安装ncdf4，错误nc-config… no： ‘getOption(“repos”)’ replaces Bioconductor standard repositorie…

R 06/28/2025
1564
Rstudio/Rstudio Server enable Copilot-Rstudio Server打开Copilot

Rstudio Server 默认是关闭Copilot Copilot is turned off by default. Copilot is turned off with copilot-enabled=0 in /etc/rstud…

R 06/05/2025
0575
R_Code: KEGG analysis

library(clusterProfiler) library(org.Hs.eg.db) # 读取输入数据文件 file_path <- “C:/Users/Lamarck/Desktop/UP_genes_ENSEMBL_ENT…

R 06/02/2025
0636