SCENIC:cisTarget databases数据库下载

List of databases for the cisTarget family of tools (e.g. RcisTarget, SCENIC/pySCENIC, and cisTopic).

To choose the database appropiate for your analysis, start by selecting the species and ranking type (i.e. What do you want to analyze: genes or regions?).

Note that the download size is typically over 1GB (100GB for mammal region databases), we recommend downloading the files with zsync_curl (see the Help with downloads).


Related files:

  • sha256sum.txt: To confirm whether the file was succesfuly downloaded
  • TF annotation: Annotation to transcripton factors for the motifs or ChIP-seq tracks in each collection (30-100 Mb)
    • Human TFs (motif collection v8 / v9); (ChIP-seq v1 hg19 / hg38)
    • Mouse TFs (motif collection v8 / v9)
    • Fly TFs (motif collection v8 / v9); (ChIP-seq v1 dm6)

Column info:

Species:

  • Human (Homo sapiens)
  • Mouse (Mus musculus)
  • Fly (Drosophila melanogaster)

Ranking type:

  • Region: The ranking contains regions (i.e. for analyses of region-sets from ATAC-seq, ChIP-seq, …)
  • Genes: The ranking contains genes.

Distance: For gene rankings only. Indicates the search space around the TSS of gene in which the motif is scored:

  • 500bpUp: 500bp upstream of TSS
  • TSS+/-10kb: 10kb around the TSS (total: 20kb)
  • TSS+/-5kb: 5kb around the TSS (total: 10kb)
  • 5kbUp,FullTx: 5kb upstream TSS and transcript introns
  • 500bpUp100Dw: 500bp upstream of TSS, and 100bp downstream.

Motif or track collection:

  • Motifs – Version 8 (mc8nr): 20003 motifs
  • Motifs – Version 9 (mc9nr): 24453 motifs
  • TF ChIP-seq – Version 1 (tc_v1):
    • dm6: 1503 tracks
    • hg19: 3040 tracks
    • hg38: 2993 tracks

nOrt: Number of orthologous species used to select the regions based on conservation. In case of doubt of which version to use: 7 species is normally appropiate for most analyses.

Genome: Genome version used to construct the ranking. For region-based analyses it is important that this version matches your data! Gene annotation version is shown in parenthesis.

Database name: Database name (add the extensions to obtain specific file names, e.g. .feather or .feather.zsync).

Download URL: Link to the database (.feather file, and its size).


List of databases:

本站原创,如若转载,请注明出处:https://www.ouq.net/1460.html

(0)
打赏 微信打赏,为服务器增加50M流量 微信打赏,为服务器增加50M流量 支付宝打赏,为服务器增加50M流量 支付宝打赏,为服务器增加50M流量
上一篇 03/21/2022 12:55
下一篇 04/01/2022 14:38

相关推荐

  • Rstudio/Rstudio Server enable Copilot-Rstudio Server打开Copilot

    Rstudio Server 默认是关闭Copilot Copilot is turned off by default. Copilot is turned off with copilot-enabled=0 in /etc/rstud…

    R 06/05/2025
    112
  • R_Code: KEGG analysis

    library(clusterProfiler) library(org.Hs.eg.db) # 读取输入数据文件 file_path <- “C:/Users/Lamarck/Desktop/UP_genes_ENSEMBL_ENT…

    R 06/02/2025
    134
  • R_Code:GO and Functional GO

    GO analysis: library(AnnotationDbi) library(org.Hs.eg.db) #基因注释包 library(clusterProfiler) #富集包 # 读取CSV文件 file_path <-…

    R 06/02/2025
    137
  • R_Code:WGCNA and WGCNA_Get_Gene_Length

    library(WGCNA) library(DESeq2) # enableWGCNAThreads(nThreads = 10) # 在处理数据框(data.frame)时,不会自动给将String类型转换成factor类型 optio…

    R 06/02/2025
    139
  • R_Code:RNAseq_GSEA_analysis

    library(clusterProfiler) # GSEA 和富集分析主力包 library(org.Hs.eg.db) # 人类注释数据库(ENTREZID 与 SYMBOL 等 ID 转换) library(enrichplot) …

    R 06/02/2025
    137