SCENIC:cisTarget databases数据库下载

List of databases for the cisTarget family of tools (e.g. RcisTarget, SCENIC/pySCENIC, and cisTopic).

To choose the database appropiate for your analysis, start by selecting the species and ranking type (i.e. What do you want to analyze: genes or regions?).

Note that the download size is typically over 1GB (100GB for mammal region databases), we recommend downloading the files with zsync_curl (see the Help with downloads).


Related files:

  • sha256sum.txt: To confirm whether the file was succesfuly downloaded
  • TF annotation: Annotation to transcripton factors for the motifs or ChIP-seq tracks in each collection (30-100 Mb)
    • Human TFs (motif collection v8 / v9); (ChIP-seq v1 hg19 / hg38)
    • Mouse TFs (motif collection v8 / v9)
    • Fly TFs (motif collection v8 / v9); (ChIP-seq v1 dm6)

Column info:

Species:

  • Human (Homo sapiens)
  • Mouse (Mus musculus)
  • Fly (Drosophila melanogaster)

Ranking type:

  • Region: The ranking contains regions (i.e. for analyses of region-sets from ATAC-seq, ChIP-seq, …)
  • Genes: The ranking contains genes.

Distance: For gene rankings only. Indicates the search space around the TSS of gene in which the motif is scored:

  • 500bpUp: 500bp upstream of TSS
  • TSS+/-10kb: 10kb around the TSS (total: 20kb)
  • TSS+/-5kb: 5kb around the TSS (total: 10kb)
  • 5kbUp,FullTx: 5kb upstream TSS and transcript introns
  • 500bpUp100Dw: 500bp upstream of TSS, and 100bp downstream.

Motif or track collection:

  • Motifs – Version 8 (mc8nr): 20003 motifs
  • Motifs – Version 9 (mc9nr): 24453 motifs
  • TF ChIP-seq – Version 1 (tc_v1):
    • dm6: 1503 tracks
    • hg19: 3040 tracks
    • hg38: 2993 tracks

nOrt: Number of orthologous species used to select the regions based on conservation. In case of doubt of which version to use: 7 species is normally appropiate for most analyses.

Genome: Genome version used to construct the ranking. For region-based analyses it is important that this version matches your data! Gene annotation version is shown in parenthesis.

Database name: Database name (add the extensions to obtain specific file names, e.g. .feather or .feather.zsync).

Download URL: Link to the database (.feather file, and its size).


List of databases:

本站原创,如若转载,请注明出处:https://www.ouq.net/1460.html

(0)
打赏 微信打赏,为服务器增加50M流量 微信打赏,为服务器增加50M流量 支付宝打赏,为服务器增加50M流量 支付宝打赏,为服务器增加50M流量
上一篇 03/21/2022 12:55
下一篇 04/01/2022 14:38

相关推荐

  • R:monocle3安装code

    #安装依赖包BiocManager::install(c(‘BiocGenerics’, ‘DelayedArray’, ‘DelayedMatrixStats’,&#…

    R 03/03/2026
    66
  • 通过WGS组学分析HLA分型

    参考基因组:1000 Genomes reference: http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/GRCh38_reference_genome/GRCh…

    生物信息技术 02/11/2026
    109
  • Windows错误: “2147942402”解决

    无法作为 不可用/不可用 启动 DCOM 服务器: MicrosoftWindows.Client.CBS_1000.19061.1000.0_x64__cw5n1h2txyewy!InputApp。错误: “2147942402” 在启动…

    网络资源 11/07/2025
    290
  • 个人所得税税率表

    个人所得税税率表(综合所得适用) 级数 全年应纳税所得额 税率(%) 速算扣除数 1 不超过36000元的 3 0 2 超过36000元至144000元的部分 10 2520 3 超过144000元至300000元的部分 20 16920 …

    科普 10/24/2025
    560
  • RNA-seq read count normalization (TPM and RPKM)

    RNA-seq read count normalization (TPM and RPKM) #Rscript normalize_featurecounts.R counts_table.txt tpm ; #Rscript norma…

    R 08/27/2025
    386