生物信息学相关数据库

生物信息学数据库可以分为4大类:即基因组数据库、核酸和蛋白质一级结构数据库、生物大分子三维空间结构数据库,当前研究比较热点的集中于基因组、miRNA、LncRNA、circRNA等分子的查询,以及蛋白或蛋白修饰变化(甲基化、乙酰化等)与DNA启动子、miRNA、LncRNA、circRNA的互作,LncRNA与miRNA、mRNA、circRNA等相互的结合调控,目前各种数据库大概有上百种,没有系统性针对性的数据库,以下是我们对数据的整理,通过数据库查询分类、数据库功能及用途、示例结合分析、数据库优化等这四大项,进行阐述和演示数据库的查询和使用,希望对您的实验项目有所帮助

 基因查询数据库:

查询获取你的基因信息及相关序列信息

①NCBI:https://www.ncbi.nlm.nih.gov/

②UCSC:http://genome.ucsc.edu/

③Ensembl:http://www.ensembl.org/index.html

④EBI:http ://www.ebi.ac.uk/

⑤NIG:http://www.nig.ac.jp/

MiRNA查询数据库:

①miRBase: http://www.mirbase.org

②microRNA.org:http://www.microrna.org/

③deepBase: http://deepbase.sysu.edu.cn/

④starBase: http://starbase.sysu.edu.cn/

⑤targetScan:http://www.targetscan.org/vert_70/

⑥TarBase: http://www.tarbase.com/

⑦miRanda: http://www.microrna.org/microrna/home.do

⑧RNAhybrid:https://bibiserv.cebitec.uni-bielefeld.de/

⑨CoGeMiR:http://cogemir.tigem.it/

⑩miRNApath:http://lgmb.fmrp.usp.br/mirnapath/tools.php

LncRNA查询数据库:

①Ensembl:http://www.ensembl.org/index.html

②LncRNAdb:http://www.lncrnadb.org/

③LNCipedia:https://lncipedia.org/

④CHIPbase:http://rna.sysu.edu.cn/chipbase/

⑤starBase: http://starbase.sysu.edu.cn/

circRNA查询数据库:

①circBase:http://www.circbase.org/

②CIRCpedia:http://www.picb.ac.cn/rnomics/circpedia/

③deepbase:http://rna.sysu.edu.cn/deepBase/

④starbase:http://starbase.sysu.edu.cn/index.php

常用数据库功能用途介绍:

基因数据库功能:

1. NCBI:

The National Center for Biotechnology Information advances science and health by providing access to biomedical and genomic information


生物信息学相关数据库_第1张图片

数据库功能:

Submit:NCBI collects submissions of data for the world's largest public repository of biological and scientific information

Download:The majority of NCBI data are available for downloading, either directly from the NCBI FTP site or by using software tools to download custom datasets

Learn:NCBI creates a variety of educational products including courses, workshops, webinars, training materials and documentation. NCBI educational events are free and open to everyone. All NCBI educational materials are available for anyone to re-use and distribute.

Develop:NCBI provides a variety of resources that allow developers to access and manipulate NCBI data in their applications.

Analyze:NCBI provides a wide variety of data analysis tools that allow users to manipulate, align, visualize and evaluate biological data.

2.UCSC Genome Browser:

The UCSC Genome Browser is developed and maintained by the Genome Bioinformatics Group, a cross-departmental team within the UCSC Genomics Institute. the website has grown to include a broad collection of vertebrate and model organism assemblies and annotations, along with a large suite of tools for viewing, analyzing and downloading data.


生物信息学相关数据库_第2张图片

数据库功能:

Genome Browser:interactively visualize genomic data

BLAT:rapidly align sequences to the genome

Table Browser:download data from the Genome Browser database

Variant Annotation Integrator:get functional effect predictions for variant calls

Data Integrator:combine data sources from the Genome Browser database

Gene Sorter:find genes that are similar by expression and other metrics

Genome Browser in a Box (GBiB):run the Genome Browser on your laptop or server

In-Silico PCR:rapidly align PCR primer pairs to the genome

LiftOver:convert genome coordinates between assemblies

VisiGene:interactively view in situ images of mouse and frog


MiRNA数据库:

1. miRBase

the microRNA database


生物信息学相关数据库_第3张图片

数据库功能:

• The miRBase database is a searchable database of published miRNA sequences and annotation. • The miRBase Registry provides miRNA gene hunters with unique names for novel miRNA genes prior to publication of results. 


生物信息学相关数据库_第4张图片

2. microRNA.org :

Targets and Expression,Predicted microRNA targets & target downregulation scores. Experimentally observed expression patterns.

数据库功能:

1. mirSVR predicted target site scoring method: Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites

2. microRNA target predictions: The microRNA.org resource: targets and expression.

3. miRanda application: Human MicroRNA targets.

4. miRanda algorithm: MicroRNA targets in Drosophila.


LncRNA数据库:

1. Ensembl genome browser

Ensembl is a genome browser for vertebrate genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation. Ensembl annotate genes, computes multiple alignments, predicts regulatory function and collects disease data. Ensembl tools include BLAST, BLAT, BioMart and the Variant Effect Predictor (VEP) for all supported species


生物信息学相关数据库_第5张图片

数据库功能:

Variant Effect Predictor

Gene expression in Ensembl

Retrieving sequences

Compare genes across species

SNPs and other variants for my gene

Use my own data in Ensembl


生物信息学相关数据库_第6张图片

2. LncRNAab :

Long Noncoding RNA Database v2.0- The Reference Database For Functional Long Noncoding RNAs


生物信息学相关数据库_第7张图片

数据库功能:

nucleotide sequences

genomic context

gene expression data derived from the Illumina Body Atlas

structural information

subcellular localization

conservation

function with referenced literature


3. LNCipedia.org v. 4.1:

A comprehensive compendium of human long non-coding RNAs


生物信息学相关数据库_第8张图片


circRNA数据库:

1. circBase:

Circular RNA ( circ RNA) is a recent addition to the growing list of types of noncoding RNA.Here you can explore public circ RNA datasets and download the custom python scripts needed to dis cover cicRNAs in your own RNA-seq data


生物信息学相关数据库_第9张图片

数据库功能(Database function)

• Sequence-based search

• Search the database by identifier, gene description, genomic position, or their lists.

• Retrieve dataset slices by defining a set of conditions (table browser).

• Export tables in a variety of formats.

• Export FASTA files containing genomic sequence.


2. CIRCpedia:

CIRCpedia is an integrative database, aiming to annotating alternative back-splicing and alternative splicing in circRNAs across different cell lines. Through employing an upgraded circRNA characterization pipeline (CIRCexplorer2), thousands of alternative back-splicing and alternative splicing events in circRNAs were identified. All these identified alternative back-splicing and alternative splicing in circRNAs, together with novel exons, are formatted and classified for being easily searched, browsed and downloaded from CIRCpedia


生物信息学相关数据库_第10张图片

示例分析:

基因查询:以H19为例

UCSC数据库

1. 打开主页面

2. 点击Genome Browser,选择种属,

3. 对话框中输入基因,点击“GO”

4. 即可查询到基因的相关信息


生物信息学相关数据库_第11张图片
生物信息学相关数据库_第12张图片
生物信息学相关数据库_第13张图片

数据库优化:

UCSC数据库可查询到基因的信息,以及该基因在不同物种中,序列的保守性等数据


2. miRNA查询:

miRBase使用:以has-mir-9为例

1. 输入网址,打开主页面

2. “search by miRNA name or keyword’对话框中输入miRNA名称

3. 点击“GO”查询

4. 根据您的物种需要,点击即可获取该miRNA的相关信息

5. 点击“Get sequence”,即可获取序列信息


生物信息学相关数据库_第14张图片

数据库优化:

MiRbase是一款非常强大的miRNA查询数据库,可查询miRNA相关信息外,还可以做与mRNA的结合预测分析,详细请您进一步探知



LncRNA查询:以LncRNA H19为例

Ensembl genome browser数据库:

1. 打开主页面

2. 选取种属,对话框输入查询LncRNA

3. 点击进入,即可获取LncRNAH19的相关信息


生物信息学相关数据库_第15张图片
生物信息学相关数据库_第16张图片

数据库优化:Ensembl数据库是一款可查询LncRNA不同剪接变体及详细信息的数据库,对于LncRNA有多种剪接变体来说,可查询获取得到确切的研究变体序列



CircRNA查询:

CircRNA数据库:以CDR1(小脑变性相关蛋白1)为例,查询环状RNA信息


生物信息学相关数据库_第17张图片



生物信息学相关数据库_第18张图片

数据库优化:circbase可查询基因转录对应的环状RNA信息外,还可以直接通过输入环状RNA的ID或是名称进行查询,可得到详细的环状RNA的信息

你可能感兴趣的:(生物信息学相关数据库)