blast数据库含义

blast的数据库里面有这几个数据库,每一个的具体含义:

https://ncisf.org/index.php?q=software-databases/blast-databases

A list of the databases available on the cluster, including information about the database, it's source, update method and description.

All databases are located in /sw/db

Name
Type
Update Method
Source
Description 
nt nucleic Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/nt.* nucleotide sequence database, with entries from all traditional divisions of GenBank, EMBL, and DDBJ excluding bulk divisions (gss, sts, pat, est, and htg divisions. wgs entries are also excluded. Not non-redundant.
nr protein Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/nr.* non-redundant protein squence database with 
entries from GenPept, Swissprot, PIR, PDF, PDB
and NCBI RefSeq
swissprot protein Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/swissprot.tar.gz swiss-prot sequence databases (last major update), 
it's parent database is nr.
human_genomic nucleic Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/human_genomic.* Human RefSeq (NC_######) chromosome records 
with gap adjusted concatenated NT_ contigs
est_human nucleic Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/est_human.* Alias and mask files for human subset of the est
database.   These alias and mask files need all volumes 
of est to function properly.
pataa protein Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/pataa.* Patent protein sequence database.  Directly from 
USPTO or from EU/Japan Patent Agencies via EMBL/DDBJ
patnt nucleic Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/patnt.* Patent nucleotide sequence database.  Directly from 
USPTO or from EU/Japan Patent Agencies via EMBL/DDBJ
pdbaa protein Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/pdbaa.* Protein sequneces from PDB protein structures, it's parent 
database is nr.
pdbnt nucleic Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/pdbnt/* Nucleotide sequences from pdb nucleic acid structures.  
It's parent database is nt.  They are NOT the protein coding 
sequences for the corresponding pdbaa entries.
sts nucleic Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/sts.* Sequences from the STS division of GenBank, EMBL, and DDBJ
vector nucleic Automatic - NCBI formatted. ftp://ftp.ncbi.nih.gov/blast/db/vector.* Vector sequence database.  (Note that for vector screening, 
NCBI recommend using the UniVec database, please contact 
[email protected] should you require this database).

你可能感兴趣的:(blast数据库含义)