|
The National Center for Biotechnology Information (NCBI) at the National Library
of Medicine maintains several databases of genetic sequence data. A Full Record
in MEDLINE® may be linked to any of ten databases.
Click the link to get information about the genetic sequence data reported
in the source article.
Conserved Domains
Conserved Domains is a database of protein domains. The source databases for
Conserved Domains are Pfam, Smart, and COG.*
Genome Database
The Genome database provides views for a variety of genomes, complete chromosomes,
sequence maps with contigs, and integrated genetic and physical maps. The database
is organized in six major organism groups: Archaea, Bacteria, Eukaryotae, Viruses,
Viroids, and Plasmids and includes complete chromosomes, organelles, and plasmids
as well as draft genome assemblies.*
Nucleotide
The Nucleotide database contains sequence data from GenBank, EMBL, and DDBJ,
the members of the tripartite, international collaboration of sequence databases.
EMBL is the European Molecular Biology Laboratory at Hinxton Hall, UK; DDBJ is
the DNA Database of Japan in Mishima, Japan. Sequence data are also incorporated
from the Genome Sequence Data Base (GSDB), Santa Fe, NM. Patent sequences are
incorporated through arrangements with the U.S. Patent and Trademark Office (USPTO)
and via the collaborating international databases from other international patent
offices.*
OIMM Database
OMIM (Online Mendelian Inheritance in Man) is a database of human genes and
genetic disorders authored and edited by Dr. Victor A. McKusick and his colleagues
at Johns Hopkins University and elsewhere. It was developed for the World Wide
Web by NCBI.
The database contains textual information and references. It contains copious
links to MEDLINE and sequence records in the Entrez system. It also contains links
to additional related resources at NCBI and elsewhere.*
PopSet Database
The PopSet database contains aligned sequences submitted as a set resulting
from a population, phylogenetic, or mutation study. These alignments describe
such events as evolution and population variation. The PopSet database contains
both nucleotide and protein sequence data.*
ProbeSet
Probe is a public registry of nucleic acid reagents designed for use in a wide
variety of biomedical research applications, together with information on reagent
distributors, probe effectiveness, and computed sequence similarities.*
Protein Database
The Protein database contains sequence data from the translated coding regions
from DNA sequences in GenBank, EMBL, and DDBJ as well as protein sequences submitted
to Protein Information Resource (PIR), SWISS-PROT, Protein Research Foundation
(PRF), and Protein Data Bank (PDB) (sequences from solved structures).*
SNP
The Single Nucleotide Polymorphism Database (dbSNP) is a central repository
database for both single-base nucleotide substitutions and short deletion and
insertion polymorphisms.*
Structure
The Structure database or Molecular Modeling Database (MMDB) contains experimental
data from crystallographic and NMR structure determinations. The data for MMDB
are obtained from the Protein Data Bank (PDB). The NCBI has cross-linked structural
data to bibliographic information, to the sequence databases, and to the NCBI
taxonomy.*
Taxonomy Database
The Taxonomy database contains the names of all organisms that are represented
in the NCBI genetic database by at least one nucleotide or protein sequence.*
UniGene
The UniGene database contains gene-oriented clusters of transcript sequences.
UniGene is an experimental system for automatically partitioning GenBank sequences
into a non-redundant set of gene-oriented clusters. Each UniGene cluster contains
sequences that represent a unique gene, as well as related information such as
the tissue types in which the gene has been expressed and map location.*
UniSTS
The UniSTS database contains Sequence Tagged Sites (STS) derived from STS mapping
data.
UniSTS
integrates marker and mapping data from a variety of public resources. Data sources include dbSTS, RHdb, GDB, various human maps, and various mouse maps.*
3D Domain
3D Domains are compact structural domains identified automatically in MMDB,
Entrez's macromolecular three-dimensional structure database.*
|