| Database | Type | Location on the Helix Systems | Last Updated | ||
EMBOSS databasesAccessible via EMBOSS web interface& EMBOSS command-line | |||||
| EST EST division of Genbank |
Nuc | /fdb/embossdb/est.new | 30 Oct 2009 (Updated bimonthly after Genbank release | ||
| Gb_New All sequences added to Genbank since last major release |
Nuc | /fdb/embossdb/gbnew.new | 24 Nov 2009 (Updated daily | ||
| Genbank The NIH Genetic Sequence Database, an annotated collection of all publicly available DNA sequences. More information at NCBI. |
Nuc | /fdb/embossdb/genbank.new | 21 Oct 2009 (Updated bimonthly after Genbank release | ||
| Refseqnt NCBI's comprehensive, integrated, non-redundant set of sequences, including genomic DNA, transcript (RNA) for major research organisms. |
Nuc | /fdb/embossdb/refseqnt.new | 18 Nov 2009 (Updated weekly | ||
| Prints Protein fingerprints, groups of conserved motifs used to characterize a protein family. |
Patterns | used internally by Emboss | 18 Nov 2009 (Updated after new Prints release | ||
| Prosite A database/dictionary of protein sites and patterns. More information at Expasy. |
Patterns | used internally by Emboss | 18 Nov 2009 (Updated every 2 months | ||
| REBASE About restriction enzymes, recognition sequences, cleavage sites... More information at REBASE. |
Enzymes | used internally by Emboss | 18 Nov 2009 (Updated every month | ||
| Transfac Transcription factor database, most recent version from www.biobase.de. |
Info | used internally by Emboss | 04 Sep 2009 (Updated with new Transfac release. | ||
| GenPept GenPept is produced by parsing the corresponding GenBank release for translated coding regions of GenBank sequences. More information at NCI, Frederick |
Prot | /fdb/embossdb/genpept.new | 28 Oct 2009 (Updated bimonthly after Genbank release | ||
| GP_New All sequences added to GenPept since last major release |
Prot | /fdb/embossdb/gpnew.new | 24 Nov 2009 (Updated daily | ||
| Refseqaa NCBI's comprehensive, integrated, non-redundant set of protein sequences for major research organisms. |
Prot | /fdb/embossdb/refseqaa.new | 18 Nov 2009 (Updated weekly | ||
| UniProt (Swissprot + Trembl) A highly-annotated, curated protein sequence database. Minimal redundancy and high level of integration with other databases. More information at Expasy |
Prot | /fdb/embossdb/uniprot | 04 Nov 2009 (Updated weekly | ||
Blast databasesAccessible via Blast (Helix)Blast (Biowulf) | |||||
| Drosophila Drosophila sequences |
Nuc | /fdb/blastdb/drosoph.nt | 25 Jul 2005 (Updated weekly | ||
| E.Coli E.Coli sequences |
Nuc | /fdb/blastdb/ecoli.nt | 25 Jul 2005 (Updated weekly | ||
| E.Coli E.Coli sequences |
Nuc | /fdb/blastdb/ecoli.nt | 25 Jul 2005 (Updated weekly | ||
| EST - human Human sequences from the EST division of Genbank |
Nuc | /fdb/blastdb/est_human | 19 Nov 2009 (Updated weekly | ||
| EST - mouse Mouse sequences from the EST division of Genbank. |
Nuc | /fdb/blastdb/est_mouse | 19 Nov 2009 (Updated weekly | ||
| EST - others Non-human, non-mouse sequences from the EST division of Genbank |
Nuc | /fdb/blastdb/est_others | 19 Nov 2009 (Updated weekly | ||
| HTGs High throughput genome sequences |
Nuc | /fdb/blastdb/htgs | 17 Nov 2009 (Updated weekly | ||
| Human Genome Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Nuc | /fdb/blastdb/hs_genome | 01 May 2006 (Updated after new build release | ||
| Human Genome RNA Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Nuc | /fdb/blastdb/hs_genome.rna | 11 Aug 2009 (Updated after build release | ||
| Mito Mitochondrial sequences |
Nuc | /fdb/blastdb/mito.nt | 25 Jul 2005 (Updated weekly | ||
| Mouse Genome Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Nuc | /fdb/blastdb/mouse_genome | 25 Mar 2008 (Updated after new build release | ||
| Mouse Genome RNA Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Nuc | /fdb/blastdb/mouse_genome.rna | 26 Mar 2008 (Updated after release | ||
| NCBI nt All GenBank+EMBL+DDBJ (but no EST, STS, GSS, HTG). No longer nonredundant. |
Nuc | /fdb/blastdb/nt | 18 Nov 2009 (Updated weekly | ||
| Protein Data Bank An archive of experimentally determined three-dimensional strtures of biological macromolecules. More information at the PDB. |
Nuc | /fdb/blastdb/pdbnt | 19 Nov 2009 (Updated weekly | ||
| Refseq Human Genomic Refseq Human (NC_######) chromosome records with gap adjusted concatenated NT_ contigs |
Nuc | /fdb/blastdb/human_genomic | 19 Nov 2009 (Updated weekly | ||
| Refseq Human RNA A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Nuc | /fdb/blastdb/human.rna | 23 Nov 2009 (Updated weekly | ||
| Refseq Mouse RNA A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Nuc | /fdb/blastdb/mouse.rna | 23 Nov 2009 (Updated weekly | ||
| Refseq Other Genomic RefSeq chromosome records (NC_######) for organisms other than human |
Nuc | /fdb/blastdb/other_genomic | 17 Nov 2009 (Updated weekly | ||
| UniVec Core A non-redundant database of sequences commonly attached to cDNA or genomic DNA during the cloning process. Includes only oligonucleotides and vectors consisting of bacterial, phage, viral, yeast or synthetic sequences, but not vectors that include sequences of mammalian origin. |
Nuc | /fdb/blastdb/univec_core | 28 Apr 2008 (Updated once | ||
| Yeast Yeast sequences |
Nuc | /fdb/blastdb/yeast.nt | 25 Jul 2005 (Updated weekly | ||
| Drosophila Drosophila sequences |
Prot | /fdb/blastdb/drosoph.aa | 25 Jul 2005 (Updated weekly | ||
| E.Coli E.Coli sequences |
Prot | /fdb/blastdb/ecoli.aa | 19 Mar 2006 (Updated weekly | ||
| Human Genome Proteins Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Prot | /fdb/blastdb/hs_genome.protein | 11 Aug 2009 (Updated after build release | ||
| Mito Mitochondrial sequences |
Prot | /fdb/blastdb/mito.aa | 25 Jul 2005 (Updated weekly | ||
| Mouse Genome Proteins Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Prot | /fdb/blastdb/mouse_genome.protein | 26 Mar 2008 (Updated weekly | ||
| NCBI nr NCBI's nonredundant Genbank CDS translations + PDB + SwissProt |
Prot | /fdb/blastdb/nr | 19 Nov 2009 (Updated weekly | ||
| Protein Data Bank An archive of experimentally determined three-dimensional strtures of biological macromolecules. More information at the PDB. |
Prot | /fdb/blastdb/pdbaa | 19 Nov 2009 (Updated weekly | ||
| Refseq Human Proteins A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Prot | /fdb/blastdb/human.protein | 23 Nov 2009 (Updated weekly | ||
| Refseq Mouse Proteins A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Prot | /fdb/blastdb/mouse.protein | 23 Nov 2009 (Updated weekly | ||
| SwissProt A highly-annotated, curated protein sequence database. Minimal redundancy and high level of integration with other databases. More information at Expasy |
Prot | /fdb/blastdb/swissprot | 19 Nov 2009 (Updated weekly | ||
| Yeast Yeast sequences |
Prot | /fdb/blastdb/yeast.aa | 25 Jul 2005 (Updated weekly | ||
Fasta databasesAccessible via Fasta, BLAT. User programs. | |||||
| Drosophila Drosophila sequences |
Nuc | /fdb/fastadb/drosoph.nt.fas | 25 Jul 2005 (Updated weekly | ||
| E.Coli E.Coli sequences |
Nuc | /fdb/fastadb/ecoli.nt.fas | 25 Jul 2005 (Updated weekly | ||
| EST - human Human sequences from the EST division of Genbank. |
Nuc | /fdb/fastadb/est_human.fas | 24 Nov 2009 (Updated weekly | ||
| EST - mouse Mouse sequences from the EST division of Genbank. |
Nuc | /fdb/fastadb/est_mouse.fas | 24 Nov 2009 (Updated weekly | ||
| Human Genome Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Nuc | /fdb/genome/human-apr2006/ | 01 May 2006 (Updated after new build release | ||
| Human Genome RNA Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Nuc | /fdb/fastadb/hs_genome.rna.fas | 28 Apr 2006 (Updated after build release | ||
| Mito Mitochondrial sequences |
Nuc | /fdb/fastadb/mito.nt.fas | 25 Jul 2005 (Updated weekly | ||
| Mouse Genome Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Nuc | /fdb/genome/mouse-mar2006/ | 26 Mar 2008 (Updated after new build release | ||
| Mouse Genome RNA Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Nuc | /fdb/fastadb/mouse_genome.rna.fas | 25 Mar 2008 (Updated after release | ||
| NCBI nt All GenBank+EMBL+DDBJ (but no EST, STS, GSS, HTG). No longer nonredundant. |
Nuc | /fdb/fastadb/nt.fas | 24 Nov 2009 (Updated weekly | ||
| Protein Data Bank An archive of experimentally determined three-dimensional strtures of biological macromolecules. More information at the PDB. |
Nuc | /fdb/fastadb/pdb.nt.fas | 24 Nov 2009 (Updated weekly | ||
| Refseq Human Genomic Refseq Human (NC_######) chromosome records with gap adjusted concatenated NT_ contigs |
Nuc | /fdb/fastadb/ref.human.genomic.fas | 24 Nov 2009 (Updated weekly | ||
| Refseq Human RNA A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Nuc | /fdb/fastadb/ref.human.rna.fas | 24 Nov 2009 (Updated weekly | ||
| Refseq Mouse RNA A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Nuc | /fdb/fastadb/ref.mouse.rna.fas | 24 Nov 2009 (Updated weekly | ||
| Refseq Other Genomic RefSeq chromosome records (NC_######) for organisms other than human |
Nuc | /fdb/fastadb/ref.other.genomic.fas | 24 Nov 2009 (Updated weekly | ||
| Yeast Yeast sequences |
Nuc | /fdb/fastadb/yeast.nt.fas | 25 Jul 2005 (Updated weekly | ||
| Drosophila Drosophila sequences |
Prot | /fdb/fastadb/drosoph.aa.fas | 25 Jul 2005 (Updated weekly | ||
| E.Coli E.Coli sequences |
Prot | /fdb/fastadb/ecoli.aa.fas | 25 Jul 2005 (Updated weekly | ||
| Human Genome Proteins Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Prot | /fdb/fastadb/hs_genome.protein.fas | 28 Apr 2006 (Updated after build release | ||
| Mito Mitochondrial sequences |
Prot | /fdb/fastadb/mito.aa.fas | 25 Jul 2005 (Updated weekly | ||
| Mouse Genome Proteins Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Prot | /fdb/fastadb/mouse_genome.protein.fas | 25 Mar 2008 (Updated weekly | ||
| NCBI nr NCBI's nonredundant Genbank CDS translations + PDB + SwissProt |
Prot | /fdb/fastadb/nr.aa.fas | 24 Nov 2009 (Updated weekly | ||
| Protein Data Bank An archive of experimentally determined three-dimensional strtures of biological macromolecules. More information at the PDB. |
Prot | /fdb/fastadb/pdb.aa.fas | 24 Nov 2009 (Updated weekly | ||
| Refseq Human Proteins A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Prot | /fdb/fastadb/ref.human.protein.fas | 24 Nov 2009 (Updated weekly | ||
| Refseq Mouse Proteins A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Prot | /fdb/fastadb/ref.mouse.protein.fas | 24 Nov 2009 (Updated weekly | ||
| SwissProt A highly-annotated, curated protein sequence database. Minimal redundancy and high level of integration with other databases. More information at Expasy |
Prot | /fdb/fastadb/swissprot.aa.fas | 24 Nov 2009 (Updated weekly | ||
| Yeast Yeast sequences |
Prot | /fdb/fastadb/yeast.aa.fas | 25 Jul 2005 (Updated weekly | ||
Mascot databasesAccessible via Mascot search engine | |||||
| MSDB A nonredundant protein sequence database designed specifically for mass-spec applications. |
Prot | biospec.nih.gov | 20 Jun 2008 (Updated weekly | ||
| NCBI nr NCBI's nonredundant Genbank CDS translations + PDB + SwissProt |
Prot | biospec.nih.gov | 24 Nov 2009 (Updated weekly | ||
| NIH-Specific A collection of NIH-specific databases requested by NIH Mascot users. |
Prot | biospec.nih.gov | 13 Nov 2009 (Updated as requested | ||
| Sp_Trembl SwissProt + Trembl (a computer-annotated supplement of SwissProt) |
Prot | biospec.nih.gov | 08 Nov 2009 (Updated weekly | ||
| SwissProt A highly-annotated, curated protein sequence database. Minimal redundancy and high level of integration with other databases. More information at Expasy |
Prot | biospec.nih.gov | 08 Nov 2009 (Updated weekly | ||
PDB databasesAccessible via Molecules R Us or direct access to coordinate files.NIH users can NFS-mount the PDB databases on their own machines -- contact staff@helix.nih.gov for more info. | |||||
| Protein Data Bank An archive of experimentally determined three-dimensional strtures of biological macromolecules. More information at the PDB. |
3-D | /pdb/pdb | 25 Nov 2009 (Updated daily | ||
CSD databasesAccessible via Quest | |||||
| Cambridge Structural Database Crystal structure information for over 165,000 organic and organometallic compounds. More info at CCDC. |
3-D | /local/csd | 03 Apr 2009 (Updated every 3 months | ||
PFAM databasesAccessible via HMMER (Biowulf, Helix) | |||||
| PFAM A collection of multiple sequence alignments and hidden Markov models. More information at PFAM home page |
Families | /fdb/fastadb/pfam | 23 Mar 2009 (Updated every 3 months | ||
WU-Blast databasesAccessible via WU-Blast | |||||
| Drosophila Drosophila sequences |
Prot | /fdb/wublastdb/drosoph.aa | 25 Aug 2009 (Updated weekly | ||
| E.Coli E.Coli sequences |
Prot | /fdb/wublastdb/ecoli.aa | 25 Aug 2009 (Updated weekly | ||
| Human Genome Proteins Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Prot | /fdb/wublastdb/hs_genome.protein | 01 May 2006 (Updated after build release | ||
| Mito Mitochondrial sequences |
Prot | /fdb/wublastdb/mito.aa | 29 Oct 2008 (Updated weekly | ||
| Mouse Genome Proteins Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Prot | /fdb/wublastdb/mouse_genome.protein | 25 Mar 2008 (Updated weekly | ||
| NCBI nr NCBI's nonredundant Genbank CDS translations + PDB + SwissProt |
Prot | /fdb/wublastdb/nr | 24 Nov 2009 (Updated weekly | ||
| Protein Data Bank An archive of experimentally determined three-dimensional strtures of biological macromolecules. More information at the PDB. |
Prot | /fdb/wublastdb/pdb.aa | 24 Nov 2009 (Updated weekly | ||
| Refseq Human Proteins A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Prot | /fdb/wublastdb/ref.human.protein | 24 Nov 2009 (Updated weekly | ||
| Refseq Mouse Proteins A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Prot | /fdb/wublastdb/ref.mouse.protein | 24 Nov 2009 (Updated weekly | ||
| SwissProt A highly-annotated, curated protein sequence database. Minimal redundancy and high level of integration with other databases. More information at Expasy |
Prot | /fdb/wublastdb/swissprot.aa | 24 Nov 2009 (Updated weekly | ||
| Yeast Yeast sequences |
Prot | /fdb/wublastdb/yeast.aa | 25 Aug 2009 (Updated weekly | ||
| Drosophila Drosophila sequences |
Nuc | /fdb/wublastdb/drosoph.nt | 25 Aug 2009 (Updated weekly | ||
| E.Coli E.Coli sequences |
Nuc | /fdb/wublastdb/ecoli.nt | 25 Aug 2009 (Updated weekly | ||
| EST - human Human sequences from the EST division of Genbank. |
Nuc | /fdb/wublastdb/est_human | 24 Nov 2009 (Updated weekly | ||
| EST - mouse Mouse sequences from the EST division of Genbank. |
Nuc | /fdb/wublastdb/est_mouse | 24 Nov 2009 (Updated weekly | ||
| Human Genome Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Nuc | /fdb/wublastdb/hs_genome | 01 May 2006 (Updated weekly | ||
| Human Genome RNA Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Nuc | /fdb/wublastdb/hs_genome.rna | 01 May 2006 (Updated after build release | ||
| Mito Mitochondrial sequences |
Nuc | /fdb/wublastdb/mito.nt | 25 Aug 2009 (Updated weekly | ||
| Mouse Genome Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Nuc | /fdb/wublastdb/mouse_genome | 25 Mar 2008 (Updated after new build release | ||
| Mouse Genome RNA Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Nuc | /fdb/wublastdb/mouse.rna | 25 Mar 2008 (Updated after release | ||
| NCBI nt All GenBank+EMBL+DDBJ (but no EST, STS, GSS, HTG). No longer nonredundant. |
Nuc | /fdb/wublastdb/nt | 24 Nov 2009 (Updated weekly | ||
| Protein Data Bank An archive of experimentally determined three-dimensional strtures of biological macromolecules. More information at the PDB. |
Nuc | /fdb/wublastdb/pdb.nt | 24 Nov 2009 (Updated weeekly | ||
| Refseq Human RNA A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Nuc | /fdb/wublastdb/ref.human.rna | 24 Nov 2009 (Updated weekly | ||
| Refseq Mouse RNA A comprehensive, integrated, non-redundant set of sequences. More info at NCBI |
Nuc | /fdb/wublastdb/ref.mouse.rna | 24 Nov 2009 (Updated weekly | ||
| Yeast Yeast sequences |
Nuc | /fdb/wublastdb/yeast.nt | 25 Aug 2009 (Updated weekly | ||
MySQL databasesAccessible via NIH mirror of UCSC Genome BrowserAlso available for direct MySQL queries from the Biowulf cluster nodes. | |||||
| Chicken Genome May 2006 assembly from WUSTL. |
Nuc | NIH mirror of UCSC Genome Browser | 30 Dec 2007 (Updated weekly | ||
| Cow Genome Mar 2005 assembly from the Baylor Sequencing Center |
Nuc | NIH mirror of UCSC genome browser | 24 Mar 2008 (Updated weekly | ||
| Dog Genome May 2005 assembly from the Broad Institute |
Nuc | NIH mirror of UCSC genome browser | 30 Dec 2007 (Updated weekly | ||
| Human Genome Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Nuc | NIH mirror of UCSC genome browser | 23 Jul 2009 (Updated weekly | ||
| Mouse Genome Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Nuc | NIH mirror of UCSC genome browser | 01 Oct 2009 (Updated weekly | ||
| Rat Genome May 2006 build, rn4, from the Rat Genome Sequencing Consortium |
Nuc | NIH mirror of UCSC genome browser | 29 Dec 2007 (Updated weekly | ||
| Rhesus genome Jan 2006 assembly from the Baylor Sequencing Center. |
Nuc | NIH mirror of UCSC genome browser | 29 Dec 2007 (Updated weekly | ||
| Zebrafish genome Mar 2006 assembly from the Sanger Center. |
Nuc | NIH mirror of UCSC genome browser | 30 Dec 2007 (Updated weekly | ||
| Chicken Genome May 2006 assembly from WUSTL. |
Prot | NIH mirror of UCSC Genome Browser | 30 Dec 2007 (Updated weekly | ||
| Cow Genome Aug 2006 assembly from the Baylor Sequencing Center |
Prot | NIH mirror of UCSC genome browser | 24 Mar 2008 (Updated weekly | ||
| Dog Genome May 2005 assembly from the Broad Institute |
Prot | NIH mirror of UCSC genome browser | 30 Dec 2007 (Updated weekly | ||
| Human Genome Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Prot | NIH mirror of UCSC genome browser | 23 Jul 2009 (Updated weekly | ||
| Mouse Genome Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Prot | NIH mirror of UCSC genome browser | 01 Oct 2009 (Updated weekly | ||
| Rat Genome May 2006 build, rn4, from the Rat Genome Sequencing Consortium |
Prot | NIH mirror of UCSC genome browser | 29 Dec 2007 (Updated weekly | ||
| Rhesus genome Jan 2006 assembly from the Baylor Sequencing Center. |
Prot | NIH mirror of UCSC genome browser | 29 Dec 2007 (Updated weekly | ||
| Zebrafish genome Mar 2006 assembly from the Sanger Center. |
Prot | NIH mirror of UCSC genome browser | 30 Dec 2007 (Updated weekly | ||
| Chicken Genome May 2006 assembly from WUSTL. |
Annotations | NIH mirror of UCSC Genome Browser | 30 Dec 2007 (Updated weekly | ||
| Cow Genome Mar 2005 assembly from the Baylor Sequencing Center |
Annotations | NIH mirror of UCSC genome browser | 24 Mar 2008 (Updated weekly | ||
| Dog Genome May 2005 assembly from the Broad Institute |
Annotations | NIH mirror of UCSC genome browser | 30 Dec 2007 (Updated weekly | ||
| Drosophila genome April 2006 assembly |
Annotations | NIH mirror of UCSC genome browser | 17 Jul 2009 (Updated weekly | ||
| Human Genome Build 36, hg18 (April 2006) from the International Human Genome Consortium |
Annotations | NIH mirror of UCSC genome browser | 23 Jul 2009 (Updated weekly | ||
| Mouse Genome Build 37, mm9, Jul 2007 from the Mouse Genome Consortium |
Annotations | NIH mirror of UCSC genome browser | 01 Oct 2009 (Updated weekly | ||
| Rat Genome May 2006 build, rn4, from the Rat Genome Sequencing Consortium |
Annotations | NIH mirror of UCSC genome browser | 29 Dec 2007 (Updated weekly | ||
| Rhesus genome Jan 2006 assembly from the Baylor Sequencing Center. |
Annotations | NIH mirror of UCSC genome browser | 29 Dec 2007 (Updated weekly | ||
| Zebrafish genome Mar 2006 assembly from the Sanger Center. |
Annotations | NIH mirror of UCSC genome browser | 30 Dec 2007 (Updated weekly | ||

