Reference genomes, protein and nucleotide sequences databases, and other bio resources are now available on Discovery and Endeavour. If you need a specific release that is not currently included in the pages below, please submit a help ticket and we will try to make those resources available to you.
A set of ready-to-use reference sequences and annotations for commonly analyzed organisms, sourced from iGenomes.Genbank
The GenBank sequence database is an open access, annotated collection of all publicly available nucleotide sequences and their protein translations.Genome Taxonomy Database (GTDB)
The Genome Taxonomy Database (GTDB) is an initiative to establish a standardized microbial taxonomy based on genome phylogeny.Pfam Database
The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs).TIGRFAMs
TIGRFAMs is a resource consisting of curated multiple sequence alignments, Hidden Markov Models (HMMs) for protein sequence classification, and associated information designed to support automated annotation of (mostly prokaryotic) proteins.UniProt
The Universal Protein Resource (UniProt), a collaboration between the European Bioinformatics Institute (EBI), the SIB Swiss Institute of Bioinformatics, and the Protein Information Resource (PIR)