RefSeq release 95: naming evidence added to all relevant WP proteins

RefSeq release 95: naming evidence added to all relevant WP proteins

RefSeq release 95 is accessible online, via FTP and through NCBI’s Entrez programming utilities, E-utilities.

This full release incorporates genomic, transcript, and protein data available, as of July 8, 2019 and contains 206,416,381 records, including 146,381,777 proteins, 27,212,750 RNAs, and sequences from 93,618 organisms.

The release is provided in several directories as a complete dataset and as divided by logical groupings.

Special announcements:

  1. New human genome Annotation Release 109.20190607

    As previously announced, Annotation Release 109.20190607 includes the MANE Select subset of transcripts. The annotation products are available in the sequence databases and on the FTP site.

  2. Naming evidence added to all relevant prokaryotic RefSeq protein (WP) names

    We have added a new comment to all WP proteins that received their names from a curated Hidden Markov Model-based or BLAST-based protein family, or a conserved domain architecture. Publications, gene symbols and EC numbers are also propagated from these naming evidence when available. For example, see WP_004152100.1 or WP_003106506.1. More details about the protein family used as the naming evidence are available from the Evidence Accession hyperlink shown in Entrez Protein, and more information on the naming of microbial proteins is available here.

Leave a Reply