Tag: Eukaryote Genome Annotation Pipeline (EGAP)

Try Out a Development Version of NCBI’s Publicly Available Annotation Tool, EGAPx

Try Out a Development Version of NCBI’s Publicly Available Annotation Tool, EGAPx

Latest release now available 

Are you generating genomes for vertebrates, arthropods, or plants, and looking for a way to generate high-quality genome annotation? NCBI is working on a public version of the NCBI Eukaryotic Genome Annotation Pipeline (EGAPx), and the latest developmental release is now available for testing and feedback. Continue reading “Try Out a Development Version of NCBI’s Publicly Available Annotation Tool, EGAPx”

RefSeq Release 226 is Available!

RefSeq Release 226 is Available!

Check out RefSeq release 226, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also divided by logical groupings.

What’s included in this release?

As of September 13, 2024, this full release incorporates genomic, transcript, and protein data containing:

  • 472,512,852 records
  • 355,355,673 proteins
  • 65,576,846 RNAs
  • Sequences from 155,792 organisms

Continue reading “RefSeq Release 226 is Available!”

New RefSeq Annotations Now Available!

New RefSeq Annotations Now Available!

In February and March, the NCBI Eukaryotic Genome Annotation Pipeline released forty-six new annotations in RefSeq!

New Annotations
  • Aedes albopictus (Asian tiger mosquito)
  • Anolis carolinensis (green anole)
  • Armigeres subalbatus (mosquito)
  • Bacillus rossius redtenbacheri (walking stick)
  • Bolinopsis microptera (comb jelly)
  • Bombyx mori (domestic silkworm)
  • Bubalus kerabau (carabao)
  • Candoia aspera (snake)
  • Cavia porcellus (domestic guinea pig) 
  • Continue reading “New RefSeq Annotations Now Available!”
Now Available: RefSeq Release 223

Now Available: RefSeq Release 223

Check out RefSeq release 223, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.

What’s included in this release?

As of March 4, 2024, this full release incorporates genomic, transcript, and protein data containing:

  • 425,594,654 records
  • 316,329,937 proteins
  • 60,886,133 RNAs
  • sequences from 147,591 organisms 

Continue reading “Now Available: RefSeq Release 223”

New RefSeq Annotations Now Available!

New RefSeq Annotations Now Available!

During October to January, the NCBI Eukaryotic Genome Annotation Pipeline released seventy new annotations in RefSeq!

New Annotations
  • Alnus glutinosa (eudicot)
  • Amyelois transitella (moth)
  • Anolis sagrei ordinatus (Brown anole)
  • Apis cerana (Asiatic honeybee)
  • Balaenoptera ricei (Rice’s whale)
  • Bombus pascuorum (bee)
  • Bos javanicus (banteng)
  • Bos taurus (cattle) 

Continue reading “New RefSeq Annotations Now Available!”