Tag: Eukaryote Genome Annotation Pipeline (EGAP)
With the latest release of EGAPx, we’re excited to announce that you can now submit genome assemblies with EGAPx annotations directly to GenBank. We’re making it easier for researchers to share richly annotated eukaryotic genomes, complete with structural and functional features generated by the EGAPx pipeline.
What’s new?
- Easily integrate your EGAPx annotations into GenBank: You can now attach the EGAPx-generated ASN.1 annotation file as part of a submission package.
Continue reading “GenBank Now Supports EGAPx-Based Annotation” →
Check out RefSeq release 229, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also as divided by logical groupings.
What’s included in this release?
As of March 3, 2025, this full release incorporates genomic, transcript, and protein data containing:
- 522,879,448 records
- 399,577,538 proteins
- 68,985,910 RNAs
- Sequences from 164,117 organisms
Continue reading “RefSeq Release 229 is Now Available!” →
Latest release now available
Are you generating genomes for vertebrates, arthropods, or plants, and looking for a way to generate high-quality genome annotation? NCBI is working on a public version of the NCBI Eukaryotic Genome Annotation Pipeline (EGAPx), and the latest developmental release is now available for testing and feedback. Continue reading “Try Out a Development Version of NCBI’s Publicly Available Annotation Tool, EGAPx” →
Check out RefSeq release 226, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also divided by logical groupings.
What’s included in this release?
As of September 13, 2024, this full release incorporates genomic, transcript, and protein data containing:
- 472,512,852 records
- 355,355,673 proteins
- 65,576,846 RNAs
- Sequences from 155,792 organisms
Continue reading “RefSeq Release 226 is Available!” →
In February and March, the NCBI Eukaryotic Genome Annotation Pipeline released forty-six new annotations in RefSeq!
New Annotations
- Aedes albopictus (Asian tiger mosquito)
- Anolis carolinensis (green anole)
- Armigeres subalbatus (mosquito)
- Bacillus rossius redtenbacheri (walking stick)
- Bolinopsis microptera (comb jelly)
- Bombyx mori (domestic silkworm)
- Bubalus kerabau (carabao)
- Candoia aspera (snake)
- Cavia porcellus (domestic guinea pig)
- Continue reading “New RefSeq Annotations Now Available!” →
Check out RefSeq release 223, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.
What’s included in this release?
As of March 4, 2024, this full release incorporates genomic, transcript, and protein data containing:
- 425,594,654 records
- 316,329,937 proteins
- 60,886,133 RNAs
- sequences from 147,591 organisms
Continue reading “Now Available: RefSeq Release 223” →
During October to January, the NCBI Eukaryotic Genome Annotation Pipeline released seventy new annotations in RefSeq!
New Annotations
- Alnus glutinosa (eudicot)
- Amyelois transitella (moth)
- Anolis sagrei ordinatus (Brown anole)
- Apis cerana (Asiatic honeybee)
- Balaenoptera ricei (Rice’s whale)
- Bombus pascuorum (bee)
- Bos javanicus (banteng)
- Bos taurus (cattle)
Continue reading “New RefSeq Annotations Now Available!” →