Check out RefSeq release 226, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets. The release is provided in several directories as a complete dataset and also divided by logical groupings.
What’s included in this release?
As of September 13, 2024, this full release incorporates genomic, transcript, and protein data containing:
- 472,512,852 records
- 355,355,673 proteins
- 65,576,846 RNAs
- Sequences from 155,792 organisms