Skip to content

Comments

Ignore unknown bases ('N's) when detecting DB type#739

Merged
yannickwurm merged 1 commit intowurmlab:masterfrom
tadast:tt/unknown-bases
Apr 16, 2024
Merged

Ignore unknown bases ('N's) when detecting DB type#739
yannickwurm merged 1 commit intowurmlab:masterfrom
tadast:tt/unknown-bases

Conversation

@tadast
Copy link
Collaborator

@tadast tadast commented Apr 16, 2024

Some FASTAs might have a lot of unknown bases before meaningful data starts e.g. https://www.ncbi.nlm.nih.gov/nuccore/CM000994.3 Keep scanning the file until we reach meaningful bases to run the determination algorithm against.

Closes #727

Some FASTAs might have a lot of unknown bases before meaningful data
starts e.g. https://www.ncbi.nlm.nih.gov/nuccore/CM000994.3
Keep scanning the file until we reach meaningful bases to run the
determination algorithm against.

Closes wurmlab#727
@tadast tadast force-pushed the tt/unknown-bases branch from 2182828 to 8bdd2fd Compare April 16, 2024 11:34
@yannickwurm yannickwurm merged commit 00db003 into wurmlab:master Apr 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

genome fastas beginning with Ns aren't correctly identified

2 participants