0% found this document useful (0 votes)

263 views7 pages

Composite Dbs

The document discusses the proliferation of primary sequence databases (dbs) and the challenges in choosing the most accurate, up-to-date, and comprehensive options. It highlights various dbs such as NRL-3D, PIR, SWISS-PROT, and composite dbs like NRDB, OWL, and MIPSX, each with unique features and limitations. The document suggests that composite dbs can streamline searches by amalgamating multiple sources, thus improving efficiency and reducing redundancy.

Uploaded by

sadia.202204062

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

263 views7 pages

Composite Dbs

Uploaded by

sadia.202204062

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

An embarras de richesses

Th proliferation of primary sequence dbs gives rise to a

number of questions:

Do they all have same format?

Which is the most accurate?
Which is the most up-to-date?
Which is the most comrehensive?
Given the choice, which should we use?

Of the protein sequence dbs, NRL-3D is the least

comprehensive because it reflects only the contents of PDB,
yet it has the advantage of relating directly to structural
information.
PIR (1-4) is the most coprehensive resource, but the quality of
its annotations is still relatively poor.

SWISS-PROT, on the other hand, is a highly structured db that

provides excellent annotations, but its sequence coverage is
poor compared to PIR.

Choosing the right db to search can seem an impossible

choice; so is it, perhaps, better to search them all?
Composite Protein Sequence Dbs
One solution to the problem of proliferation primary dbs is to
compile a composite, i.e. a db that amalgamates a variety of
different primary sources.

Composite dbs: These dbs render sequence searching much

more efficient, because they obviate the need to interrogate
multiple resources.
The interrogation process is streamlined still further if the
composite has been designed to be non-redundant, as this
means that the same sequence need not be searched more
than once.
Different strategies can be used to create composite
resources.
The final product depends on the chosen data sources and the
criteria used to merge them; e.g.

A composite resource will be non-identical if it eliminates only

identical sequence copies during the amalgamation process.

But if both identical and highly similar sequences are ejected

(e.g. those entries that differ by only one residue), then the
resulting db will be more truly non-redundant.
The choice of different sources and the application of different
redundancy criteria have led to the emergence of different
composites, each of which has its own particular format.
The main dbs are outlined below.

NRDB: Non-Redundant Db is built at the NCBI.

The db is a composite of GenPept (derived from automatic
GenBank CDS translations), PDB sequences, SWISS-PROT,
SPupdate (the weekly updates of SWISS-PROT), PIR and
GenPeptupdate (the daily updates of GenPept).

This db is thus comprehensive and contains up-to-date

information.
However, strictly speaking, it is not non-redundant but non-
identical i.e. only identical sequence copies are removed from
the resource.
OWL: It is non-Redundant protein sequene db built at the
University of Leeds in collaboration with the Daresbury
Laboratory in Warrington.

The db is a composite of four major primary sources: SWISS-

PROT, PIR 1-4, GenBank (CDS tranlations) and NRL-3D.

MIPSX: It is merged db produced at the Max-Planck Institut in

Martinsried.

The db contains information from the following resources: PIR

1-4, MIPS preliminary entries, MIPSOwn; MIPS/PIR
preliminary entries, PIRMOD; MIPS preliminary translations,
MIPSTrn; MIPS yeast entries, MIPSH, NRL-3D, SWISS-PROT,
EMTrans, GBTrans, Kabat and PSeqIP.
SWISS-PROT + TrEMBL: At the EBI, the combination of SWISS-
PROT and TrEMBL provides a resource that is both
comprehensive and minimally redundant.

This db has the advantage of containing fewer errors than do

those mentioned above.

Lecture Topic: Protein Databases: Topics Covered
No ratings yet
Lecture Topic: Protein Databases: Topics Covered
67 pages
Module 2 Biodata
No ratings yet
Module 2 Biodata
36 pages
Overview of Biological Databases
No ratings yet
Overview of Biological Databases
20 pages
Database 2
No ratings yet
Database 2
15 pages
Bioinformatics Databases
No ratings yet
Bioinformatics Databases
10 pages
Bioinformatics Databases Overview
No ratings yet
Bioinformatics Databases Overview
31 pages
Disclaimer
No ratings yet
Disclaimer
18 pages
Protein Database Overview
No ratings yet
Protein Database Overview
13 pages
Overview of Biological Databases
No ratings yet
Overview of Biological Databases
13 pages
Biological Databases PDF
No ratings yet
Biological Databases PDF
13 pages
L-5 Protein Database and Secondary Databases
No ratings yet
L-5 Protein Database and Secondary Databases
24 pages
Protein Seq Databases
No ratings yet
Protein Seq Databases
20 pages
Biologicaldatabase 190402034501
No ratings yet
Biologicaldatabase 190402034501
26 pages
Protein Databases
No ratings yet
Protein Databases
49 pages
Databases - Final
No ratings yet
Databases - Final
50 pages
Biological Data Retrieval Techniques
No ratings yet
Biological Data Retrieval Techniques
9 pages
Databases Class Work
No ratings yet
Databases Class Work
48 pages
Biological Sequence Databases
No ratings yet
Biological Sequence Databases
33 pages
Introduction To Databases - NCBI, PDB and Uniprot
No ratings yet
Introduction To Databases - NCBI, PDB and Uniprot
5 pages
Biological Databases
No ratings yet
Biological Databases
6 pages
Biological Database Overview
No ratings yet
Biological Database Overview
31 pages
Adv Bi Unit 1
No ratings yet
Adv Bi Unit 1
39 pages
Rese Rach
No ratings yet
Rese Rach
37 pages
Bioinformatics Databases Explained
No ratings yet
Bioinformatics Databases Explained
5 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Biological Databases
No ratings yet
Biological Databases
19 pages
Overview of Bioinformatics Databases
No ratings yet
Overview of Bioinformatics Databases
105 pages
Overview of Sequence Databases
No ratings yet
Overview of Sequence Databases
135 pages
Overview of Biological Databases
No ratings yet
Overview of Biological Databases
17 pages
Bioinformatics Day2
No ratings yet
Bioinformatics Day2
3 pages
Biological - Databases Class Work 60
No ratings yet
Biological - Databases Class Work 60
60 pages
Protein Database
No ratings yet
Protein Database
3 pages
Overview of Bioinformatics Databases
No ratings yet
Overview of Bioinformatics Databases
65 pages
Databases 2025
No ratings yet
Databases 2025
50 pages
Major Bioinformatics Databases Overview
No ratings yet
Major Bioinformatics Databases Overview
54 pages
Bioinformatics Code & Format Guide
No ratings yet
Bioinformatics Code & Format Guide
53 pages
Nucleic Acid & Protein Sequence Databases
No ratings yet
Nucleic Acid & Protein Sequence Databases
21 pages
11-Protein Information Resource (PIR) - 02-09-2024
No ratings yet
11-Protein Information Resource (PIR) - 02-09-2024
11 pages
Data Base in Bioinformatics
No ratings yet
Data Base in Bioinformatics
30 pages
Lecture 3-Uniprot-Biological Information Repository.
No ratings yet
Lecture 3-Uniprot-Biological Information Repository.
15 pages
I Hate This Website
No ratings yet
I Hate This Website
4 pages
UniProt: Comprehensive Protein Data Resource
No ratings yet
UniProt: Comprehensive Protein Data Resource
4 pages
Bioinformatics for Plant Scientists
No ratings yet
Bioinformatics for Plant Scientists
28 pages
Biological Databases BDB
No ratings yet
Biological Databases BDB
5 pages
Protein Sequence Database Ankita Sharma
No ratings yet
Protein Sequence Database Ankita Sharma
31 pages
Protein Databases
No ratings yet
Protein Databases
12 pages
Overview of the Protein Data Bank
No ratings yet
Overview of the Protein Data Bank
7 pages
Introduction to Bioinformatics Basics
No ratings yet
Introduction to Bioinformatics Basics
47 pages
UniProt: Central Protein Knowledgebase
No ratings yet
UniProt: Central Protein Knowledgebase
3 pages
UniProt: Comprehensive Protein Database
No ratings yet
UniProt: Comprehensive Protein Database
6 pages
Major Bioinformatics Databases Overview
No ratings yet
Major Bioinformatics Databases Overview
36 pages
Biological Databases
No ratings yet
Biological Databases
41 pages
Overview of Biological Databases
No ratings yet
Overview of Biological Databases
50 pages
Introduction To Databases
No ratings yet
Introduction To Databases
21 pages
Overview of Bioinformatics Databases
50% (2)
Overview of Bioinformatics Databases
5 pages
In Silico Protein Characterization Tools
No ratings yet
In Silico Protein Characterization Tools
13 pages
Online Biological Databases: A/Prof. Ly Le
No ratings yet
Online Biological Databases: A/Prof. Ly Le
64 pages
Overview of Biological Databases
No ratings yet
Overview of Biological Databases
25 pages
Protein Structure Data Guide
No ratings yet
Protein Structure Data Guide
33 pages
4th Lecture (20!02!2017) Plant Tisue Cultue
No ratings yet
4th Lecture (20!02!2017) Plant Tisue Cultue
23 pages
Transgenic Animals and Techniques
No ratings yet
Transgenic Animals and Techniques
3 pages
Diving Into The World of Virtual Reality Using
No ratings yet
Diving Into The World of Virtual Reality Using
24 pages
VR Innovations in Bioinformatics Data Retrieval
No ratings yet
VR Innovations in Bioinformatics Data Retrieval
7 pages
Tripeptides with Positive Charge at pH 7
No ratings yet
Tripeptides with Positive Charge at pH 7
58 pages
Gene Editing Advances in Asthma Treatment
No ratings yet
Gene Editing Advances in Asthma Treatment
25 pages
List of Biological Databases
No ratings yet
List of Biological Databases
9 pages
Database Question Bank for Students
No ratings yet
Database Question Bank for Students
6 pages
Protein Sequence Databases
No ratings yet
Protein Sequence Databases
4 pages
MicroRNA Database: miRBase Overview
No ratings yet
MicroRNA Database: miRBase Overview
2 pages
Lab Report 1 Bioinformatics
No ratings yet
Lab Report 1 Bioinformatics
13 pages
L-4 Primary Databases
No ratings yet
L-4 Primary Databases
35 pages
Uniprot and Protein Database Quiz
No ratings yet
Uniprot and Protein Database Quiz
4 pages
Understanding Biological Databases
No ratings yet
Understanding Biological Databases
10 pages
Composite Dbs
No ratings yet
Composite Dbs
7 pages
Ncbi, Embl DDBJ
No ratings yet
Ncbi, Embl DDBJ
20 pages
Entrez NCBI: Comprehensive Database Search
No ratings yet
Entrez NCBI: Comprehensive Database Search
10 pages
Human Metabolome Database
No ratings yet
Human Metabolome Database
4 pages
NCBI and PubMed Resources Overview
No ratings yet
NCBI and PubMed Resources Overview
8 pages
DNA Data Bank of Japan Overview
No ratings yet
DNA Data Bank of Japan Overview
2 pages
Overview of GenBank Database
No ratings yet
Overview of GenBank Database
6 pages
Genomes With Ensembl
No ratings yet
Genomes With Ensembl
19 pages
MobiDB: Protein Disorder Database
No ratings yet
MobiDB: Protein Disorder Database
3 pages
National Center For Biotechnology Information
No ratings yet
National Center For Biotechnology Information
23 pages
Batch CD-Search Tool Results
No ratings yet
Batch CD-Search Tool Results
2 pages
BRENDA: Comprehensive Enzyme Database
No ratings yet
BRENDA: Comprehensive Enzyme Database
5 pages
Bioinformatics Day4
No ratings yet
Bioinformatics Day4
5 pages
Literature Database
No ratings yet
Literature Database
37 pages
Genbank: Multilizer PDF Translator Free Version - Translation Is Limited To 3 Pages Per Translation
No ratings yet
Genbank: Multilizer PDF Translator Free Version - Translation Is Limited To 3 Pages Per Translation
7 pages
Diabetes Mellitus - Search Results - PubMed
No ratings yet
Diabetes Mellitus - Search Results - PubMed
2 pages
Biological Databases - Databanks
No ratings yet
Biological Databases - Databanks
7 pages
PC#1 Exercises Introduction To NCBI 2020-Solved
No ratings yet
PC#1 Exercises Introduction To NCBI 2020-Solved
6 pages
Lecture 5 Protein Sequence Database
No ratings yet
Lecture 5 Protein Sequence Database
12 pages

Composite Dbs

Uploaded by

Composite Dbs

Uploaded by

An embarras de richesses

Th proliferation of primary sequence dbs gives rise to a

Do they all have same format?

Of the protein sequence dbs, NRL-3D is the least

SWISS-PROT, on the other hand, is a highly structured db that

Choosing the right db to search can seem an impossible

Composite dbs: These dbs render sequence searching much

A composite resource will be non-identical if it eliminates only

But if both identical and highly similar sequences are ejected

NRDB: Non-Redundant Db is built at the NCBI.

This db is thus comprehensive and contains up-to-date

The db is a composite of four major primary sources: SWISS-

MIPSX: It is merged db produced at the Max-Planck Institut in

The db contains information from the following resources: PIR

This db has the advantage of containing fewer errors than do

You might also like