Database

We need to organize biological data into databases because scientists produce large amounts of it. Sharing data in databases allows it to help other researchers, even if a particular piece of data was not useful for the original scientist's paper. Biological databases organize data into standardized records with fields for items like identifiers, sequences, descriptions, and references. This allows the data to be stored efficiently and shared in easy-to-access ways.

Uploaded by

filymascolo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views2 pages

Database

Uploaded by

filymascolo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

BIOLOGICAL DATABASE

We need to organize our data because we produce a lot of it. Scientific literature is not somehow to
share data, but is used to share stories. There is the personal interpretation of our data. Data are not
the main part. We need to share data in easy ways. There may be data not useful in my paper, so I
won’t use it and it will be lost, but in database it can help other scientist with their research.
DATABASE ORGANIZATION
Flat collection. Items that are somehow identical  same type of elements, same features in
common. For protein I want for sure to store the sequence but is not enough: when was sequenced,
how, where etc… EMBL Databank: Idea to store items like this in a typical file, called tabbed
and in order to store data in optimized form. How can a human record a gene of 20.000 characters?
A database is a collection of file, which one is a record of a protein. Flat collection of elements
identical. Each file is organized in smart and economical way. Every info is on a line: what it and
plenty of option readable from AI also. A field is typical a line (ID ex). I could have different
update for a single line, that wasn’t possible before. If I have no space left for description of line 1
data, I can keep writing and start writing in the next line and so on, and that next line is not another
description of the same data, but is the remaining of the previous that wasn’t possible to write in
due to no space left.
Ex of EMBL record: ID, Acc number, dates, description, keywords, taxonomy, ref block 1, ref bloc
2, ref block 3 (grouped in references), comment.
At least we want the sequence, so it’s put in feature field the sequence itself. It’s written in way that
not only machines can read it but also humans, ex the seq is split in group of 10 nucleotides with
spaces and at the end of the line the number of index nucleotide (60, 120, 180, 240…). Additional
field may be CDS (coding sequence) that tells where the sequence starts coding, and so a program
that does translation can use this information (20-1729). A file written in the 80’s is still readable.
DDBJ is national storage for sequences made in java. At a certain point database stopped competing
vs each other and started collaboration, stored in INSDC International Nucleotide Sequence
Database Collaboration (DDBJ, EML, NCI…)
ENA European Nucleotide Archive
EMBL DB + SRA = ENA {vedi questa storia che la chiede sicuro l’ha detto Luca}
Today we usually take the information by a web interface. Be able to distinguish the web interface
to the database, that is only a program that answers a query. Before w/o internet no nightly updates,
that were monthly or 3-monthly, and data didn’t move via internet but in suitcases in trains xD lol
lmao so funny kill me pls I hate this life.
PROTEIN SEQUENCE DATABASE – DATABANKS
Electronic version of ATLAS of protein sequence and structure (1965?).
Swissprot (1986) but usable in sequences rich in annotation (descry of function, domain
structure…)
TrEmble (1996) useful for protein not in swissprot with no annotation… ex protein that could exist
but are not found and no proved they exist.
SECONDARY DATABASE
Ex tremble comes not from experimental work but from translation of already existing.
If I want to sequence 20 nt, then another seq a gene, another mitochondrial genome, an other trnas,
there’s a lot of chaos, and we need to put order in this disorder, trying to put together the pieces. Ex
if I put all sequencing in a database, I may see that in a species there may be billions genes, but it’s
not possible, how many genes have a human or a chimpanzee? Not billions for sure, there’s a ort of
redundancy. I put all this stuff in a program and I somehow it organizes.
HOW TO ORGANIZE DATA
It’s not an informatic issue but most a logic problem. We have to make data easily understandable;
today we produce a huge amount of data, in an afternoon we can produce a larger amount of data
done in a year, thanks to an experiment. We can study the expression of several genes in several cell
lines in several condition.
New model is to organize ex human names in index, because some humans may share the same
number (ok facciamo finta di sì) ad the same address. By doing this, if I ask database “who lives in
Via Roma 21?” the query will compare only one time Via Roma 21 for each line, in the sense that
will appear in database only one time and not 4 or 5, and is associated to 4 or 5 indexes associated
with 4 or 5 people. In this way I can reduce computational stress and fasten the research.

M Lec 01 & 02 Biological Database
No ratings yet
M Lec 01 & 02 Biological Database
50 pages
Sec1 Introduction To Bioinformatics
No ratings yet
Sec1 Introduction To Bioinformatics
20 pages
Biological Database
No ratings yet
Biological Database
8 pages
Unit Ii
No ratings yet
Unit Ii
23 pages
Seminar Bioinformatics
No ratings yet
Seminar Bioinformatics
13 pages
Bioinformatics Databases Explained
No ratings yet
Bioinformatics Databases Explained
5 pages
Capture D'écran . 2023-03-14 À 00.15.22
No ratings yet
Capture D'écran . 2023-03-14 À 00.15.22
54 pages
02-A-Introduction To Biological Databases
No ratings yet
02-A-Introduction To Biological Databases
52 pages
Overview of Sequence Databases
No ratings yet
Overview of Sequence Databases
135 pages
Bioinformatics Lecture 1
No ratings yet
Bioinformatics Lecture 1
48 pages
Rese Rach
No ratings yet
Rese Rach
37 pages
Understanding Biological Databases
No ratings yet
Understanding Biological Databases
47 pages
Bioinformatics Database Basics
No ratings yet
Bioinformatics Database Basics
18 pages
Biological - Databases Class Work 60
No ratings yet
Biological - Databases Class Work 60
60 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Database
No ratings yet
Database
40 pages
Lecture3 4
No ratings yet
Lecture3 4
73 pages
Introduction to Biological Databases
No ratings yet
Introduction to Biological Databases
49 pages
Biological Databases in Bioinformatics
No ratings yet
Biological Databases in Bioinformatics
29 pages
Lec4 Databases
No ratings yet
Lec4 Databases
29 pages
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
No ratings yet
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
48 pages
Databases Class Work
No ratings yet
Databases Class Work
48 pages
Bioinformatics PPT Section B Data Storage and Retrival Group 3
No ratings yet
Bioinformatics PPT Section B Data Storage and Retrival Group 3
36 pages
Intro to Biological Databases
No ratings yet
Intro to Biological Databases
14 pages
2024.HF BioInformatics Lec3p
No ratings yet
2024.HF BioInformatics Lec3p
11 pages
Lecture Bioinfo Databases
No ratings yet
Lecture Bioinfo Databases
27 pages
Bi 5&10mark Q&A Mse 1
No ratings yet
Bi 5&10mark Q&A Mse 1
14 pages
Biological Databases
No ratings yet
Biological Databases
41 pages
Lesson 01 Intro DataBases V2
No ratings yet
Lesson 01 Intro DataBases V2
38 pages
Module 2 (Bioinformatics)
No ratings yet
Module 2 (Bioinformatics)
81 pages
Introduction To Databases
No ratings yet
Introduction To Databases
21 pages
Bioinformatics Week 1: Play Video Starting At:4:13 and Follow Transcript4:13
No ratings yet
Bioinformatics Week 1: Play Video Starting At:4:13 and Follow Transcript4:13
7 pages
Lecture 5-6 - Databases NR
No ratings yet
Lecture 5-6 - Databases NR
35 pages
Bioinformatics for Researchers
No ratings yet
Bioinformatics for Researchers
23 pages
Biological Databases: - Bio-Informatics
No ratings yet
Biological Databases: - Bio-Informatics
16 pages
BIOINFORMATICS
No ratings yet
BIOINFORMATICS
13 pages
Introduction to Bioinformatics Basics
No ratings yet
Introduction to Bioinformatics Basics
35 pages
Lecture1 BIOF242 Shuvadeep
No ratings yet
Lecture1 BIOF242 Shuvadeep
38 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
Overview of Biological Databases
No ratings yet
Overview of Biological Databases
50 pages
Cannataro 2014
No ratings yet
Cannataro 2014
10 pages
? Bioinformatics Study Note
No ratings yet
? Bioinformatics Study Note
4 pages
Biological Databases
No ratings yet
Biological Databases
19 pages
Unit II Major Databases in Bioinformatics
No ratings yet
Unit II Major Databases in Bioinformatics
54 pages
Biological Databases PDF
No ratings yet
Biological Databases PDF
13 pages
Biological Databases
No ratings yet
Biological Databases
13 pages
#1 L1 BioDatabases
No ratings yet
#1 L1 BioDatabases
89 pages
Biological Databases
No ratings yet
Biological Databases
17 pages
Bioinformatics Database Guide
No ratings yet
Bioinformatics Database Guide
7 pages
Bioinformatics: Overview and Applications
No ratings yet
Bioinformatics: Overview and Applications
24 pages
Biological Databases
No ratings yet
Biological Databases
3 pages
Overview of Bioinformatics Databases
50% (2)
Overview of Bioinformatics Databases
5 pages
Bioinfo Lecture 2
No ratings yet
Bioinfo Lecture 2
29 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
52 pages
Essential Info Notes-1
No ratings yet
Essential Info Notes-1
57 pages
Presentation 11
No ratings yet
Presentation 11
20 pages
المحاضرة 2
No ratings yet
المحاضرة 2
16 pages
Unit II Bioinformatics
No ratings yet
Unit II Bioinformatics
25 pages
SF9 Senior High School Progress Report
100% (1)
SF9 Senior High School Progress Report
2 pages
Ethiopian Airlines Maintenance Exam Questions
100% (1)
Ethiopian Airlines Maintenance Exam Questions
5 pages
Thesis Help for Smoking Topics
100% (2)
Thesis Help for Smoking Topics
8 pages
Lei Ilima Girls Club Project: Calendar of Events
No ratings yet
Lei Ilima Girls Club Project: Calendar of Events
4 pages
The Chimney Effect in Fire Investigation - Understanding Its Impact On Fire Behavior
No ratings yet
The Chimney Effect in Fire Investigation - Understanding Its Impact On Fire Behavior
11 pages
Stages of Hypovolemic Shock
No ratings yet
Stages of Hypovolemic Shock
25 pages
Resource Governor
No ratings yet
Resource Governor
70 pages
VMware Cloud Director Availability in AVS
No ratings yet
VMware Cloud Director Availability in AVS
24 pages
Module 3 Protocols and Models
No ratings yet
Module 3 Protocols and Models
46 pages
ADMN 2506A Business Statistics Midterm
No ratings yet
ADMN 2506A Business Statistics Midterm
5 pages
The Stick, Regulation As A Tool of Goverment
No ratings yet
The Stick, Regulation As A Tool of Goverment
11 pages
D20 Modern - WOTC - Past - Oef BM We
96% (25)
D20 Modern - WOTC - Past - Oef BM We
101 pages
Types of Learning
No ratings yet
Types of Learning
3 pages
Excerpt - Ursoi Race
100% (2)
Excerpt - Ursoi Race
2 pages
From Natural Language To Simulations Applying AI To Automate Simulation Modelling of Logistics Systems
No ratings yet
From Natural Language To Simulations Applying AI To Automate Simulation Modelling of Logistics Systems
25 pages
Synchronous Machines Overview and Analysis
No ratings yet
Synchronous Machines Overview and Analysis
58 pages
Rio+20: Sustainable Development Goals
No ratings yet
Rio+20: Sustainable Development Goals
13 pages
Yamaha Ventura Phazer 8GC - 28197 - J1
No ratings yet
Yamaha Ventura Phazer 8GC - 28197 - J1
212 pages
EE 311 Module 2: Resistance
No ratings yet
EE 311 Module 2: Resistance
13 pages
Level II - Ata 36-21-30 Air Systems
100% (1)
Level II - Ata 36-21-30 Air Systems
88 pages
SGM722XMS
No ratings yet
SGM722XMS
18 pages
LIC
No ratings yet
LIC
33 pages
LESSON 4 - Modern Dance
No ratings yet
LESSON 4 - Modern Dance
45 pages
21 00202 Proposed Construction of Basketball Court Roofing at Bugallon Plaza
No ratings yet
21 00202 Proposed Construction of Basketball Court Roofing at Bugallon Plaza
84 pages
Park Hyatt Abu Dhabi Anti-Slip Treatment
No ratings yet
Park Hyatt Abu Dhabi Anti-Slip Treatment
1 page
Adaptec Ultra160 Driver Guide Win98
No ratings yet
Adaptec Ultra160 Driver Guide Win98
4 pages
Speedometer
No ratings yet
Speedometer
58 pages
ATL001 CTA1 Foundations 01 TASA 2009 and The Code of Professional Conduct
No ratings yet
ATL001 CTA1 Foundations 01 TASA 2009 and The Code of Professional Conduct
21 pages
High School Unified Manual 2024-25
No ratings yet
High School Unified Manual 2024-25
209 pages
TH 0622
No ratings yet
TH 0622
8 pages

Database

Uploaded by

Database

Uploaded by

BIOLOGICAL DATABASE

You might also like