0% found this document useful (0 votes)

48 views6 pages

BIO Code Report

This document discusses using Biopython to analyze a COVID-19 DNA sequence. It shows how to import modules, parse the FASTA format DNA sequence, transcribe it to mRNA, translate the mRNA to an amino acid sequence, split the sequence at stop codons to identify proteins, and use ProtParam to analyze properties of the identified proteins such as molecular weight and flexibility.

Uploaded by

Sai Sangavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views6 pages

BIO Code Report

Uploaded by

Sai Sangavi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

COVID2–19 DNA sequence data using python.

Major Modules Used:

Bio Python
Squiggle
Pandas

Importing Modules:

from future import division

from [Link] import ProtParam
import warnings
import pandas as pd
from Bio import SeqIO
from [Link] import CodonTable

We will use [Link] from Biopython for parsing

DNA sequence data(fasta). It provides a simple
uniform interface to input and output assorted
sequence file formats.

for sequence in [Link](r'[Link]', "fasta"):

print([Link])
print(len(sequence), 'nucliotides')

DNAsequence = [Link](r'[Link]', "fasta")

print(DNAsequence)
Since input sequence is FASTA (DNA), and
Coronavirus is RNA type of virus, we need to:
Transcribe DNA to RNA (ATTAAAGGTT… =>
AUUAAAGGUU…)
Translate RNA to Amino acid sequence
(AUUAAAGGUU… => IKGLYLPR*Q…)
In the current scenario, the .fna file starts with
ATTAAAGGTT, then we call transcribe() so T
(thymine) is replaced with U (uracil), so we get the
RNA sequence which starts with AUUAAAGGUU
The transcribe() method will convert the DNA to
mRNA.
DNA = [Link]
mRNA = [Link]()
print(mRNA)
print('Size : ', len(mRNA))

The difference between the DNA and the mRNA is

just that the bases T (for Thymine) are replaced
with U (for Uracil).
Next, we are going to translate the mRNA sequence
to amino-acid sequence using translate() method,
we get something like IKGLYLPR*Q ( is so-called
STOP codon, effectively is a separator for proteins).
Amino_Acid = [Link](table=1, cds=False)
print('Amino Acid', Amino_Acid)
print("Length of Protein:", len(Amino_Acid))
print("Length of Original mRNA:", len(mRNA))

The standard genetic code is traditionally

represented as an RNA codon table because, when
proteins are made in a cell by ribosomes, it is
mRNA that directs protein synthesis. The mRNA
sequence is determined by the sequence of
genomic DNA. Here are some features of codons:
Most codons specify an amino acid
Three “stop” codons mark the end of a protein
One “start” codon, AUG, marks the beginning of a
protein and also encodes the amino acid
methionine.
A series of codons in part of a messenger RNA
(mRNA) molecule. Each codon consists of three
nucleotides, usually corresponding to a single
amino acid. The nucleotides are abbreviated with
the letters A, U, G, and C. This is mRNA, which
uses U (uracil). DNA uses T (thymine) instead. This
mRNA molecule will instruct a ribosome to
synthesize a protein according to this code. Source

print(CodonTable.unambiguous_rna_by_name['Sta
ndard'])
Now we are extracting the Proteins (chains of
amino acids), basically separating at the stop
codon, marked by * (ASTERISK). Then let’s remove
any sequence less than 20 amino acids long, as
this is the smallest known functional protein

Proteins = Amino_Acid.split('*')
df = [Link](Proteins)
[Link]()
print('Total proteins:', len(df))
def conv(item):
return len(item)
def to_str(item):
return str(item)
df['sequence_str'] = df[0].apply(to_str)
df['length'] = df[0].apply(conv)
[Link](columns={0: "sequence"}, inplace=True)
[Link]()
functional_proteins = [Link][df['length'] >= 20]

print('Total functional proteins:',

len(functional_proteins))

print(functional_proteins.describe())

Protein Analysis With The Protparam Module In

Biopython using ProtParam.

poi_list = []
MW_list = []

for record in Proteins[:]:

print("\n")
X = [Link](str(record))
POI = X.count_amino_acids()
poi_list.append(POI)
MW = X.molecular_weight()
MW_list.append(MW)
print("Protein of Interest = ", POI)
try:
print("Amino acids percent = ",
str(X.get_amino_acids_percent()))
except ZeroDivisionError:
pass
print("Molecular weight = ", MW)
try:
print("Aromaticity = ", [Link]())
except ZeroDivisionError:
pass
print("Flexibility = ", [Link]())
try:
print("Secondary structure fraction = ",
X.secondary_structure_fraction())
except ZeroDivisionError:
pass

As The Above Code Produces The OutPut For All

The 775 proteins, we have attached only one of the
output screen.

MOOC Project Work - Sequence Analysis - Data Analysis With Python 2021
No ratings yet
MOOC Project Work - Sequence Analysis - Data Analysis With Python 2021
29 pages
Biopy
No ratings yet
Biopy
4 pages
Biopython: Sequence Objects and Methods
100% (1)
Biopython: Sequence Objects and Methods
63 pages
Lec 2
No ratings yet
Lec 2
31 pages
INFO390C DNDS Pset05
No ratings yet
INFO390C DNDS Pset05
9 pages
Biopython Lab Manual for Biologists
No ratings yet
Biopython Lab Manual for Biologists
24 pages
Uniroma1 Bioinformatics pcs2 2021 Ichatz Talk10
No ratings yet
Uniroma1 Bioinformatics pcs2 2021 Ichatz Talk10
5 pages
Computational Problem For Practice
No ratings yet
Computational Problem For Practice
18 pages
Computational Biology, Part 8: Protein Coding Regions
No ratings yet
Computational Biology, Part 8: Protein Coding Regions
40 pages
solutionsExerciseMaster11 23
No ratings yet
solutionsExerciseMaster11 23
13 pages
RIP Tutorials Bioinformatics
No ratings yet
RIP Tutorials Bioinformatics
19 pages
DNA RNA Protein
No ratings yet
DNA RNA Protein
5 pages
Biopython Tutorial
100% (1)
Biopython Tutorial
26 pages
Lab 2
No ratings yet
Lab 2
7 pages
BioPython Cookbook
No ratings yet
BioPython Cookbook
310 pages
04 Functions
No ratings yet
04 Functions
16 pages
Genomics Data Preparation Guide
No ratings yet
Genomics Data Preparation Guide
30 pages
Python for Bioinformatics with Biopython
No ratings yet
Python for Bioinformatics with Biopython
28 pages
Biopython Tutorial and Cookbook
No ratings yet
Biopython Tutorial and Cookbook
324 pages
Bio Python Tutorial
No ratings yet
Bio Python Tutorial
331 pages
Biopython Tutorial PDF
No ratings yet
Biopython Tutorial PDF
332 pages
Bio Python
100% (1)
Bio Python
357 pages
HW 13
No ratings yet
HW 13
6 pages
1009169194
No ratings yet
1009169194
17 pages
Biopython Tutorial
No ratings yet
Biopython Tutorial
237 pages
Tutorial
No ratings yet
Tutorial
365 pages
CL662 HW3
No ratings yet
CL662 HW3
5 pages
Biopython - Tutorial and Cookbook
No ratings yet
Biopython - Tutorial and Cookbook
206 pages
Lab 6 Pseudocode
No ratings yet
Lab 6 Pseudocode
2 pages
Python Programming Exercises Solutions
100% (1)
Python Programming Exercises Solutions
15 pages
Bio Python
No ratings yet
Bio Python
374 pages
Formats
No ratings yet
Formats
7 pages
Biopython Useage With Examples
No ratings yet
Biopython Useage With Examples
2 pages
Module in Tics
No ratings yet
Module in Tics
20 pages
Tutorial
No ratings yet
Tutorial
445 pages
Asm 4
No ratings yet
Asm 4
12 pages
Biopython Guide for Bioinformaticians
No ratings yet
Biopython Guide for Bioinformaticians
79 pages
PCR Product Sequence Analysis
No ratings yet
PCR Product Sequence Analysis
42 pages
Bioinformatics for Biochem Students
No ratings yet
Bioinformatics for Biochem Students
6 pages
Bioinformatics Lecture Summary
No ratings yet
Bioinformatics Lecture Summary
15 pages
RNA Seq Analysis
No ratings yet
RNA Seq Analysis
53 pages
Lab 3
No ratings yet
Lab 3
2 pages
Bio in For Matics Workshop
No ratings yet
Bio in For Matics Workshop
6 pages
COVID-19 Protein Analysis with Python
No ratings yet
COVID-19 Protein Analysis with Python
23 pages
Python For Biologist
No ratings yet
Python For Biologist
24 pages
02 Handling Files
No ratings yet
02 Handling Files
18 pages
Ass 2 Bioinformatics
No ratings yet
Ass 2 Bioinformatics
8 pages
PM703 Practical Biotechnology (2019) PM703 Practical Biotechnology (2019)
No ratings yet
PM703 Practical Biotechnology (2019) PM703 Practical Biotechnology (2019)
20 pages
ExPASy 1
No ratings yet
ExPASy 1
5 pages
From Scratch: Writing Your Own Functions
No ratings yet
From Scratch: Writing Your Own Functions
15 pages
Rana
No ratings yet
Rana
53 pages
Propy 1.0 User Guide: Uniprot & Lamda30
No ratings yet
Propy 1.0 User Guide: Uniprot & Lamda30
11 pages
Gene Prediction Using Statistical Methods
No ratings yet
Gene Prediction Using Statistical Methods
47 pages
Biopython Org DIST Docs Tutorial Tutorial HTML
No ratings yet
Biopython Org DIST Docs Tutorial Tutorial HTML
267 pages
Merge and Translate mRNA Functions
No ratings yet
Merge and Translate mRNA Functions
4 pages
Is To Be Acquaint With Sequence Analysis Tools That Can Be Accessed Through The Internet Specifically Working The NCBI Database
No ratings yet
Is To Be Acquaint With Sequence Analysis Tools That Can Be Accessed Through The Internet Specifically Working The NCBI Database
3 pages
Anotacion de Genomas
No ratings yet
Anotacion de Genomas
84 pages
BIOLOGY Chapter 4 - DNA, RNA and Protein Synthesis
No ratings yet
BIOLOGY Chapter 4 - DNA, RNA and Protein Synthesis
6 pages
Panaro 2020
No ratings yet
Panaro 2020
8 pages
Supplementary Table 1. Primers Used To Amplify Genomes of Malaysian Isolates. ID Gene Sequence 5'-3'
No ratings yet
Supplementary Table 1. Primers Used To Amplify Genomes of Malaysian Isolates. ID Gene Sequence 5'-3'
10 pages
Dna Topology
No ratings yet
Dna Topology
1 page
DNA and RNA Isolation Techniques For Non
No ratings yet
DNA and RNA Isolation Techniques For Non
15 pages
Unit 3 Study Guide H Bio Solutions
No ratings yet
Unit 3 Study Guide H Bio Solutions
9 pages
Genetic Code & Translation
No ratings yet
Genetic Code & Translation
16 pages
Explanatory Chapter: Troubleshooting PCR
No ratings yet
Explanatory Chapter: Troubleshooting PCR
8 pages
Microarray Technology - A Brief Introduction - : Markus Panhuysen
No ratings yet
Microarray Technology - A Brief Introduction - : Markus Panhuysen
38 pages
Biology Meiosis Quiz
No ratings yet
Biology Meiosis Quiz
4 pages
A Brief Introduction To TOPO Cloning - Imran
No ratings yet
A Brief Introduction To TOPO Cloning - Imran
7 pages
DNA and RNA Viruses Replication-New
No ratings yet
DNA and RNA Viruses Replication-New
48 pages
Unravelling A Chromosome Cloze Activity
No ratings yet
Unravelling A Chromosome Cloze Activity
4 pages
Review of Advanced Topics in Forensic DN
No ratings yet
Review of Advanced Topics in Forensic DN
2 pages
Week 10 Lecture (Chap 11) (Compatibility Mode)
No ratings yet
Week 10 Lecture (Chap 11) (Compatibility Mode)
135 pages
Unit 3 Module 2 Science Grade 10 Summary
100% (4)
Unit 3 Module 2 Science Grade 10 Summary
6 pages
OCR A As Biology 3 Practice Question Answers 3
No ratings yet
OCR A As Biology 3 Practice Question Answers 3
3 pages
(Molecular Biology) Course Outline 1
No ratings yet
(Molecular Biology) Course Outline 1
4 pages
DNA Modifying Enzymes
No ratings yet
DNA Modifying Enzymes
77 pages
3 Chemlab Expt 12 Characterization of Nucleic Acids
No ratings yet
3 Chemlab Expt 12 Characterization of Nucleic Acids
8 pages
DNA Exam Questions
No ratings yet
DNA Exam Questions
6 pages
16s Rrna Seq Methods Guide M GL 02701
No ratings yet
16s Rrna Seq Methods Guide M GL 02701
10 pages
Group 4
No ratings yet
Group 4
2 pages
Cell Gebetics
No ratings yet
Cell Gebetics
7 pages
Mutational Signature Analyses in Multi-Child Families Reveal Sources of Age - Related Increases in Human Germline Mutations
No ratings yet
Mutational Signature Analyses in Multi-Child Families Reveal Sources of Age - Related Increases in Human Germline Mutations
12 pages
Understanding DNA Libraries: cDNA vs Genomic
No ratings yet
Understanding DNA Libraries: cDNA vs Genomic
18 pages
DNA Replication EQ With Answers
No ratings yet
DNA Replication EQ With Answers
7 pages
Chapter 24
No ratings yet
Chapter 24
13 pages
CHEMISTRY 1 - 11 - Q2 - m17
No ratings yet
CHEMISTRY 1 - 11 - Q2 - m17
15 pages
Molecular Biology Important Questions
No ratings yet
Molecular Biology Important Questions
2 pages

BIO Code Report

Uploaded by

BIO Code Report

Uploaded by

COVID2–19 DNA sequence data using python.

Major Modules Used:

from __future__ import division

We will use [Link] from Biopython for parsing

for sequence in [Link](r'[Link]', "fasta"):

DNAsequence = [Link](r'[Link]', "fasta")

The difference between the DNA and the mRNA is

The standard genetic code is traditionally

print('Total functional proteins:',

Protein Analysis With The Protparam Module In

for record in Proteins[:]:

As The Above Code Produces The OutPut For All

You might also like

from future import division