Structure and Function of DNA
Structure and Function of DNA
Genome
Dr. Avinash Rawal
Learning Objectives
• Summarize the basic chemical composition of a nucleotide:
nitrogenous base, phosphate and deoxyribose sugar
• Recall the 4 nitrogenous bases of DNA: Adenine, Guanine, Cytosine
and Thymine
• Explain the difference between purine and pyrimidine bases and list
which bases are purines and which are pyrimidines
• Synthesize the complementary DNA strand for a given DNA sequence
• Explain Chargaff's rule and use it to calculate composition of a DNA
strand
• Explain the relationship between DNA composition and melting
temperature
• Describe the features of the DNA strands and the DNA double helix
Learning Objectives
• Describe the experiments that allowed us to conclude DNA is the hereditary
material for cells
• Outline the central dogma of molecular biology
• Explain how prokaryotic DNA is packaged and differentiate this from how
eukaryotic DNA is typically packed
• Recall the basic composition of a nucleosome
• Explain the relationship between nucleosomes and nuclease digestion
patterns
• Define the 10 nm and 30 nm fibers
• Describe how nucleosomes are packaged up to give rise to the 30 nm solenoid
fiber
• Describe how the solenoid fiber is further packaged to give rise to looped
chromatin domains
• Explain how the state of DNA differs at different points of the cell cycle (i.e.
Learning Objectives
• Describe the structure of DNA during interphase
• Describe the structure of DNA during mitosis
• Explain the purpose of centromeres and their role in cell division
• Define the kinetochore
• Describe DNA telomeres structure and explain their purpose
• Outline how DNA telomeres are made
• Describe the technique of karyotyping including its purpose, how it is
performed, and the types of cells commonly used
• Explain what is meant by epigenetics
• List examples of types of epigenetic modifications that occur in
humans
• Define heterochromatin and euchromatin
Learning Objectives
• Recognize the site of DNA methylation in humans
• Discuss the effect that DNA methylation has on gene expression
• Explain Fragile X disorder and how it results from inappropriate
methylation
• Recognize the clinical presentation of Fragile X disorder
• Define what is meant by Barr bodies and X-inactivation and the role
DNA methylation plays in the process
• Describe the major modifications that occur to core histone proteins
and their effect on gene expression
• Explain how HDACs and HATs affect the expression of DNA
Learning Objectives
• Describe the basic features of a human gene and the function of each part
• Describe the types of DNA sequences that are part of the human genome
(single copy, repetitive, etc.)
• Explain what is meant by single copy DNA and its role in the cell
• Explain what is meant by tandem repeat sequences and their role in the
cell
• Explain what is meant by interspersed DNA repeats and their role in the
cell
• Explain what is meant by transposable elements and how this relates to
interspersed DNA repeats
• Explain how retrotransposons copy themselves
• Describe how retrotransposons can cause DNA mutations
• Explain what is meant by pseudogenes
DNA
• DNA or deoxyribonucleic acid is a cell’s
instruction manual
• It encodes all the genetic information
necessary to produce a functional organism ATC G
• It is the heredity material ournitrogenousbases
Mnemonic PyramidCUT
• Pyrimidine (Thymine and Cytosine): nitrogenous bases with a single
ring
Single Strand:
The Structure of DNA
• So each chain consists of a backbone of
sugar and phosphates alternating. Each
sugar has a nitrogenous base attached si 3
• The final structure incorporates two single
chains held together by hydrogen bonds
between the nitrogenous bases Two Strands held together by hydrogen bonds
• A bonds to T (2 Hydrogen bonds) between nitrogenous bases:
• G bonds to C (3 Hydrogen bonds)
• The two strains run antiparallel to one 3 5
another
• In other words, one strand is starting with the
phosphate (aka 5’ end) at the end of the
chain, the other has the sugar (aka 3’ end)
• The ultimate result is the double helix
structure
Terminology: 3’ and 5’ ends
• The phosphate end is referred to as the 5’
end
ÉI
• The –OH end on the sugar is referred to as
the 3’ end
• This nomenclature is based on organic
chemistry numbering of the 5 carbons that
make up the deoxyribose sugar
• Also, thanks to the predictable pairings of A
with T and C with G, if we are given the
sequence of one strand of DNA we can
determine the other
Which of the following sequences is complementary
for this DNA sequence
5'-CTCGTACCGTTA –3’ ?
A. 5'-TAACGGTACGAG-3‘
B. 5'-CTCGTACCGTTA-3‘
C. 5'- ATTGCCATGCTC-3‘
D. 5'-GAGCATGGCAAT-3'
Lookforvideosaboutthis
Chargaff’s Rule
• Since in a double helix of DNA, adenine always pairs with thymine and
guanine always pairs with cytosine, if information is given about one
base, we can calculate the others
• Chargaff’s Rule: A=T and G=C, Hence, the # of purines= # of pyrimidines
(A+G=T+C)
• So we can predict % of other bases if given a % of one base
• Example: If a DNA sample from an organism has 21% guanine
• It has 21% cytosine (G=C)
• A+T = 100- 42= 58%
• A = 29% and T=29%
• Also, if % bases do not match Chargaff’s rule, then the DNA structure is not
a complete double stranded structure
• So it might be or have regions that are single stranded
Chargaff’s Rule Practice Question
ii
19
Packaging DNA inside the Nucleus
• Each human cell contains approximately 6 billion base pairs that make
up our 46 individual chromosomes
• So how do our cells organize and manage our huge DNA molecules?
• How do we fit all this DNA into a cell?
• How to does the cell provide access to the parts being actively used for
making proteins?
Prokaryotic DNA & Packaging
• Prokaryotes package DNA in bacterial
chromosomes and plasmids
• Bacterial chromosomes: Circular DNA molecule,
containing some bound proteins, that is localized
in a special region of the bacteria called the
nucleoid
• Negatively supercoiled
• folded into extensive loops held by RNA and proteins
• Bacterial plasmids: small, circular molecules of
DNA that carry genes for both their own
replication and one or more cellular function
• Four classes of E. Coli plasmids Image:
• F (fertility) factors https://www.ck12.org/book/ck-12-biology-advanced-
concepts/section/11.5/
• Col (colicinogenic) factors
• R (resistance) factors
• Cryptic plasmids
21
Eukaryotes package DNA in chromatin and June13ᵗʰ202
chromosomes in ofourcells DNA everynucleus
a 7andahalffeet
mm
22
The packaging of DNA inside the nucleus
• DNA is first wound up around
proteins known as histones to give
O rise to a structure known as a
nucleosome (beads on a string)
• Gives rise to a 10 nm fiber
• These nucleosomes are then
packed together to form
chromatin fibers
• Gives rise to a 30 nm fiber aka
solenoid fiber
• These fibers can be further
condensed when cells divide (aka
mitosis and meiosis) to give rise to
chromosomes
Structure of a Nucleosome
• Nucleosomes consist of DNA wound
around proteins known as histones
• Histones are small, positively charged
proteins
• This allows them to readily interact with the
negative DNA phosphate backbone
• Thus they consist of a high amount of amino
acids Lysine and Arginine
• Humans have 5 abundant histones: H1,
H2A, H2B, H3 and H4
• H2A, H2B, H3 and H4 are core histones
• This is the structure that the DNA wraps around
• H1 is a linker histone
• This protein binds linker DNA outside the
nucleosome structure
Structure of a Nucleosome
34
Figure 4-71 Molecular Biology of the Cell (© Garland Science 2008)
Telomeres
• Telomeres are sequences at the ends
of chromosomes, and are required for
the replication of linear DNA
molecules.
• The sequences are similar among
eukaryotes, with repeats containing
clusters of G residues on one strand.
• They are repeated hundreds or
thousands of times and end with a 3ʹ
overhang of single-stranded DNA.
• The telomere sequences of some
organisms (including humans) form
loops at the ends.
• They bind a protein complex (shelterin)
that protects the chromosome termini
from degradation.
Telomeres
• The ends of linear chromosomes can’t be
replicated by DNA polymerase.
• Instead, telomerase, which uses reverse
transcriptase activity, replicates telomeric DNA
sequences.
• Maintenance of telomeres appears to be
important in determining the lifespan and
reproductive capacity of cells.
• Studies of telomeres and telomerase may provide
new insights into aging and cancer.
Karyotyping
Cell
• Karyotyping allows us to Iainamanita
examine chromosomes in a cell EE
• Common cell types karyotyped
• T lymphocytes from blood
• Chorionic villi
• Amniotic fluid
• The cells are grown in tissue
culture and arrested in mitosis
by microtubule disrupting
drugs
• Synchronize and enrich for
a population of cells in
mitosis
Karyotype from a male patient with Down syndrome, showing
• Hypotonic lysis to release three copies of chromosome 21. From Thompson & Thompson
chromosomes
Summary: Eukaryote DNA Packaging in the Nucleus
dotranscriptionnow
cannot
you
11nm 30nm
2nm
700nm
300nm
1400nm
Sperm Nucleus Structure and Chromatin
Packaging
• The chromatin within a sperm cell is
organized differently compared to a
somatic cell
• Histone-rich chromatin located
towards the periphery
• Protamine-rich chromatin located
towards the center
• Some regions of the sperm chromatin
replaces histones with protamines
• Protamines are small, arginine-rich,
nuclear proteins
• Allows for enhanced packaging into
toroid (doughnut loop-shaped)
structures
• Makes the chromatin more compact
for packaging in the sperm head
Top image from: Zini and Libman, CMAJ August 29, 2006 175 (5) 495-500; DOI:
Toroid
https://doi.org/10.1503/cmaj.060218 http://www.cmaj.ca/content/175/5/495/tab-figures-data
Bottom image from: William Steven Ward, Am J Clin Exp Urol 2018;6(2):87-92 www.ajceu.us /ISSN:2330-
1910/AJCEU0075900,
https://pdfs.semanticscholar.org/1147/a321ab07e4921ee420803567d8ca4a37c263.pdf
Epigenetic Modifications and Regulation of
Gene Expression
• Epigenetic: The term that refers to any
factor that can affect gene function without
change in the genotype
• Some typical epigenetic factors involve
alterations that change genome structure
and affect gene expression without
changing the primary DNA sequence:
• DNA methylation,
• Chromatin structure
• Histone modifications
• Transcription factor binding
• Side Note: As you learn more about
gene expression and protein
production you will appreciate that
there are lots of levels of regulation
for this process, not just those we will
discuss here that are epigenetic
Euchromatin vs Heterochromatin
• Euchromatin is genomic DNA that is • Heterochromatin
chromatin that
is highly folded
is usually
more loosely packed and diffuse. transcriptionally inactive
• Most of transcriptionally active genes • Heterochromatin is highly organized and
are in euchromatin. unusually resistant to gene expression
• Transcriptionally active genes often • DNA is packaged so tightly that transcription is
inhibited
have acteylated histones associated • DNA in heterochromatin is often
with their DNA methylated on cytosines in CpG
• During interphase the euchromatin is sequences
decondensed and distributed • During interphase, heterochromatin
throughout the nucleus. remains highly condensed and is
transcriptionally inactive
DNA Methylation
• Methylated DNA is often associated with
heterochromatin
• Major site of Methylation in mammals
including humans is on cytosine of 5’-CG-3’,
usually in 5’ noncoding sequences
• Methylation patterns are inherited after DNA
replication
• X-chromosome inactivation – one of the X
chromosomes in cells from females is highly
condensed, transcriptionally inactive
heterochromatin
• The inactive X chromosome:
• Barr body
• The Barr body is extensively methylated
• So generally speaking, methylation of DNA is
associated with turning off gene expression
aka gene silencing
Disease of Inappropriate DNA Methylation:
Fragile X Disorder aka Martin-Bell Syndrome
• Disorder related to a mutation in
FMR1 gene
• FMR1 is located on the X
chromosome so disease is inherited
in a X-linked dominant fashion
• This disorder is also an example of
a trinucleotide repeat disease (a
disease caused by an inappropriate
number of repeats in a gene)
• The mechanism of disease is due
to inappropriate DNA methylation
Fragile X (aka Martin-Bell Syndrome)
• Within the promoter of the FMR1
gene is a region that consists of
repeated CGG sequences
• These repeats are normal and don’t
cause any issues with FMR1 expression
• A problem occurs however if the CGG
region becomes too large (aka too
many CGG repeats)
• Once the CGG repeats >200, it triggers
the cell to consider these CGGs
methylation sites and so it
hypermethylates the FMR1 promoter
• This methylation silences the FMR1 gene
and so it is not expressed
• Since FMR1 is required for various
functions in the body, its absence causes
the syndrome Fragile X disorder
Fragile X Syndrome (Martin-Bell Syndrome) GG
● Clinical features
• Intellectual Disability
• Delayed language development, autistic behavior
• Coarse facial features (long and narrow face, prominent
forehead, jaw, big ears)
• Macroorchidism: Testicles grow to a total average volume of
50 ml (normal 15 to 20 mL) in adulthood (rarely occurs before
puberty)
• Deep plantar creases
• Beyond average height and head circumference
• Mitral valve prolapse: can lead to mitral regurgitation
• Hyperactivity
• Focal seizures
DNA
RNA
PYotein
getsynthesizedinthe
Ribosomes nucleolus
• A human gene consists of:
• A promoter region (5’ Regulatory Region) • Exons
• Regulates the timing and amount of expression of a • Protein coding sequences within the gene
particular gene, specifically how much mRNA is made • Introns
• Located in the 5’ end • Sequences between exons that are removed from the
mature mRNA
• NOT made into mRNA • 3’UTR
• 5’ UTR • Contains regulator regions that affect gene expression
• Responsible for recruiting ribosome to make RNA into and export
protein • Made into RNA but not into protein
• Located downstream from promoter and upstream (5’ • 3’ Regulatory Sequences
side) of gene • Can also be other regulatory sites in the 3’ end of the
• Made into mRNA but not into protein gene that are not made into RNA
Organization of the Human Genome
• Genetic linkage analysis and
physical mapping of cloned
genomic and cDNA sequences
established maps of the
human genome, which
provided a background for
genomic sequencing.
• A major surprise from the Only 1-2% of our genome
human genome sequence actually encodes proteins
was the unexpectedly low
number of genes: only 20,000
to 25,000.
• But alternative splicing in
human genes allows a
single gene to specify
more than one protein.
• So what is in the human About 90% of an
average human
genome? gene consists of
introns.
Single Copy Elements
• Only 1-2% of DNA sequences codes for proteins, but 75% of DNA is
transcribed into RNA.
• Single copy DNA à Present once, or very few times in the genome.
• About 50% of the genome, and includes the 1.5% that are genes
59
thisapparently
Won'taskquestionsabout hastloslides
Pseudogenes
• Pseudogenes are nonfunctional segments of DNA that resemble functional genes.
Pseudogenes can arise in two ways:
• Duplication of a segment of DNA results in the transfer of a block of DNA to a new
location in the genome.
• Duplication by reverse transcription of an mRNA, followed by integration of the cDNA
copy into a new chromosomal site.
• Duplication of a gene by reverse transcription usually yields an inactive gene copy called a
processed pseudogene—they lack introns and the normal sequences that direct
transcription.