Download lecture_11(LP)

Document related concepts

Oncogenomics wikipedia , lookup

DNA wikipedia , lookup

DNA barcoding wikipedia , lookup

RNA-Seq wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Frameshift mutation wikipedia , lookup

DNA repair wikipedia , lookup

Comparative genomic hybridization wikipedia , lookup

Mitochondrial DNA wikipedia , lookup

Mutagen wikipedia , lookup

DNA sequencing wikipedia , lookup

DNA profiling wikipedia , lookup

Zinc finger nuclease wikipedia , lookup

Human genome wikipedia , lookup

Mutation wikipedia , lookup

Primary transcript wikipedia , lookup

Metagenomics wikipedia , lookup

Genome evolution wikipedia , lookup

Genetic engineering wikipedia , lookup

Nucleosome wikipedia , lookup

DNA polymerase wikipedia , lookup

Cancer epigenetics wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Gene wikipedia , lookup

DNA damage theory of aging wikipedia , lookup

United Kingdom National DNA Database wikipedia , lookup

Replisome wikipedia , lookup

Genealogical DNA test wikipedia , lookup

DNA vaccination wikipedia , lookup

Nucleic acid analogue wikipedia , lookup

Gel electrophoresis of nucleic acids wikipedia , lookup

SNP genotyping wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

Designer baby wikipedia , lookup

Nucleic acid double helix wikipedia , lookup

Extrachromosomal DNA wikipedia , lookup

DNA supercoil wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Point mutation wikipedia , lookup

Non-coding DNA wikipedia , lookup

Molecular cloning wikipedia , lookup

Epigenomics wikipedia , lookup

Cre-Lox recombination wikipedia , lookup

Cell-free fetal DNA wikipedia , lookup

Bisulfite sequencing wikipedia , lookup

No-SCAR (Scarless Cas9 Assisted Recombineering) Genome Editing wikipedia , lookup

Genome editing wikipedia , lookup

Microsatellite wikipedia , lookup

Genomics wikipedia , lookup

Microevolution wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Genomic library wikipedia , lookup

History of genetic engineering wikipedia , lookup

Deoxyribozyme wikipedia , lookup

Helitron (biology) wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
Announcements
-A reading assignment on DNA hybridization has been
posted on the course website.
-The second midterm exam (Monday, May 19th) will cover
material through today’s lecture and next weeks quiz
section.
-My office hours: Fridays, by appointment between 1:00 &
2:30 PM.
-Emailing questions to course TA’s…
If you don’t have much background
with molecular biology techniques:
-The textbooks that are on reserve in the undergraduate
library cover these techniques very well:
-Chapter 20 (specifically, pages 715-735) in Griffiths, et al.
-Chapter 9 (specifically, pages 301-335) in Hartwell, et al.
-Chapter 19 in Klug, et al.
-I have provided links on the course website that direct you
to sites that provide information on gel electrophoresis,
Southern blotting and northern blotting.
-I have placed a file on the course website that describes
gel electrophoresis and Southern blotting .
-The TA’s can also help-but please, not by email.
Today…
-DNA cloning
-DNA hybridization
-Sequencing
-Polymerase chain reaction
The Yeast Adenine Biosynthetic Pathway
ADE4 ADE5*
A
B
ADE8 ADE6 ADE7 ADE2 ADE1
X
D
E
F
Y
G
C
ADE13
ADE3*
H
red
ADE17
pigment
ADE16
I
ADE17
ADE16
J
ADE12
K
ADE13
AMP
Given that many different genes are involved in adenine
biosynthesis, what do all of these enzymes “look” like?
--how are they different?
--what is the sequence of amino acids?
--what is their 3-D structure?
--how do the enzymes work?
--do humans have the same enzymes as yeast?
Bigger question:
How can we figure out what any gene actually encodes?
How can we get our hands on the piece of DNA so that we
can sequence it and determine the amino acid sequence of
the protein?
DNA cloning
Dolly
Clone: cells or organisms
that are genetically identical
because they are related by
non-sexual reproduction—i.e.,
by mitosis not meiosis.
Copy Cat
DNA clone: identical copies
yeast
of a DNA fragment, usually
generated by propagation in
bacteria
Each colony is >106 identical cells.
DNA cloning—the idea
A tiny fraction of a genome:
1. Break the genome into gene-sized pieces
2. Introduce DNA into E. coli
3. Culture each bacterium
separately to generate clones
4. Study each gene in
isolation
Problems to solve:
» How can the new DNA survive and replicate in E. coli?
» How can we “select” for bacteria with the new DNA?
» How do we find the one bacterium with the gene we
want to study--the one with the ADE2 gene?
Plasmids as vectors for transformation
Non-essential DNA
molecules that can
replicate independently of
the chromosome
Selectable marker
AmpR or TetR
ori
because they have an
origin of replication
A gene that confers resistance
to a drug such as ampicillin or
tetracycline.
For cloning: “recombine” the foreign DNA into a plasmid.
In principal, vectors solve two of the problems—maintenance
& selection.
But how does cloning work in practice?
Restriction endonucleases
Recognize and cut specific sequences in DNA
Restriction enzyme EcoRI:
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
EcoRI site in DNA:
--GACTGAATTCTGATC---CTGACTTAAGACTAG-Reversed by DNA ligase
--GACTG
--CTGACTTAA
AATTCTGATC-GACTAG--
sticky ends
Using EcoRI to make recombinant DNA
(Watson & Crick strands shown)
cut with EcoRI
partial digest with EcoRI
--GACTG
--CTGACTTAA
AATTCTGATC-GACTAG--
sticky ends
Making a DNA library
Tetracycline-sensitive E. coli
DNA ligase to repair
phosphodiester
backbone
transformation
(CaCl2, heat shock)
This is a
clone
tetracycline plate
A DNA library
Each clone may have a different piece
of the“foreign” genome in it in the
form of a recombinant DNA molecule
“Genomic DNA library”:
• start with genomic DNA
• expect all the chromosomal DNA to be represented
• expect about equal representation of all sequences
Practice question
The yeast genome is ~12,000kb and the ADE2 gene is ~2kb.
1. If the average insert size in the yeast genomic library
is 12 kb, what fraction of the E. coli colonies are expected
1/1000
to contain the ADE2 gene? ___________
2. If you looked at 10,000 colonies how many are
10
expected to contain the ADE2 gene? ___________
3. At 200 colonies per plate, how many plates would be
needed to ensure that you have 10 different clones with the
50
ADE2 gene? ___________
This principle is called “coverage”--the number of times, on average,
that the genome (or particular gene) is represented in the library
Quiz Section this week and next:
Genetic Analysis in Caenorhabditis elegans
Goals: to illustrate the logic
of genetic analysis
Mutant analysis (AKA Genetic Analysis)
The use of mutants to understand how a
biological process normally works
• Start with “unknown” system (e.g., metabolic
pathway, embryonic development, behavior,
etc.)
Locomotion
• Generate mutations that affect the “unknown” unc-93,
unc-29,
system
(i.e., athat
“break” the “unknown”
Is the
mutation
loss-of-function
or gain-of-function?
The etc.
system)
answer
influences the interpretation of the phenotype
Paralyzed?
• Study the mutant phenotypes to reveal the
hyperactive?
functions of the genes
etc.
• Map the genes
• Identify the genes (more on this later) K+-channel, Ach
Receptor, etc
Loss-of-function mutations are usually recessive
Amount of
protein
activity
threshold
WT phenotype
mutant phenotype
AA
Aa
aa
Loss-of-function mutation simply decreases the activity of
the protein; threshold determines whether dominant or
not.
Gain-of-function mutations are usually dominant
threshold
Amount of
protein
activity
mutant phenotype
wt phenotype
aa
Aa
AA
One type of gain of gain-of-function mutation simply
increases the activity of the protein; threshold determines
whether dominant or not.
Example from QS - acetylcholine receptor (unc-29)
acetylcholine
+
outside
Wild-type
UNC-29
closed
open
inside
LOF - reduced gene function, GOF - increased gene function.
Phenotypes are often opposite.
What would be the phenotype
of a LOF mutation? - receptor
missing or defective
(equivalent to being always
closed)
What would be the phenotype of
a GOF mutation? - receptor
activated in absence of Ach
(equivalent to being always open)
A real ACh receptor (top view)
pentamer with four
types of subunit:
a2, b, d, g
All four types of subunits (a, b, d, g) are required for function.
What happens if any of them is knocked-out by mutation (LOF)?
What (probably) happens if one has GOF and another has LOF?
Example from QS: potassium channel gene unc-93
mostly
closed
unc-93 is expressed
in muscle (body and
egg-laying)
outside
Wild-type
UNC-93
inside
gate
K+
outside
unc-93
(cb1500)
mutant
How could you make new
LOF mutations in unc-93?
Excess K+ flow
prevents muscle
excitation
inside
K+
unc-93 is featured
again in QS 7.
Cloning by complementation!
Finding the yeast ADE2 gene in an E. coli library by
complementation of an E. coli adenine mutant phenotype.
Hypothesis: if the intermediates in these pathways are the
same, then the enzymes should be interchangeable.
E. coli purE+ gene
HC
N
CH
C
H2 N
N
R
(AIR)
*
Yeast ADE2 gene
-OOC
C
N
CH
C
H2 N
N
R
(CAIR)
*Both species need the same basic
enzyme for this step of adenine synthesis.
Cloning a gene based on its function
“cloning by complementation”
» Start with (recessive) E. coli purE- mutant strain
purE-
»Transform with DNA library made from (dominant) ADE2
wild-type yeast strain
»Ask: which plasmid clone rescues the mutant phenotype?
portion of
yeast genome
Library
ADE2
purE-
E. coli
no growth
purE-
purE-
no growth
-adenine plate
portion of
yeast genome
ADE2 must
be on this
plasmid!
purE-
growth!
-adenine plate
Practice Questions
Does that colony really have an ADE2
plasmid? Could it not be a purE+
revertant or a contaminant?
1. What phenotype would the E. coli have if the plasmid
was lost? (How to lose a plasmid? How exactly would
you do the experiment?)
back to being ade- (unable to grow on -adenine plates)!
2. What would they expect if the the original yeast
genomic DNA library had been made from ade2
mutant yeast? no colonies on the -adenine plate
3. Why is this type of functional cloning considered an
example of “complementation”?
4. Can this experiment be done using the human “ADE2”
gene?
The cell division cycle
Cell cycle regulation— MUCH of
what we know comes from genetic
analysis using temperaturesensitive yeast mutants
restrictive
temperature
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
Can complement
some yeast
mitotic defects
with human DNA!
Lee Hartwell
2001 Nobel Prize
Working with DNA
What else might we
want to know about the
ADE2 gene?
QuickTime™ and a
TIFF (LZW) decompressor
are needed to see this picture.
What other questions
could we ask?
Wesequence
need to know
the
--what is the
of amino
sequence of the ADE2
acids?
gene3-D structure?
--what is their
--how do the enzymes work?
--how do the mutants differ from
But first, a little review. . . .
WT?
P
S
S
P
P
S
S
P
P
S
S
S
P
P
S
P
S
S
S
P
P
S
S
P
P
S
S
S
P
P
Review: DNA Structure
S
Nucleotide: repeating unit
phosphate
group
S
nitrogenous
base
P
O
P
S
OH
P
N
O
S
O
P
S
S
P
P
S
S
P
P
S
S
S
P
P
S
P
S
S
S
P
P
S
S
S
S
P
P
S
S
P
P
S
S
P
P
S
P
S
denaturation
breaks these
H-bonds
restriction
enzymes break these
covalent bonds
H C
2
4’
5’
N
O
C H
H
H C
C
3’
OH
H
C
1’
H
2’
deoxyribose
sugar
Review: DNA synthesis
DNA polymerase
requires . . .
template strand
3'
5'
A A T G C G A C A G T A G A
T T A C G C T G T C A T C T
OH
OH
3’ OH
primer
activated
nucleotides
5'
5'
G
OH
5'
5'
C
OH
T
5'
OH
A
to make
the
nascent
strand
OH
DNA denaturation (and renaturation)
The strands of DNA can come apart . . . and go back together!
..TAGAGTTCTGTACGTTCGTGATGCCATGCCTGGAAC..
..ATCTCAAGACATGCAAGCACTACGGTACGGACCTTG..
Heat to 95˚
(or NaOH)
Cool to ~65˚C (+ some salt)
5’..TAGAGTTCTGTACGTTCGTGATGCCATGCCTGGAAC..
+
..ATCTCAAGACATGCAAGCACTACGGTACGGACCTTG..5’
It’s reversible!
DNA denaturation (and renaturation)
In a complex mixture of different DNA molecules,
complementary ssDNA strands can find each other and
reform dsDNA.
denaturation
NaOH or
>95˚C
65˚C +
salt
renaturation
base pairing provides the specificity to reform dsDNA
DNA (or RNA) hybridization
Annealing single strands from different sources
..TAGAGTTCTGTACGTTCGTGATGCCATGCCTGGAAC..
..ATCTCAAGACATGCAAGCACTACGGTACGGACCTTG..
heat
5’..TAGAGTTCTGTACGTTCGTGATGCCATGCCTGGAAC..
+
..ATCTCAAGACATGCAAGCACTACGGTACGGACCTTG..5’
cool
5’GTACGTTCGTGAT
GTACGTTCGTGAT
..ATCTCAAGACATGCAAGCACTACGGTACGGACCTTG..5’
a “hybrid” DNA molecule.
Add excess
oligonucleotides
with a sequence
complementary
to one of the
strands
Some notes on hybridization
-Probe and target need not be the same length
both okay
-Small mismatches are tolerated if overall match length is
substantial
okay
-Small probes (25-30 bases) can work if conditions (salt,
temperature) are adjusted. Mismatches much more
significant for small probes.
DNA sequencing
Uses DNA polymerase
The sequence obtained is for the strand
being synthesized
Can determine ~600-800 bases in one “read”
With 3,000,000,000 bp in the human
genome, that’s a lot of sequencing
reactions!
Fred Sanger
Nobel Prize for
protein
sequencing
(1958) and for
DNA sequencing
(1980)!
Addition of a nucleotide to a 3’OH
new
nucleotide
at 3’ end
3’ end
activated
nucleotide
H2O
O
2
O
P
O-
O-
+
Reagents for Sanger DNA sequencing
• DNA fragment to be sequenced
heat
• 1 primer: short ssDNA complementary to ONE region
of the template DNA
• dNTPs
All 4: dATP, dCTP, dGTP, dTTP
• DNA polymerase from E. coli
Question: How does synthesizing a strand of DNA
give us information on its sequence of
bases??????
These are
• small amount of each dideoxyNTP
the key
• Split the mixture
among four tubes
C
A
G
T
• Add a small amount of 1 dideoxy nucleotide to each
reaction
What’s a dideoxy?
dye molecule
H H
A = green
T = red
C = blue
G = black
No 3’ OH. “chain terminator”
• Incubate until all synthesis is complete
• Analyze newly synthesized strands on a gel
QuickTime™ and a
Cinepak Codec by Radius decompressor
are needed to see this picture.
Products of the four synthesis reactions
“C”
reaction
Primer +
T C T T T T A A G C A T T C
A G A A >>>>>>>>>>
A GA A A A TTC
5
Primer
“A”
reaction
T C T T T T A A G C A T T C
A G A A >>>>>>>>>>
AGAAA
AGAAAA
A GA A A A TTCGTA
A GA A A A TTCGTA A
1
2
8
9
“G”
reaction
“T”
reaction
Primer +
T C T T T T A A G C A T T C
A G A A >>>>>>>>>>
A GA A A A TTCG
6
A GA A A A TTCGTA AG
10
T C T T T T A A G C A T T C
A G A A >>>>>>>>>>
AGAAAAT
3
A GA A A A TT
4
A GA A A A TTCGT
7
Analysis of the new strands on a gel
ddA ddC ddG ddT
+etc
+11
+10
+9
+8
+7
+6
+5
+4
+3
+2
+1
primer
-
mix
all 4
reactions
3’
A
T
G
G
A
C
T
T
A
5’-A
+
Read bottom up
Laser scan of a sequencing gel
ssDNA fragments run past a laser where the color is detected.
Sequencing our clone
The vector sequence is known; insert is to be sequenced
 use vector sequence for initial sequencing primers…
Ampr
ori end
primer
ori
vector
Amp end
primer
insert
+ insert
ADE2 gene
then use the new sequence info to extend the sequencing
What if . . .
. . . you wanted to know what was
different about the sequence of
the ADE2 gene from an ade2
mutant strain?
ori
ampR
vector
insert
Do we need to start with a library
from an ade2 mutant?
ADE2 gene
Not necessarily; if we already know the sequence of
ADE2 we can use PCR to obtain DNA for the ade2
mutant allele.
The polymerase chain reaction
Uses DNA polymerase
Makes unlimited quantities of the DNA of interest
Only requires a single template molecule—very sensitive
Uses two DNA primers
Reagents for PCR
1. target DNA
genomic DNA
2. Two specific primers—
AAGT
CACG
“ left” and “right”
short (17-20 nt) oligonucleotides that are complementary
to sequences on either side of the region of interest
3. All 4 dNTPs
dATP, dCTP, dGTP, dTTP
4. DNA polymerase from the heat-loving bacterium,
Thermus aquaticus doesn’t die at 95°C!
DNA products at successive cycles of PCR
starting dsDNA
denature
anneal
synthesize
cycle 1
cycle 3
repeat
cycle 2
cycle 4
After 4 cycles, half of
the products are DNA
fragments of a specific
size—the size of the
DNA that lies between
the two primers. By 30
cycles there would be
228 of the DNA
molecules from the
initial DNA molecule
Primers for PCR
…must “point inward” to the region to be amplified
5’  3’
“forward
primer”
5’
3’
region to be amplified
3’
5’
“reverse
primer”
Primers for PCR
…must “point inward” to the region to be amplified
5’  3’
region to be amplified
5’
3’
3’
5’
AACGTATTTTAACGATTTCGATATCCAGTTTAATTCCGAGTGTA
TTGCATAAAATTGCTAAAGCTATAGGTCAAATTAAGGCTCACAT
Primers for PCR
…must “point inward” to the region to be amplified
5’  3’
region to be amplified
5’
3’
3’
5’
AACGTATTTTAACGATTTCGATATCCAGTTTAATTCCGAGTGTA
TTGCATAAAATTGCTAAAGCTATAGGTCAAATTAAGGCTCACAT
Primers for PCR
…must “point inward” to the region to be amplified
5’  3’
region to be amplified
5’
3’
3’
5’
AACGTATTTTAACGATTTCGATATCCAGTTTAATTCCGAGTGTA
TTGCATAAAATTGCTAAAGCTATAGGTCAAATTAAGGCTCACAT5’
5’AACGTATTTTAACGATTTCGATATCCAGTTTAATTCCGAGTGTA
TTGCATAAAATTGCTAAAGCTATAGGTCAAATTAAGGCTCACAT
Practice question
DNA sequence from exon 1 of the huntingtin gene
5’
GTCCCTCAAGTCCTTCCAGCAGCAGCAGCTTGAGCCGCCAC
CGC
A. What
6 nucleotides from the gray boxes could be used
CAGGGAGTTCAGGAAGGTCGTCGTCGTCGTTGTCGGCGGTG
as primers
GCG to amplify the region of the genome shown
above?
“left (forward) primer”? 5’ ________3’
CCCTCA
GTGGCG
“right (reverse) primer”? 5’ ________3’
B. What size will the DNA fragment be after PCR
amplification using the two primers above? 33 base pairs
C. Huntington’s disease is a dominant autosomal trait that is
fatal. It usually strikes in middle age, by which time affected
individuals may have already passed on the disease allele.
Individuals have different numbers of the triplet CAGs in the
first exon in the “huntingtin” gene.
5’
GTCCCTCAAGTCCTTCCAGCAGCAGCAGCTTGAGCCGCCAC
CGC
CAGGGAGTTCAGGAAGGTCGTCGTCGTCGAACTCGGCGGT
GGCG
How many CAG repeats are present in this allele? 4
If the CAG were in the coding portion of exon 1, how
would this sequence appear in the huntingtin protein?
…S L K S F Q Q Q Q L E P P P…
D. Below is a gel in which the PCR fragments of different
individuals are shown. The region amplified used the
same primers as in part A. Why do people have 2 bands?
Because each individual represented is heterozygous for a
huntingtin gene with a different number of CAG repeats
Triplet
repeat
number
65
50
35
20
5
no disease
disease
E. What is the
correlation
between the
number of repeats
and the presence
of disease?
individuals with a
single Huntingtin
allele of more than
~ 35 CAG repeats
display the disease
phenotype.
F. Examine the pedigree below and determine the probability of disease
among the 3 females in generation III. Notice that the middle branch of
the family refused DNA testing. Numbers (i.e. 7,21) refer to the number
of CAG repeats.
45 years old
40
7,21
7,46 11,15
46,11
36
5
5
14,46 12,12 26
12,46 7,26
5
12,26
G. Discussion questions: Would you want test results if you were II-5 or II-6?
Results on the previous page indicate that individuals with a single Huntingtin
allele of more than ~ 35 CAG repeats will display the disease phenotype.
Individual II-5 will have this disease, whereas individual II-6 will not.
Would you want your child, III-3, tested if you weren’t willing to be tested?
What are the different concerns of each individual? These are personal
choices, and the answers given for the other questions should provide
sufficient information for you to consider these issues.