U.S. patent application number 16/017570 was filed with the patent office on 2018-12-20 for compositions of toehold primer duplexes and methods of use.
This patent application is currently assigned to President and Fellows of Harvard College. The applicant listed for this patent is President and Fellows of Harvard College. Invention is credited to Peng Yin, David Yu Zhang.
Application Number | 20180363045 16/017570 |
Document ID | / |
Family ID | 44913435 |
Filed Date | 2018-12-20 |
United States Patent
Application |
20180363045 |
Kind Code |
A1 |
Zhang; David Yu ; et
al. |
December 20, 2018 |
COMPOSITIONS OF TOEHOLD PRIMER DUPLEXES AND METHODS OF USE
Abstract
Provided herein are primers and primer systems having improved
specificity and kinetics over existing primers, and methods of use
thereof.
Inventors: |
Zhang; David Yu; (Houston,
TX) ; Yin; Peng; (Brookline, MA) |
|
Applicant: |
Name |
City |
State |
Country |
Type |
President and Fellows of Harvard College |
Cambridge |
MA |
US |
|
|
Assignee: |
President and Fellows of Harvard
College
Cambridge
MA
|
Family ID: |
44913435 |
Appl. No.: |
16/017570 |
Filed: |
June 25, 2018 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
14553165 |
Nov 25, 2014 |
10036059 |
|
|
16017570 |
|
|
|
|
13882231 |
Jul 1, 2013 |
9284602 |
|
|
PCT/US2011/058178 |
Oct 27, 2011 |
|
|
|
14553165 |
|
|
|
|
61407291 |
Oct 27, 2010 |
|
|
|
Current U.S.
Class: |
1/1 |
Current CPC
Class: |
C12Q 1/6832 20130101;
C07H 21/02 20130101; C12Q 1/6848 20130101; C12Q 1/686 20130101;
C12Q 1/6853 20130101; C12Q 1/6832 20130101; C12Q 2537/1373
20130101; C12Q 2537/161 20130101; C12Q 2525/161 20130101; C12Q
1/6832 20130101; C12Q 2537/1373 20130101; C12Q 2525/301 20130101;
C12Q 2525/161 20130101; C12Q 1/6848 20130101; C12Q 2537/1373
20130101; C12Q 2525/301 20130101; C12Q 2525/161 20130101; C12Q
1/6848 20130101; C12Q 2537/1373 20130101; C12Q 2537/161 20130101;
C12Q 2525/161 20130101 |
International
Class: |
C12Q 1/6853 20060101
C12Q001/6853; C07H 21/02 20060101 C07H021/02; C12Q 1/6848 20060101
C12Q001/6848; C12Q 1/6832 20060101 C12Q001/6832; C12Q 1/686
20060101 C12Q001/686 |
Goverment Interests
FEDERALLY SPONSORED RESEARCH
[0002] This invention was made with government support under
OD007292 awarded by National Institutes of Health, under
N00014-11-1-0914 and N00014-10-1-0827 awarded by U.S. Department of
Defense, Office of Naval Research, and under CCF 1054898 awarded by
National Science Foundation. The government has certain rights in
the invention.
Claims
1. A partially double-stranded primer comprised of first and second
nucleic acid strands arranged into (1) one double-stranded
target-non-specific region, (2) one double-stranded target-specific
region, and (3) one single-stranded target-specific region
contributed to by the first nucleic acid strand, wherein the
double-stranded target-non-specific region has a standard free
energy approximately equal to the standard free energy for the
single-stranded target-specific region bound to a target nucleic
acid.
2. The partially double-stranded primer of claim 1, wherein the
second nucleic acid strand comprises a non-extendable nucleotide at
its 3' end and/or the first nucleic acid strand comprises a
non-natural nucleotide at or near the 3' end of its
target-non-specific region.
3. The partially double-stranded primer of claim 2, wherein the
non-extendable nucleotide is a non-natural nucleotide or a dideoxy
nucleotide, optionally wherein the non-natural nucleotide is iso-C,
iso-G or deoxyuridine.
4. (canceled)
5. The partially double-stranded primer of claim 1, wherein the
double-stranded target non-specific region is about 4-21
nucleotides in length and/or the single-stranded target-specific
region is about 4-20 nucleotides in length.
6. (canceled)
7. The partially double-stranded primer of claim 1, wherein the
first and second nucleic acid strands are comprised of DNA or
RNA.
8. A system comprising a nucleic acid target, a polymerase, and a
partially double-stranded primer comprising a first and a second
nucleic acid strand arranged into (1) one double-stranded
target-non-specific region, (2) one double-stranded target-specific
region, and (3) one single-stranded target-specific region
contributed to by the first nucleic acid strand.
9. The system of claim 8, wherein the nucleic acid target is a
single-stranded.
10. The system of claim 8, wherein the nucleic acid target is DNA
or RNA.
11. The system of claim 8, wherein the system comprises a plurality
of different partially double-stranded primers.
12. The system of claim 8, wherein the system comprises at least
two partially double-stranded primers which together can be used to
amplify a region of the nucleic acid target.
13. The system of claim 8, wherein the nucleic acid target
comprises repetitive sequence, secondary structure and/or high GC
content.
14. The system of claim 8, wherein the nucleic acid target is
present in a plurality of different nucleic acid targets,
optionally wherein the nucleic acid target is present as a single
copy or in low copy in a plurality of different nucleic acid
targets.
15. (canceled)
16. A method comprising contacting a partially double-stranded
primer of claim 1 to a sample, and detecting hybridization of the
primer to a target in the sample.
17. The method of claim 16, wherein the partially double-stranded
primer is labeled with a detectable moiety, optionally wherein the
detectable moiety comprises a fluorophore or a radioisotope.
18. (canceled)
19. The method of claim 16, wherein the target is present as a
single copy in the sample.
20. A method comprising hybridizing a single-stranded
target-specific region of a first strand of a partially
double-stranded primer of claim 1 to a nucleic acid target, thereby
dissociating the first strand of the primer from a second strand of
the primer, and extending the first strand at its 3' end, in a
target-complementary manner, in the presence of a polymerase.
21. A method comprising performing a nucleic acid synthesis
reaction in the presence of a nucleic acid target, a polymerase,
and one or more partially double-stranded primers of claim 1.
22. The method of claim 21, wherein the nucleic acid synthesis
reaction is a nucleic acid amplification reaction, optionally a
polymerase chain reaction (PCR), or wherein the nucleic acid
synthesis reaction is a transcription reaction, optionally a
reverse transcription reaction.
23.-26. (canceled)
27. A kit comprising one or more partially double-stranded primers
of claim 1, and one or more nucleic acid synthesis reagents.
28.-29. (canceled)
30. A kit comprising a first single-stranded nucleic acid in a
first container, and a second single-stranded nucleic acid that is
complementary to a region of the first single-stranded nucleic
acid, in a second container, wherein, when the first and second
single-stranded nucleic acids are hybridized to each other, a
partially double-stranded nucleic acid is formed that comprises (1)
a double-stranded target-non-specific region, (2) a double-stranded
target-specific region, and (3) a single-stranded target-specific
region contributed to by the first nucleic acid, wherein the first
single-stranded nucleic acid comprises a non-natural nucleotide
and/or the second single-stranded nucleic acid comprises a
non-extendable nucleotide at its 3' end.
31.-33. (canceled)
34. A single stranded primer that partially self-hybridizes to form
(1) one or more double-stranded target-non-specific region, (2) one
or more double-stranded target-specific region, (3) one or more
single-stranded target-specific region, and (4) one or more hairpin
loop regions, wherein the one or more double-stranded
target-non-specific region has a concentration-adjusted standard
free energy approximately equal to the concentration-adjusted
standard free energy for the one or more single-stranded
target-specific region bound to a target nucleic acid.
35. A method of performing a multiplexed nucleic acid amplification
reaction comprising amplifying multiple unique nucleic acid
molecules using a primer of claim 1.
Description
RELATED APPLICATIONS
[0001] This application is a continuation application of U.S.
application Ser. No. 14/553,165, filed Nov. 25, 2014 which is a
divisional application of U.S. application Ser. No. 13/882,231,
filed Jul. 1, 2013, which is a national stage filing under 35
U.S.C. .sctn. 371 of international application number
PCT/US2011/058178, filed Oct. 27, 2011, which was published under
PCT Article 21(2) in English and claims priority under 35 U.S.C.
.sctn. 119(e) from U.S. provisional application Ser. No.
61/407,291, filed Oct. 27, 2010, each of which is herein
incorporated by reference in its entirety.
FIELD OF INVENTION
[0003] The embodiments described herein relate to partially
double-stranded nucleic acid primers and their use in, for example,
nucleic acid synthesis methods.
BACKGROUND OF INVENTION
[0004] Nucleic acids are vital information carriers of biology, and
the detection, amplification, and identification of nucleic acids
has formed the basis for a vast sector of biotechnology. In
particular, methods such as the polymerase chain reaction (PCR)
(Saiki et al. Science 239, 487-491 (1988)) have been used all over
the world as a reliable means of amplifying DNA, while reverse
transcriptase methods have been used to probe the transcriptome.
The operation of DNA polymerase, RNA polymerase, and reverse
transcriptase typically uses a short oligonucleotide fragment known
as a primer to direct the portion of a long target to be replicated
or transcribed.
[0005] Although the specificity of nucleic acid hybridization is
frequently sufficient to direct enzymatic activity for most target
sequences, targets with repetitive sequence, secondary structure,
and high G/C content are difficult to amplify with high yield.
Furthermore, high backgrounds of other nucleic acids can frequently
lead to incorrect amplification, such as in the case of single copy
human genome amplification. Finally, multiplexed amplification,
such as from a DNA chip pool, can be difficult to achieve due to
the large number of orthogonal amplification reactions that must
occur simultaneously. Similar problems exist for transcription and
for reverse transcription.
SUMMARY OF INVENTION
[0006] The hybridization of nucleic acids is specific at the single
nucleotide level. For example, cytosine preferentially binds to
guanine, and adenine preferentially binds to thymine or uracil.
However, for nucleic acid molecules composed of many nucleotides,
the specificity of hybridization is reduced, and nucleic acids with
near complementary sequences will bind almost as strongly as
perfect complementary sequences. Given a heterogeneous mixture of
target nucleic acid of interest ("targets") and nucleic acids with
sequences that differ from the target by, for example, one
nucleotide ("spurious targets"), a significant portion of primers
complementary to the target will hybridize instead to spurious
targets.
[0007] Because the correct targets bind with a slightly higher
affinity to a primer having a complementary sequence, given enough
time, correct targets will eventually displace spurious targets in
binding to a complementary primer. Though, this process is very
slow, and would take months at the nanomolar concentrations typical
of many experimental systems.
[0008] In order to mitigate the propensity of complementary primers
binding to spurious targets it is often necessary to operate
nucleic acid primer-based experimental systems near the melting
temperature of the primer/target complex. Because this melting
temperature is generally much higher than the temperature at which
most biological systems naturally operate, this high temperature
requirement precludes the experimental system from operating under
normal biological conditions. Additionally, because the melting
temperature will vary from target to target, the requisite narrow
temperature range for such experimental systems restricts the
simultaneous use of multiple primers to detect a plurality of
targets.
[0009] Provided herein are primers (e.g., primer duplexes and
hairpin primer duplexes) that, in embodiments, are able to rapidly
bind to nucleic acid targets with high specificity at a broad range
of temperatures. These primers may be used, for example, in nucleic
acid synthesis reactions (e.g., PCR), microarray analyses, imaging
methods, and single nucleotide polymorphism (SNP) analyses. The
primers may also be used in nucleic acid detection assays where
they function primarily as "probes". Accordingly, regardless of the
application, the primers of the invention may be referred herein to
interchangeably as "probes". Regardless of the application (or
method of use), the primers of the invention overcome problems
commonly experienced when specific hybridization is required in the
presence of spurious targets, and more particularly when such
spurious targets are present in excess.
[0010] The primers provided herein possess several unique
properties that facilitate their use in combination with enzymes
that act upon nucleic acids. First, the primers are
thermodynamically designed to bind with high specificity to only
their intended targets, and they show high discrimination against
even single-nucleotide changes. Second, the specificity of the
primers enables PCR, transcription, and reverse transcription of
traditionally difficult targets, such as those having significant
sequence repetition, secondary structure, and/or high G/C content.
The high degree of specificity that can be achieved with these
primers further enables accurate processing even in high nucleic
acid backgrounds such as single-copy human genome amplification.
Third, the partially double-stranded nature of the primers means
that they are unlikely to interact with each other, and
consequently they are amenable to highly multiplexed replication,
transcription, and/or reverse transcription reactions. Finally, the
hybridization of these primers to targets is relatively robust to
temperature and salinity, and therefore the primers may be of
significantly greater length than standard primers, which in turn
provides further enhanced specificity and primer design
flexibility.
[0011] In some embodiments, the nucleic acid primers discussed here
are rationally designed so that the standard free energy for
hybridization (e.g., theoretical standard free energy) between the
specific target nucleic acid molecule and the primer is close to
zero, while the standard free energy for hybridization between a
spurious target (even one differing from the specific (actual)
target by as little as a single nucleotide) and the primer is high
enough to make their binding unfavorable by comparison. The
inventors accomplished this by designing a primer having (a) a
"toehold" single-stranded target specific region, (b) a "branch
migration" double-stranded target specific region, and (c) a
"balance" double-stranded target non-specific region.
[0012] In some embodiments, the primer may be comprised of a single
strand that self hybridizes to form double-stranded regions. In
some embodiments, the primer may be comprised of two strands. As an
example of the latter embodiment, the primers is comprised of a
first or complement strand and a second or protector strand. The
complement strand, as its name implies, is partially complementary
to the target of interest and will hybridize to the target. The
protector strand, on the other hand, is designed to not hybridize
to the target and rather to compete with the target (or spurious
target) for binding of the complement.
[0013] The "toehold" region is present in the complement strand, is
complementary to a target sequence and not complementary to a
protector region. The "balance" region in the complement strand
(i.e., the complement balance region) is complementary to part of
the protector (i.e., to the protector balance region) and not
complementary to target sequence. The hybridization energy of
toehold to target is matched or nearly matched to the hybridization
energy of complement balance region to protector balance region
(adjusting for various other thermodynamic considerations). The
sequence of the balance region is rationally designed to achieve
this matching under desired conditions of temperature and primer
concentration. As a result, the equilibrium for the actual target
and primer rapidly approaches 50% target:primer::protector:primer
(or whatever ratio is desired), while equilibrium for the spurious
target and primer greatly favors protector:primer. The abundant
free primer in the presence of specific target facilitates its
highly sensitive and specific detection.
[0014] In some embodiments, the nucleic acid primers discussed here
are designed so that the concentration-adjusted free energy for
hybridization between the specific target nucleic acid molecule and
the primer is close to zero, while the concentration-adjusted
standard free energy for hybridization between a spurious target
and the primer is high enough to make their binding unfavorable by
comparison. "Concentration-adjusted free energy," as used herein,
refers to .DELTA.G +(.DELTA.n)RT ln(c), where R is the universal
gas constant, T is temperature in Kelvins, c is concentration of
the primer, and .DELTA.n is the change in the number of molecules
through the course of the reaction (.DELTA.n=-1 for standard
hybridization, .DELTA.n=0 for two-stranded primer
hybridization.
[0015] Aspects of the invention therefore provide the primer
compositions comprising the primers, compositions comprising the
complement and protector strands (for example in kits), methods of
making the primers, and methods of using the primers in assays or
reactions including without limitation nucleic acid synthesis
and/or detection assays or reactions.
[0016] Thus, in one aspect, the invention provides a partially
double-stranded primer comprised of (a) first nucleic acid strand
(also referred to herein as a complement strand) and second nucleic
acid strand (also referred to herein as a protector strand),
wherein the first and second strands when hybridized to each other
are arranged into (1) a double-stranded target-non-specific
(balance) region, (2) a double-stranded target-specific (branch
migration) region, and (3) a single-stranded target-specific
(toehold) region contributed to by the first nucleic acid strand,
wherein the double-stranded target-non-specific region has a
standard free energy approximately equal to the standard free
energy for the single-stranded target-specific region bound to a
target nucleic acid. The partially double-stranded primer may
comprise one or more double-stranded target-non-specific regions,
one or more double-stranded target-specific regions, and/or one or
more single-stranded target-specific regions. In some embodiments,
the partially double-stranded primer may comprise one or two
double-stranded target-non-specific (balance) regions, one or more
double-stranded target-specific (branch migration) regions, and/or
one or more single-stranded target-specific (toehold) regions.
[0017] In some embodiments, the second nucleic acid strand
comprises a non-extendable nucleotide at its 3' end and/or the
first nucleic acid strand comprises a non-natural nucleotide at or
near the 3' end of its target-non-specific region. In some
embodiments, the non-extendable nucleotide is a non-natural
nucleotide or a dideoxy nucleotide. In some embodiments, the
non-natural nucleotide is iso-C, iso-G or deoxyuridine. These
examples are intended as non-limiting.
[0018] In some embodiments, the double-stranded target non-specific
region is about 4-20 nucleotides in length. The double-stranded
target non-specific region may be longer than 20 nucleotides, such
as for example 4-21 nucleotides in length. In some embodiments, it
may be about 12-192 nucleotides in length.
[0019] In some embodiments, the single stranded target specific
region is about 4-20 nucleotides in length. The single stranded
target specific region may be longer than 20 nucleotides, such as
for example 4-21 nucleotides in length. In some embodiments, it may
be about 12-192 nucleotides in length.
[0020] In some embodiments, the double-stranded target non-specific
region and the single stranded target specific region have similar
or identical proportions of A/T nucleotides (and typically similar
or identical proportions of G/C nucleotides). In some embodiments,
the first and second nucleic acid strands are comprised of DNA or
RNA or a combination thereof.
[0021] In another aspect, the invention provides a single-stranded
primer that partially self-hybridizes to form (1) a double-stranded
target-non-specific region, (2) a double-stranded target-specific
region, (3) single-stranded target-specific region, and (4) a
hairpin loop region, wherein the one or more double-stranded
target-non-specific region has a concentration-adjusted standard
free energy approximately equal to the concentration-adjusted
standard free energy for the one or more single-stranded
target-specific region bound to a target nucleic acid.
[0022] In another aspect, the invention provides a composition
comprising the any of the afore-mentioned primers. The composition
may further comprise a carrier such as a buffer, optionally
comprising a preservative, one or more salts, etc. The composition
may also comprise an excess of single-stranded protector strands,
wherein each protector strand comprises a protector balance region
and a protector branch migration region. The single stranded
protector strands may each comprise a non-extendable and/or
non-naturally occurring nucleotide, preferably at its 3' end.
[0023] In another aspect, the invention provides a system
comprising a nucleic acid target, a polymerase, and any of the
foregoing primers. In some embodiments, the primer is a partially
double-stranded primer comprising a first and a second nucleic acid
strand arranged into (1) a double-stranded target-non-specific
region, (2) a double-stranded target-specific region, and (3) a
single-stranded target-specific region contributed to by the first
nucleic acid strand.
[0024] In some embodiments, the nucleic acid target is a
single-stranded. In some embodiments, the nucleic acid target is
DNA or RNA. In some embodiments, the nucleic acid target comprises
repetitive sequence, secondary structure and/or high GC content. In
some embodiments, the nucleic acid target is present in a plurality
of different nucleic acids. In some embodiments, the nucleic acid
target is present as a single copy or in low copy (e.g., less than
0.001%, less than 0.01%, less than 0.1%, or less than 1%) in a
plurality of different nucleic acids.
[0025] In some embodiments, the system comprises a plurality of any
of the foregoing primers such as a plurality of different partially
double-stranded primers. In some embodiments, the system comprises
at least two of the foregoing primers, such as at least two
partially double-stranded primers, which together can be used to
amplify a region of the nucleic acid target.
[0026] In another aspect, the invention provides a composition
comprising the any of the afore-mentioned systems. The composition
may further comprise a carrier such as a buffer, optionally
comprising a preservative, one or more salts, one or more enzymes
such as a polymerase, nucleotides suitable for nucleic acid
synthesis, etc. The composition may also comprise an excess of
single-stranded protector strands, wherein each protector strand
comprises a protector balance region and a protector branch
migration region. The single stranded protector strands may each
comprise a non-extendable and/or non-naturally occurring
nucleotide, preferably at its 3' end.
[0027] In another aspect, the invention provides a method
comprising contacting any of the foregoing primers, including any
of the foregoing partially double-stranded primers to a sample, and
detecting hybridization of the primer to a target in the
sample.
[0028] In some embodiments, the primer such as the partially
double-stranded primer is labeled with a detectable moiety. In some
embodiments, the detectable moiety comprises a fluorophore or a
radioisotope.
[0029] The target will typically be a nucleic acid. In some
embodiments, the target is a single-stranded nucleic acid. In some
embodiments, the target is DNA or RNA. In some embodiments, the
target is a nucleic acid that comprises repetitive sequence,
secondary structure and/or high GC content. In some embodiments,
the target is present in a plurality of different nucleic acids. In
some embodiments, the target is present as a single copy or in low
copy (e.g., less than 0.001%, less than 0.01%, less than 0.1%, or
less than 1%) in a plurality of different nucleic acids.
[0030] In some embodiments, this and other methods described herein
are performed at a temperature below the melting temperature of the
complement strand-target complex. In some embodiments, this and
other methods described herein are performed at a temperature
between and including room temperature up to and including
50.degree. C., or up to and including 40.degree. C., or up to and
including 30.degree. C. In some embodiments, this and other methods
described herein are performed at about 37.degree. C. In some
embodiments, this and other methods described herein are performed
in an excess of protector strand that comprises a protector balance
region and a protector branch migration region and that is
identical to the protector strand in the partially double-stranded
primer. In this and other methods described herein, the primer may
be any of the foregoing primers including the partially
double-stranded primers.
[0031] In another aspect, the invention provides a method
comprising hybridizing a single-stranded target-specific (toehold)
region of a first (complement) strand of any of the foregoing
partially double-stranded primers to a nucleic acid target, thereby
dissociating the first strand of the primer from the second
(protector) strand of the primer, and extending the first strand at
its 3' end, in a target-complementary manner, in the presence of a
polymerase.
[0032] In another aspect, the invention provides a method
comprising performing a nucleic acid synthesis reaction in the
presence of a nucleic acid target, a polymerase, and one or more of
the foregoing partially double-stranded primers.
[0033] In some embodiments, the nucleic acid synthesis reaction is
a nucleic acid amplification reaction. In some embodiments, the
nucleic acid amplification reaction is polymerase chain reaction
(PCR). In some embodiments, the nucleic acid synthesis reaction is
a transcription reaction. In some embodiments, the transcription
reaction is a reverse transcription reaction.
[0034] In some embodiments, two partially double-stranded primers
are used.
[0035] In another aspect, the invention provides a method of
performing a multiplexed nucleic acid amplification reaction
comprising amplifying multiple unique nucleic acid molecules using
any of the foregoing primers including the partially
double-stranded primer.
[0036] In another aspect, the invention provides a kit comprising
one or more (including a plurality) of any of the foregoing
partially double-stranded primers, and one or more nucleic acid
synthesis reagents such as enzymes, nucleotides, salts, EDTA, a
buffer, etc.
[0037] In some embodiments, the one or more nucleic acid synthesis
reagents is selected from the group consisting of a buffer,
nucleotides, and a polymerase.
[0038] In some embodiments, the kit further comprises an excess of
protector strand that is identical to the protector strand
comprised in the primer.
[0039] In some embodiments, the kit further comprises instructions
for use.
[0040] In another aspect, the invention provides a kit comprising a
first single-stranded (complement) nucleic acid in a first
container, and a second single-stranded (protector) nucleic acid
that is complementary to a region of the first single-stranded
nucleic acid, in a second container, wherein, when the first and
second single-stranded nucleic acids are hybridized to each other,
a partially double-stranded nucleic acid is formed that comprises
(1) a double-stranded target-non-specific region, (2) a
double-stranded target-specific region, and (3) a single-stranded
target-specific region contributed to by the first nucleic acid,
wherein the first single-stranded nucleic acid comprises a
non-natural nucleotide and/or the second single-stranded nucleic
acid comprises a non-extendable nucleotide at its 3' end.
[0041] In some embodiments, the kit further comprises instructions
for use. In some embodiments, the kit further comprises one or more
nucleic acid synthesis reagents such as those recited above. In
some embodiments, the one or more nucleic acid synthesis reagents
is selected from the group consisting of a buffer, nucleotides, and
a polymerase.
[0042] In some embodiments, the protector strand is provided in the
kit in an amount (e.g., a molar amount) that is greater than the
amount (e.g., a molar amount) of complement strand in the kit.
[0043] In some embodiments of the foregoing aspects and inventions,
particularly those relating to two strand primers, the nucleotide
sequence of the primer is selected such that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 -.DELTA.G.sub.3
|.ltoreq..DELTA.G.sub.R ,
wherein: .DELTA.G.sub.1 is the standard free energy of
hybridization of the protector balance region to the complement
balance region; .DELTA.G.sub.2 is the standard free energy of
hybridization of the protector balance region to the sequence
immediately adjacent in the first direction to the target nucleic
acid sequence, if any; .DELTA.G.sub.3 is the standard free energy
of hybridization of the toehold region to the second target nucleic
acid sequence; and .DELTA.G.sub.R is 3.5 kcal/mol.
[0044] In one aspect, provided herein is a primer duplex system
comprising a complement strand and a protector strand, wherein the
protector strand comprises a nucleic acid having: a protector
branch migration region having a first end, a second end, and a
sequence that corresponds to a first target nucleic acid sequence
having a first end and a second end, wherein the first end of the
protector branch migration region and the first end of the first
target nucleic acid sequence are either both 5' or else both 3';
and a protector balance region immediately adjacent to the first
end of the protector branch migration region having a sequence that
does not correspond to sequence immediately adjacent to the first
end of the first target nucleic acid sequence, if any; and the
complement primer comprises a nucleic acid having: a complement
branch migration region having a first end and a second end, and a
sequence that is complementary to the protector branch migration
region, wherein the first end of the complement branch migration
region and the first end of the first target nucleic acid sequence
are either both 5' or else both 3'; a toehold region that is:
immediately adjacent to the first end of the complement branch
migration region; and complementary to a second target nucleic acid
sequence that is immediately adjacent to the second end of the
first target nucleic acid sequence; and a complement balance region
that: is immediately adjacent to the second end of the complement
branch migration region; is complementary to the protector balance
region; and has a sequence such that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 -.DELTA.G.sub.3
|.ltoreq..DELTA.G.sub.R ,
wherein: .DELTA.G.sub.1 is the standard free energy of
hybridization of the protector balance region to the complement
balance region; .DELTA.G.sub.2 is the standard free energy of
hybridization of the protector balance region to the sequence
immediately adjacent in the first direction to the target nucleic
acid sequence, if any; .DELTA.G.sub.3 is the standard free energy
of hybridization of the toehold region to the second target nucleic
acid sequence; and .DELTA.G.sub.R is 3.5 kcal/mol.
[0045] In another aspect, provided herein is a primer duplex system
comprising a nucleic acid having a protector strand, a hairpin
region and a complement strand, wherein: the protector strand
comprises a protector branch migration region and a protector
balance region, wherein: the protector branch migration region has:
a first end; a second end; and a sequence that corresponds to a
first target nucleic acid sequence having a first end and a second
end, wherein the first end of the protector branch migration region
and the first end of the first target nucleic acid sequence are
either both 5' or else both 3'; and the protector balance region
has: a first end; a second end immediately adjacent to the first
end of the protector branch migration region; and a sequence that
does not correspond to sequence immediately adjacent to the first
end of the first target nucleic acid sequence, if any; the hairpin
region comprises: a first end; and a second end immediately
adjacent to the first end of the protector balance region; and the
complement strand comprises a complement balance region, a
complement branch migration region, and a toehold region, wherein:
the complement balance region has: a first end; a second end
immediately adjacent to the first end of the hairpin region; and a
sequence that is complementary to the protector balance region; the
complement branch migration region has: a first end; a second end
immediately adjacent to the first end of the complement balance
region; and a sequence that is complementary to the protector
branch migration region, wherein the first end of the complement
branch migration region and the first end of the first target
nucleic acid sequence are either both 5' or else both 3'; the
toehold region is: immediately adjacent to the first end of the
complement branch migration region; and complementary to a second
target nucleic acid sequence that is immediately adjacent to the
second end of the first target nucleic acid sequence; and the
complement balance region has a sequence such that: |.DELTA.G.sub.1
-.DELTA.G.sub.2 -.DELTA.G.sub.3 +.DELTA.G.sub.4 +RT ln
(c)|.ltoreq..DELTA.G.sub.R , wherein: .DELTA.G.sub.1 is the
standard free energy of hybridization of the protector balance
region to the complement balance region; .DELTA.G.sub.2 is the
standard free energy of hybridization of the protector balance
region to the sequence immediately adjacent in the first direction
to the target nucleic acid sequence, if any; and .DELTA.G.sub.3 is
the standard free energy of hybridization of the toehold region to
the second target nucleic acid sequence; .DELTA.G.sub.4 is the
standard free energy of confinement of the hairpin region; R is the
ideal gas constant; T is the temperature at which the primer duplex
system is to be used; c is the concentration at which the primer
duplex system is to be used; and .DELTA.G.sub.R is 3.5
kcal/mol.
[0046] In yet another aspect, provided herein system having, in 3'
to 5' order, a first protector strand, a first hairpin region, a
complement strand, a second hairpin region and a second protector
strand, wherein: the first protector strand comprises: a first
protector branch migration region having a sequence that
corresponds to a first target nucleic acid sequence; and a first
protector balance region that: is immediately 5' to the first
protector branch migration region; and has a sequence that does not
correspond to sequence immediately 5' to the first target nucleic
acid sequence, if any; the first hairpin region is immediately 5'
to the first protector balance region; the complement strand
comprises: a first complement balance region that: is immediately
5' to the first hairpin region; and has a sequence complementary to
the sequence of the first protector balance region; a first
complement branch migration region that: is immediately 5' to the
first complement balance region; and has a sequence complementary
to a first protector branch migration region; a toehold region
that: is immediately 5' to the first complement branch migration
region; and has a sequence that is complementary to a second target
nucleic acid sequence that is immediately 3' to the first target
nucleic acid sequence; a second complement branch migration region
that: is immediately 5' to the toehold region; and has a sequence
complementary to a third target nucleic acid sequence that is
immediately 3' to the second target nucleic acid sequence; a second
complement balance region that: is immediately 5' to the second
complement branch migration region; has a sequence that is not
complementary to sequence immediately 3' to the third target
nucleic acid sequence, if any; the second hairpin region is
immediately 5' to the second complement balance region; and the
second protector strand comprises: a second protector balance
region that: is immediately 5' to the second hairpin region; and
has a sequence complementary to the second complement balance
region; and a second protector branch migration region that: is
immediately 5' to the second protector balance region; and has a
sequence complementary to the second complement branch migration
region; wherein the first complement balance region and the second
complement balance region have sequences such that: |.DELTA.G.sub.1
-.DELTA.G.sub.2 +.DELTA.G.sub.3 -.DELTA.G.sub.4 -.DELTA.G.sub.5
+.DELTA.G.sub.6 +.DELTA.G.sub.7 +RT ln (c)| .DELTA.G.sub.R ,
wherein: .DELTA.G.sub.1 is the standard free energy of
hybridization of the first protector balance region to the first
complement balance region; .DELTA.G.sub.2 is the standard free
energy of hybridization of the first complement balance region to
the sequence immediately 5' to the first target nucleic acid
sequence, if any; .DELTA.G.sub.3 is the standard free energy of
hybridization of the second protector balance region to the second
complement balance region; .DELTA.G.sub.4 is the standard free
energy of hybridization of the second complement balance region to
the sequence immediately 3' to the third target nucleic acid
sequence, if any; .DELTA.G.sub.5 is the standard free energy of
hybridization of the toehold region to the second target nucleic
acid sequence; .DELTA.G.sub.6 is the standard free energy of
confinement of the first hairpin region; .DELTA.G.sub.7 is the
standard free energy of confinement of the second hairpin region; R
is the ideal gas constant; T is the temperature at which the primer
duplex system is to be used; and c is the concentration at which
the primer duplex system is to be used; and .DELTA.G.sub.R is 3.5
kcal/mol.
[0047] In still another aspect, provided herein is a primer duplex
system comprising a hairpin primer and a protector strand, wherein:
the hairpin primer comprises a nucleic acid having: a first
protector strand having: a first protector branch migration region
having a sequence that corresponds to a first target nucleic acid
sequence; and a first protector balance region that: is immediately
5' to the first protector branch migration region; and has a
sequence that does not correspond to sequence immediately 5' to the
first target nucleic acid sequence, if any; a hairpin region
immediately 5' to the first protector balance region; a complement
strand having: a first complement balance region that: is
immediately 5' to the first hairpin region; and has a sequence
complementary to the sequence of the first protector balance
region; a first complement branch migration region that: is
immediately 5' to the first complement balance region; and has a
sequence complementary to a first protector branch migration
region; a toehold region that: is immediately 5' to the first
complement branch migration region; and has a sequence that is
complementary to a second target nucleic acid sequence that is
immediately 3' to the first target nucleic acid sequence; a second
complement branch migration region that: is immediately 5' to the
toehold region; and has a sequence complementary to a third target
nucleic acid sequence that is immediately 3' to the second target
nucleic acid sequence; a second complement balance region that: is
immediately 5' to the second complement branch migration region;
has a sequence that is not complementary to sequence immediately 3'
to the third target nucleic acid sequence, if any; and the
protector comprises a nucleic acid having: a second protector
strand having: a second protector balance region that has a
sequence complementary to the second complement balance region; and
a second protector branch migration region that: is immediately 5'
to the second protector balance region; and has a sequence
complementary to the second complement branch migration region;
wherein the first complement balance region and the second
complement balance region have sequences such that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 +.DELTA.G.sub.3 -.DELTA.G.sub.4
-.DELTA.G.sub.5 +.DELTA.G.sub.6 |.ltoreq..DELTA.G.sub.R ,
wherein: .DELTA.G.sub.1 is the standard free energy of
hybridization of the first protector balance region to the first
complement balance region; .DELTA.G.sub.2 is the standard free
energy of hybridization of the first complement balance region to
the sequence immediately 5' to the first target nucleic acid
sequence, if any; .DELTA.G.sub.3 is the standard free energy of
hybridization of the second protector balance region to the second
complement balance region; .DELTA.G.sub.4 is the standard free
energy of hybridization of the second complement balance region to
the sequence immediately 3' to the third target nucleic acid
sequence, if any; .DELTA.G.sub.5 is the standard free energy of
hybridization of the toehold region to the second target nucleic
acid sequence; .DELTA.G.sub.6 is the standard free energy of
confinement of the hairpin region; and .DELTA.G.sub.R is 3.5
kcal/mol.
[0048] In a further aspect, provided herein is a primer duplex
system comprising a protector strand and a hairpin primer, wherein:
the protector strand comprises a nucleic acid having: a first
protector strand having: a first protector branch migration region
having a sequence that corresponds to a first target nucleic acid
sequence; and a first protector balance region that: is immediately
5' to the first protector branch migration region; and has a
sequence that does not correspond to sequence immediately 5' to the
first target nucleic acid sequence, if any; the hairpin primer
comprises a nucleic acid having: a complement strand having: a
first complement balance region that has a sequence complementary
to the sequence of the first protector balance region; a first
complement branch migration region that: is immediately 5' to the
first complement balance region; and has a sequence complementary
to a first protector branch migration region; a toehold region
that: is immediately 5' to the first complement branch migration
region; and has a sequence that is complementary to a second target
nucleic acid sequence that is immediately 3' to the first target
nucleic acid sequence; a second complement branch migration region
that: is immediately 5' to the toehold region; and has a sequence
complementary to a third target nucleic acid sequence that is
immediately 3' to the second target nucleic acid sequence; a second
complement balance region that: is immediately 5' to the second
complement branch migration region; has a sequence that is not
complementary to sequence immediately 3' to the third target
nucleic acid sequence, if any; a hairpin region immediately 5' to
the second complement balance region; and a second protector strand
having: a second protector balance region that: is immediately 5'
to the second hairpin region; and has a sequence complementary to
the second complement balance region; and a second protector branch
migration region that: is immediately 5' to the second protector
balance region; and has a sequence complementary to the second
complement branch migration region; wherein the first complement
balance region and the second complement balance region have
sequences such that: |.DELTA.G.sub.1 -.DELTA.G.sub.2
+.DELTA.G.sub.3 -.DELTA.G.sub.4 -.DELTA.G.sub.5 +.DELTA.G.sub.6
|.ltoreq..DELTA.G.sub.R , wherein: .DELTA.G.sub.1 is the standard
free energy of hybridization of the first protector balance region
to the first complement balance region; .DELTA.G.sub.2 is the
standard free energy of hybridization of the first complement
balance region to the sequence immediately 5' to the first target
nucleic acid sequence, if any; .DELTA.G.sub.3 is the standard free
energy of hybridization of the second protector balance region to
the second complement balance region; .DELTA.G.sub.4 is the
standard free energy of hybridization of the second complement
balance region to the sequence immediately 3' to the third target
nucleic acid sequence, if any; and .DELTA.G.sub.5 is the standard
free energy of hybridization of the toehold region to the second
target nucleic acid sequence; .DELTA.G.sub.6 is the standard free
energy of confinement of the hairpin region; and .DELTA.G.sub.R is
3.5 kcal/mol.
[0049] In another aspect, provided herein is a primer duplex system
comprising a first protector strand, a complement strand and a
second protector strand, wherein: the first protector strand
comprises a nucleic acid having: a first protector branch migration
region having a sequence that corresponds to a first target nucleic
acid sequence; and a first protector balance region that: is
immediately 5' to the first protector branch migration region; and
has a sequence that does not correspond to sequence immediately 5'
to the first target nucleic acid sequence, if any; the complement
strand comprises a nucleic acid having: a first complement balance
region that: is immediately 5' to the first hairpin region; and has
a sequence complementary to the sequence of the first protector
balance region; a first complement branch migration region that: is
immediately 5' to the first complement balance region; and has a
sequence complementary to a first protector branch migration
region; a toehold region that: is immediately 5' to the first
complement branch migration region; and has a sequence that is
complementary to a second target nucleic acid sequence that is
immediately 3' to the first target nucleic acid sequence; a second
complement branch migration region that: is immediately 5' to the
toehold region; and has a sequence complementary to a third target
nucleic acid sequence that is immediately 3' to the second target
nucleic acid sequence; a second complement balance region that: is
immediately 5' to the second complement branch migration region;
has a sequence that is not complementary to sequence immediately 3'
to the third target nucleic acid sequence, if any; and the second
protector strand comprises: a second protector balance region that
has a sequence complementary to the second complement balance
region; and a second protector branch migration region that: is
immediately 5' to the second protector balance region; and has a
sequence complementary to the second complement branch migration
region; wherein the first complement balance region and the second
complement balance region have sequences such that: |.DELTA.G.sub.1
-.DELTA.G.sub.2 +.DELTA.G.sub.3 -.DELTA.G.sub.4 -.DELTA.G.sub.5 -RT
ln (c)| .DELTA.G.sub.R , wherein: .DELTA.G.sub.1 is the standard
free energy of hybridization of the first protector balance region
to the first complement balance region; .DELTA.G.sub.2 is the
standard free energy of hybridization of the first complement
balance region to the sequence immediately 5' to the first target
nucleic acid sequence, if any; .DELTA.G.sub.3 is the standard free
energy of hybridization of the second protector balance region to
the second complement balance region; .DELTA.G.sub.4 is the
standard free energy of hybridization of the second complement
balance region to the sequence immediately 3' to the third target
nucleic acid sequence, if any; and .DELTA.G.sub.5 is the standard
free energy of hybridization of the toehold region to the second
target nucleic acid sequence; R is the ideal gas constant; T is the
temperature at which the primer duplex system is to be used; c is
the concentration at which the primer duplex system is to be used;
and .DELTA.G.sub.R is 3.5 kcal/mol.
[0050] In yet another aspect, provided herein is a primer duplex
system comprising, in 3' to 5' order, a first protector strand, a
first hairpin region, a complement strand, a second hairpin region
and a second protector strand, wherein: the first protector strand
has a sequence that corresponds to a first target nucleic acid
sequence; the first hairpin region is immediately 5' of the first
protector strand; the complement strand comprises: a first
complement branch migration region that: is immediately 5' of the
first hairpin region; and has a sequence complementary to the
sequence of the first protector strand; a toehold region that: is
immediately 5' of the first complement branch migration region; and
has a sequence complementary to a second target nucleic acid
sequence that is immediately 3' of the first target nucleic acid
sequence; and a second complement branch migration region that: is
immediately 5' of the toehold region; and has a sequence
complementary to a third target nucleic acid sequence that is
immediately 3' of the second nucleic acid sequence; the second
hairpin region is immediately 5' of the second complement branch
migration region; and the second protector strand has a sequence
that is complementary to the sequence of the second complement
branch migration region.
[0051] In any one of the foregoing aspects, .DELTA.G.sub.R may be
2.0 kcal/mol, 1.0 kcal/mol, or 0.5 kcal/mol; and/or c may be about
10 nM; and/or T may be about 293 K or about 338 K; and/or the
toehold region may be between 4 and 20 nucleotides in length,
between about 4 and 15 nucleotides in length, or between about 4
and 10 nucleotides in length; and/or the first end of the protector
branch migration region may be 5' or 3'; and/or the primer duplex
system may further comprise a functionalized fluorescent group or
dye; and/or the primer duplex system may be immobilized on a solid
support; and/or the hairpin region may be no greater than 20
nucleotides in length or no greater than 10 nucleotides in length;
and/or the sequence of the hairpin region may be selected from the
group consisting of a poly-adenosine sequence, poly-deoxyadenosine
sequence, a poly-5'-methyluridine sequence, a poly-thymidine
sequence, a poly-guanosine sequence, a poly-deoxyguanosine
sequence, a poly-cytidine sequence, a poly-deoxycytidine sequence,
a poly-uridine sequence, and a poly-deoxyuridine sequence; and/or
the first target nucleic acid sequence and/or the second target
nucleic acid sequence may be sequences that naturally occur in an
organism or a virus; and/or the first target nucleic acid sequence
and/or the second target nucleic acid sequence may be sequences
that naturally occur in a micro-RNA.
[0052] In one aspect, provided herein is a method of detecting a
target nucleic acid in a sample comprising: contacting a target
nucleic acid with a primer duplex system of any one of the
embodiments described herein; and detecting the formation of a
complex between the target nucleic acid and at least a part of the
primer duplex system. In some embodiments, the primer duplex system
further comprises a functionalized fluorescent group or dye. In
some embodiments, the primer duplex system is immobilized on a
solid support. In some embodiments, the contacting occurs in a
cell. In some embodiments, the target nucleic acid is a nucleic
acid that naturally occurs in an organism or a virus. In some
embodiments, the target nucleic acid is a micro-RNA.
[0053] In another aspect, provided herein is a method of amplifying
a sequence contained within a target nucleic acid comprising:
forming a solution comprising: a target nucleic acid; a primer
duplex system of any one of the embodiments described herein; and
reagents for performing an amplification reaction; and incubating
the solution under conditions such that a sequence contained within
the target nucleic acid is amplified. In some embodiments, the
target nucleic acid is a nucleic acid that naturally occurs in an
organism or a virus.
[0054] These and other aspects and embodiments of the invention
will be explained in greater detail herein.
BRIEF DESCRIPTION OF DRAWINGS
[0055] FIGS. 1A, 1B, 2A, 2B, 3A, 3B, 4-8, 9A, and 9B depict
exemplary nucleic acid probe systems.
[0056] FIGS. 10, 11, 12A, 12B, 13, and 14 depict exemplary methods
of using nucleic acid probe systems.
[0057] FIGS. 15A and 15B depict highly specific polymerase chain
reaction (PCR) using the primer duplexes provided herein.
[0058] FIGS. 16A-16E show experimental demonstrations of primer
hybridization with single nucleotide discrimination.
[0059] FIGS. 17A-17D show additional experimental results and
statistics on the single-base discrimination abilities of primer
duplexes.
[0060] FIGS. 18A-18B show experimental results using duplex primers
to improve the PCR yield of a quasi-repetitive target.
DETAILED DESCRIPTION OF INVENTION
[0061] A significant challenge in probe-based nucleic acid assays
is that nucleic acids having sequences similar to that of a target
will hybridize to the target's complement with strong
thermodynamics and fast kinetics. However, as described herein, the
kinetics and thermodynamics of strand displacement reactions can be
partially decoupled, so that reactions that are only slightly
thermodynamically favorable or even unfavorable can nonetheless
have kinetics as fast as the hybridization of two complementary
strands. The compositions and methods described herein take
advantage of this decoupling mechanism to provide nucleic acid
probe systems with improved specificity and kinetics.
[0062] Provided herein are highly specific nucleic acid probe
systems and methods of using such probe systems. In certain
embodiments, the nucleic acid probe systems described herein
comprise complement probes having regions complementary to a target
sequence that are protected from hybridization to spurious targets
by protector regions complementary to a portion of the complement
probes. The free energy of the binding reaction between the target
and the protected probe is finely controlled via the rationally
designed bases of one or more balancing regions, which have
sequences that do not correspond to the target nucleic acid
sequence or its complement. In certain embodiments, a protector and
a complement probe form regions on a single nucleic acid molecule
and are separated from one another by one or more nucleic acid
hairpins.
[0063] The methods and compositions described herein possess
several unique properties that facilitate their use in
hybridization assays. First, the nucleic acid probe systems
described herein reliably convert small sequence differences
between targets and spurious targets into large differences in
binding affinity and reaction rates between hybridization of the
target vs. spurious target with the probe. Second, the nucleic acid
probe systems described herein can be designed to operate at any of
a wide range of temperatures and salt concentrations, and can
therefore function reliably under many different experimental
conditions. Third, use of the nucleic acid probe systems described
herein can result in hybridization reactions that are kinetically
fast even at room temperature, which facilitates rapid and
high-throughput analysis of nucleic acids. Fourth, the nucleic acid
probe systems described herein are rationally designed, and
therefore are unlikely to interact unfavorably or in unexpected
ways with other biomolecules.
[0064] Accordingly, provided herein are primer compositions,
methods of making such compositions, and methods of their use. The
embodiments described herein are premised in part on the discovery
that primer (e.g., a pair of partially hybridized primers, or a
single self-hybridizing primer) that are partially double-stranded
and partially single-stranded, when used in a nucleic acid
synthesis reaction for example, are able to discriminate between
fully complementary targets and those having one or more mismatches
(i.e., spurious targets). As demonstrated herein, the primer
duplexes described herein are superior to standard primers in, for
example, PCR reactions using spurious targets such as those having
quasi-repetitive sequences.
[0065] The primer duplexes herein comprise a single-stranded region
referred to herein as a "toehold" from which the primer duplex
initiates binding to a target, a double-stranded "balance region"
which spontaneously dissociates so that a single primer strand does
not complete hybridization (along the full length of the primer) to
the target, and a double-stranded branch migration region, in
between the toehold and balance regions, which is fully
complementary to a target nucleic acid sequence. Mechanistically,
it is thought that hybridization to a target begins at the toehold
and continues along the length of the complement strand until the
primer is no longer "double-stranded". This assumes complementarity
between the target and the branch region as well. As used herein, a
nucleic acid "region" or "domain" is a consecutive stretch of
nucleotides of any length. When nucleotide mismatches exist between
the "target" and the complement strand, displacement of the second
strand (i.e., the protector strand) is thermodynamically
unfavorable and the association between the complement strand and
the "target" is reversed. It is to be understood that in this
latter description, the "target" is actually a spurious target
since it comprises nucleotide differences or mismatches from the
complement strand.
[0066] Because the standard free energy favors a complete match
(fully complementary) between the target sequence of the nucleic
acid and branch migration plus toehold regions of the primer rather
than a mismatch (e.g., single nucleotide change), the first
(complement) strand of the primer will bind stably to a target in
the absence of a mismatch but not in the presence of a mismatch. If
a mismatch exists between the first (complement) strand of the
primer and the target, the primer duplex prefers to reform via
newly exposed single-stranded balance regions. In this way, the
frequency of beginning a nucleic acid synthesis reaction at an
incorrect position in a target (or in a sample, for that matter) is
reduced. This type of discrimination is typically not possible
using the standard single-stranded primers of the prior art because
in those reactions there is no competing nucleic acid strand (such
as the protector strand) to which a mismatched primer strand would
prefer to bind. In some embodiments, the primers described herein
may be significantly longer than conventional primers (e.g., those
used for polymerase chain reaction (PCR) amplification) because the
instant primers rely on the presence of a competing, protector
strand for specificity rather than on melting temperature to
discriminate between complementary and mismatched sequences.
Accordingly, the instant primers may be selected and used in a
manner that is temperature independent.
[0067] The primer duplexes described herein therefore improve
specificity of for example nucleic acid synthesis reactions and, in
some embodiments, allow for a greater degree of multiplexing of
primers. Preliminary experiments, the results of which are provided
herein, show that the PCR yield of quasi-repetitive targets can be
significantly improved using the primer duplexes provided herein as
compared to standard primers (e.g., 75% vs. 30%). The primer
duplexes described herein also provide for specific nucleic acid
detection and amplification from a heterogeneous population of
nucleic acids, such as for example, detecting and amplifying a
bacterial DNA from a sample comprised of human DNA, which has broad
applicability in detection of rare organisms such as biowarfare
agents.
Primer Duplexes
[0068] As used herein, the primers of the invention may be referred
to as "primer duplexes" to covey that they may be provided and/or
exist in a conformation in which they comprise double-stranded
regions. Accordingly, the terms "primer" and "primer duplex" may be
used interchangeably.
[0069] The primer duplexes provide improved specificity and
kinetics over existing primers. A "primer duplex" herein refers to
a primer comprising a first strand (referred to herein as a
"complement strand") and a second strand (referred to herein as a
"protector strand") partially complementary to the first strand. In
some embodiments, the complement strand and the protector strand
are separate single-stranded nucleic acid molecules (FIGS. 1A and
1B). In other embodiments, the complement strand and the protector
strand are connected to each other and separated by a hairpin
region to form contiguous regions of a single nucleic acid molecule
(FIGS. 2A and 2B). As used herein, a "hairpin region" is a
single-stranded loop of nucleotides connecting two double-stranded
regions of a nucleic acid. The general structure of exemplary
primer duplexes is illustrated in the Figures and described herein.
It is to be understood that, in most instances, when reference is
made to a complement region or a protector region (or vice versa),
each region is typically within a single "primer duplex" (or a
single primer system). For example, a complement balance region in
a primer of the invention is complementary to a protector balance
region in the same primer such that a complement balance region of
one primer of the invention does not hybridize to a protector
balance region of different physically separate primer.
[0070] In embodiments in which the primer of the invention consists
of only a single strand, the complement "strand" may be referred to
as the complement region, and the protector "strand" may be
referred to as the protector region.
[0071] In certain embodiments, the complement strand (or region)
comprises a toehold region, a complement branch migration region,
and a complement balance region, while the protector strand (or
region) comprises a protector branch migration region and a
protector balance region. As used herein, a nucleic acid "region"
is a consecutive stretch of nucleotides of any length. Toehold and
branch migration regions are each designed to be complementary to,
and thus "base-pair" with (e.g., hybridize to), adjacent regions in
a target nucleic acid. A region of a complement strand that
base-pairs with a region in a target nucleic acid is referred to as
a "target-specific" region. Balance regions are designed to be not
complementary to, and thus to not base-pair with, a target nucleic
acid. Balance regions therefore are referred to as
"target-non-specific" regions. In certain aspects, when the
complement strand (or region) and the protector strand (or region)
are hybridized to each other (are double-stranded), a primer duplex
is formed. Thus, in some aspects, a primer duplex comprises a
target-specific single-stranded toehold region, a target-specific
double-stranded branch migration region, and a target-non-specific
double-stranded balance region (FIGS. 1A and 1B). In some
instances, the primer duplex may also comprise a hairpin loop, as
described in greater detail below.
[0072] The primer duplexes described herein may be designed to
hybridize specifically with a target nucleic acid. The efficacy of
a primer, for example, in a nucleic acid amplification reaction,
depends on the specificity, efficiency, and fidelity of the primer.
Typical nucleic acid primers often bind to spurious targets with a
thermodynamic and kinetic profile comparable to that of the same
primer binding to its intended, specific target nucleic acid,
except between the melting temperatures of the mismatched duplex
and the perfectly hybridized duplex. Accordingly, mismatched and
perfect duplexes can be distinguished by their melting
temperatures. The primers of the invention, in contrast,
distinguish between spurious and true target in a relatively
temperature-independent manner.
[0073] A "spurious target" herein refers to a nucleic acid molecule
that differs from a target nucleic acid molecule by at least one
nucleotide within the region hybridizing to the complement strand.
For example, TCGACGGGG is a spurious target, if the target is
TCGAAGGGG. In certain embodiments, a spurious target comprises at
least 2, at least 3, at least 4, or more nucleotide changes
relative to the target. Primer binding to spurious targets reduces
the fidelity (accuracy) of, e.g., nucleic acid amplification. The
primer duplexes presented herein are designed to alter the standard
free energy of strand displacement with spurious targets,
permitting discrimination between correct targets and spurious
targets, including spurious targets that differ from a correct
target by only one nucleotide. As described herein, the protector
strand is responsible for altering the standard free energy to
allow the complement strand to discriminate between correct and
spurious targets.
[0074] The primers described herein are rationally designed to
facilitate strand displacement reactions with finely tuned kinetics
and thermodynamics such that kinetics and thermodynamics of strand
displacement reactions are partially decoupled. As a result of this
decoupling, reactions only slightly thermodynamically favorable or
even unfavorable can nonetheless have kinetics as fast as the
hybridization of two complementary strands.
[0075] For example, at 37.degree. C. and 1 M Na.sup.+, the
concentration-adjusted standard free energy for hybridization of a
primer to a perfectly complementary (correct or specific) target
(i.e., 100% nucleotide match) is between 1.9 kcal/mol and 6.6
kcal/mol more favorable than the concentration-adjusted standard
free energy for hybridization of the same primer to a spurious
target for every nucleotide that the spurious target differs from
the intended target. In certain embodiments, the present primer
duplexes use toehold exchange strand displacement reactions to
translate this 1.9 to 6.6 kcal/mol difference in
concentration-adjusted standard free energy to an optimal
discrimination between the target and spurious targets. An example
of the thermodynamics/kinetics of primer duplex binding to a target
nucleic acid is described as follows in reference to FIGS. 3A and
3B.
[0076] For purposes of this example, the target nucleic acid has at
least two regions, (1) and (2). In certain embodiments, region (1)
may be about 10 to about 200 (including 14-200 or 20-200)
nucleotides long, while region 2 may be smaller, for example, about
4 to about 20 nucleotides long. As used herein, the terms
"nucleotide" and "bases" are used interchangeably. The protector
strand includes a protector branch migration region adjacent to a
protector balance region (3). The protector branch migration region
corresponds to target region 1, while the protector balance region
(3) does not correspond to region (1) or region (2) or any region
immediately 5' of the target regions. A nucleic acid sequence,
domain or region is "immediately adjacent to", "immediately 5" or
"immediately 3" to another sequence if the two sequences are part
of the same nucleic acid molecule and if no bases separate the two
sequences. The complement strand includes a complement balance
region (3), a complement branch migration region (1), and a toehold
region (2). The complement balance region (3) is complementary to
the protector balance region (3), the complement branch migration
region (1) is complementary to the protector branch migration
region and target region (1) (i.e., the protector branch migration
region and the target region (1) are identical in sequence and this
both bind to the complement branch migration region (1), and the
toehold region (2) is complementary to target region (2).
[0077] In certain embodiments, the balance region is designed so
that its concentration-adjusted standard free energy
(.DELTA.G.sub.3:3 ) is the same or about the same as the
concentration-adjusted standard free energy for the toehold region
bound to target region (2) (.DELTA.G.sub.2:2 ). In some instances,
for a 10 nanomolar (nM) primer used in a reaction at 37.degree. C.,
|.DELTA.G.sub.2:2 | and |.DELTA.G.sub.3:3 (the vertical bars
denoting absolute value) should each be less than about 11.3
kcal/mol to ensure dissociation of the full protector strand from
the target.
[0078] In some embodiments, when the primer duplex interacts with a
specific (correct) target nucleic acid molecule (FIG. 3A), the
dissociation of (3):(3) and the association of (2):(2) balance one
another, and the (1):(1) hybridization thermodynamics are identical
for the target nucleic acid and for the protector strand
interacting with the complement strand. The total free energy
change between the two states is relatively small (e.g., about 1
kcal/mol), and the reaction quickly (e.g., less than a minute)
reaches an equilibrium of about 50:50. In certain embodiments, the
balance region may be designed to have standard free energy very
close to that of the toehold region binding to target region 2, so
that the equilibrium balance is, for example, 60:40 or 70:30. In
some embodiments, the design of the balance region may also take
into account other contributors to free energy change during the
reaction, such as hybridization between the protector balance
region and upstream target sequences (which in some instances is
negligible), confinement of a hairpin (if present), intended
temperature of use, and intended primer concentration.
[0079] In some embodiments, when the primer duplex instead
interacts with a spurious target nucleic acid molecule (FIG. 3B),
the dissociation of (3):(3) and the association of (2m):(2) are not
balanced because spurious target region (2m) is not fully
complementary to the toehold region. The equilibrium is
consequently shifted to the state in which the primer duplex does
not bind the spurious target.
[0080] Explained another way, in some instances, the free energy of
the complement strand bound to the protector strand is
.DELTA.G.sub.3:3 +.DELTA.G.sub.1:1 (ignoring contribution from the
optional hairpin region and other considerations), which balances
the free energy of the complement strand bound to specific target,
.DELTA.G.sub.2:2 +.DELTA.G.sub.1:1 . In this example, this is
because the balance region (3) of the primer duplex has been
designed to have a concentration-adjusted standard free energy
equal to (or approximately equal to) that of target region (2).
When the primer duplex interacts with a spurious target having a
single-nucleotide (base) change in target region (2m), the system's
free energy .DELTA.G.sub.2m:2 +.DELTA.G.sub.1:1 is less negative
than that of the primer duplex, and therefore disfavored in
equilibrium.
[0081] As used herein, the term "approximately equal to" in
reference to standard free energy means that the first referenced
free energy is within 10% of the second referenced free energy. In
some embodiments, a first free energy that is approximately equal
to a second free energy is within about +3 kcal/mol to about -3
kcal/mol of the second free energy. It is to be understood that the
differences between the first and second true energies may be less
than or about 1 kcal/mol, less than or about 2 kcal/mol, less than
or about 3 kcal/mol, less than or about 3.5 kcal/mol, or more, in
some embodiments.
[0082] Although FIG. 3B illustrates a single nucleotide change
corresponding to region (2)/(2m) of a target nucleic acid molecule,
the present primer duplexes can also discriminate between a
specific target and a spurious target having a nucleotide change in
region (1). When the spurious target has a single-base change in
target region 1, then the primer duplex's standard free energy
after binding becomes .DELTA.G.sub.2:2 +G.sub.1m:1 , where
.DELTA.G.sub.1m:1 is the standard free energy of the mismatched
target region (1) binding to the primer duplex's complement region
(1). Because of the single-base change, the primer duplex's free
energy is less negative than .DELTA.G.sub.3:3 +.DELTA.G.sub.1:1
(free energy of complement primer bound to protector), so
equilibrium is shifted to the state in which the primer duplex does
not bind the spurious target region. Standard free energies can be
calculated theoretically based on the knowledge in the art and the
teachings provided herein.
Complement and Protector Strands, Regions or Domains
[0083] The complement domains of the nucleic acid probe systems
described herein each include a plurality of regions, including a
toehold region and one or more complement target regions. Both the
toehold region and the one or more complement target regions have
nucleic acid sequences that are complementary to nucleic acid
sequences of the target nucleic acid. The toehold region and the
complement target region are therefore able to base-pair with and
therefore form a complex with a sequence of a target nucleic acid
when the nucleic acid probe system is contacted with a target
nucleic acid under appropriate hybridization conditions. The
complement domains may also include one or more complement balance
regions. The one or more complement balance regions are rationally
designed. Thus, the sequences of the one or more complement balance
regions are not designed to be complementary to a target nucleic
acid sequence.
[0084] A toehold region is complementary to (and thus hybridizes
to) a sequence in the target nucleic acid molecule; however, a
toehold region does not hybridize to a protector strand. Thus, when
the complement strand is hybridized to the target nucleic acid
molecule, the toehold region is also hybridized to the target
nucleic acid molecule, but when the complement strand is hybridized
to the protector strand, the toehold region remains
single-stranded. A toehold region may be positioned at the 3' end
or the 5' end of the complement strand (e.g., is an extension of
the 3' end or 5' end of the complement strand).
[0085] In certain embodiments, a toehold region is about 4
nucleotides to about 20 nucleotides in length, about 4 nucleotides
to about 15 nucleotides in length, or about 4 nucleotides to about
10 nucleotides in length. In some embodiments, a toehold region is
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20
nucleotides in length. In some embodiments, the toehold region is
greater than 20 nucleotides in length, including for example less
than or about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85,
90, 95, 100 or more nucleotides.
[0086] The complement branch migration region is complementary to a
sequence in the target nucleic acid molecule and to the protector
branch migration region. Thus, when the complement strand
hybridizes to a target nucleic acid molecule, the complement branch
migration region hybridizes to the target nucleic acid. When the
complement strand hybridizes to its protector strand, the
complement branch migration region hybridizes to the protector
branch migration region.
[0087] In certain embodiments, a branch migration region is no more
than 200, 100, 75, 50, 40, 30, 25 or 20 nucleotides in length. In
some embodiments, a branch migration region is about 10 nucleotides
to about 200 nucleotides in length. In certain embodiments, a
branch migration region is about 10 nucleotides to about 150
nucleotides, about 10 nucleotides to about 100 nucleotides, or
about 10 nucleotides to about 50 nucleotides in length. In
particular embodiments, a branch migration region is 10, 11, 12,
13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29,
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46,
47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63,
64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80,
81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97,
98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111,
112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124,
125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137,
138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150,
151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163,
164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176,
177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189,
190, 191, 192, 193, 194, 195, 196, 197, 198, 199, or 200
nucleotides in length. In particular embodiments, a branch
migration region may be more than 200 nucleotides in length,
depending on the target nucleic acid molecule of interest.
[0088] The balance regions of a complement strand and a protector
strand are complementary to each other (i.e., form a
double-stranded nucleic acid) but are non-complementary to the
target of interest (i.e., neither forms a double-stranded nucleic
acid with the target). Thus, when a complement strand hybridizes to
a target nucleic acid molecule, the complement balance region does
not hybridize to the target nucleic acid molecule. When the
complement strand hybridizes to its protector strand, the
complement balance region hybridizes to the protector balance
region.
[0089] The design of the balance region is dependent on the design
of the toehold region. In some embodiments, the balance region is
designed such that the thermodynamic profile of the balance region
is comparable to that of the toehold region. In some embodiments,
the thermodynamic profile is based on a theoretic model, using for
example, Mfold software available at the bioinfo website of RPI.
The number and/or nature of nucleotides within a balance region is
comparable to that of the toehold region. For example, if a toehold
region is comprised of 40% A and T nucleotides and 60% G and C
nucleotides, then the balance region should also be comprised of
40% A and T nucleotides and 60% G and C nucleotides. In
embodiments, the balance region is designed such that no more than
three consecutive nucleotides are complementary to a sequence on
the target nucleic acid to avoid binding of the balance region to
the target nucleic acid.
[0090] In some embodiments, the length of a balance region is short
enough so that the complement and protector spontaneously
dissociate from each other. In some embodiments, a balance region
is about 4 nucleotides to about 20 nucleotides in length, about 4
nucleotides to about 15 nucleotides in length, or about 4
nucleotides to about 10 nucleotides in length. In some embodiments,
a balance region is 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16,
17, 18, 19, or 20 nucleotides in length. In some embodiments, a
balance region is greater than 20 nucleotides, including for
example less than about 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75,
80, 85, 90, 95, 100 or more nucleotides. In some embodiments, the
number of consecutive nucleotides that are complementary to a
nucleotide sequence within the target nucleic acid may be greater
than three provided that the balance region does not bind to the
target nucleic acid.
[0091] In some embodiments, for example those where the primer
duplex contains two separate nucleic acid strands, the design of a
balance region does not depend on the concentration of the primer
duplex or the temperature at which the primer duplex is
formed/used. In some embodiments, a balance region is designed such
that the standard free energy for the reaction in which the
protector strand is displaced from the complement strand by the
target nucleic acid molecule is close to zero kcal/mol. As used
herein, "close to zero" means the standard free energy for the
reaction is within 3.5 kcal/mol from 0 kcal/mol. In certain
embodiments, the standard free energy of this displacement reaction
is within 3.5, 3.0, 2.5, 2.0, 0.9, 0.8, 0.7, 0.6, 0.5, 0.4, 0.3,
0.2, or 0.1 kcal/mol of zero kcal/mol.
[0092] In other embodiments, for example those where the primer
duplexes is formed by a single nucleic acid molecule (e.g., a
hairpin region separating the complement strand (or region or
domain) and the protector strand (or region or domain)), the design
of a balance region will be dependent on the primer duplex
concentration as well as reaction temperature. In such embodiments,
a balance region is designed so that the standard free energy for
the reaction in which the protector strand is displaced from the
complement strand by the target nucleic acid plus RT ln(c) is close
to zero kcal/mol, where R is the universal gas constant
(0.0019858775(34) kcal/molK), T is the temperature at which the
primer duplex is used, and c is the concentration at which primer
duplex is used. In some embodiments, the temperature at which the
primer duplexes are used is about 273 K (0.degree. C.), 277 K, 283
K, 288 K, 293K, 298 K, 303 K, 308 K, 313 K, 318 K, 323 K, 328 K,
333 K, 338 K, 343 K, 348 K, 353 K, 358 K or 363 K (90.degree. C.).
In some embodiments the concentration (c) at which the primer
duplexes are used is about 1 nM, 2 nM, 3 nM, 4 nM, 5 nM, 10 nM, 15
nM, 20 nM, 25 nM, 30 nM, 35 nM, 40 nM, 45 nM, 50 nM, 55 nM, 60 nM,
65 nM, 70 nM, 75 nM, 80 nM, 85 nM, 90 nM, 95 nM, 100 nM, 125 nM,
150 nM, 175 nM, 200 nM, 225 nM, 250 nM, 300 nM, 350 nM, 400 nM, 450
nM, 500 nM, 600 nM, 700 nM, 800 nM, 900 nM or 1 .mu.M. In certain
embodiments, the standard free energy of this displacement reaction
plus RT ln(c) is within 3.5, 3.0, 2.5, 2.0, 0.9, 0.8, 0.7, 0.6,
0.5, 0.4, 0.3, 0.2, or 0.1 kcal/mol of zero kcal/mol.
[0093] In some embodiments, a primer duplex may include one or more
hairpin regions that connect the complement strand to the protector
strand. In certain embodiments, the hairpin region of a primer
duplex can be of any length. In some embodiments, the hairpin
region is more than 30, 25, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11,
10, 9, 8, 7, 6, 5, 4 or 3 nucleotides in length. In some
embodiments, the sequence of the hairpin is not complementary to a
sequence of the target nucleic acid molecule.
[0094] In certain embodiments, the hairpin region has a
poly-mononucleotide sequence, such as a poly-adenosine sequence,
poly-deoxyadenosine sequence, a poly-5'-methyluridine sequence, a
poly-thymidine sequence, a poly-guanosine sequence, a
poly-deoxyguanosine sequence, a poly-cytidine sequence, a
poly-deoxycytidine sequence, a poly-uridine sequence or a
poly-deoxyuridine sequence.
[0095] The primer duplex described herein may be one of at least
two orientations. For example, in one orientation, the toehold
region is located at the 5' end, immediately adjacent to the
complement branch migration region (i.e., no intervening
nucleotides between the two regions), and the complement balance
region is located at the 3' end, immediately adjacent to the
complement branch migration region. In this orientation, the
protector balance region is at the 5' end of the protector strand,
immediately adjacent to the protector branch migration region (FIG.
1A). In another orientation, the toehold region is located at the
3' end, immediately adjacent to the complement branch migration
region, and the complement balance region is located at the 5' end,
immediately adjacent to the complement branch migration region. In
this orientation, the protector balance region is at the 3' end of
the protector strand, immediately adjacent to the protector branch
migration region (FIG. 1B).
[0096] Regardless of orientation, the sequence of the complement
balance region is such that such that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 -.DELTA.G.sub.3
|.ltoreq..DELTA.G.sub.R ,
where: [0097] .DELTA.G.sub.1 is the standard free energy of
hybridization of the protector balance region to the complement
balance region; [0098] .DELTA.G.sub.1 is the standard free energy
of hybridization of the protector balance region to the sequence
immediately adjacent in the first direction to the target nucleic
acid sequence, if any; [0099] .DELTA.G.sub.3 is the standard free
energy of hybridization of the toehold region to the second target
nucleic acid sequence; and [0100] .DELTA.G.sub.R is 3.5
kcal/mol.
[0101] In some embodiments, a primer duplex comprises a complement
strand longer than the protector, the difference in length being
dependent on the length of the toehold region of the complement
strand. The lengths of the primers are designed such that
hybridization of the complement to the target of interest has a
standard free energy (.DELTA.G ) close to zero. Release of the
protector strand (from the primer duplex) ensures that this
hybridization reaction is entropically neutral and robust to
concentration. As a result, in some embodiments, this reaction at
room temperature (e.g., about 25.degree. C. or about 298 K)
parallels the specificity of hybridization achieved at near melting
temperature across many conditions.
[0102] As intended herein, a .DELTA.G (change in standard free
energy) "close to zero" refers to an absolute value (amount) less
than or about 1 kcal/mol, less than or about 2 kcal/mol, less than
or about 3 kcal/mol, or less than or about 3.5 kcal/mol. In some
embodiments, the standard free energy of a balance region or
toehold region is >-1 kcal/mol to <1 kcal/mol>-3 kcal/mol
to <3 kcal/mol or >-3.5 kcal/mol to <3.5 kcal/mol.
[0103] The primer duplexes may be prepared at a ratio of protector
strand to complement strand of about 2:1 to about 5:1. In some
embodiments, the ratio of protector strand to complement strand is
about 2:1, about 3:1, about 4:1, or about 5:1. In some embodiments,
the ratio of protector strand to complement strand is 2:1, 2.1:1,
2.2:1, 2.3:1, 2.4:1, 2.5:1, 2.6:1, 2.7:1, 2.8:1, 2.9:1, 3:1, 3.1:1,
3.2:1, 3.3:1, 3.4:1, 3.5:1, 3.6:1, 3.7:1, 3.8:1, 3.9:1, 4:1, 4.2:1,
4.3:1, 4.4:1, 4.5:1, 4.6:1, 4.7:1, 4.8:1, 4.9:1, or 5:1. The primer
duplexes may also be used together with excess protector strand in
any of the assays or reactions described herein. The protector
strand may be in about equal to or more than 2-, 5-, 10-, 20-, 50-,
100-, or 500-fold molar excess relative to the primer.
Hairpin Primer Duplex Systems (e.g., Single-Stranded Systems)
[0104] In certain embodiments, a primer duplex includes a single
nucleic acid that comprises a complement region or domain, a
hairpin region, and a protector region or domain. In some
embodiments, the complement domain hybridizes to the protector
domain, forming a primer duplex having in intervening hairpin-loop
region. Like other primer systems disclosed here, such hairpin
primer systems are designed to specifically hybridize to a target
nucleic acid molecule. Herein, a "hairpin primer duplex system" or
a "hairpin system" includes a complement balance region, a
complement branch migration region, a toehold region, a protector
balance region, and a protector branch migration region. As
described above, a complement balance region is complementary to a
protector balance region; a complement branch migration region is
complementary to a protector branch migration region and a target
nucleic acid region; and a toe hold region is complementary to a
target nucleic acid region. A protector branch migration region
corresponds to a target nucleic acid region and is complementary to
a complement branch migration region. Because the hairpin primer
duplex systems described herein are formed by a single nucleic acid
molecule, the design of the complement balance region will be
dependent on the temperature and concentration at which the primer
system is to be used, as described herein. It is to be understood
that though the sequence of the target nucleic acid molecule may be
used to describe the characteristics of the primer systems, in some
embodiments, the target nucleic acid itself may or may not be a
component of the primer system (e.g., two-stranded or
single-stranded systems).
[0105] For primer duplexes having a hairpin region, the standard
free energy of the confinement of the hairpin region may be
considered when determining the standard free energy for the
reaction in which the protector strand is displaced from the
complement strand by the target nucleic acid. Approximate values
for the standard free energy of hairpin confinement for hairpins of
various sizes are provided in Table 1 (from SantaLucia and Hicks,
Annu. Rev. Biophys. Biomol. Struct., 33:414-440, (2004)).
TABLE-US-00001 Hairpin Size .DELTA.G.degree. of Hairpin Confinement
3 nt 3.5 kcal/mol 4 nt 3.5 kcal/mol 5 nt 3.3 kcal/mol 6 nt 4.0
kcal/mol 7 nt 4.2 kcal/mol 8 nt 4.3 kcal/mol 9 nt 4.5 kcal/mol 10
nt 4.6 kcal/mol 12 nt 5.0 kcal/mol 14 nt 5.1 kcal/mol 16 nt 5.3
kcal/mol 18 nt 5.5 kcal/mol 20 nt 5.7 kcal/mol 25 nt 6.1 kcal/mol
30 nt 6.3 kcal/mol
[0106] The standard free energy of the confinement of the hairpin
regions having lengths not provided in Table 1 (e.g., a length of
n) can be estimated using the following equation:
.DELTA.G (loop-n)=.DELTA.G (loop-x)+2.44RT ln(n/x)
[0107] where .DELTA.G (loop-n) is the unknown standard free energy
of the confinement of a hairpin region of n nucleotides in length,
.DELTA.G (loop-x) is the known standard free energy of the
confinement of a hairpin region of n nucleotides in length (e.g.,
as provided in Table 1), R is the ideal gas constant, and T is the
temperature at which the primer duplex is to be used. Additional
information on the calculation of standard free energies of hairpin
region confinement is provided in SantaLucia and Hicks, id., which
is hereby incorporated by reference in its entirety.
[0108] The hairpin primer duplex systems described herein may be
one of at least two orientations. For example, in one orientation
shown in FIG. 2A, the toehold region is located at the 5' end of
the nucleic acid molecule. The 5' end of the complement branch
migration region is immediately adjacent to the 3'end of the
toehold region; the 5' end of the complement balance region is
immediately adjacent to the 3' end of the complement branch
migration region; the 5' end of the hairpin region is immediately
adjacent to the 3' end of the complement balance region; the 5' end
of the protector balance region is immediately adjacent to the 3'
end of the hairpin region; and the 5' end of the protector branch
migration region is immediately adjacent to the 3' end of the
protector balance region. In this orientation, when the nucleic
acid molecule is subjected to conditions that permit annealing, the
hairpin region forms a loop that extends from the complement
balance region and the protector balance region. In another
orientation shown in FIG. 2B, the toehold region is located at the
3' end of the nucleic acid molecule. The 3' end of the complement
branch migration region is immediately adjacent to the 5'end of the
toehold region; the 3' end of the complement balance region is
immediately adjacent to the 5' end of the complement branch
migration region; the 3' end of the hairpin region is immediately
adjacent to the 5' end of the complement balance region; the 3' end
of the protector balance region is immediately adjacent to the 5'
end of the hairpin region; and the 3' end of the protector branch
migration region is immediately adjacent to the 5' end of the
protector balance region.
[0109] Regardless of orientation, the complement balance region of
the hairpin primer duplex system has a sequence such that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 -.DELTA.G.sub.3 +.DELTA.G.sub.4 +RT
ln(c)|.ltoreq..DELTA.G.sub.R ,
where: [0110] .DELTA.G.sub.1 is the standard free energy of
hybridization of the protector balance region to the complement
balance region; [0111] .DELTA.G.sub.2 is the standard free energy
of hybridization of the protector balance region to the sequence
immediately adjacent in the first direction to the target nucleic
acid sequence, if any; and [0112] .DELTA.G.sub.3 is the standard
free energy of hybridization of the toehold region to the second
target nucleic acid sequence; [0113] .DELTA.G.sub.4 is the standard
free energy of confinement of the hairpin region; [0114] R is the
ideal gas constant; [0115] T is the temperature at which the primer
system is to be used; [0116] c is the concentration at which the
primer system is to be used; and [0117] .DELTA.G.sub.R is 3.5
kcal/mol.
Other Primer Systems
[0118] Additional primer systems are depicted in FIGS. 4-7. Each of
these primer systems include a complement domain and two protector
domains. The complement domain has a single toehold region that is
flanked by two complement branch migration regions and two
complement balance regions. Each of the protector domains include a
protector branch migration region that has a sequence complementary
to the sequence of one of the complement branch migration regions,
and a protector balance region that has a sequence that is
complementary to the sequence of one of the complement balance
regions. The difference between the primer systems of FIGS. 4-7 is
in the number and location of the hairpin regions, which in turn
affects the design of the complement balance regions. Like the
other primer systems disclosed here, these primer systems are
designed to specifically hybridize to a target nucleic acid. Though
the sequence of the target nucleic acid is used to describe the
characteristics of a primer system, in some embodiments, the target
nucleic acid is not a component of a primer system.
[0119] As depicted in FIG. 4, a primer system may have, in 3' to 5'
order, a first protector domain, a first hairpin region, a
complement domain, a second hairpin region and a second protector
domain.
[0120] In such embodiments, the first protector domain includes a
first protector branch migration region and a first protector
balance region. The first protector branch migration region has a
sequence that corresponds to a first target nucleic acid sequence.
The first protector balance region is immediately 5' to the first
protector branch migration region and has a sequence that does not
correspond to sequence immediately 5' to the first target nucleic
acid sequence, if any, on the target nucleic acid.
[0121] In this embodiment, the first hairpin region is immediately
5' to the first protector balance region.
[0122] The complement domain of such target nucleic acids comprises
a first complement balance region, a first complement branch
migration region, a toehold region, a second complement branch
migration region and a second complement balance region. The first
complement balance region is immediately 5' to the first hairpin
region and has a sequence complementary to the sequence of the
first protector balance region. The first complement branch
migration region is immediately 5' to the first complement balance
region and has a sequence complementary to a first protector branch
migration region. The toehold region is immediately 5' to the first
complement branch migration region and has a sequence that is
complementary to a second target nucleic acid sequence that is
immediately 3' to the first target nucleic acid sequence on the
target nucleic acid. The second complement branch migration region
is immediately 5' to the toehold region and has a sequence
complementary to a third target nucleic acid sequence that is
immediately 3' to the second target nucleic acid sequence on the
target nucleic acid. The second complement balance region is
immediately 5' to the second complement branch migration region and
has a sequence that is not complementary to sequence immediately 3'
to the third target nucleic acid sequence, if any, on the target
nucleic acid.
[0123] In such embodiments, the second hairpin region is
immediately 5' to the second complement balance region.
[0124] In this embodiment the second protector domain includes a
second protector balance region and a second protector branch
migration region. The second protector balance region is
immediately 5' to the second hairpin region and has a sequence
complementary to the second complement balance region. The second
protector branch migration region is immediately 5' to the second
protector balance region and has a sequence complementary to the
second complement branch migration region.
[0125] According to this embodiment, the first complement balance
region and the second complement balance region have sequences such
that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 +.DELTA.G.sub.3 -.DELTA.G.sub.4
-.DELTA.G.sub.5 +.DELTA.G.sub.6 +.DELTA.G.sub.7 RT
ln(c)|.ltoreq..DELTA.G.sub.R ,
where: [0126] .DELTA.G.sub.1 is the standard free energy of
hybridization of the first protector balance region to the first
complement balance region; [0127] .DELTA.G.sub.2 is the standard
free energy of hybridization of the first complement balance region
to the sequence immediately 5' to the first target nucleic acid
sequence, if any; [0128] .DELTA.G.sub.3 is the standard free energy
of hybridization of the second protector balance region to the
second complement balance region; [0129] .DELTA.G.sub.4 is the
standard free energy of hybridization of the second complement
balance region to the sequence immediately 3' to the third target
nucleic acid sequence, if any; [0130] .DELTA.G.sub.5 is the
standard free energy of hybridization of the toehold region to the
second target nucleic acid sequence; [0131] .DELTA.G.sub.6 is the
standard free energy of confinement of the first hairpin region;
[0132] .DELTA.G.sub.R is the standard free energy of confinement of
the second hairpin region; [0133] R is the ideal gas constant;
[0134] T is the temperature at which the primer system is to be
used; and [0135] c is the concentration at which the primer system
is to be used; and [0136] .DELTA.G.sub.R is 3.5 kcal/mol.
[0137] As depicted in FIG. 5, in certain embodiments, a primer
system may have a hairpin primer and a protector, where the hairpin
primer is a nucleic acid that includes a first protector domain, a
first hairpin region, a complement domain and the protector is a
nucleic acid that includes a second protector domain.
[0138] In such embodiments, the first protector domain includes a
first protector branch migration region and a first protector
balance region. The first protector branch migration region has a
sequence that corresponds to a first target nucleic acid sequence.
The first protector balance region is immediately 5' to the first
protector branch migration region and has a sequence that does not
correspond to sequence immediately 5' to the first target nucleic
acid sequence, if any, on the target nucleic acid.
[0139] In this embodiment, the hairpin region is immediately 5' to
the first protector balance region.
[0140] The complement domain of such target nucleic acids comprises
a first complement balance region, a first complement branch
migration region, a toehold region, a second complement branch
migration region and a second complement balance region. The first
complement balance region is immediately 5' to the hairpin region
and has a sequence complementary to the sequence of the first
protector balance region. The first complement branch migration
region is immediately 5' to the first complement balance region and
has a sequence complementary to a first protector branch migration
region. The toehold region is immediately 5' to the first
complement branch migration region and has a sequence that is
complementary to a second target nucleic acid sequence that is
immediately 3' to the first target nucleic acid sequence on the
target nucleic acid. The second complement branch migration region
is immediately 5' to the toehold region and has a sequence
complementary to a third target nucleic acid sequence that is
immediately 3' to the second target nucleic acid sequence on the
target nucleic acid. The second complement balance region is
immediately 5' to the second complement branch migration region and
has a sequence that is not complementary to sequence immediately 3'
to the third target nucleic acid sequence, if any, on the target
nucleic acid.
[0141] In this embodiment, the second protector domain includes a
second protector balance region and a second protector branch
migration region. The second protector balance region has a
sequence complementary to the second complement balance region. The
second protector branch migration region is immediately 5' to the
second protector balance region and has a sequence complementary to
the second complement branch migration region.
[0142] According to this embodiment, the first complement balance
region and the second complement balance region have sequences such
that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 +.DELTA.G.sub.3 -.DELTA.G
.sub.4-.DELTA.G.sub.5 +.DELTA.G.sub.6 |.ltoreq..DELTA.G.sub.R ,
where: [0143] .DELTA.G.sub.1 is the standard free energy of
hybridization of the first protector balance region to the first
complement balance region; [0144] .DELTA.G.sub.2 is the standard
free energy of hybridization of the first complement balance region
to the sequence immediately 5' to the first target nucleic acid
sequence, if any; [0145] .DELTA.G.sub.3 is the standard free energy
of hybridization of the second protector balance region to the
second complement balance region; [0146] .DELTA.G.sub.4 is the
standard free energy of hybridization of the second complement
balance region to the sequence immediately 3' to the third target
nucleic acid sequence, if any; [0147] .DELTA.G.sub.5 is the
standard free energy of hybridization of the toehold region to the
second target nucleic acid sequence; [0148] .DELTA.G.sub.6 is the
standard free energy of confinement of the hairpin region; and
[0149] .DELTA.G.sub.R is 3.5 kcal/mol.
[0150] As depicted in FIG. 6, in certain embodiments a primer
system may have a protector and a hairpin primer, where the
protector is a nucleic acid that includes a first protector domain
and the hairpin primer is a nucleic acid that includes a complement
domain, hairpin region and a second protector domain.
[0151] In such embodiments, the first protector domain includes a
first protector branch migration region and a first protector
balance region. The first protector branch migration region has a
sequence that corresponds to a first target nucleic acid sequence.
The first protector balance region is immediately 5' to the first
protector branch migration region and has a sequence that does not
correspond to sequence immediately 5' to the first target nucleic
acid sequence, if any, on the target nucleic acid.
[0152] The complement domain of such target nucleic acids comprises
a first complement balance region, a first complement branch
migration region, a toehold region, a second complement branch
migration region and a second complement balance region. The first
complement balance region has a sequence complementary to the
sequence of the first protector balance region. The first
complement branch migration region is immediately 5' to the first
complement balance region and has a sequence complementary to a
first protector branch migration region. The toehold region is
immediately 5' to the first complement branch migration region and
has a sequence that is complementary to a second target nucleic
acid sequence that is immediately 3' to the first target nucleic
acid sequence on the target nucleic acid. The second complement
branch migration region is immediately 5' to the toehold region and
has a sequence complementary to a third target nucleic acid
sequence that is immediately 3' to the second target nucleic acid
sequence on the target nucleic acid. The second complement balance
region is immediately 5' to the second complement branch migration
region and has a sequence that is not complementary to sequence
immediately 3' to the third target nucleic acid sequence, if any,
on the target nucleic acid.
[0153] According to such embodiments, the hairpin region is
immediately 5' to the second complement balance region.
[0154] In this embodiment the second protector domain includes a
second protector balance region and a second protector branch
migration region. The second protector balance region is
immediately 5' to the hairpin region and has a sequence
complementary to the second complement balance region. The second
protector branch migration region is immediately 5' to the second
protector balance region and has a sequence complementary to the
second complement branch migration region.
[0155] According to this embodiment, the first complement balance
region and the second complement balance region have sequences such
that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 +.DELTA.G.sub.3 -.DELTA.G.sub.4
-.DELTA.G.sub.5 +.DELTA.G.sub.6 |.ltoreq..DELTA.G.sub.R ,
where: [0156] .DELTA.G.sub.1 is the standard free energy of
hybridization of the first protector balance region to the first
complement balance region; [0157] .DELTA.G.sub.2 is the standard
free energy of hybridization of the first complement balance region
to the sequence immediately 5' to the first target nucleic acid
sequence, if any; [0158] .DELTA.G.sub.3 is the standard free energy
of hybridization of the second protector balance region to the
second complement balance region; [0159] .DELTA.G.sub.4 is the
standard free energy of hybridization of the second complement
balance region to the sequence immediately 3' to the third target
nucleic acid sequence, if any; and [0160] .DELTA.G.sub.5 is the
standard free energy of hybridization of the toehold region to the
second target nucleic acid sequence; [0161] .DELTA.G.sub.6 is the
standard free energy of confinement of the hairpin region; and
[0162] .DELTA.G.sub.R is 3.5 kcal/mol.
[0163] As depicted in FIG. 7, in certain embodiments a primer
system may have a first protector, a complement primer and a second
protector, where the first protector is a nucleic acid that
includes a first protector domain, the complement primer is a
nucleic acid that includes a complement domain, and the second
protector is a nucleic acid that includes a second protector
domain.
[0164] In such embodiments, the first protector domain includes a
first protector branch migration region and a first protector
balance region. The first protector branch migration region has a
sequence that corresponds to a first target nucleic acid sequence.
The first protector balance region is immediately 5' to the first
protector branch migration region and has a sequence that does not
correspond to sequence immediately 5' to the first target nucleic
acid sequence, if any, on the target nucleic acid.
[0165] The complement domain of such target nucleic acids comprises
a first complement balance region, a first complement branch
migration region, a toehold region, a second complement branch
migration region and a second complement balance region. The first
complement balance region has a sequence complementary to the
sequence of the first protector balance region. The first
complement branch migration region is immediately 5' to the first
complement balance region and has a sequence complementary to a
first protector branch migration region. The toehold region is
immediately 5' to the first complement branch migration region and
has a sequence that is complementary to a second target nucleic
acid sequence that is immediately 3' to the first target nucleic
acid sequence on the target nucleic acid. The second complement
branch migration region is immediately 5' to the toehold region and
has a sequence complementary to a third target nucleic acid
sequence that is immediately 3' to the second target nucleic acid
sequence on the target nucleic acid. The second complement balance
region is immediately 5' to the second complement branch migration
region and has a sequence that is not complementary to sequence
immediately 3' to the third target nucleic acid sequence, if any,
on the target nucleic acid.
[0166] In this embodiment the second protector domain includes a
second protector balance region and a second protector branch
migration region. The second protector balance region has a
sequence complementary to the second complement balance region. The
second protector branch migration region is immediately 5' to the
second protector balance region and has a sequence complementary to
the second complement branch migration region.
[0167] According to this embodiment, the first complement balance
region and the second complement balance region have sequences such
that:
|.DELTA.G.sub.1 -.DELTA.G.sub.2 +.DELTA.G.sub.3 -.DELTA.G.sub.4
-.DELTA.G.sub.5 -RT ln(c)|.ltoreq..DELTA.G.sub.R ,
where: [0168] .DELTA.G.sub.1 is the standard free energy of
hybridization of the first protector balance region to the first
complement balance region; [0169] .DELTA.G.sub.2 is the standard
free energy of hybridization of the first complement balance region
to the sequence immediately 5' to the first target nucleic acid
sequence, if any; [0170] .DELTA.G.sub.3 is the standard free energy
of hybridization of the second protector balance region to the
second complement balance region; [0171] .DELTA.G.sub.4 is the
standard free energy of hybridization of the second complement
balance region to the sequence immediately 3' to the third target
nucleic acid sequence, if any; and [0172] .DELTA.G.sub.5 is the
standard free energy of hybridization of the toehold region to the
second target nucleic acid sequence; [0173] R is the ideal gas
constant; [0174] T is the temperature at which the primer system is
to be used; [0175] c is the concentration at which the primer
system is to be used; and [0176] .DELTA.G.sub.R is 3.5
kcal/mol.
Primer Duplex Systems Lacking Balance Domains
[0177] In some embodiments, primer systems may lack balance
domains. Such nucleic acids will hybridize with a target nucleic
acid with fast kinetics if the target nucleic acid has a sequence
complementary to the sequence of the toehold region of the primer
system, but with slow kinetics if the target nucleic is mutated so
that it does not contain a sequence complementary to the toehold
region of the primer system. Such primer systems are therefore
useful, for example, for locating difference and/or mutations in
nucleic acid targets using kinetic discrimination.
[0178] As depicted in FIG. 8, in certain embodiments a primer
system may include a nucleic acid having, in 3' to 5' order, a
first protector domain, a first hairpin region, a complement
domain, a second hairpin region and a second protector domain. The
first protector domain of such primer systems has a sequence that
corresponds to a first target nucleic acid sequence. The first
hairpin region is immediately 5' of the first protector domain. The
complement domain has a first complement branch migration region, a
toehold region and a second complement branch migration region. The
first complement branch migration region is immediately 5' of the
first hairpin region and has a sequence complementary to the branch
migration sequence of the first protector domain. The toehold
region is immediately 5' of the first complement branch migration
region and has a sequence complementary to a second target nucleic
acid sequence that is immediately 3' of the first target nucleic
acid sequence on the target nucleic acid molecule. The second
complement branch migration region is immediately 5' of the toehold
region and has a sequence complementary to a third target nucleic
acid sequence that is immediately 3' of the second nucleic acid
sequence. The second hairpin region is immediately 5' of the second
complement branch migration region. The second protector domain has
a sequence that is complementary to the sequence of the second
complement branch migration region.
Primer Modifications Generally
[0179] Each primer described herein may be comprised of DNA, RNA,
or analogs thereof, and/or combinations thereof. In certain
embodiments, a primer comprises one or more non-natural
nucleotides. The incorporation of non-natural nucleotides in the
primers can further augment the performance of the primer duplexes.
In particular, the protector strand, while not intended to serve to
initiate transcription, may happen to be complementary to other
regions of the target or other background molecules, and may
spuriously initiate replication/transcription. To prevent this, the
use of a non-natural nucleotide or a dideoxy nucleotide at the 3'
end of the second protector strand may serve to prevent unintended
priming by that strand. Examples of non-natural nucleotides
include, but are not limited to, iso-C, iso-G, deoxyuridine (see
also Krueger et al. Chem Biol. 16:242-48 (2009), the teachings
which relating to non-natural nucleotides are incorporated by
reference herein).
[0180] In some embodiments, for example, in a polymerase chain
reaction (PCR) where a repeated primed enzymatic function is used,
the extended complement strand can become a target for subsequent
primer hybridization. To preserve the specificity of primer
hybridization for subsequent rounds of amplification, a balance
region of a primer cannot be replicated. Introducing a non-natural
nucleotide at the interface between the branch migration and
balance regions of the complement strand, for example, may prevent
the balance region from being replicated.
[0181] In certain embodiments, the primers described herein serve
as starting points for polymerase extensions. To facilitate
analysis of amplified (nucleic acid) fragments, labeled primers can
also be used in PCR reactions. Labeled primers are those that are
coupled (or conjugated) to a detectable moiety. Examples include
fluorescent dyes, radioactive labels, and identifiable metals,
nucleic acid sequences, and proteins. When a reaction is carried
out with fluorescently labeled primers, amplicons (nucleic acid
products) with a fluorescent label may be generated.
[0182] The primers described herein can be synthesized by any
method known in the art (see, e.g., Ogilvie et al. J. Amer. Chem.
Soc. 99 (23): 7741-7743; Reese, C. B. Tetrahedron 34(21): 3143
(1978); Efimov et al. Nucleic Acids Res. 11(23): 8369-8387 (1983);
Garegg et al. Tetrahedron Lett. 27(34): 4051 (1986); Beaucage et
al. Tetrahedron 48(12): 2223 (1992); Efimov et al. Nucleosides,
Nucleotides & Nucleic Acids 26 (8-9): 1087-93 (2007),
incorporated herein by reference).
Target Nucleic Acid Molecules
[0183] A "target" can be a single-stranded (ss) or double-stranded
(ss) nucleic acid. Target nucleic acids can be, for example, DNA,
RNA, or the DNA product of RNA subjected to reverse transcription.
In some embodiments, a target may be a mixture (chimera) of DNA and
RNA. In other embodiments, a target comprises artificial nucleic
acid analogs, for example, peptide nucleic acids (Nielsen et al.
Science 254(5037): 1497-500 (1991)) or locked nucleic acids (Alexei
et al. Tetrahedron 54(14): 3607-30 (1998)). In some embodiments, a
target may be naturally occurring (e.g., genomic DNA) or it may be
synthetic (e.g., from a genomic library). As used herein, a
"naturally occurring" nucleic acid sequence is a sequence that is
present in nucleic acid molecules of organisms or viruses that
exist in nature in the absence of human intervention. In some
embodiments, a target is genomic DNA, messenger RNA, ribosomal RNA,
micro-RNA, pre-micro-RNA, pro-micro-RNA, viral DNA, viral RNA or
piwi-RNA. In certain embodiments, a target nucleic acid is a
nucleic acid that naturally occurs in an organism or virus. In some
embodiments, a target nucleic is the nucleic acid of a pathogenic
organism or virus. In certain embodiments the presence or absence
of a target nucleic acid in a subject is indicative that the
subject has a disease or disorder or is predisposed to acquire a
disease or disorder. In certain embodiments the presence or absence
of a target nucleic acid in a subject is indicative that the
subject will respond well or poorly to a treatment, such as a drug,
to treat a disease or disorder.
[0184] The terms "polynucleotide," "nucleic acid" and "nucleic acid
molecule" are used interchangeably. They refer to a polymeric form
of nucleotides of any length, either deoxyribonucleotides or
ribonucleotides, or analogs thereof. Polynucleotides may have any
three-dimensional structure, and may perform any function. The
following are non-limiting examples of polynucleotides: coding or
non-coding regions of a gene or gene fragment, loci (locus) defined
from linkage analysis, exons, introns, messenger RNA (mRNA),
transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant
polynucleotides, branched polynucleotides, plasmids, vectors,
isolated DNA of any sequence, isolated RNA of any sequence, nucleic
acid probes, and primers. A polynucleotide may comprise modified
nucleotides, such as methylated nucleotides and nucleotide analogs.
If present, modifications to the nucleotide structure may be
imparted before or after assembly of the polymer. A polynucleotide
may be further modified, such as by conjugation with a labeling
component. The term "recombinant" polynucleotide means a
polynucleotide of genomic, cDNA, semi-synthetic, or synthetic
origin which either does not occur in nature or is linked to
another polynucleotide in a non-natural arrangement. The term
"isolated nucleic acid" refers to a polynucleotide of natural or
synthetic origin or some combination thereof, which (1) is not
associated with the cell in which the "isolated nucleic acid" is
found in nature, and/or (2) is operably linked to a polynucleotide
to which it is not linked in nature.
[0185] A nucleic acid may also encompass single- and
double-stranded DNA and RNA, as well as any and all forms of
alternative nucleic acid containing modified bases, sugars, and
backbones. The term "nucleic acid" thus will be understood to
include, but not be limited to, single- or double-stranded DNA or
RNA (and forms thereof that can be partially single-stranded or
partially double-stranded), cDNA, aptamers, peptide nucleic acids
("PNA"), 2'-5' DNA (a synthetic material with a shortened backbone
that has a base-spacing that matches the A conformation of DNA;
2'-5' DNA will not normally hybridize with DNA in the B form, but
it will hybridize readily with RNA), and locked nucleic acids
("LNA"). Nucleic acid analogues include known analogues of natural
nucleotides that have similar or improved binding, hybridization of
base-pairing properties. "Analogous" forms of purines and
pyrimidines are well known in the art, and include, but are not
limited to aziridinylcytosine, 4-acetylcytosine, 5-fluorouracil,
5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil,
5-carboxymethylaminomethyluracil, inosine, N6-isopentenyladenine,
1-methyladenine, 1-methylpseudouracil, 1-methylguanine,
1-methylinosine, 2,2-dimethylguanine, 2-methyladenine,
2-methylguanine, 3-methylcytosine, 5-methylcytosine,
N.sup.6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil,
5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine,
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine,
uracil-5-oxyacetic acid methylester, pseudouracil, queosine,
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil,
5-methyluracil, uracil-5-oxyacetic acid, and 2,6-diaminopurine. DNA
backbone analogues provided herein include phosphodiester,
phosphorothioate, phosphorodithioate, methylphosphonate,
phosphoramidate, alkyl phosphotriester, sulfamate, 3'-thioacetal,
methylene(methylimino), 3'-N-carbamate, morpholino carbamate, and
peptide nucleic acids (PNAs), methylphosphonate linkages or
alternating methylphosphonate and phosphodiester linkages
(Strauss-Soukup, 1997, Biochemistry 36:8692-8698), and
benzylphosphonate linkages, as discussed in U.S. Pat. No.
6,664,057; see also OLIGONUCLEOTIDES AND ANALOGUES, A PRACTICAL
APPROACH, edited by F. Eckstein, IRL Press at Oxford University
Press (1991); Antisense Strategies, Annals of the New York Academy
of Sciences, Volume 600, Eds. Baserga and Denhardt (NYAS 1992);
Milligan, 1993, J. Med. Chem. 36:1923-1937; Antisense Research and
Applications (1993, CRC Press). The nucleic acids herein can be
extracted from cells or synthetically prepared according to any
means known to those skilled in the art; for example, the nucleic
acids can be chemically synthesized or transcribed or reverse
transcribed from cDNA or mRNA, among other sources.
[0186] As used herein, two nucleic acids or nucleic acid regions
"correspond" to one another if they are both complementary to the
same nucleic acid sequence. Two nucleic acids or nucleic acid
regions are "complementary" to one another if they base-pair with
each other to form a double-stranded nucleic acid molecule.
[0187] A target nucleic acids utilized herein can be any nucleic
acid, for example, human nucleic acids, bacterial nucleic acids, or
viral nucleic acids. A target nucleic acid sample can be, for
example, a nucleic acid sample from one or more cells, tissues, or
bodily fluids. Target samples can be derived from any source
including, but not limited to, eukaryotes, plants, animals,
vertebrates, fish, mammals, humans, non-humans, bacteria, microbes,
viruses, biological sources, serum, plasma, blood, urine, semen,
lymphatic fluid, cerebrospinal fluid, amniotic fluid, biopsies,
needle aspiration biopsies, cancers, tumors, tissues, cells, cell
lysates, crude cell lysates, tissue lysates, tissue culture cells,
buccal swabs, mouthwashes, stool, mummified tissue, forensic
sources, autopsies, archeological sources, infections, nosocomial
infections, production sources, drug preparations, biological
molecule productions, protein preparations, lipid preparations,
carbohydrate preparations, inanimate objects, air, soil, sap,
metal, fossils, excavated materials, and/or other terrestrial or
extra-terrestrial materials and sources. The sample may also
contain mixtures of material from one source or different sources.
For example, nucleic acids of an infecting bacterium or virus can
be amplified along with human nucleic acids when nucleic acids from
such infected cells or tissues are amplified using the disclosed
methods. Types of useful target samples include eukaryotic samples,
plant samples, animal samples, vertebrate samples, fish samples,
mammalian samples, human samples, non-human samples, bacterial
samples, microbial samples, viral samples, biological samples,
serum samples, plasma samples, blood samples, urine samples, semen
samples, lymphatic fluid samples, cerebrospinal fluid samples,
amniotic fluid samples, biopsy samples, needle aspiration biopsy
samples, cancer samples, tumor samples, tissue samples, cell
samples, cell lysate samples, crude cell lysate samples, tissue
lysate samples, tissue culture cell samples, buccal swab samples,
mouthwash samples, stool samples, mummified tissue samples, autopsy
samples, archeological samples, infection samples, nosocomial
infection samples, production samples, drug preparation samples,
biological molecule production samples, protein preparation
samples, lipid preparation samples, carbohydrate preparation
samples, inanimate object samples, air samples, soil samples, sap
samples, metal samples, fossil samples, excavated material samples,
and/or other terrestrial or extra-terrestrial samples.
[0188] In some embodiments, a target nucleic acids utilized herein
comprise repetitive sequence, secondary structure, and/or a high
G/C content.
[0189] In certain embodiments, a target nucleic acid molecule of
interest is about 100 to about 1,000,000 nucleotides (nt) in
length. In some embodiments, the target is about 100 to about 1000,
about 1000 to about 10,000, about 10,000 to about 100,000, or about
100,000 to about 1,000,000 nucleotides in length. In some
embodiments, the target is about 100, about 200, about 300, about
400, about 500, about 600, about 700, about 800, about 900, about
1,000, about 2,000, about 3,000, about 4,000, about 5,000, about
6,000, about 7,000, about 8,000, about 9000, about 10,000, about
20,000, about 30,000, about 40,000, about 50,000, about 60,000,
about 70,000, about 80,000, about 90,000, about 100,000, about
200,000, about 300,000, about 400,000, about 500,000, about
600,000, about 700,000, about 800,000, about 900,000, or about
1,000,000 nucleotides in length. It is to be understood that the
target nucleic acid may be provided in the context of a longer
nucleic acid (e.g., such as a coding sequence or gene within a
chromosome or a chromosome fragment).
[0190] In certain embodiments, a target of interest is linear,
while in other embodiments, a target is circular (e.g., plasmid
DNA, mitochondrial DNA, or plastid DNA).
Combined Primer-Target Systems
[0191] In some embodiments, provided herein are primer-target
systems. A primer-target system comprises one or more nucleic acid
targets, a polymerase, and one or more primers (e.g., primer duplex
and/or hairpin primer duplex). The term "primer" encompasses any
one of the primers or primer systems described herein (e.g.,
single-stranded primers, double-stranded primer duplexes, and
hairpin primer duplexes). In certain embodiments, the primer-target
systems described herein comprise a plurality of different primers.
In some embodiments, a primer-target system can comprise at least
two primers, which can be used to identify and, for example
amplify, a target nucleic acid molecule. A target nucleic acid
molecule may be present amongst a plurality of non-target nucleic
acid molecules, for example, as a single copy or in low copy
number. Any one of the primer-target systems described herein may
comprises conditions similar to those used in nucleic acid
amplification or sequencing reactions (e.g., similar reagents,
reaction temperature, etc.).
Methods of Use
[0192] The primer systems described herein are able to discriminate
a specific target from spurious targets either through a
thermodynamic mechanism or through a kinetic mechanism. In
distinguishing the target from spurious targets using a
thermodynamic mechanism (described below), the strand displacement
reaction is run to completion and the target is distinguished from
the spurious targets based on differences in equilibrium binding
affinity. To distinguish the target from spurious targets using a
kinetic mechanism (described below), the strand displacement
reaction is stopped before reaching equilibrium, and the
differential rate in reaction completion is used to distinguish the
targets from spurious targets.
[0193] Thermodynamic Separation
[0194] The general strategy for both the thermodynamic and the
kinetic mechanism is to use toehold exchange strand displacement
reactions. In general, toehold exchange involves extending a
target's complement with an additional region that is not
complementary to the target, and pre-hybridizing a protector strand
to this extended complement strand and a large number of bases
adjacent to the extended region, but not to a single-stranded
toehold region.
[0195] FIG. 9 depicts one implementation of toehold exchange. In
this implementation, a two-stranded primer duplex system (as
described above and depicted in FIG. 1A) is used to distinguish
between a target nucleic acid and a spurious target. The complement
branch migration region and the toehold region of the complement
strand have sequences that correspond to a first target sequence
and a second target sequence, respectively. A complement balance
region was designed to be the same length and nucleotide base
distribution as the toehold region. In this way, the standard free
energy of the strand displacement reaction shown in FIG. 9A between
the correct target and the protected complement is roughly .DELTA.G
=0 kcal/mol.
[0196] The strand displacement reaction can be written as:
Target+Complement/Protector.revreaction.Target/Complement+Protector
.DELTA.G relates to the equilibrium constant K.sub.eq by the
following relation:
.DELTA.G =RT ln(K.sub.eq).
[0197] For a reaction with .DELTA.G =0, the equilibrium constant
(K.sub.eq) is 1. The equilibrium constant also relates to the
equilibrium; for this reaction, K.sub.eq=[TC] [P]/[T] [PC]=1. For
an assay where [PC] and [P] and are in excess of [T], [TC]/[T]=1,
meaning that exactly half of all target molecules are hybridized at
equilibrium. In the example shown in FIG. 9A, .DELTA.G =+0.1
kcal/mol, corresponding to a K.sub.eq=0.85, which means that 46% of
the target molecules are hybridized at equilibrium.
[0198] The protector strand correspondingly changes the standard
free energy of the strand displacement reaction with spurious
targets. In the example shown in FIG. 9B, the spurious target
differs from the correct target by a single base, which results in
the strand displacement reaction with the same two-stranded nucleic
acid primer system having a .DELTA.G of +3.7 kcal/mol, which
corresponds to K.sub.eq=1.9*10.sup.-3. At equilibrium, only 0.19%
of the spurious target will be hybridized to the complement. Thus,
the exemplary nucleic acid primer system depicted in FIG. 8
preferentially binds to its target versus a spurious target having
only a single nucleotide mismatch by more than 200-fold.
[0199] FIG. 10 is a plot of the equilibrium binding affinity as a
function of the standard free energy of the reaction. When the
standard free energy of the reaction is very negative (as in the
case in a pure hybridization reaction), both the target and the
spurious targets bind very strongly, and it is difficult to
distinguish between the two, leading to false positives. On the
other hand, when the standard free energy of the reaction is very
positive, the primer binds to the target very weakly, leading to
false negatives. Designing a primer duplex system to have a
standard free energy of near zero results in an optimal
discrimination between targets and spurious targets, thereby
minimizing false positives and false negatives.
[0200] Kinetic Separation
[0201] Kinetic separation relies on the differential kinetics of
toehold exchange. The kinetics of the toehold exchange reaction
depend on the binding strengths of the toeholds regions. Each
kcal/mol of difference in toehold binding energy can affect
kinetics by a factor of 5.4 (FIG. 11), so the +3.6 kcal/mol
mismatch shown in FIG. 9B would yield a kinetic slowdown of
434.
[0202] Unlike in thermodynamic discrimination, kinetic
discrimination occurs only when the mismatch is in the toehold
region. Spurious targets differing from the correct target at a
position complementary to the complement branch migration region
are unlikely to yield significantly different reaction kinetics. As
a consequence, methods that use the kinetic mechanism of
distinguishing target from spurious target are useful in
conjunction with thermodynamic separation as a means of pinpointing
the locations of target/primer mismatches.
[0203] Significantly, primer duplex systems lacking complement
balance regions, such as the primer systems depicted in FIG. 8, can
be used in methods that exploit the kinetic mechanism to pinpoint
the location of target/primer mismatches.
[0204] Microarrays
[0205] Nucleic acid microarrays are often used for high-throughput
nucleic acid detection, but often are unable to distinguish between
closely related nucleic acid sequences. In some embodiments, the
primer duplex systems described herein can be used in nucleic acid
microarrays in order to, for example, improve the specificity of
microarray analysis. In some instance, microarrays assays can be
performed using methods well known in the art, with the exception
that the primer duplex systems described herein can be used in
place of conventional nucleic acid primers.
[0206] For example, as depicted in FIG. 12A, in certain
embodiments, a hairpin primer duplex system from can be directly
synthesized or immobilized on a microarray chip using standard
techniques. In other embodiments, a two-stranded primer duplex
system can be used in a nucleic acid microarray. In some
embodiment, hairpin structures including two photocleavable bases
at predefined positions can be synthesized as in FIG. 12A.
Subsequent exposure to light cleaves the hairpin, yielding the
two-stranded complexes functionalized to the array surface (FIG.
12B). Other methods, such as use of nicking or restriction enzymes,
can also be used to prepare two-stranded complexes.
[0207] Nucleic Acid Synthesis Reactions, Including Amplification
Reactions
[0208] Primer duplexes and systems disclosed here can be used in
some embodiments to improve the specificity of a primer-based
amplification reaction, including polymerase chain reaction (PCR),
strand displacement amplification, or transcription mediated
amplification, by substituting a primer duplex system described
herein for the nucleic acid primers in a primer based amplification
reaction known in the art.
[0209] For example, as depicted in FIG. 13, by using as PCR primers
the hairpin primer duplex systems of the type depicted in FIG. 4,
it is possible to improve the specificity of PCR for a variety of
(e.g., biotechnological) applications. In this example, a target
nucleic acid sequence is amplified by forming a solution comprising
a primer duplex system with the target nucleic acid and standard
reagents for performing an amplification reaction and incubating
the solution under conditions such that an amplification reaction
occurs. In certain embodiments, non-natural bases are incorporated
into a hairpin primer duplex primer systems in order to prevent
replication of the hairpin itself.
[0210] In some embodiments, the primer duplexes described herein
can be adapted for use in amplifying target nucleic acids that
typically require amplification by any one or more of the following
PCR methods: allele-specific PCR, assembly PCR, asymmetric PCR,
helicase-dependent amplification, intersequence-specific PCR
(ISSR), inverse PCR, ligation-mediated PCR, methylation-specific
PCR (MSP), miniprimer PCR, multiplex PCR, nested PCR,
overlap-extension PCR, quantitative PCR (Q-PCR), reverse
transcription PCR (RT-PCR), solid phase PCR, thermal asymmetric
interlaced PCR (TAIL-PCR), or touchdown PCR. In some instances, the
primer duplexes and methods described herein may be used or adapted
for use in any one of the foregoing PCR methods or may substitute
(used instead of) any one of the foregoing PCR methods. A brief
description of each of the foregoing PCR methods is presented
below.
[0211] Allele-specific PCR is a diagnostic or cloning technique
based on single-nucleotide polymorphisms (SNPs) (single-base
differences in DNA). It typically requires prior knowledge of a DNA
sequence, including differences between alleles.
[0212] Assembly PCR or polymerase cycling assembly (PCA) is an
artificial synthesis of long DNA sequences by performing PCR on a
pool of long oligonucleotides with short overlapping segments. The
oligonucleotides alternate between sense and antisense directions,
and the overlapping segments determine the order of the PCR
fragments, thereby selectively producing the final long DNA product
(Stemmer et al. Gene 164(1): 49-53 (1995)).
[0213] Asymmetric PCR preferentially amplifies one DNA strand in a
double-stranded DNA target. It can be used in sequencing and
hybridization probing where amplification of only one of the two
complementary strands is required (Innis et al. Proc. Natl. Acad.
Sci. USA 85(24): 9436-40 (1988)).
[0214] Helicase-dependent amplification is similar to traditional
PCR, but typically uses a constant temperature rather than cycling
through denaturation and annealing/extension cycles. DNA helicase,
an enzyme that unwinds DNA, is used in place of thermal
denaturation (Vincent et al. EMBO Reports 5(8): 795-800
(2004)).
[0215] Intersequence-specific PCR (ISSR) is a PCR method for DNA
fingerprinting that amplifies regions between simple sequence
repeats to produce a unique fingerprint of amplified fragment
lengths (Zietkiewicz et al. Genomics 20(2): 176-83 (1994)).
[0216] Inverse PCR is commonly used to identify the flanking
sequences around genomic inserts. It involves a series of DNA
digestions and self-ligation, resulting in known sequences at
either end of the unknown sequence (Ochman et al. Genetics 120 (3):
621-623 (1988)).
[0217] Ligation-mediated PCR uses small DNA linkers ligated to the
DNA of interest and multiple primers annealing to the DNA linkers;
it has been used for DNA sequencing, genome walking, and DNA
footprinting (Mueller et al. Science 246(4931): 780-786
(1988)).
[0218] Methylation-specific PCR (MSP) is used to detect methylation
of CpG islands in genomic DNA. DNA is first treated with sodium
bisulfite, which converts unmethylated cytosine bases to uracil,
which is recognized by primers as thymine.
[0219] Miniprimer PCR uses a thermostable polymerase (S-Tbr) and is
used to amplify conserved DNA sequences, such as the 16S (or
eukaryotic 18S) rRNA gene (Isenbarger et al. Applied and
Environmental Microbiology 74(3): 840-9. (2008)).
[0220] Multiplex-PCR targets multiple genes at once, gaining
additional information from a single test-run that otherwise would
require several times the reagents and more time to perform.
[0221] Nested PCR increases the specificity of DNA amplification,
by reducing background due to non-specific amplification of DNA.
Two sets of primers are used in two successive PCRs. In the first
reaction, one pair of primers is used to generate DNA products,
which besides the intended target, may still consist of
non-specifically amplified DNA fragments. The product(s) are then
used in a second PCR with a set of primers whose binding sites are
completely or partially different from and located 3' of each of
the primers used in the first reaction.
[0222] Overlap-extension PCR or splicing by overlap extension (SOE)
is a genetic engineering technique that is used to splice together
two or more DNA fragments that contain complementary sequences. It
is used to join DNA pieces containing genes, regulatory sequences,
or mutations; the technique enables creation of specific and long
DNA constructs.
[0223] Quantitative PCR (Q-PCR) is used to measure the quantity of
a PCR product (commonly in real-time). It quantitatively measures
starting amounts of DNA, cDNA, or RNA. Q-PCR is commonly used to
determine whether a DNA sequence is present in a sample and the
number of its copies in the sample.
[0224] Reverse Transcription PCR (RT-PCR) is used for amplifying
DNA from RNA. Reverse transcriptase reverse transcribes RNA into
cDNA, which is then amplified by PCR. RT-PCR is widely used in
expression profiling, to determine the expression of a gene or to
identify the sequence of an RNA transcript, including transcription
start and termination sites. If the genomic DNA sequence of a gene
is known, RT-PCR can be used to map the location of exons and
introns in the gene. The 5' end of a gene (corresponding to the
transcription start site) is typically identified by RACE-PCR
(Rapid Amplification of cDNA Ends).
[0225] Solid Phase PCR encompasses multiple meanings, including
polony amplification (where PCR colonies are derived in a gel
matrix, for example), bridge PCR (primers are covalently linked to
a solid-support surface), conventional solid phase PCR (where
asymmetric PCR is applied in the presence of solid support bearing
primer with sequence matching one of the aqueous primers) and
enhanced solid phase PCR (where conventional solid phase PCR can be
improved by employing high melting temperature (T.sub.m) and nested
solid support primer with optional application of a thermal `step`
to favor solid support priming).
[0226] Thermal asymmetric interlaced PCR (TAIL-PCR) is used for
isolation of an unknown sequence flanking a known sequence. Within
the known sequence, TAIL-PCR uses a nested pair of primers with
differing annealing temperatures; a degenerate primer is used to
amplify in the other direction from the unknown sequence (Liu et
al. Genomics 25 (3): 674-81. (1995)).
[0227] Touchdown PCR (step-down PCR) is a variant of PCR that aims
to reduce nonspecific background by gradually lowering the
annealing temperature as PCR cycling progresses. The annealing
temperature at the initial cycles is usually a few degrees
(3-5.degree. C.) above the Tm of the primers used, while at the
later cycles, it is a few degrees (3-5.degree. C.) below the primer
Tm. The higher temperatures give greater specificity for primer
binding, and the lower temperatures permit more efficient
amplification from the specific products formed during the initial
cycles.
[0228] The temperature of the reaction solutions may be
sequentially cycled between a denaturing state, an annealing state,
and an extension state for a predetermined number of cycles. The
actual times and temperatures can be enzyme, primer, and target
dependent.
[0229] For any given reaction, denaturing states can range in
certain embodiments from about 75.degree. C. to about 100.degree.
C. The annealing temperature and time can influence the specificity
and efficiency of primer binding to a particular locus within a
target nucleic acid and may be important for particular PCR
reactions.
[0230] For any given reaction, annealing states can range in
certain embodiments from about 20.degree. C. to about 75.degree. C.
In some embodiments, the annealing state can be performed at about
20.degree. C. to about 25.degree. C., about 25.degree. C. to about
30.degree. C., about 30.degree. C. to about 35.degree. C., or about
35.degree. C. to about 40.degree. C., about 40.degree. C. to about
45.degree. C., about 45.degree. C. to about 50.degree. C. In
certain embodiments, the annealing state can be performed at room
temperature (e.g., 20.degree. C. or 25.degree. C.). In some
embodiments, the annealing state can be performed at a temperature
of 20.degree. C., 21.degree. C., 22.degree. C., 23.degree. C.,
24.degree. C., 25.degree. C., 26.degree. C., 27.degree. C.,
28.degree. C., 29.degree. C., 30.degree. C., 31.degree. C.,
32.degree. C., 33.degree. C., 34.degree. C., 35.degree. C.,
36.degree. C., 37.degree. C., 38.degree. C., 39.degree. C.,
40.degree. C., 41.degree. C., 42.degree. C., 43.degree. C.,
44.degree. C., 45.degree. C., 46.degree. C., 47.degree. C.,
48.degree. C., 49.degree. C., or 50.degree. C.
[0231] Extension temperature and time may impact the allele product
yield and are understood to be an inherent property of the enzyme
under study. For a given enzyme, extension states can range in
certain embodiments from about 60.degree. C. to about 75.degree.
C.
[0232] In any of the foregoing embodiments, any DNA or RNA
polymerase (enzyme that catalyzes polymerization of nucleotides
into a nucleic acid strand) may be utilized, including thermostable
polymerases and reverse transcriptases (RTases). Examples include
Bacillus stearothermophilus pol I, Thermus aquaticus (Taq) pol I,
Pyrccoccus furiosus (Pfu), Pyrococcus woesei (Pwo), Thermus flavus
(Tfl), Thermus thermophilus (Tth), Thermus litoris (Tli) and
Thermotoga maritime (Tma). These enzymes, modified versions of
these enzymes, and combination of enzymes, are commercially
available from vendors including Roche, Invitrogen, Qiagen,
Stratagene, and Applied Biosystems. Representative enzymes include
PHUSION.RTM. (New England Biolabs, Ipswich, Mass.), Hot
MasterTaq.TM. (Eppendorf), PHUSION.RTM. Mpx (Finnzymes),
PyroStart.RTM. (Fermentas), KOD (EMD Biosciences), Z-Taq (TAKARA),
and CS3AC/LA (KlenTaq, University City, Mo.).
[0233] Salts and buffers include those familiar to those skilled in
the art, including those comprising MgCl.sub.2, and Tris-HCl and
KCl, respectively. Buffers may contain additives such as
surfactants, dimethyl sulfoxide (DMSO), glycerol, bovine serum
albumin (BSA) and polyethylene glycol (PEG), as well as others
familiar to those skilled in the art. Nucleotides are generally
deoxyribonucleoside triphosphates, such as deoxyadenosine
triphosphate (dATP), deoxycytidine triphosphate (dCTP),
deoxyguanosine triphosphate (dGTP), and deoxythymidine triphosphate
(dTTP), and are also added to a reaction adequate amount for
amplification of the target nucleic acid.
[0234] Also provided herein are methods comprising (1) hybridizing
a complement strand of a primer duplex to a target nucleic acid,
thereby dissociating the complement strand from its protector
strand, and (2) extending the complement strand at its 3' end, in a
target-complementary manner, in the presence of a polymerase.
[0235] Also provided herein are methods comprising performing a
nucleic acid synthesis reaction in the presence of a target nucleic
acid, a polymerase, and one or more of the primer duplexes of any
one of the embodiments described herein.
[0236] A "nucleic acid synthesis reaction" refers to any reaction
in which a nucleic acid is synthesized. Examples include nucleic
acid amplification reactions such as polymerase chain reaction
(PCR) or a variation thereof (described elsewhere herein), a
transcription reaction, a reverse transcription reaction,
sequencing-by-synthesis, or other primer extension reactions (see
also, Lizardi et al. Nat. Genet. 19: 225-32 (1998), incorporated by
reference).
[0237] In some instances, a method is provided that comprises (1)
synthesizing a complement strand having a target-non-specific
balance region, a target-specific branch migration region, and a
target-specific toehold region; (2) synthesizing a protector strand
having a balance region complementary to the complement strand and
a branch migration region complementary to the complement strand;
and (3) hybridizing the complement strand to the protector strand
to form a primer duplex.
[0238] In some instances, a method is provided that comprises (1)
providing a complement strand having a target-non-specific balance
region, a target-specific branch migration region, and a
target-specific toehold region; (2) providing a protector strand
having a balance region complementary to the complement strand and
a branch migration region complementary to the complement strand;
and (3) combining the complement strand to the protector strand to
form a primer duplex.
[0239] In some instances, a method is provided that comprises (1)
providing a plurality of nucleic acid molecules comprising a target
nucleic acid; (2) providing at least one primer duplex having (i) a
balance region, (ii) a branch migration region complementary to the
target nucleic acid, and (iii) a toehold region; and (3) combining
in a single reaction the plurality of target nucleic acids, at
least one primer duplex, and a polymerase under conditions suitable
for nucleic acid hybridization.
[0240] Also provided herein are methods of amplifying at least one
target nucleic acid of interest, comprising (1) providing a
plurality of nucleic acid molecules comprising at least one target
nucleic acid, (2) providing at least one primer duplex having (i) a
balance region, (ii) a branch migration region, and (iii) a toehold
region; and (3) combining in a single reaction the plurality of
target nucleic acid molecules, at least one primer duplex, and a
polymerase under conditions suitable for amplification of the at
least one target nucleic acid. In certain embodiments multiple
unique target nucleic acids are amplified in a single reaction or
in multiple reactions, for example, in one or more multiplexed PCR
amplification reaction. In some embodiments, about 10 to 100, about
100 to about 1000, about 1000 to about 10,000, or about 10,000 to
about 100,000 nucleic acid targets are amplified. The number of
different primer duplexes in a reaction will depend on the number
of desired targets.
[0241] In some embodiments, provided herein are methods of
discriminating against spurious nucleic acid molecules having one
or more nucleotide changes relative to a target nucleic acid
molecule, comprising (1) providing a plurality of nucleic acid
molecules comprising at least one target nucleic acid, (2)
providing at least one primer duplex having (i) a balance region,
(ii) a branch migration region, and (iii) a toehold region; and (3)
combining in a single reaction the plurality of target nucleic acid
molecules, at least one primer duplex, and a polymerase under
conditions suitable for amplification of the at least one target
nucleic acid molecule.
[0242] Any one of the methods described herein may further comprise
providing or combining in a single reaction one or more of the
following reagents: buffer (e.g., KCl, MgCl.sub.2, Tris-HCl), dNTPs
(e.g., dATP, dCTP, dGTP, dTTP at concentrations of, e.g., about 50
to about 100 .mu.M), polymerase (e.g., at concentrations of about
0.5-2.0 units per 50 .mu.l reaction), and/or water. The
concentration of each strand of a primer duplex in a single
reaction varies depending on, for example, the concentration of
target nucleic acid. In some embodiments, about 5 to about 50 pg of
plasmid or viral target may be used, or about 50 ng to about 500 ng
of genomic target may be used. In such instances, the concentration
each primer (the first strand and the second strand) may be, for
example, about 0.05 .mu.M to about 1 .mu.M. In particular
embodiments, the concentration of each primer is about 1 nM to
about 1 .mu.M.
[0243] In any one of the embodiments described herein, a single
reaction may be subject to cyclic temperature changes such that a
dsDNA structure undergoes multiple rounds of denaturation,
subsequent primer annealing, and polymerase-based extension, for
example, similar to those conditions used for standard PCR. In some
embodiments, the temperature range for a denaturation step is about
90 to about 95.degree. C. In certain embodiments, an initial
denaturation step of about 1 to about 5 minutes is required prior
to cycling; the exact amount of time may depend on GC content of
the nucleic acid target of interest. In certain embodiments, the
denaturation step during a cycling reaction is about 15 to about 30
seconds. In some embodiments, the temperature range for an
annealing step is about 50.degree. C. to about 60.degree. C. In
some embodiments, the annealing step is about 20.degree. C. to
about 40.degree. C. in particular embodiments, the annealing step
is at room temperature (about 20.degree. C. or about 25.degree.
C.). In certain embodiments, the annealing step during a cycling
reaction is about 15 to about 30 seconds. In some embodiments, the
temperature range for an extension step is about 70.degree. C. to
about 75.degree. C. In certain embodiments, the extension step
during a cycling reaction is about 45 to about 60 seconds. The
temperature, time of each step, and number of cycles of a cycling
reaction may depend on the length of the nucleic acid target(s) of
interest as well as the polymerase being used. Longer target may
require, for example, longer extension times. One example of
cycling conditions for a 500 nucleotide target is set forth in
Table 2.
TABLE-US-00002 TABLE 2 1 cycle 98.degree. C. 2 minutes 25 cycles
98.degree. C. 15 seconds 30.degree. C., 35.degree. C., 40.degree.
C., 45.degree. C., 15 seconds 50.degree. C., 55.degree. C., or
60.degree. C. 72.degree. C. 45 seconds 1 cycle 72.degree. C. 5
minutes 1 cycle 4.degree. C. indefinite
[0244] In any one of the embodiments described herein, a single
reaction (e.g., nucleic acid amplification) may proceed at room
temperature (e.g., about 20.degree. C. or about 25.degree. C.). In
certain embodiments, a single reaction proceeds at room temperature
for about 1 hour.
[0245] In any one of the methods described herein, the second
protector strand of a primer duplex may be provided in excess of
the first complementary strand or in excess of the annealed primer
duplexes. For example, in some embodiments, the second strand is
provided at a concentration about 1.times. to about 10.times.
(e.g., 1.times., 2.times., 3.times., 4.times., 5.times., 6.times.,
7.times., 8.times., 9.times., or 10.times.) the concentration the
first strand, or about 1.times. to about 10.times. (e.g., 1.times.,
2.times., 3.times., 4.times., 5.times., 6.times., 7.times.,
8.times., 9.times., or 10.times.) the concentration of the annealed
primer duplex. In some embodiments, the first strand is provided at
a concentration of about 0.05 .mu.M to about 1 .mu.M, while the
second strand is provided at a concentration of about 0.10 .mu.M to
about 2 .mu.M, or about 0.15 .mu.M to about 3 .mu.M, about 0.2
.mu.M to about 4 .mu.M, or about 0.25 .mu.M to about 5 .mu.M.
[0246] Any one of the methods described herein may comprise a
method selected from: allele-specific PCR, assembly PCR, asymmetric
PCR, helicase-dependent amplification, intersequence-specific PCR
(ISSR), inverse PCR, ligation-mediated PCR, methylation-specific
PCR (MSP), miniprimer PCR, multiplex PCR, nested PCR,
overlap-extension PCR, quantitative PCR (Q-PCR), reverse
transcription PCR (RT-PCR), solid phase PCR, thermal asymmetric
interlaced PCR (TAIL-PCR), and touchdown PCR.
[0247] In any one of the methods described herein, the yield of
amplified nucleic acid target may be about 30% to about 100%. In
some embodiments, the yield is at least 30%, at least 35%, at least
40%, at least 45%, at least 50%, at least 55%, at least 60%, at
least 65%, at least 70%, at least 75%, at least 80%, at least 85%,
at least 90%, at least 95%, at least 99%, or at least 100%.
[0248] In any one of the methods described herein, the amplified
nucleic acid product may be purified. Nucleic acid purification
methods are well-known to those of skill in the art and include,
phenol extraction, guanidinium isothiocyanate, alcohol
precipitation, DEAE (ion exchange), size exclusion chromatography
(SEC), cesium chloride, extraction from agarose, silica, and other
column-based purification methods.
[0249] In any one of the methods described herein, a purified
amplified target nucleic acid may be about 30% to about 100% pure.
In some embodiments, the purity is at least 30%, at least 35%, at
least 40%, at least 45%, at least 50%, at least 55%, at least 60%,
at least 65%, at least 70%, at least 75%, at least 80%, at least
85%, at least 90%, at least 95%, at least 99%, or 100% pure.
[0250] Imaging
[0251] The primer duplexes and systems described herein can also be
used to improve the specificity of in situ imaging assays.
Nonspecific interactions between biological RNAs and
fluorophore-labeled primers are frequently a source of background
noise. Thus, as depicted in FIG. 14, the use fluorophore-labeled
nucleic acid primer systems described herein in the place of
conventional primers, in some embodiments, greatly improves the
performance of existing in situ imaging techniques. Notably, by
labeling the complement strand or domain with a fluorophore and the
protector strand or domain with a quencher, the primer duplex
system will only produce a detectable signal when it is bound to
the target.
[0252] Single Nucleotide Polymorphism (SNP) Detection
[0253] The accurate detection of the location and identity of
single nucleotide polymorphisms (SNPs) is of great interest for
both research and therapeutic purposes. The kinetic discrimination
methods described herein are therefore useful for the convenient
identification SNPs.
Kits
[0254] Provided herein are kits comprising (1) at least one
complement strand having a balance region, a branch migration
region, and a toehold region, and (2) at least one protector strand
having a balance region and a branch migration region.
[0255] Provided herein are kits comprising at least one primer
duplex comprising (1) at least one complement strand or region
having a balance region, a branch migration region, and a toehold
region, and (2) at least one protector strand or region having a
balance region and a branch migration region.
[0256] Any one of the kits described herein may further comprise a
polymerase. Any one of the kits provided herein may further
comprise one or more agent selected from buffer (e.g., KCl,
MgCl.sub.2, Tris-HCl), dNTPs (e.g., dATP, dCTP, dGTP, dTTP), and
water. Any one of the kits provided herein may comprise protector
strand is molar excess of the primer. Any one of the kits provided
herein may further comprise instructions or directions for
obtaining instructions (e.g., from a website) for using the
components of the kits. Any one of the kits provided herein may
further comprise at least one reaction tube, well, chamber, or the
like.
[0257] Any one of the primers or primer systems described herein
may be provided in the form of a kit or comprised within a kit.
EXAMPLES
[0258] In accordance with the invention, the above limitations of
PCR, transcription, and reverse transcription can be overcome
through the use of highly specific primer duplexes. The experiments
described herein demonstrate that primer duplexes can reliably
discriminate against targets with single-base changes (FIG. 16) for
both DNA and RNA targets and primers (FIG. 17). The correct target
hybridizes to the 7/5 primers with roughly 50% yield, but even a
large excess (200.times.) of targets with a single-base change is
insufficient to significantly hybridize. Primer duplexes were
designed and tested for multiple different targets, and each primer
duplex achieved high discrimination factors versus
single-nucleotide changes (FIG. 17). Quantitatively, the median
discrimination in hybridization yield to a spurious target with a
single-nucleotide change is 26.
[0259] The primer duplexes were used for PCR in a
proof-of-principle demonstration (FIGS. 18A and 18B). A
semi-repetitive target nucleic acid was designed, which is
difficult to amplify by traditional PCR (PCR without the use of the
instant primer duplexes). The yield of standard 21 nucleotide
primers and the primer duplexes were calculated. Many different
thermal cycling schedules were determined in order to investigate
the range of function. Based on the length and nucleotide content
of the primer duplexes, standard PCR condition would predict that
the annealing temperature of the primers would 55.degree. C.
Surprisingly, as an example, even under conditions most unfavorable
for primer duplex annealing (35.degree. C. and 40.degree. C.), the
fraction (50.2%) of correct-length product amplified using the
primer duplexes was higher than the fraction (31.0%) of
correct-length product amplified using standard primers under their
most favorable PCR conditions (45.degree. C.). Furthermore, in this
particular experiment, the primer duplexes were arbitrarily
designed (7 nucleotide toehold region and 5 nucleotide balance
region), and were not optimized for PCR yield performance. Thus, it
is likely that even higher PCR specificity can be achieved through
optimization of the instant primer duplexes.
[0260] FIG. 15 shows highly specific PCR using the primer duplexes
provided herein. In FIG. 5A, the primer "PC" is comprised of a
complement strand "C" and a protector strand "P". When PC binds to
the intended target at the correct position "X", the
single-stranded protector oligonucleotide "P" is released as an
inert waste product, and the primed target is elongated by the DNA
polymerase. In FIG. 5B, when the primer PC binds to an unintended
target or to the correct target at an incorrect position (in either
case, denoted "Y"), the displacement of the protector from the
complementary strand "C" is thermodynamically unfavorable, and
kinetically quick to reverse. Consequently, off-target
amplification (e.g., amplification of Y rather than X) is expected
to be significantly reduced.
[0261] FIG. 16. shows an experimental demonstration of primer
hybridization with single nucleotide discrimination. In FIG. 16A,
short synthetic DNA target "X" or spurious target "Y" is reacted
with the primer. (The poly-T tail on the protector strand "P"
serves to distinguish products from reactants on a gel.) Shown in
red boxes are the positions of single-base changes for spurious
target Y. FIG. 16B shows native polyacrylamide gel results. The
primer "PC" was prepared at a 2:1 ratio of protector P to
complement C, and annealed at 1 .mu.M concentration of PC. Either
the correct or spurious targets were added to achieve final
concentrations of 200 nM target (X or Y), 100 nM PC, and 100 nM P.
In some embodiments, a reaction may have an excess of the protector
(P) primer. For example, in some embodiments, the protector strand
is provided at a concentration of about 1.times. to about 10.times.
(e.g., 1.times., 2.times., 3.times., 4.times., 5.times., 6.times.,
7.times., 8.times., 9.times., or 10.times.) of the complement
strand. All reactions proceeded at room temperature (25.degree. C.)
for 1 hour. As an example, the designation "7/4" denotes a primer
that possesses 7 nucleotides of single-stranded nucleotides (as a
3' overhang) to initiate hybridization to the target, and the
protector spontaneously dissociates 4 nucleotides to be released.
FIG. 16C is a plot of hybridization yields inferred from data shown
in FIG. 16B. Shown as plot "X" is the hybridization of the primer
to the correct target X, while the remaining "dotted" plots show
the hybridization to the spurious targets Y. The 7/4, 7/5 and 7/6
primers all discriminate in their hybridization yields (.chi.)
between the correct and the spurious targets. The 7/0 target does
not. In FIG. 16D, the discrimination factor (Q) is a quantitative
measurement of the specificity of the primer, and is calculated as
the hybridization yield (.chi.) of the correct target divided by
the hybridization yield (.chi.) of the spurious target. In 16E,
there is little hybridization of the 7/5 primer to a spurious
target Y even when such target is present in large excess (i.e.,
200-fold).
[0262] FIG. 17. shows additional experimental results and
statistics on the single-base discrimination abilities of primer
duplexes. FIG. 17A shows that four additional targets and sets of
primers were constructed and tested: two based on naturally
occurring microRNA sequences, and two designed to intentionally
possess significant secondary structure. FIG. 17B shows a histogram
of the discrimination factors (Q) achieved by the 7/5 primers for
each target. Due to limitations of the gel scanner, it was not
possible to reliably measure discrimination factors above 100, and
these were all grouped as "100+." FIG. 17C show RNA target and
primer. The target sequence is a synthetic RNA oligonucleotide with
sequence identical to the human let7g microRNA. FIG. 17D shows
native PAGE results. The PC primer was prepared at a 2:1 ratio of
protector P to complement C, and annealed at 3 .mu.M concentration.
Either the correct or spurious targets were added to achieve final
concentrations of 2 .mu.M X or Y, 1 .mu.M PC, and 1 .mu.M P. The
correct target successfully binds to the primer; the hybridization
yield of targets with single-nucleotide mismatches is low.
[0263] FIG. 18 shows experimental results using duplex primers to
improve the PCR yield of a quasi-repetitive target. FIG. 18A shows
a quasi-repetitive PCR target (168 nt) that traditional PCR primers
struggle to amplify with high yield. Here, a* is the correct target
for X1. The remaining sites labeled a*m1 (which is X1-m17G), a*m2
(which is X1-m9T), and a*m3 (which is X1-m11G) are not the correct
targets. Similarly, b* is the correct target for X2, and b*m1
(which is X2-m3T), b*m2 (which is X2-m11C), and b*m3 (which is
X2-m18T) are not the correct targets. Thus, the outer-most binding
sites are the perfect binding sites for the primers, but there are
also 3 additional single-base mismatch primer binding sites between
the perfect sites. The primer duplexes bind by 7 nucleotides to the
target, and the protector must spontaneously dissociate 5
nucleotides to be released. The primer duplex was designed so that
its 3' end cannot be extended by the polymerase. The toehold region
of the complement strand was designed at the 3' end, instead of the
5' end as in previous designs. In FIG. 18B, primer duplexes show
significantly higher yield of correct length product, as compared
to standard primers. Each lane is labeled with the primers used as
well as the temperature cycling schedule (e.g., "98-40-72"
indicates denaturation at 98.degree. C., annealing at 40.degree.
C., and elongation at 72.degree. C.). The left-most lane shows the
synthetic oligonucleotide reference. The lower numbers labeled "%
Correct" indicate the relative intensity of band corresponding to
the correct length product compared to the integrated intensity of
all bands in the lane. The primer duplex PCR product appears as 10
nucleotides longer than the reference and the standard PCR product
because of the 5 nucleotide of overhangs (toehold region) on each
primer.
EQUIVALENTS
[0264] Those skilled in the art will recognize or be able to
ascertain using no more than routine experimentation many
equivalents to the specific embodiments described herein.
[0265] The articles "a" and "an" are used herein to refer to one or
to more than one (i.e., to at least one) of the grammatical object
of the article. By way of example, "an element" means one element
or more than one element.
[0266] Claims or descriptions that include "or" between one or more
members of a group are considered satisfied if one, more than one,
or all of the group members are present in, employed in, or
otherwise relevant to a given product or process unless indicated
to the contrary or otherwise evident from the context. The
invention includes embodiments in which exactly one member of the
group is present in, employed in, or otherwise relevant to a given
product or process. The invention includes embodiments in which
more than one, or all of the group members are present in, employed
in, or otherwise relevant to a given product or process.
Furthermore, it is to be understood that the invention encompasses
all variations, combinations, and permutations in which one or more
limitations, elements, clauses, descriptive terms, etc., from one
or more of the listed claims is introduced into another claim. For
example, any claim that is dependent on another claim can be
modified to include one or more limitations found in any other
claim that is dependent on the same base claim.
[0267] Where elements are presented as lists, e.g., in Markush
group format, it is to be understood that each subgroup of the
elements is also disclosed, and any element(s) can be removed from
the group. It should it be understood that, in general, where the
invention, or aspects of the invention, is/are referred to as
comprising particular elements, features, certain embodiments of
the invention or aspects of the invention consist, or consist
essentially of, such elements and/or features. For purposes of
simplicity those embodiments have not been specifically set forth
in haec verba herein. It is also noted that the term "comprising"
is intended to be open and permits the inclusion of additional
elements or steps.
[0268] Where ranges are given, endpoints are included. Furthermore,
it is to be understood that unless otherwise indicated or otherwise
evident from the context and understanding of one of ordinary skill
in the art, values that are expressed as ranges can assume any
specific value or sub-range within the stated ranges in different
embodiments of the invention, to the tenth of the unit of the lower
limit of the range, unless the context clearly dictates
otherwise.
[0269] As used herein, the term "about" generally may refer to any
value within a range of 10% of the recited value. In some instance,
however, "about" may encompasses a range of 20% of the recited
value.
[0270] In addition, it is to be understood that any particular
embodiment of the present invention that falls within the prior art
may be explicitly excluded from any one or more of the claims.
Because such embodiments are deemed to be known to one of ordinary
skill in the art, they may be excluded even if the exclusion is not
set forth explicitly herein. Any particular embodiment of the
methods of the invention can be excluded from any one or more
claims, for any reason, whether or not related to the existence of
prior art. This invention is not limited in its application to the
details of construction and the arrangement of components set forth
in the following description or illustrated in the drawings. The
invention is capable of other embodiments and of being practiced or
of being carried out in various ways. Also, the phraseology and
terminology used herein is for the purpose of description and
should not be regarded as limiting. The use of "including,"
"comprising," or "having," "containing," "involving," and
variations thereof herein, is meant to encompass the items listed
thereafter and equivalents thereof as well as additional items.
[0271] Each of the foregoing patents, patent applications and
references is hereby incorporated by reference, particularly for
the teaching referenced herein.
REFERENCES
[0272] [1] Petersen, M. & Wengel, J. LNA: a versatile tool for
therapeutics and genomics. Trends Biotechnol. 21, 74-81, (2003).
[0273] [2] Krueger, A. T. & Kool, E. T. Redesigning the
Architecture of the Base Pair: Toward Biochemical and Biological
Function of New Genetic Sets. Chem Biol. 16, 242-248 (2009). [0274]
[3] Lizardi, P. M. et al. Mutation detection and single-molecule
counting using isothermal rolling-circle amplification. Nat. Genet.
19, 225-232 (1998). [0275] [4] Saiki, R. K., Gelfand, D. H.,
Stoffel, S., Scharf, S. J., Higuchi, R., Horn, G. T., Mullis, K. B.
& Erlich, H. A. Primer-directed enzymatic amplification of DNA
with a thermostable DNA polymerase. Science 239, 487-491 (1988).
[0276] [5] Zhang, D. Y., Chen, X. & Yin, P. Optimizing Nucleic
Acid Hybridization Specificity. submitted (2011).
Sequence CWU 1
1
149129DNAArtificial SequenceSynthetic Oligonucleotide 1attcagacat
tcaataccct acgtctcca 29222DNAArtificial SequenceSynthetic
Oligonucleotide 2tcgtcaccat tcaataccct ac 22329DNAArtificial
SequenceSynthetic Oligonucleotide 3agcagtggta agttatggga tgcagaggt
29422DNAArtificial SequenceSynthetic Oligonucleotide 4cattcaatac
cctactagat gt 22529DNAArtificial SequenceSynthetic Oligonucleotide
5taagtcggta agttatggga tgatctaca 29649DNAArtificial
SequenceSynthetic Oligonucleotide 6tggagacgta gggtattgaa tgaggttttt
tcctcattca ataccctac 49749DNAArtificial SequenceSynthetic
Oligonucleotide 7cattcaatac cctactagtt ttttctagta gggtattgaa
tgtctgaat 49828DNAArtificial SequenceSynthetic Oligonucleotide
8ttacattcaa taccctacgt ctccatga 28961DNAArtificial
SequenceSynthetic Oligonucleotide 9cgtctccaac cttttttggt tggagacgta
gggtattgaa tgaggttttt tcctcattca 60a 611011DNAArtificial
SequenceSynthetic Oligonucleotide 10cgtctccaac c
111144DNAArtificial SequenceSynthetic Oligonucleotide 11ggttggagac
gtagggtatt gaatgaggtt ttttcctcat tcaa 441210DNAArtificial
SequenceSynthetic Oligonucleotide 12cctcattcaa 101346DNAArtificial
SequenceSynthetic Oligonucleotide 13cgtctccaac cttttttggt
tggaagacgt agggtattga atgagg 461428DNAArtificial SequenceSynthetic
Oligonucleotide 14ggagtaagtt atgggatgca gaggttgg
281549DNAArtificial SequenceSynthetic Oligonucleotide 15cgtctccatt
tttttggaga cgtagggtat tgaatgtttt ttcattcaa 491622DNAArtificial
SequenceSynthetic Oligonucleotide 16cattcaatac cctacgtctc ca
221722DNAArtificial SequenceSynthetic Oligonucleotide 17cattcaatac
cctacgtatc ca 221822DNAArtificial SequenceSynthetic Oligonucleotide
18tcgacaccat tcaataccct ac 221929DNAArtificial SequenceSynthetic
Oligonucleotide 19agctgtggta agttatggga tgcagaggt
292036DNAArtificial SequenceSynthetic Oligonucleotide 20gatcgacact
cattcaatac cctacgtctc acgatc 362122DNAArtificial SequenceSynthetic
Oligonucleotide 21ttgcatccac tcattcaata cc 222229DNAArtificial
SequenceSynthetic Oligonucleotide 22aacgtaggtg agtaagttat gggatgcag
292335DNAArtificial SequenceSynthetic Oligonucleotide 23aacgtaggtg
agtaagttat gggatgcaga gtgct 352436DNAArtificial SequenceSynthetic
Oligonucleotide 24gatcgacact cattcattac cctacgtcaa acttga
362522DNAArtificial SequenceSynthetic Oligonucleotide 25cactcattca
ataccctacg tc 222622DNAArtificial SequenceSynthetic Oligonucleotide
26tgaggtagta gtttgtacag tt 222722DNAArtificial SequenceSynthetic
Oligonucleotide 27ttaatgctaa tcgtgatagg gt 222822DNAArtificial
SequenceSynthetic Oligonucleotide 28cacttgatac aagcttacca tc
222922DNAArtificial SequenceSynthetic Oligonucleotide 29caagaacaga
tgtaccatca ca 223021RNAArtificial SequenceSynthetic Oligonucleotide
30ugagguagua guuugacagu u 213123RNAArtificial SequenceSynthetic
Oligonucleotide 31uuuaugauug agguaguagu uug 233227RNAArtificial
SequenceSynthetic Oligonucleotide 32uacuaacucc aucaucaaac augucaa
273321DNAArtificial SequenceSynthetic Oligonucleotide 33tgcatccact
cattcaatac c 213420DNAArtificial SequenceSynthetic Oligonucleotide
34gcatccactc attcaatacc 203519DNAArtificial SequenceSynthetic
Oligonucleotide 35catccactca ttcaatacc 193615DNAArtificial
SequenceSynthetic Oligonucleotide 36cactcattca atacc
153720DNAArtificial SequenceSynthetic Oligonucleotide 37atgattgagg
tagtagtttg 203819DNAArtificial SequenceSynthetic Oligonucleotide
38tgattgaggt agtagtttg 193920DNAArtificial SequenceSynthetic
Oligonucleotide 39aggatttaat gctaatcgtg 204019DNAArtificial
SequenceSynthetic Oligonucleotide 40ggatttaatg ctaatcgtg
194120DNAArtificial SequenceSynthetic Oligonucleotide 41ctcatcactt
gatacaagct 204219DNAArtificial SequenceSynthetic Oligonucleotide
42tcatcacttg atacaagct 194320DNAArtificial SequenceSynthetic
Oligonucleotide 43cgttccaaga acagatgtac 204419DNAArtificial
SequenceSynthetic Oligonucleotide 44gttccaagaa cagatgtac
194522DNAArtificial SequenceSynthetic Oligonucleotide 45gactcattca
ataccctacg tc 224622DNAArtificial SequenceSynthetic Oligonucleotide
46cactctttca ataccctacg tc 224722DNAArtificial SequenceSynthetic
Oligonucleotide 47cactcattca ttaccctacg tc 224822DNAArtificial
SequenceSynthetic Oligonucleotide 48cactcattca ctaccctacg tc
224922DNAArtificial SequenceSynthetic Oligonucleotide 49cactcattca
gtaccctacg tc 225021DNAArtificial SequenceSynthetic Oligonucleotide
50cactcattca taccctacgt c 215123DNAArtificial SequenceSynthetic
Oligonucleotide 51cactcattca aataccctac gtc 235223DNAArtificial
SequenceSynthetic Oligonucleotide 52cactcattca tataccctac gtc
235323DNAArtificial SequenceSynthetic Oligonucleotide 53cactcattca
cataccctac gtc 235423DNAArtificial SequenceSynthetic
Oligonucleotide 54cactcattca gataccctac gtc 235522DNAArtificial
SequenceSynthetic Oligonucleotide 55cactcattca ataccgtacg tc
225622DNAArtificial SequenceSynthetic Oligonucleotide 56tcaggtagta
gtttgtacag tt 225722DNAArtificial SequenceSynthetic Oligonucleotide
57tgaggttgta gtttgtacag tt 225822DNAArtificial SequenceSynthetic
Oligonucleotide 58tgaggtagta gattgtacag tt 225922DNAArtificial
SequenceSynthetic Oligonucleotide 59tgaggtagta gtttgttcag tt
226022DNAArtificial SequenceSynthetic Oligonucleotide 60tgaggtagta
gtttgtccag tt 226122DNAArtificial SequenceSynthetic Oligonucleotide
61tgaggtagta gtttgtgcag tt 226221DNAArtificial SequenceSynthetic
Oligonucleotide 62tgaggtagta gtttgtcagt t 216323DNAArtificial
SequenceSynthetic Oligonucleotide 63tgaggtagta gtttgtaaca gtt
236423DNAArtificial SequenceSynthetic Oligonucleotide 64tgaggtagta
gtttgttaca gtt 236523DNAArtificial SequenceSynthetic
Oligonucleotide 65tgaggtagta gtttgtgaca gtt 236623DNAArtificial
SequenceSynthetic Oligonucleotide 66tgaggtagta gtttgtcaca gtt
236722DNAArtificial SequenceSynthetic Oligonucleotide 67tttatgctaa
tcgtgatagg gt 226822DNAArtificial SequenceSynthetic Oligonucleotide
68ttaatgcaaa tcgtgatagg gt 226922DNAArtificial SequenceSynthetic
Oligonucleotide 69ttaatgcgaa tcgtgatagg gt 227022DNAArtificial
SequenceSynthetic Oligonucleotide 70ttaatgccaa tcgtgatagg gt
227121DNAArtificial SequenceSynthetic Oligonucleotide 71ttaatgcaat
cgtgataggg t 217223DNAArtificial SequenceSynthetic Oligonucleotide
72ttaatgcata atcgtgatag ggt 237323DNAArtificial SequenceSynthetic
Oligonucleotide 73ttaatgctta atcgtgatag ggt 237423DNAArtificial
SequenceSynthetic Oligonucleotide 74ttaatgcgta atcgtgatag ggt
237523DNAArtificial SequenceSynthetic Oligonucleotide 75ttaatgccta
atcgtgatag ggt 237622DNAArtificial SequenceSynthetic
Oligonucleotide 76ttaatgctaa tcctgatagg gt 227722DNAArtificial
SequenceSynthetic Oligonucleotide 77ttaatgctaa tcgtgattgg gt
227822DNAArtificial SequenceSynthetic Oligonucleotide 78cacatgatac
aagcttacca tc 227922DNAArtificial SequenceSynthetic Oligonucleotide
79cacttgattc aagcttacca tc 228022DNAArtificial SequenceSynthetic
Oligonucleotide 80cacttgatac aaggttacca tc 228122DNAArtificial
SequenceSynthetic Oligonucleotide 81cacttgatac aagcttacga tc
228222DNAArtificial SequenceSynthetic Oligonucleotide 82cacttgatac
aagcttacaa tc 228322DNAArtificial SequenceSynthetic Oligonucleotide
83cacttgatac aagcttacta tc 228421DNAArtificial SequenceSynthetic
Oligonucleotide 84cacttgatac aagcttacat c 218523DNAArtificial
SequenceSynthetic Oligonucleotide 85cacttgatac aagcttaccc atc
238623DNAArtificial SequenceSynthetic Oligonucleotide 86cacttgatac
aagcttacgc atc 238723DNAArtificial SequenceSynthetic
Oligonucleotide 87cacttgatac aagcttacac atc 238823DNAArtificial
SequenceSynthetic Oligonucleotide 88cacttgatac aagcttactc atc
238922DNAArtificial SequenceSynthetic Oligonucleotide 89catgaacaga
tgtaccatca ca 229022DNAArtificial SequenceSynthetic Oligonucleotide
90caagaactga tgtaccatca ca 229122DNAArtificial SequenceSynthetic
Oligonucleotide 91caagaacaga tgaaccatca ca 229222DNAArtificial
SequenceSynthetic Oligonucleotide 92caagaacaga tgcaccatca ca
229322DNAArtificial SequenceSynthetic Oligonucleotide 93caagaacaga
tggaccatca ca 229421DNAArtificial SequenceSynthetic Oligonucleotide
94caagaacaga tgaccatcac a 219523DNAArtificial SequenceSynthetic
Oligonucleotide 95caagaacaga tgataccatc aca 239623DNAArtificial
SequenceSynthetic Oligonucleotide 96caagaacaga tgttaccatc aca
239723DNAArtificial SequenceSynthetic Oligonucleotide 97caagaacaga
tgctaccatc aca 239823DNAArtificial SequenceSynthetic
Oligonucleotide 98caagaacaga tggtaccatc aca 239922DNAArtificial
SequenceSynthetic Oligonucleotide 99caagaacaga tgtaccaaca ca
2210026DNAArtificial SequenceSynthetic Oligonucleotide
100cactcattca ataccctacg tctttt 2610122DNAArtificial
SequenceSynthetic Oligonucleotide 101cactcattca ataccctacg tc
2210223DNAArtificial SequenceSynthetic Oligonucleotide
102tttatgattg aggtagtagt ttg 2310327DNAArtificial SequenceSynthetic
Oligonucleotide 103tactaactcc atcatcaaac atgtcaa
2710423DNAArtificial SequenceSynthetic Oligonucleotide
104tttaggattt aatgctaatc gtg 2310527DNAArtificial SequenceSynthetic
Oligonucleotide 105tcctaaatta cgattagcac tatccca
2710623DNAArtificial SequenceSynthetic Oligonucleotide
106tttcgtccca agaacagatg tac 2310727DNAArtificial SequenceSynthetic
Oligonucleotide 107gcagggttct tgtctacatg gtagtgt
2710824DNAArtificial SequenceSynthetic Oligonucleotide
108ttttgcatcc actcattcaa tacc 2410928DNAArtificial
SequenceSynthetic Oligonucleotide 109acgtaggtga gtaagttatg ggatgcag
2811023DNAArtificial SequenceSynthetic Oligonucleotide
110tttgcatcca ctcattcaat acc 2311127DNAArtificial SequenceSynthetic
Oligonucleotide 111cgtaggtgag taagttatgg gatgcag
2711236DNAArtificial SequenceSynthetic Oligonucleotide
112ggacgttgat atgggacgta gggtattgaa tgagtg 3611361DNAArtificial
SequenceSynthetic Oligonucleotide 113gcttaagcct tcactattca
atgtatgaga ataaaccctc ctctgttaga ggctaagtac 60t
6111460DNAArtificial SequenceSynthetic Oligonucleotide
114cgtcgctcgg tttgcttctg catggagggt gaggactttt atacacaggg
aagcgagctc 6011536DNAArtificial SequenceSynthetic Oligonucleotide
115tgaggtagta gtttgtacag ttgagaagtt aggttg 3611636DNAArtificial
SequenceSynthetic Oligonucleotide 116caacctaact tctcaactgt
acaaactact acctca 3611760DNAArtificial SequenceSynthetic
Oligonucleotide 117gagctcgctt ccctgtgtat aaaagtcctc accctccatg
cagaagcaaa ccgagcgacg 6011861DNAArtificial SequenceSynthetic
Oligonucleotide 118agtacttagc ctctaacaga ggagggttta ttctcataca
ttgaatagtg aaggcttaag 60c 6111936DNAArtificial SequenceSynthetic
Oligonucleotide 119cactcattca ataccctacg tcccatatca acgtcc
3612036DNAArtificial SequenceSynthetic Oligonucleotide
120ggacgttgat atgggacgtc gggtattgaa tgagtg 3612135DNAArtificial
SequenceSynthetic Oligonucleotide 121tgacctaatg ctaactcgtg
tcgctctgcg gcttc 3512235DNAArtificial SequenceSynthetic
Oligonucleotide 122tctacttgac gtatacggtc tagctgccac ggagg
3512336DNAArtificial SequenceSynthetic Oligonucleotide
123tgtggtagta gtttgtacag ttgagaagtt aggttg 3612436DNAArtificial
SequenceSynthetic Oligonucleotide 124caacctaact tctcaactgt
acaaactact accaca 3612535DNAArtificial SequenceSynthetic
Oligonucleotide 125cctccgtggc agctagaccg tatacgtcaa gtaga
3512635DNAArtificial SequenceSynthetic Oligonucleotide
126gaagccgcag agcgacacga gttagcatta ggtca 3512736DNAArtificial
SequenceSynthetic Oligonucleotide 127cactcattca atacccgacg
tcccatatca acgtcc 3612822DNAArtificial SequenceSynthetic
Oligonucleotide 128gacgtagggt attgaatgag tg 2212922DNAArtificial
SequenceSynthetic Oligonucleotide 129aactgtacaa actactacct ca
2213027DNAArtificial SequenceSynthetic Oligonucleotide
130ggtcagacgt agggtattga atgagtg 2713120DNAArtificial
SequenceSynthetic Oligonucleotide 131tcaataccct acgtctgacc
2013220DNAArtificial SequenceSynthetic Oligonucleotide
132tcaataccct acgtctgacc 2013320DNAArtificial SequenceSynthetic
Oligonucleotide 133gtagtttgta cagttagtcg 2013449DNAArtificial
SequenceSynthetic Oligonucleotide 134gagtggacgt agggtattga
atgagtggac gtcgggtatt gaatgagtg 4913544DNAArtificial
SequenceSynthetic Oligonucleotide 135gacgtagggt attaaatgag
tggacgtagg gtactgaatg agtg 4413644DNAArtificial SequenceSynthetic
Oligonucleotide 136tgaggtagta gtttgtatag tttgaggtag tactttgtac agtt
4413749DNAArtificial SequenceSynthetic Oligonucleotide
137tgtggtagta gtttgtacag tttgaggtag tagtttgtac agtttgagg
4913849DNAArtificial SequenceSynthetic Oligonucleotide
138cctcaaactg tacaaactac tacctcaaac tgtacaaact actaccaca
4913944DNAArtificial SequenceSynthetic Oligonucleotide
139aactgtacaa agtactacct caaactatac aaactactac ctca
4414044DNAArtificial SequenceSynthetic Oligonucleotide
140cactcattca gtaccctacg tccactcatt taatacccta cgtc
4414149DNAArtificial SequenceSynthetic Oligonucleotide
141cactcattca atacccgacg tccactcatt caatacccta cgtccactc
4914249DNAArtificial SequenceSynthetic Oligonucleotide
142gagtggacgt cgggtattga atgagtggac gtagggtatt gaatgagtg
4914344DNAArtificial SequenceSynthetic Oligonucleotide
143gacgtagggt attaaatgag tggacgtagg gtactgaatg agtg
4414444DNAArtificial SequenceSynthetic Oligonucleotide
144tgaggtagta gtttgtatag tttgaggtag tactttgtac agtt
4414549DNAArtificial SequenceSynthetic Oligonucleotide
145tgaggtagta gtttgtacag tttgtggtag tagtttgtac agtttgagg
4914649DNAArtificial SequenceSynthetic Oligonucleotide
146cctcaaactg tacaaactac tacctcaaac tgtacaaact actaccaca
4914744DNAArtificial SequenceSynthetic Oligonucleotide
147aactgtacaa agtactacct caaactatac aaactactac ctca
4414844DNAArtificial SequenceSynthetic Oligonucleotide
148cactcattca gtaccctacg tccactcatt taatacccta cgtc
4414949DNAArtificial SequenceSynthetic Oligonucleotide
149cactcattca atacccgacg tccactcatt caatacccta cgtccactc 49
* * * * *