Should I Use 8f or 1492r for Sequencing Both Diresction Read or One
Appl Environ Microbiol. 2008 April; 74(8): 2461–2470.
Critical Evaluation of Two Primers Commonly Used for Amplification of Bacterial 16S rRNA Genes▿
Jeremy A. Frank
Section of Microbiology,ane Host-Microbe Systems Theme, Constitute for Genomic Biology,two National Center for Supercomputing Applications,3 College of Medicine, Academy of Illinois at Urbana-Champaign, Urbana, Illinois 61801,4 Carle Foundation Infirmary, Urbana, Illinois 618015
Claudia I. Reich
Department of Microbiology,1 Host-Microbe Systems Theme, Found for Genomic Biological science,ii National Center for Supercomputing Applications,three College of Medicine, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801,4 Carle Foundation Infirmary, Urbana, Illinois 618015
Shobha Sharma
Department of Microbiology,one Host-Microbe Systems Theme, Institute for Genomic Biological science,2 National Heart for Supercomputing Applications,3 College of Medicine, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801,four Carle Foundation Hospital, Urbana, Illinois 618015
Jon S. Weisbaum
Section of Microbiology,1 Host-Microbe Systems Theme, Found for Genomic Biology,ii National Heart for Supercomputing Applications,3 College of Medicine, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801,4 Carle Foundation Hospital, Urbana, Illinois 618015
Brenda A. Wilson
Department of Microbiology,1 Host-Microbe Systems Theme, Plant for Genomic Biological science,ii National Center for Supercomputing Applications,3 College of Medicine, Academy of Illinois at Urbana-Champaign, Urbana, Illinois 61801,iv Carle Foundation Hospital, Urbana, Illinois 61801v
Gary J. Olsen
Department of Microbiology,1 Host-Microbe Systems Theme, Institute for Genomic Biological science,ii National Center for Supercomputing Applications,3 College of Medicine, Academy of Illinois at Urbana-Champaign, Urbana, Illinois 61801,4 Carle Foundation Hospital, Urbana, Illinois 618015
Received 2007 Oct 5; Accepted 2008 February 11.
Abstract
rRNA-based studies, which have get the most common method for assessing microbial communities, rely upon faithful amplification of the respective genes from the original Deoxyribonucleic acid sample. We written report here an analysis and reevaluation of normally used primers for amplifying the Dna between positions 27 and 1492 of bacterial 16S rRNA genes (numbered according to the Escherichia coli rRNA). We suggest a formulation for a forward primer (27f) that includes three sequences not commonly nowadays. We compare our proposed formulation to two mutual alternatives by using linear amplification—providing an assessment that is independent of a opposite primer—and in combination with the 1492 opposite primer (1492r) under the PCR conditions appropriate for making customs rRNA gene clone libraries. For analyses of DNA from human vaginal samples, our formulation was meliorate at maintaining the original rRNA gene ratio of Lactobacillus spp. to Gardnerella spp., particularly under stringent distension conditions. Because our 27f formulation remains relatively uncomplicated, having vii singled-out primer sequences, in that location is minimal loss of overall amplification efficiency and specificity.
The study of microbial communities is important on multiple levels, from describing food cycling and elucidating novel metabolisms to understanding how ecosystems are maintained and how mixtures of microbes can promote and/or upset the health status of their harboring host. Our ability to evaluate aspects of microbial ecology depends to a large extent on correctly identifying community members and their relative contributions to the overall makeup of the ecosystem.
The analysis of genes found in an environs as proxies for the organisms themselves has revolutionized our agreement of microbial communities (16). Studies of universal genes, especially the minor-subunit rRNA (SSU rRNA), provide phylogenetic portraits of the communities, including organisms that have not yet been cultivated (10, 16, 32). These data increment in value with time, equally newly cultivated species provide more anchor points that chronicle organismal phylogeny and physiology. Furthermore, communities are easily compared between locations and over fourth dimension.
An essential contribution to the utility of this approach is the interspersion of more- and less-conserved sequences inside the rRNA genes. The more varied portions distinguish the phylogenetic groups, while the conserved portions provide universal (or nearly universal) sequences for PCR primer binding. This allows specific amplification of the genes of interest out of total community genome Dna (the metagenome). Nearly all studies of bacterial SSU rRNA genes rely on primers designed over 15 years ago (32, 35). Although several groups have warned of the limitations of these primers, this has had lilliputian touch on common practice (reviewed in reference 30). This might exist of little result in studies that seek only a qualitative portrait of community variety, merely with the increasing awarding of rRNA cistron-based methods to analyze medically of import samples (run into, e.g., references 4, 5, vi, ix, 12, thirteen, 25, 29, and 36), overlooking some of the customs components due to inefficient primer binding could take great practical implications.
Motivated past analyses of vaginal microbial communities (five, 12, thirteen, 36), nosotros accept reexamined two of the nigh commonly used primers for bacterial 16S rRNA genes, 27f (spanning positions 8 to 27 in Escherichia coli rRNA coordinates) and 1492r (commonly spanning positions 1492 to 1507, though longer versions are sometimes used) (2, 11, 16, 21, 28, 32, 35), which amplify nearly the entire length of the factor. We assessed the sequence variability at the corresponding primer-binding sites in the near 200,000 sequences from the Ribosomal Database Project Two (RDP) (8, 17) and in the Sargasso Bounding main metagenomic information ready (27). In doing and then, we developed tools for extracting the relevant sequence regions and a heuristic algorithm for screening out sequences that are contributed past the inclusion of amplification primers in published sequences rather than by the original genomic Dna. Based on the observed sequence variation, nosotros propose a new conception of the 27f primer. We compare our formulation to two that are in current use for their influence on the relative abundances of Gardnerella and Lactobacillus rRNA genes in linear (unidirectional) and exponential (bidirectional) amplifications.
MATERIALS AND METHODS
Sequence data.
Sequences were from the RDP (release nine.36) (eight; http://rdp.cme.msu.edu/) and the Sargasso Body of water metagenome (27) (GenBank accretion numbers AACY01000001 to AACY01811372).
Construction of a representative set of bacterial SSU rRNA gene sequences.
The identification of rRNA gene sequences in metagenomic data and the locating of primer-binding sites in the rRNA cistron sequences were based on the similarity to sequences in a phylogenetically representative ready of full-length bacterial SSU rRNA sequences. In the interests of completeness and accurateness, to as bully an extent as feasible, these representative sequences were extracted from completed (or nearly completed) genome sequences available through the NCBI Entrez system (34) and The SEED (22). An initial set of sequences was handpicked for phylogenetic multifariousness and the familiarity of the taxonomic name. Other named sequences with less than 85% identity to those already chosen (as measured by BLASTN [3]) were added with the aid of Perl scripts to comprehend phylogenetic groups without consummate genome information. Several other sequences (particularly from actinomycetes) were added to embrace more of the variations in structure observed in the commencement 200 nucleotides of the 16S rRNA.
In order to project primer-bounden site coordinates from the E. coli 16S rRNA sequence to each of the other representative sequences, a multiple sequence alignment was produced using ClustalW (7). The initial alignment demonstrated that the endpoints of many of the 16S rRNA genes were misannotated. Where necessary, endpoints were manually adjusted. The resulting alignment was manually checked for accuracy at the primer locations.
Locating bacterial 16S rRNA genes in metagenomic information.
The Sargasso Sea metagenome sequence information from The SEED (22) were formatted as a nucleotide Smash database. To locate bacterial rRNA genes in the metagenomic information, each of the representative rRNA factor sequences (described above) was used as a BLASTN (3) query with the post-obit search parameters: a maximum expectation value of x−12, an identity score of 1, a nonidentity score of −ane, x,000 maximum alignments, and no low-complexity filtering. The matching Dna regions were extracted, oriented in the rRNA-like direction, and placed in a database of metagenomic rRNA genes. The resulting collection comprised one,137 rRNA gene sequence fragments. Because the commencement points and endpoints of the original clones are random, the sequence coverage of the SSU rRNA genes is relatively uniform from starting time to finish, with somewhat over 400 sequences roofing any given portion of the gene.
Identification of primer-binding sites in rRNA gene sequences.
The showtime task was to reliably identify and extract the rRNA factor sequences found at the primer-binding sites in a manner that is resistant to idiosyncratic variations, since characterizing these variations was our goal. Nosotros reasoned that the about reliable arroyo is to marshal each sequence with a related, total-length "representative" rRNA factor and employ that alignment to project the known locations of the primer sites in the representatives onto the database sequences. The representative rRNA factor sequences (described above) were formatted as a nucleotide Smash database. To identify a primer-binding site sequence in a given rRNA cistron, the rRNA gene was used as a BLASTN (3) query with the following search parameters: a maximum expectation value of ten−eight, an identity score of 1, a nonidentity score of −i, a maximum of 5 alignments, and no low-complexity filtering. The search results were scanned for matches overlapping the desired primer-binding site in one of the representative sequences, and the corresponding portion of the query was extracted. For consistency of data handling, all analyses were carried out in the RNA-like sequence orientation.
PCR primers.
Lyophilized oligonucleotides (Integrated Deoxyribonucleic acid Technologies) were resuspended in 1 mM EDTA containing 0.001% Triton X-100. Nearly full length 16S rRNA genes were amplified using the 1492r primer (5′-TACCTTGTTACGACTT) and i of the following three 27f primer formulations: twofold-degenerate primer 27f-CM (5′-AGAGTTTGATCMTGGCTCAG, where G is A or C), fourfold-degenerate primer 27f-YM (five′-AGAGTTTGATYMTGGCTCAG, where Y is C or T), or sevenfold-degenerate primer 27f-YM+three. The sevenfold-degenerate primer 27f-YM+3 is four parts 27f-YM, plus ane part each of primers specific for the amplification of Bifidobacteriaceae (27f-Bif, 5′-AGGGTTCGATTCTGGCTCAG), Borrelia (27f-Bor, v′-AGAGTTTGATCCTGGCTTAG), and Chlamydiales (27f-Chl, 5′-AGAATTTGATCTTGGTTCAG) sequences.
For the quantitative PCR (qPCR) of Lactobacillus 16S rRNA gene sequences, the primers used were LactoF (5′-TGGAAACAGRTGCTAATACCG, where R is A or G) and LactoR (5′-GYCCATTGTGGAAGATTCCC); these primers dilate a fragment of about 200 bp in length, approximately between positions 200 and 400 of Lactobacillus 16S rRNA genes. They match all Lactobacillus 16S rRNA factor clones isolated from the man vaginal microbial Dna samples that we have analyzed and no other taxonomic groups in the samples. For the qPCR of Gardnerella 16S rRNA cistron sequences, the primers used were GardnF (v′-GACTGAGATACGGCCCAGAC) and GardnR (5′-ATTCGAAAGGTACACTCACC); these primers amplify a fragment of about 180 bp in length, approximately between positions 180 and 360 of the Gardnerella vaginalis 16S rRNA cistron.
Sample collection and genomic Dna extraction.
Man vaginal samples were collected from eight healthy, premenopausal women between the ages of twenty and 45 years through a study approved by the Institutional Review Boards of the Academy of Illinois at Urbana-Champaign and the Carle Foundation Hospital. Genomic Dna was isolated from 0.five-ml aliquots of the vaginal samples. To set Dna from each sample, 125 μl of 0.5 G Na-EDTA, pH 8.0, containing 75 mg/ml lysozyme was added and samples were incubated at 37°C for 30 min. Post-obit the add-on of 70 μl of 10% sodium dodecyl sulfate and v μl of 10-mg/ml proteinase M, the samples were subjected to 3 freeze-thaw cycles consisting of 5 min in a dry water ice-ethanol bathroom followed by incubation at 37°C for 5 min, and finally they were held at 55°C for an additional 30 min. Post-obit the addition of 70 μl of 5 M NaCl to the mixture and incubation on ice for thirty min, the samples were centrifuged at 16,000 × yard for 20 min and the supernatants were extracted with phenol and phenol-chloroform-isoamyl alcohol, followed by ethanol atmospheric precipitation. The genomic DNA was resuspended in 100 μl of 10 mM Tris-Cl, pH 8.0, containing 1 mM EDTA and stored at −80°C until used.
Linear amplification of vaginal microbial 16S rRNA genes.
Human vaginal microbial Deoxyribonucleic acid samples were used as the substrate for the linear amplification of 16S rRNA genes using the following iii 27f primer variants: 27f-CM, 27f-YM, and 27f-YM+iii. Each primer variant was tested at 3 levels of stringency (the annealing temperatures were 48°C, 54°C, and 60°C). Reaction mixtures independent 0.1 ng of sample DNA in 25 μl PCR buffer (Invitrogen) containing 2 mM MgCltwo, a 0.2 mM concentration of a deoxynucleoside triphosphate mix, a 200 nM concentration of the appropriate 27f primer variant, and 0.25 units of Platinum Taq Deoxyribonucleic acid polymerase (Invitrogen). Reaction mixtures were incubated for 4 min at 94°C, followed past cycles of denaturation for ane min at 94°C, annealing for 30 southward at the appropriate temperature, and extension for 2 min at 72°C. A 10-μl aliquot was removed before amplification (zip-cycle sample). Later 5 cycles, the amplification programme was paused and 10-μl aliquots were removed from each reaction mixture (five-bike sample). After the amplification program was resumed, reaction mixtures were incubated for a further v cycles, at the end of which another 10-μl aliquot was removed (ten-cycle sample). Before existence used as the substrate for qPCR, linearly amplified samples were diluted eightfold in 1 mM EDTA containing 0.001% Triton X-100, making them 0.0005 ng/μl in the original sample Deoxyribonucleic acid.
PCR amplification of vaginal microbial 16S rRNA genes.
16S rRNA genes nowadays in the homo vaginal microbial Dna samples were amplified using the 1492r primer and one of the three formulations of the 27f primer. Distension atmospheric condition were the same as those for linear distension except for the addition of a 200 nM concentration of the reverse primer; all annealings were at 48°C, and the distension was for 23 cycles. Based on our empirical estimates of the amplification efficiency, nosotros calculated that 23 cycles of PCR increased the number of rRNA factor sequences by as much as 770,000-fold. To accurately compare the ratios of Lactobacillus to Gardnerella sequences nowadays in samples before and after distension, the 23-cycle PCR products were diluted 25,000-fold in one mM EDTA containing 0.001% Triton X-100. When 1 μl of the diluted PCR product was added to a 25-μl qPCR mixture, the overall dilution of amplified rRNA cistron sequences was 625,000-fold.
qPCR.
Each sample to be analyzed was used every bit a template in two 25-μl qPCR mixtures, 1 for Lactobacillus and 1 for Gardnerella. For the analysis of linear amplification products, ii μl of the diluted product (0.01 ng of original-sample DNA) was used for each reaction. For the analysis of PCR-amplified samples, the unamplified control contained 0.1 ng of original-sample DNA, while i μl of the 25,000-fold dilution was used for the exponentially amplified products. Each reaction mixture contained 1× iQ SYBR green supermix (Bio-Rad) and 100 nM concentrations of the forward and reverse primers. The reaction mixtures were incubated at 94°C for 4 min, followed by 33 cycles of denaturation at 94°C for thirty southward, annealing at 54°C for fifteen s, and extension at 72°C for 45 s. Fluorescence was quantified using a Bio-Rad iCycler during the extension step of each PCR wheel. Each qPCR experiment was conducted in triplicate for a given DNA sample.
RESULTS
Primer-binding sites in database sequences.
As described in more particular in Materials and Methods, nosotros devised and implemented a method to extract the bacterial SSU rRNA 27f and 1492r primer-bounden site sequences from the data in the RDP and the Sargasso Sea metagenomic information. A key signal of the method is that information technology assumes only a ≥50% sequence identity between the region containing the primer-bounden site in the sequence beingness analyzed and at least i fellow member of a various set of "reference" sequences. The method is general and can exist used to extract any portion of a sequence that is sufficiently conserved or at to the lowest degree is flanked past conserved sequences.
Removing primer sequence contamination from rRNA factor data.
The bacterial SSU rRNA gene sequence data collected in the RDP were trimmed of nonribosomal sequences and oriented in the rRNA-similar direction. Even so, it became clear during our extraction of primer-bounden sites that many were PCR products that had not been trimmed to just the amplified DNA (i.eastward., they however included PCR primer sequences). In locating primer-bounden sites, nosotros used each RDP sequence as a BLASTN query against the representative sequences. For each query, we recorded the regions of similarity to the rRNA gene (mapped to E. coli coordinates). In Fig. 1A, we plot the total number of RDP sequences whose lucifer covers a given nucleotide and the number of sequences whose similarity to rRNA starts at a given nucleotide (commencement point). The distribution is revealing. Out of 30,258 sequences with similarity extending to rRNA position 8, 24,340 start precisely at that place. Given that the 27f primer is most commonly a 20-mer spanning positions 8 to 27, this is the situation expected if the rRNA genes were amplified with a 27f primer and not trimmed of their primer sequence before database submission. The spot-checking of publications in which some of these sequences were reported supports this interpretation. Overall, out of 80,428 sequences that include position 28, 29,922 appear to be PCR products trimmed of the primer, 15,081 appear to be PCR products reported with part of the primer sequence, 24,340 appear to be PCR products reported with the full primer sequence, five,918 retain similarity to rRNA at to the lowest degree ane nucleotide past the primer, and iv,317 sequences (5.iv% of the total) include similarity to rRNA at least iii nucleotides past the primer.
The same assay was carried out in the region of the 1492r primer-binding site (Fig. 1B). The histogram of rRNA gene similarity endpoints clearly shows the 16-, 19-, and 22-nt-long versions of the primer. Overall, 7,139 sequences appear to be PCR products properly trimmed of the 1492r primer, eighteen,889 sequences appear to be PCR products reported with some or all of the 1492r primer sequence, and xviii,522 sequences extend beyond the primer site. The larger number of sequences that continue beyond this primer site is due primarily to the use of opposite-amplification primers downstream of the 1492r-binding site (east.thousand., 1522r, 1525r, or a primer in the 23S rRNA gene, which ordinarily follows the 16S rRNA gene).
To maximize the chance that the sequences analyzed below contain bona fide primer-binding site sequences (not PCR primer sequences), we demanded that the sequences included in our analysis extend at least 3 nucleotides beyond the primer-bounden site being analyzed and that the extension showed similarity to rRNA sequences. At the 27f primer-binding site, this excluded 80% of the sequences that might otherwise have been included. Requiring only 2 nucleotides across the primer-binding site did not significantly increase the data available for assay but decreased our confidence in their validity. The awarding of these criteria resulted in the extraction of 4,315 (4,152 later the removal of sequences containing ambiguous nucleotides) 27f-binding site sequences and 17,940 (17,696 afterwards the removal of sequences containing ambiguous nucleotides) 1492r-binding site sequences from the RDP database. Spot checks confirmed these to exist largely devoid of PCR primer contamination.
Assay of primer-binding site sequences.
The number of occurrences of the well-nigh usually observed sequence variants at the 27f-binding site are shown in Tabular array ane, as are the dominant phylogenetic groups in which each is observed. We likewise indicate which of several published 27f primer formulations precisely match the sequence. The two most mutual bounden site variants cover most of the bacterial phyla. It is interesting that the sequence observed most often in Sargasso Sea rRNA genes (Table one, second row) is not precisely matched by the common nondegenerate class of the primer, 27f-CC (AGAGTTTGATCCTGGCTCAG) (run into, e.yard., references 1, 4, vi, nine, and 25). The tertiary near oftentimes observed bounden site sequence is found in Actinobacteria and some Proteobacteria. It is accommodated by the 27f-YM primer simply requires a C-A mispairing when the more than common 27f-CC and 27f-CM formulations are used.
Tabular array 1.
Primer binding site sequence a | No. of occurrences in: | Phylogenetic group(s) containing the binding site sequence | Exactly matching primer(s) b | |
---|---|---|---|---|
RDP v9.36 | Sargasso Sea data | |||
AGAGTTTGATCCTGGCTCAG | 2,825 | 132 | Most Leaner | 1, 2, iii, 4, five, 6 |
...........A........ | 905 | 252 | Many Bacteria, especially enteric bacteria | 2, 3, 4, 5, 6 |
..........T......... | 59 | 2 | Actinobacteria, some Proteobacteria | 3, 4, half dozen |
...A.......T...T.... | 57 | 0 | Chlamydiales | half dozen |
......C............. | 54 | 0 | Atopobium and chloroplasts | |
.................T.. | 35 | 0 | Borrelia spp. | 6 |
..........TA........ | vii | 21 | Campylobacterales and Sphingomonadales | 3, 4, half-dozen |
..G...C...T......... | 21 | 0 | Bifidobacteriales | 6 |
..G................. | 17 | 0 | Thermotogales and Planctomycetales | 5 |
The quaternary-most-observed binding site sequence is constitute in Chlamydiales. As noted by previous authors (32), this sequence differs at iii positions from all of the common primer formulations.
The fifth-most-observed sequence is plant primarily in Atopobium spp. and chloroplasts (simply, interestingly, not in Blue-green alga, which include the chloroplast ancestor). Given that this variant requires only a unmarried T-Thousand mispairing to bind the 27f-CC primer (and that component of 27f-CM and 27f-YM) and the mispairing is far from the 3′ end of the primer, it is unlikely to have a substantial consequence on efficiency.
The sixth-most-observed sequence is that previously reported for Borrelia spp. (32). Although it requires but a single C-A mispairing with the 27f-CC primer (and the respective component of all other common formulations), the mismatch is in the third pair from the 3′ end, sufficiently close enough to raise concerns about efficiency in detecting members of this clinically important genus.
The 27f primer-binding site observed in Campylobacterales (which are clinically relevant) and Sphingomonadales (which are a major component of the Sargasso Body of water data) precisely matches a component of the 27f-YM primer merely requires 1 or two (consecutive) mispairings with the more common 27f-CM and 27f-CC primers.
The 8th-almost-observed bounden site variant in these data is of particular interest for our studies of vaginal microbiology. This sequence is observed throughout the Bifidobacteriales, including the genus Gardnerella (18) (GenBank accession numbers {"type":"entrez-nucleotide-range","attrs":{"text":"M58729 to M58744","start_term":"M58729","end_term":"M58744","start_term_id":"173865","end_term_id":"544574273"}}M58729 to M58744). This sequence requires three mispairings to bind the 27f-CC and 27f-CM primer formulations and two mispairings to bind the best-matching sequence in the 27f-YM formulation. Although all of the mispaired positions are nine or more nucleotides from the 3′ stop of the primer and this primer is routinely used with annealing temperatures far below its computed melting temperature, we were concerned that these mismatches still might introduce pregnant bias in detecting these organisms. This is analyzed in particular beneath.
The next-near-common binding site sequence has as well been previously noted (28) and is establish in Thermotogales and Planctomycetales. Interestingly, this sequence requires only a single A-C mispairing 17 nucleotides from the 3′ cease of the 27f-CC, 27f-CM, and 27f-YM primer formulations, yet information technology has been incorporated into a primer used by Kuske et al. (15).
Continuing down our list of observed 27f primer-bounden site sequences (data not shown), we could not assign additional sequences to phylogenetic groups or confidently conclude that they represented bona fide variants, as opposed to sequencing errors or other artifacts. Given this, we limited our nowadays analysis to the sequences shown in Table 1. As genome projects and metagenomic studies amend the phylogenetic sampling of full-length sequences, we expect that additional variants will be found to be important.
In the example of the 1492r primer-binding site, the picture is entirely different. Restricting our analysis to a 16-nt-long version, 17,029 of 17,696 available binding sites have the sequence AAGTCGTAACAAGGTA. The most common alternative at this site, AAGTCGTAACAAGGAG, was observed 34 times. The test of the surrounding sequences indicated that the last T in the canonical sequence is missing rather than that this represents two base of operations changes. The merely phylogenetically amassed subset of this variant was found in several Lactobacillus sequences reported in a single paper (23). Other sequences for rRNA genes from the aforementioned species include the missing T, suggesting that the variant is sometimes a sequencing error (the site is expected to suffer from "band compression," making sequencing particularly hard). The next-well-nigh-observed sequences were AAATCGTAACAAGGTA and AAGTCCTAACAAGGTA, each of which was seen 28 times. Except for the recurrence of the latter sequence in a grouping of the Comamonadaceae (33), there is no obvious phylogenetic clustering of these sequences. With this caveat regarding the latter sequence, we conclude that the current data support the use of TACCTTGTTACGACTT every bit a universal bacterial 1492r primer. This is mostly in keeping with the published 16-nt version of 1492r and the corresponding regions of longer versions.
27f primer design.
When using the primer-binding site data gathered from the sequence databases to design primers for rRNA gene amplification, it is necessary to distinguish the bodily sequence variation that we wish to accommodate from those due to errors or artifacts in the data (eastward.thousand., the contamination of the data with amplification primers, as discussed above). To determine whether a variant is existent or not, there are ii useful criteria: the outset is to require that a given variant occur a minimum number of times, and the second is to require the phylogenetic congruency of a variant (we look rare, but real, variants to be phylogenetically amassed). Each of the nine bounden site sequence variants discussed higher up satisfies both of these criteria. Four of the nine 27f primer site variants in Table 1 precisely friction match components of the 27f-YM primer. The bifidobacterial and chlamydial sequences each have two and 3 differences from the best-matching component of the 27f-YM primer, suggesting a need to modify the primer design to improve conform them. The Borrelia sequence is but a single base of operations change from a 27f-YM primer component, but the mismatch is sufficiently close to the iii′ end (the third base) to potentially decrease priming efficiency. The remaining two sequence variants require only a unmarried base mismatch (suggesting a relatively small destabilization) and are farther from the 3′ terminate of the primer, minimizing their deleterious influence on the priming efficiency of a spring oligonucleotide. For the current study, we accept called not to brand any adjustments for these last two sequences. Therefore, our goal is a primer design that will precisely match seven different sequences.
Given the analysis of the 27f primer-bounden sites tabulated higher up, we decided that the best way to accommodate sequence variants was to make an equimolar mixture of the seven desired sequences. In this case, that meant synthesizing primers specific for Chlamydiales, Borrelia, and Bifidobacteriales separately and mixing i part of each with four parts of the fourfold-degenerate 27f-YM primer. We refer to this mixture equally 27f-YM+iii. This primer pool, containing only vii sequence variants, is able to adjust the vast majority of the observed natural variation at the 27f primer-bounden site. Since each individual sequence is diluted only sevenfold in the mix and amplifications are carried out with an excess of primer, we predicted comparable levels of amplification across all potential templates.
Amplification efficiency using the 27f-YM+three primer.
Nosotros evaluated the amplification efficiency of our 27f-YM+3 primer mixture and compared it to the unremarkably used 27f-CM and 27f-YM primers. As a test system, we chose man vaginal microbiota DNA samples from clinically healthy women. Our analyses of these samples (data not shown), equally well as other studies of vaginal microbiota (5, 12, 13, 36), had determined that Lactobacillus and Gardnerella are two of the most frequently observed genera. Lactobacillus rRNA genes are predicted to perfectly lucifer and to be amplified efficiently with all three of the 27f primer formulations examined in this study (as well as with 27f-CC). However, whereas Gardnerella rRNA genes should precisely friction match a component of our proposed 27f-YM+3 primer, information technology would have three and ii mismatches with the 27f-CM and 27f-YM primers, respectively, and might exist less efficiently amplified in these last two cases. Therefore, nosotros chose to evaluate the relative merits of primer formulations by comparing the amplification efficiencies of Lactobacillus and Gardnerella in these clinical samples.
We used real-time qPCR with primer pairs specifically targeted to Lactobacillus and Gardnerella to measure the change in the levels of these rRNA gene sequences post-obit amplification with each of the three 27f primer formulations. In considering an experimental design, nosotros were concerned most possible bias in any contrary primer that we might choose. Therefore, nosotros performed an analysis with linear distension using but a 27f primer, rather than the more common PCR. An additional benefit of the linear amplification was that it allowed united states of america to explore the behavior of the 27f formulations at various temperatures without business organization for the length and stability of a reverse primer.
Using qPCR, we measured the relative amounts of target Dna (Lactobacillus or Gardnerella rRNA genes) before amplification, afterward five linear distension cycles, and after ten linear amplification cycles. The primer used for the linear distension was no primer, 27f-CM, 27f-YM, or 27f-YM+three. Our logic was that if linear amplification were perfect, for every 2 strands of target in the input DNA (i.e., each double-stranded gene), 5 cycles of linear distension would yield seven strands of target Dna (the original two plus five copies of the RNA-like strand) and 10 cycles would yield 12 strands of target Deoxyribonucleic acid (the original 2 plus 10 copies). Less-efficient amplification should upshot in lesser increases in the target DNA.
Figure 2 shows the influence of the three 27f primer formulations on the yield of Lactobacillus and Gardnerella sequences at three dissimilar annealing stringencies (temperatures). In the instance of Lactobacillus (left panels), all primers at all temperatures tested amplified the starting material efficiently, though at that place was slightly less amplification with our 27f-YM+3 primer, which we attribute to the slightly college dilution of exactly matching sequences in the primer population. In contrast, the results for the amplification of Gardnerella sequences are hit (right panels). At the highest stringency tested (60°C), the 27f-CM primer completely failed to amplify Gardnerella sequences and the 27f-YM primer was but marginally proficient, while our reformulated primer was able to amplify these sequences with loftier efficiency. As the annealing stringency decreased, there was an improvement in the ability of the 27f-CM and 27f-YM primers to amplify Gardnerella sequences. It is important to note that even at a depression stringency (48°C, a commonly used temperature in microbial environmental rRNA gene-amplifying protocols), there were still discrepancies in the efficiencies with which the dissimilar primer formulations were able to amplify the target sequences.
27f-YM+3 primer and clonal representation in rRNA factor libraries.
Having demonstrated that our reformulated 27f-YM+3 primer more efficiently amplified Gardnerella sequences than the usual 27f primers, we turned our attention to the effect of its use in rRNA gene library construction from clinical samples. The question is how does each of the 27f primer formulations alter the composition of the rRNA genes used in library construction relative to the makeup of the original material?
For PCR amplification, each of the three 27f primer formulations in this report was paired with a 16-nt-long version of the 1492r primer. Since this short version of the 1492r primer requires a relatively low annealing temperature for optimal distension (48°C under our conditions), it was possible that little or no bias would exist observed due to mispairings of the 27f primer. DNAs from three different human vaginal microbial samples were amplified with each of the three 27f primer formulations and the 1492r primer. We then used qPCR to evaluate the relative amounts of Lactobacillus and Gardnerella sequences before and after amplification. To eliminate the need for an accented calibration of the qPCR efficiency and thereby simplify analysis, before performing qPCR, nosotros diluted the amplified samples to approximately the rRNA gene levels in the unamplified samples. Control PCR amplifications without the 27f and 1492r primers generated no detectable production (data not shown). Disquisitional threshold cycles (CT ; the number of qPCR cycles needed to generate detectable production) were used as a mensurate of the amount of taxon-specific rRNA genes. To compare the ratios of sequences, we calculated the difference in critical thresholds for Lactobacillus and Gardnerella (i.e., ΔCT = CT Lactobacillus − CT Gardnerella ). No divergence in critical thresholds implies that the sequences are present in the sample in approximately equal amounts; a ΔCT of +1 connotes that at that place are roughly twice as many Gardnerella sequences as there are Lactobacillus sequences (a higher CT value ways that more cycles were required for the rRNA cistron to get detectable due to a lower starting concentration). Although absolute calibration of the Lactobacillus and Gardnerella qPCR distension efficiencies is required to decide the bodily ratio, our interest in the nowadays study is the extent to which amplification with the 27f and 1492r primers alters the ratio and hence the ΔCT ; that is, we are interested in the ΔΔCT due to the PCR distension step.
For each of the iii Dna samples, Table 2 presents the Lactobacillus and Gardnerella ΔCT values (the boilerplate and standard mistake from three replicates) for the sample DNA and the amplification products using alternative 27f primer formulations. Before amplification (the "Sample DNA" row of Table 2), the dissimilar ΔCT values indicated that the clinical samples accept unlike bacterial compositions (they were called for this property based on clone sequencing [data not shown]). For samples 1, ii, and 3, the Gardnerella/Lactobacillus rRNA cistron sequence ratios are 1.54:ane, 3.81:one, and 0.54:i, assuming that they have equal qPCR efficiencies. A ΔCT of +ane corresponds to a one:2 ratio. However, what matters in this experiment is the change in the ΔCT that occurs with PCR distension (ΔΔCT ). The shift is especially dramatic with the 27f-CM formulation, less so with the 27f-YM conception, and least with the 27f-YM+3 formulation. This is well-nigh hands seen graphically (Fig. 3). For the 27f-CM primer, the ΔΔCT is ca. −2, reflecting an approximately fourfold underrepresentation of Gardnerella rRNA in the PCR product. For the 27f-YM primer, the ΔΔCT is ca. −1, reflecting an approximately twofold underrepresentation of Gardnerella rRNA in the PCR product. In dissimilarity, for the 27f-YM+iii primer, the ΔΔCT is ca. 0.ane, conspicuously showing a much more faithful representation of the Gardnerella/Lactobacillus rRNA gene ratio in the original samples.
Tabular array 2.
27f primer conception | ΔCT for sample a : | ||
---|---|---|---|
ane | two | iii | |
Sample Dna | 0.63 ± 0.32 | 1.92 ± 0.23 | −0.93 ± 0.19 |
27f-CM | −1.70 ± 0.67 | 0.07 ± 0.38 | −3.17 ± 0.44 |
27f-YM | −0.73 ± 0.12 | one.13 ± 0.30 | −one.73 ± 0.23 |
27f-YM+three | 0.47 ± 0.20 | 2.30 ± 0.l | −0.eighty ± 0.31 |
Give-and-take
The introduction of rRNA gene-based methods for community analysis has revolutionized our view of the microbial world. Overcoming the limitations of culture-based protocols for environmental census, they have revealed a largely unanticipated variety in all communities probed. These studies vastly expanded our knowledge of the extant microbial types, identifying entire new groups, sometimes even at the highest taxonomic levels. This explosion in the discovery of microbial novelty, while reshaping our thinking about the complication of natural environments, left many questions unanswered.
In guild to sympathise microbial communities, their structure, and their fluctuations, surveys demand to be comprehensive, i.east., provide as vast a coverage of the present diversity as possible. Are our current methods up to the chore? Also, surveys ideally need to provide quantitative measures of the microbial composition. How well can the relative abundance of different taxa in a community be assessed with our current methods?
Our motivation for the present studies was therefore twofold: to appraise both the quantitative scope and comprehensiveness of the nearly commonly used method of rRNA gene distension to characterize microbial communities. To address the consequence of the quantitative analysis of individual components in microbial communities, nosotros designed and tested a protocol for rRNA cistron amplification that more faithfully maintained the relative abundances of the microbial types present in the clinical samples. To accost the issue of comprehensiveness, we tested how well the most commonly used rRNA gene amplification primers stand for the vastly expanded database of rRNA factor sequences and the subsequent effects on the surveys using them. We reformulated the usually used primers and tested them for their coverage in the analysis of clinical vaginal microbial samples.
Our analyses of the 27f and 1492r primer-binding sites in SSU rRNA gene databases revealed that the termini of many (perhaps most) rRNA cistron sequences in the databases are contaminated with sequences that exercise not originate from the organism that they are asserted to represent. This suggests a need for greater vigilance in sequence reporting and curation. We have proposed and implemented a benchmark for the automatic detection of, and thereby removal of, much of the contamination, with a possible loss of some valid sequences. Improvements to our procedure are possible by integrating the analyses of both the 5′ and 3′ ends of sequences (with few exceptions, if at that place is primer contagion at one finish, then it is nowadays at the other terminate as well). Similarly, all sequences first described in any item publication tend to have or not have primer contamination.
Our analysis of 27f primer-binding sites revealed several sequence variations representing cohesive phylogenetic groups that are non accommodated past the ordinarily used 27f primer formulations, even though these sequences have been known for many years (e.g., come across reference 32). This leads to the question of whether the mismatched nucleotides crusade a systematic underrepresentation of the corresponding phylogenetic groups in rRNA gene libraries. Implicit in much of the microbial ecology literature is the supposition that a low primer annealing temperature (at to the lowest degree for the first several PCR cycles) provides a good and expedient method for accommodating primer-binding site sequence diversity, while avoiding the pitfalls of highly degenerate mixtures. We combined the 20-nucleotide-long 27f primer with a shorter (xvi nucleotide) 1492r primer, necessitating a PCR annealing temperature well beneath that required for the longer 27f. Even under these depression-stringency weather, nosotros institute that the 27f-CM and 27f-YM primers discriminated confronting Gardnerella rRNA genes, with 27f-CM being less discriminatory than 27f-YM. Thus, we establish that even low-stringency amplification did not fully tolerate the mismatches.
These results emphasize the importance of primers that exactly match their template. At that place are multiple ways to conform sequence variation when designing amplification primers. The nearly common mode is to synthesize primers with two or more nucleotides at selected positions (degenerate primers). While this strategy allows the stringency of amplification to remain high, hence reducing spurious pairing, the sequences that are exact matches to each variant are diluted in the puddle, leading to lower overall efficiency of bounden. Every bit the number of variable sites increases, the complication of the pool increases exponentially, and some of the sequences in the pool may be superfluous, worsening the problems associated with the dilution of relevant sequences. In the present example, introducing degenerate positions to include the vii desired sequences would require the sequence AGRRTTYGATYHTGGYTYAG (where H is A or C or T), which is a mixture of 192 distinct sequences.
An alternative approach is to incorporate the noncanonical nucleotide inosine at variable positions (nineteen, 31). Inosine forms stable pairs with all four nucleotides, although the strength of the interaction varies (pairings with pyrimidines are more than stable) (twenty). Unlike the use of degenerate primers, an increase in the number of positions replaced with inosine can atomic number 82 to promiscuous pairings and the amplification of spurious products. In the present case, introducing inosine at seven sites would create a primer that matches 16,384 sequences. Nosotros instead chose to maintain high specificity and depression dilution past formulating a 27f primer mixture consisting just of the 7 unique sequence variants, 27f-YM+3.
Although we have added 3 sequences to the 27f-YM primer, nosotros take not been comprehensive in testing all components of our formulation or their effects on all phylogenetic groups for which the primer is intended to ameliorate results. Instead, we focused on the behavior of ii phylogenetic groups relevant to our electric current work on vaginal microbiota, though our results should be applicable to other groups. The literature suggests that most investigators either are unaware of the issue or await that the effects will be insignificant. Our survey of the literature suggests that 27f-CC and 27f-CM are the 2 nearly mutual 27f primer formulations, with 27f-YM being the third near common. Fifty-fifty in studies of vaginal microbiota, the 27f-CC (6, 9, 25) and 27f-CM (13) primers are routinely used, each with iii mismatches to the clinically relevant Gardnerella rRNA genes. One vaginal study used an even shorter version of 27f-CC (positions x to 27 in E. coli coordinates), resulting in three mismatches out of 18 base pairs with the Gardnerella rRNA genes, a fact best-selling past the authors (29). Our results advise that a minor modification of the primer design could yield substantially more representative results.
Fifty-fifty if nosotros are willing to have the quantitative bias introduced by mismatches when we rely on a lower stringency to tolerate them, there is a second reason to explicitly accommodate known sequence diversity in the primer. Relying on a low stringency leaves niggling or no buffer for additional, undiscovered diversity. Incorporating presently known diverseness in the primer design will provide a more solid footing for sampling the unknown.
Overall experimental blueprint.
Although the focus of our work is the comparison of primer formulations, nosotros have attempted to critically analyze other aspects of the protocols. Our survey of the literature revealed a wide range of template DNA used in the PCR (in those cases where information technology is specified). Similarly, there is a wide range in the number of PCR cycles used. Only rarely do we find discussion of how published protocols were designed. When we practice, the well-nigh of import theme seems to exist avoiding PCR saturation (eastward.g., references 14, 24, and 26), the point at which arable rRNA gene types stop amplifying because they reanneal before binding the primer and Deoxyribonucleic acid polymerase. This still leaves a merchandise off between the amount of template and the number of cycles. Here, nosotros chose to minimize the consumption of the (often express) template by calculating the number of distinct rRNA genes expected in a given amount of Deoxyribonucleic acid and using the smallest corporeality that volition requite few if any duplicates in the subsequent sequencing of random clones. Given that PCR amplification but increases the number of copies of the sequences originally present, the sequencing of random clones of the amplified DNA amounts to drawing random genes from this original genetic pool, with replacement. If the pool has N rRNA genes and n clones are sequenced, then the probability that a copy of a specific one of the N genes has not been sequenced is due east − μ, where μ is −n/N. The expected number of genes without a re-create sequenced is described in the equation Ne − μ. The expected number of distinct genes that take been sequenced is described in the equation Northward − Ne − μ = Northward(1 − e − μ). The number of instances of resequencing copies of the same gene is the difference between the clones sequenced and the genes sequenced, which is n − Due north(i − e − μ). When n is ≪Northward (which is truthful when there are many more genes in the sample than at that place are clones sequenced), this is approximately Nμ2/2. If due north is equal to N, the expected number of instances of resequencing a gene is approximately 1-half. If nosotros take this as our goal, then it would crave ∼x5 rRNA genes to sequence 300 clones with minimal duplicates. If we estimate an boilerplate of 1 rRNA cistron per 10half dozen bp of bacterial Dna, this gives 1 rRNA gene per one fg of Dna, or ten5 rRNA genes per 0.1 ng of bacterial DNA. This is one,000-fold less Deoxyribonucleic acid than that used in some studies. An important caveat is that all of the input DNA must exist bacterial. If 99% of the Deoxyribonucleic acid in a sample were from a eukaryal host, then 100-fold-more input Deoxyribonucleic acid would be needed to produce the aforementioned PCR distension production. To maintain less than ane duplicate when sequencing more clones, say 1,000 instead of 300, it would require about x times the input Dna. Even so, even if the input DNA were maintained at 0.1 ng, the predicted number of duplicates in 1,000 clone sequences would be merely 5, so trivial sequencing effort would be wasted.
Given a specified corporeality of input DNA, our next goal was to decide the number of PCR cycles that would non saturate the amplification of the about-abundant sequences. Depending on the template, the primers, and the DNA polymerase, with 0.ane ng of template in a 25-μl reaction mixture, we observed the showtime evidence of saturation at 24 to 26 cycles (data not shown). Again, this value is depression relative to the number of cycles of many (but not all) published protocols. It is incommunicable to say why so much Dna and so many cycles are common in the literature, merely unless the reaction conditions are tremendously inefficient, we suspect that many investigators accept driven their amplifications well into saturation, introducing additional bias into the resulting rRNA gene composition.
Carrying an before argument i pace further, it might be argued that there are so many problems with using clone sequences to characterize a microbial community that there is no point in trying at all. Our commencement response is that it is the clone-based analyses that reveal the taxa that require more-accurate analysis. Microarrays, qPCR, and many other methods require a predefined list of what is beingness sought, and this generally comes from clone-based analyses. Missing taxa at this early on step tin can lead to their exclusion from more than-quantitative studies. Our second response to "why minimize bias in clone-based studies?" is that some analyses require a degree of resolution for which a consummate (or near complete) sequence is necessary. For case, microbiologists have yet to understand the origins and implications of the pervasive heterogeneity of rRNA gene types in natural communities and how this diversity reflects the ecology and evolution of the communities. Microarrays and qPCR are poor methods for observing and measuring this diversity because i or two brusk fragments of the gene define their specificity. Although denaturing gradient gel electrophoresis and last restriction fragment length polymorphism fingerprinting tin can sample more of the rRNA genes, they still have much less resolving ability than sequencing.
During the xx years since the introduction of rRNA gene-based customs analyses, it has been relatively easy to brand new discoveries, explore new territories, sample new environments, and recognize novel microbial types. Hereafter progress in microbial ecology hinges on our ability to perform comprehensive and quantitative studies. If in no other context, the increasing use of these methods in medical studies deserves this attention. Toward this cease, it is vital that we regularly reevaluate the analytical methods that we use to study these systems.
Acknowledgments
This work was supported in part past the Research Board of the University of Illinois at Urbana-Champaign and by the Carle Foundation Infirmary.
We thank Michelle Hughes and Barbara Hall for their clinical and clerical aid with the sample collection. We thank Mengfei Ho, Affections Rivera, Noriko Nakamura, Abigail Salyers, Rex Gaskins, Lois Hoyer, and James Slauch for their helpful discussions. We as well thank the reviewers for their suggestions.
Footnotes
▿Published ahead of print on 22 February 2008.
REFERENCES
1. Abulencia, C. B., D. Fifty. Wyborski, J. A. Garcia, Thou. Podar, W. Chen, S. H. Chang, H. West. Chang, D. Watson, Due east. L. Brodie, T. C. Hazen, and M. Keller. 2006. Environmental whole-genome distension to admission microbial populations in contaminated sediments. Appl. Environ. Microbiol. 72 :3291-3301. [PMC free commodity] [PubMed] [Google Scholar]
two. Acinas, Due south. G., 5. Klepac-Ceraj, D. E. Hunt, C. Pharino, I. Ceraj, D. Fifty. Distel, and Chiliad. F. Polz. 2004. Fine-calibration phylogenetic architecture of a complex bacterial community. Nature 430 :551-554. [PubMed] [Google Scholar]
3. Altschul, South. F., Westward. Gish, W. Miller, E. W. Myers, and D. J. Lipman. 1990. Bones local alignment search tool. J. Mol. Biol. 215 :403-410. [PubMed] [Google Scholar]
4. Bik, E. Thou., P. B. Eckburg, S. R. Gill, Grand. E. Nelson, E. A. Purdom, F. Francois, G. Perez-Perez, Thousand. J. Blaser, and D. A. Relman. 2006. Molecular analysis of the bacterial microbiota in the human tummy. Proc. Natl. Acad. Sci. USA 103 :732-737. [PMC free article] [PubMed] [Google Scholar]
5. Burton, J. P., and G. Reid. 2002. Evaluation of the bacterial vaginal flora of xx postmenopausal women by straight (Nugent score) and molecular (polymerase chain reaction and denaturing gradient gel electrophoresis) techniques. J. Infect. Dis. 186 :1770-1780. [PubMed] [Google Scholar]
half dozen. Burton, J. P., East. Devillard, P. A. Cadieux, J.-A. Hammond, and Yard. Reid. 2004. Detection of Atopobium vaginae in postmenopausal women by cultivation-independent methods warrants further investigation. J. Clin. Microbiol. 42 :1829-1831. [PMC free article] [PubMed] [Google Scholar]
7. Chenna, R., H. Sugawara, T. Koike, R. Lopez, T. J. Gibson, D. Thousand. Higgins, and J. D. Thompson. 2003. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. 31 :3497-3500. [PMC free article] [PubMed] [Google Scholar]
8. Cole, J. R., B. Chai, T. L. Marsh, R. J. Farris, Q. Wang, S. A. Kulam, S. Chandra, D. M. McGarrell, T. 1000. Schmidt, G. Yard. Garrity, and J. M. Tiedje. 2003. The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy. Nucleic Acids Res. 31 :442-443. [PMC free article] [PubMed] [Google Scholar]
9. Coolen, M. J. L., E. Mail, C. C. Davis, and L. J. Forney. 2005. Characterization of microbial communities plant in the human vagina by analysis of final restriction fragment length polymorphisms of 16S rRNA genes. Appl. Environ. Microbiol. 71 :8729-8737. [PMC costless article] [PubMed] [Google Scholar]
x. DeLong, E. F. 2004. Microbial population genomics and ecology: the route ahead. Environ. Microbiol. six :875-878. [PubMed] [Google Scholar]
xi. DeLong, East. F., C. M. Preston, T. Mincer, 5. Rich, S. J. Hallam, N. U. Frigaard, A. Martinez, M. B. Sullivan, R. Edwards, B. R. Brito, S. W. Chisholm, and D. Grand. Karl. 2006. Community genomics amongst stratified microbial assemblages in the body of water'south interior. Science 311 :496-503. [PubMed] [Google Scholar]
12. Fredricks, D. N., T. L. Fiedler, and J. M. Marrazzo. 2005. Molecular identification of bacteria associated with bacterial vaginosis. N. Engl. J. Med. 353 :1899-1911. [PubMed] [Google Scholar]
13. Hyman, R. W., Thou. Fukushima, 50. Diamond, J. Kumm, Fifty. C. Giudice, and R. Due west. Davis. 2005. Microbes on the human being vaginal epithelium. Proc. Natl. Acad. Sci. USA 102 :7952-7957. [PMC free article] [PubMed] [Google Scholar]
14. Kainz, P. 2000. The PCR plateau phase—towards an agreement of its limitations. Biochim. Biophys. Acta 1494 :23-27. [PubMed] [Google Scholar]
fifteen. Kuske, C. R., Southward. M. Barns, C. C. Grow, L. Merrill, and J. Dunbar. 2006. Environmental survey for four pathogenic leaner and closely related species using phylogenetic and functional genes. J. Forensic Sci. 51 :548-558. [PubMed] [Google Scholar]
sixteen. Lane, D. J., B. Pace, G. J. Olsen, D. A. Stahl, M. Fifty. Sogin, and N. R. Pace. 1985. Rapid determination of 16S ribosomal RNA sequences for phylogenetic analyses. Proc. Natl. Acad. Sci. USA 82 :6955-6959. [PMC costless article] [PubMed] [Google Scholar]
17. Larsen, North., G. J. Olsen, B. 50. Maidak, Thou. J. McCaughey, R. Overbeek, T. J. Macke, T. L. Marsh, and C. R. Woese. 1993. The Ribosomal Database Project. Nucleic Acids Res. 21 :3021-3023. [PMC free article] [PubMed] [Google Scholar]
18. Leblond-Bourget, N., H. Philippe, I. Mangin, and B. Decaris. 1996. 16S rRNA and 16S to 23S internal transcribed spacer sequence analyses reveal inter- and intraspecific Bifidobacterium phylogeny. Int. J. Syst. Bacteriol. 46 :102-111. [PubMed] [Google Scholar]
19. Lindh, J. M., O. Terenius, and I. Faye. 2005. 16S rRNA cistron-based identification of midgut bacteria from field-caught Anopheles gambiae sensu lato and A. funestus mosquitoes reveals new species related to known insect symbionts. Appl. Environ. Microbiol. 71 :7217-7223. [PMC costless article] [PubMed] [Google Scholar]
xx. Martin, F. H., and M. Thousand. Castro. 1985. Base pairing involving deoxyinosine: implications for probe design. Nucleic Acids Res. xiii :8927-8938. [PMC complimentary article] [PubMed] [Google Scholar]
21. Nercessian, O., Y. Fouquet, C. Pierre, D. Prieur, and C. Jeanthon. 2005. Diversity of Bacteria and Archaea associated with a carbonate-rich metalliferous sediment sample from the Rainbow vent field on the Mid-Atlantic Ridge. Environ. Microbiol. 7 :698-714. [PubMed] [Google Scholar]
22. Overbeek, R., T. Begley, R. M. Butler, J. Five. Choudhuri, H. Y. Chuang, One thousand. Cohoon, 5. de Crecy-Lagard, N. Diaz, T. Disz, R. Edwards, M. Fonstein, East. D. Frank, S. Gerdes, Eastward. M. Drinking glass, A. Goesmann, A. Hanson, D. Iwata-Reuyl, R. Jensen, N. Jamshidi, Fifty. Krause, One thousand. Kubal, Due north. Larsen, B. Linke, A. C. McHardy, F. Meyer, H. Neuweger, Grand. Olsen, R. Olson, A. Osterman, Five. Portnoy, 1000. D. Pusch, D. A. Rodionov, C. Ruckert, J. Steiner, R. Stevens, I. Thiele, O. Vassieva, Y. Ye, O. Zagnitko, and V. Vonstein. 2005. The subsystems approach to genome note and its use in the projection to annotate 1000 genomes. Nucleic Acids Res. 33 :5691-5702. [PMC complimentary article] [PubMed] [Google Scholar]
23. Pavlova, Southward. I., A. O. Kilic, S. S. Kilic, J. South. So, G. E. Nader-Macias, J. A. Simoes, and L. Tao. 2002. Genetic diversity of vaginal lactobacilli from women in different countries based on 16S rRNA gene sequences. J. Appl. Microbiol. 92 :451-459. [PubMed] [Google Scholar]
24. Suzuki, M. T., and S. J. Giovannoni. 1996. Bias acquired by template annealing in the distension of mixtures of 16S rRNA genes by PCR. Appl. Environ. Microbiol. 62 :625-630. [PMC complimentary article] [PubMed] [Google Scholar]
25. Thies, F. L., Due west. König, and B. König. 2007. Rapid label of the normal and disturbed vaginal microbiota by application of 16S rRNA gene terminal RFLP fingerprinting. J. Med. Microbiol. 56 :755-761. [PubMed] [Google Scholar]
26. Thompson, J. R., L. A. Marcelino, and M. F. Polz. 2002. Heteroduplexes in mixed-template amplifications: formation, result and emptying by 'reconditioning PCR'. Nucleic Acids Res. xxx :2083-2088. [PMC free article] [PubMed] [Google Scholar]
27. Venter, J. C., K. Remington, J. F. Heidelberg, A. L. Halpern, D. Rusch, J. A. Eisen, D. Wu, I. Paulsen, K. E. Nelson, W. Nelson, D. E. Fouts, S. Levy, A. H. Knap, M. Westward. Lomas, M. Nealson, O. White, J. Peterson, J. Hoffman, R. Parsons, H. Baden-Tillson, C. Pfannkoch, Y. H. Rogers, and H. O. Smith. 2004. Environmental genome shotgun sequencing of the Sargasso Sea. Science 304 :66-74. [PubMed] [Google Scholar]
28. Vergin, 1000. 50., Due east. Urbach, J. L. Stein, Eastward. F. DeLong, B. D. Lanoil, and S. J. Giovannoni. 1998. Screening of a fosmid library of marine ecology genomic DNA fragments reveals 4 clones related to members of the guild Planctomycetales. Appl. Environ. Microbiol. 64 :3075-3078. [PMC free article] [PubMed] [Google Scholar]
29. Verhelst, R., H. Verstraelen, One thousand. Claeys, 1000. Verschraegen, J. Delanghe, L. Van Simaey, C. De Ganck, M. Temmerman, and 1000. Vaneechoutte. 2004. Cloning of 16S rRNA genes amplified from normal and disturbed vaginal microflora suggests a strong clan between Atopobium vaginae, Gardnerella vaginalis and bacterial vaginosis. BMC Microbiol. 21 :sixteen. [PMC free commodity] [PubMed] [Google Scholar]
30. von Wintzingerode, F., U. B. Gobel, and E. Stackebrandt. 1997. Conclusion of microbial diversity in ecology samples: pitfalls of PCR-based rRNA analysis. FEMS Microbiol. Rev. 21 :213-229. [PubMed] [Google Scholar]
31. Watanabe, K., Y. Kodama, and South. Harayama. 2001. Design and evaluation of PCR primers to amplify bacterial 16S ribosomal DNA fragments used for community fingerprinting. J. Microbiol. Methods 44 :253-262. [PubMed] [Google Scholar]
32. Weisburg, W. Chiliad., S. M. Barns, D. A. Pelletier, and D. J. Lane. 1991. 16S ribosomal Dna amplification for phylogenetic study. J. Bacteriol. 173 :697-703. [PMC complimentary article] [PubMed] [Google Scholar]
33. Wen, A., 1000. Fegan, C. Hayward, S. Chakraborty, and 50. I. Sly. 1999. Phylogenetic relationships amidst members of the Comamonadaceae, and description of Delftia acidovorans (den Dooren de Jong 1926 and Tamaoka et al. 1987) gen. nov., comb. nov. Int. J. Syst. Bacteriol. 49 :567-576. [PubMed] [Google Scholar]
34. Wheeler, D. L., T. Barrett, D. A. Benson, S. H. Bryant, K. Canese, V. Chetvernin, D. Grand. Church, Chiliad. DiCuccio, R. Edgar, S. Federhen, L. Y. Geer, Y. Kapustin, O. Khovayko, D. Landsman, D. J. Lipman, T. 50. Madden, D. R. Maglott, J. Ostell, V. Miller, Yard. D. Pruitt, G. D. Schuler, East. Sequeira, S. T. Sherry, One thousand. Sirotkin, A. Souvorov, G. Starchenko, R. 50. Tatusov, T. A. Tatusova, Fifty. Wagner, and E. Yaschenko. 2007. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 35(Database issue):D5-D12. [PMC free article] [PubMed] [Google Scholar]
35. Wilson, Chiliad. H., R. B. Blitchington, and R. C. Greene. 1990. Amplification of bacterial 16S ribosomal DNA with polymerase chain reaction. J. Clin. Microbiol. 28 :1942-1946. [PMC gratis article] [PubMed] [Google Scholar]
36. Zhou, X., S. J. Bent, M. G. Schneider, C. C. Davis, Chiliad. R. Islam, and L. J. Forney. 2004. Characterization of vaginal microbial communities in adult salubrious women using cultivation-independent methods. Microbiology 150 :2565-2573. [PubMed] [Google Scholar]
Source: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2293150/
0 Response to "Should I Use 8f or 1492r for Sequencing Both Diresction Read or One"
ارسال یک نظر