Ind J Clin Biochem (Jan-Mar 2012) 27(1):28–33 DOI 10.1007/s12291-011-0154-y
ORIGINAL ARTICLE
Evaluation of Quality of DNA Extracted from Buccal Swabs for Microarray Based Genotyping Alex Livy • Sayhean Lye • Chahil K. Jagdish Nurul Hanis • Velapasamy Sharmila • Lian Wee Ler • Bagali Pramod
•
Received: 24 July 2011 / Accepted: 31 July 2011 / Published online: 23 August 2011 Ó Association of Clinical Biochemists of India 2011
Abstract Buccal cell usage has been shown by many to be a cost effective and safe method to isolate DNA for various biological experiments especially large epidemiological studies (Garcia-Closas et al. Cancer Epidemiol Biomarkers Prev 10:687–696, 2001). Non-invasive DNA collection methods are preferred over phlebotomy in order to increase study participation and compliance in research centers and for sick patients in hospital settings. There have been conflicting reports about the methodology and results obtained from using buccal DNA. It is not very clear if phlebotomy can be confidently replaced by buccal cell DNA. It is often left for the user to take an intelligent decision. To address this issue, we compared the performance of buccal and blood DNA from same subjects in a genotyping experiment and this paper reports the results. Cotton swab derived buccal cells were scraped from the inner side of cheeks from 16 subjects, and blood was also drawn from the same 16 subjects participating in a genotypic association study of a lipid disease. The DNA quality was assessed by resolving on agarose gels, checking purity (A260/A280) and finally by microarray hybridization. This study showed that DNA degradation affects the total yield and performance of the buccal DNA when compared to the blood DNA in microarray based genotyping. Genotyping results can be seriously compromised if care is not taken to check the quality and yields of such specimens.
A. Livy (&) S. Lye C. K. Jagdish N. Hanis V. Sharmila L. W. Ler B. Pramod INFOVALLEY Ò Life Sciences Sdn. Bhd., Unit 1.1, Level 1, Block B, MINES Waterfront Business Park, No. 3, Jalan Tasik, 43300 MINES Resort City, Selangor, Malaysia e-mail:
[email protected]
123
Keywords Buccal cells Blood Quality Microarray Genotyping DNA SNP Concordance
Introduction With the advent of new techniques like next gene sequencing and microarray based diagnostics, it is but natural for clinicians to look out for a safe and cost effective method to attract all strata of clients and improve business. Customers are always looking for non-invasive methods of screening, diagnosis or hospital treatments. This is true especially in the case of infants and elderly people. However, in most hospital settings, drawing of blood is a common procedure. Considering the pain of needle prick, many people shy away from routine tests. Different types of buccal cell collection have been in use like mouth wash, cytobrush, spit type cards, etc. but most of these methods are carried out by the subject under either minimal supervision or no supervision at all [1–4]. The first source of variability creeps in here. Some scrape hard enough and some hardly do, resulting in variable amounts of cells. The second issue with buccal cells is that of DNA contamination [5]. The procedures strictly define that the subject rinse his or her mouth thoroughly before collecting buccal cells and restrain from eating anything at least 2 hours prior to the swabbing but an overwhelming number of people do not exactly follow the procedures leading to contamination of non-human DNA in buccal samples and other noise producing artifacts. Often food particles remain in the mouth which contaminates the buccal cells and leads to erroneous results [6]. Quantitative PCR using human primers can detect this issue but use of such DNA for large scale genotyping is time consuming and challenging [7].
Ind J Clin Biochem (Jan-Mar 2012) 27(1):28–33
Research scientists get tempted to use buccal cell DNA for large scale genotyping studies due to easier handling procedures, for instance, an advantage of buccal cells or saliva over blood is that it can be dried over cards and sent by mail. This is especially useful when samples are collected for a multicentric study [3]. But the results have to be carefully evaluated loci by loci to be accurate and most often this is a problematic issue because the DNA derived from buccal cells is often degraded leading to poorer yields and inefficient primer specific amplification leading to errors in genotyping calls or no calls at all [8, 9]. Very limited reports are available on the quality assessment of buccal DNA versus blood DNA in microarray experimental use [10, 11]. One study talks of the lag time between collection and extraction and how it affects the quality of genomic DNA [6]. Our study aims to look at these issues from a realistic angle and we provide our opinion on the use of buccal cell DNA for important biological experiments.
Materials and Methods Subjects The subjects were enrolled for this study out of a larger study group on SNP association with a lipid disorder disease. Informed consents were obtained from all the participants and this study complied with the declaration of Helsinki and the protocol was approved by the UMMC’s (University Malaya Medical Centre) ethics committee [Ref: 546.16]. Blood samples were collected for all the participants in the large study. However, buccal and blood samples were collected only for a group of 16 subjects to do this experiment. Blood samples were drawn by a hospital nurse using 10 ml purple EDTA vacutainer tube. Each of the participants was asked to give a buccal swab from each side of the cheek. They were given two cotton swabs and two test tubes labeled with the patient ID. They were asked to scrape the inside of cheek with the swab firmly six times. An SOP (standard operating procedure) for buccal cell extraction made by us was given to each subject for reading and following. The samples were brought back to the lab and air dried for 20 min before being processed. DNA Extraction Method Buccal The buccal swab was placed in 2 ml centrifuge tube and 400 ll Phosphate-buffered saline, pH 7.1 was added to
29
sample. The Qiagen protocol was followed except for minor modifications (QIAamp DNA Blood Mini Kit- Cat. no 51106). The incubation time at 56°C was increased to 20 min instead of 10 min., 150 ll buffer AE was added for elution, incubated at room temperature for 10 min and then centrifuged at 8,000 rpm for 1 min. The DNA eluted was stored at 4°C until used. Blood Blood DNA was extracted following the Qiagen protocol provided by the manufacturer (QIAamp DNA Blood Mini Kit- Cat. no 51106). Agarose Gel Electrophoresis 6 ll of each sample DNA was electrophoresed on a 0.8% gel containing gel red at 60 V in TBE buffer (0.089 M Tris Base; 0.089 M Boric Acid; 0.002 M EDTA, Disodium Salt, Dihydrate; Final pH 8.3) at constant current for 40 min. A UVP, (Model: ChemiDoc-It 410) coupled to an ultraviolet transilluminator was used to take a digital picture of the gel and the quality of the DNA was evaluated (Fig. 1). DNA integrity was also determined by visual inspection for degradation. DNA degradation was shown by fragmentation of the buccal cell DNA samples, compared against a known molecular weight marker (Gene Ruler 1 kb DNA ladder, Fermentas Life Sciences, Inc.) with visible bands of lengths 10,000, 8,000, 6,000, 5,000, 4,000, 3,500, 3,000, 2,500, 2,000, 1,500, 1,000, 750, 500 and 250 bp. Degradation was observed in 3–4 buccal samples, evident by the long smear as compared to the sharp bands near the well in case of blood samples. Sample number 6 and 10 showed maximum degradation. Buccal DNA quantity is less by gel even though the spectrophotometric readings are high. DNA Yield and Integrity The yield of the DNA was estimated using a BioRad spectrophotometer and the results have been tabulated (Table 1). Microarray Based Genotyping Genotyping was performed on the Illumina platform using GGGT Assay, which is capable of multiplexing up to 1,536 SNPs in a single reaction. All assays were performed on 32-array Universal BeadChip according to the manufacturer’s protocol and were carried out in compliance with MIAME guidelines [12, 13]. All the raw intensity data from our custom GGGT TM microarray assays were fed to the Illumina GenomeStudio
123
30
Ind J Clin Biochem (Jan-Mar 2012) 27(1):28–33
Fig. 1 Electrophoretic analysis of genomic DNA from blood and buccal cell samples DNA. 6 ll of DNA was loaded on a 1% agarose gel stained by GelRed(R) was visualized using a GelDoc system. Top panel shows blood DNA samples of 16 subjects identified by the numbers above the lanes. Bottom panel shows buccal samples of the same 16 subjects. DNA of subjects 6 and 10 are badly degraded. DNA of other subjects is fairly good but the amount is very low
TM
Table 1 Comparison of buccal and blood DNA yields and quality Subject ID
Buccal
Blood
Concentration (ng/ll)
Purity (A260/280)
1
257.96
1.28
51.26
2
296.86
1.22
30.36
2.1
3
172.6
1.3
100.76
1.9
4
26.03
Concentration (ng/ll)
Purity (A260/280) 2.0
0.9
98.03
1.6
1.3
91.12
2.0
5
166.8
6
106.23
1.3
61.3
1.5
7
433.2
1.9
78
1.7
8
166.8
1.8
51.4
2.0
9
185.03
1.4
100.86
1.6
10
129.16
2.0
100.66
1.6
11
63.33
1.2
100.96
1.6
12
98.36
1.2
87.6
1.5
13
33.6
1.1
67.8
1.6
14
18.95
0.8
93.05
1.7
15
58.6
1.2
76.16
1.6
16
29.8
1.3
65.9
1.6
The purity of the buccal DNA is less than 1.7, with an average reading of 1.3, which indicates that there is protein contamination. However, blood DNA purity seems to be good averaging at 1.7
to decipher the true allele calls which does automated genotype clustering and calling and allows data to be visualized for further analysis. Overall genotype call rate was 70% and above with allelic data successfully generated for 84% of the SNP loci (1,292 out of 1,536). All genotyped SNPs were assessed for deviation from Hardy–Weinberg
123
equilibrium using GenomeStudio . Any sample with a call rate less than 70% was discarded. The SNPs were evaluated by cluster separation score and then visually evaluated for call integrity. Genotyping is deemed successful by evaluating a score called GC score (a GC score ranges from 0 to 1 and reflects the proximity within a cluster plot of intensities of that genotype to the centroid of the nearest cluster). All genotypes with GC score below 0.25 were considered as failures. The objective of the study was to compare the yield and quality of DNA obtained from matched buccal swab and blood samples. In addition, the performance of the samples was assessed in microarray. Statistical Analysis Pairwise statistical analysis was done for buccal and blood samples. Since our ultimate objective was to compare the genotype calls based on DNA quality, the results were scored as a call or a no call. The SNPs which gave no calls for blood was excluded before doing the statistical analysis since blood is considered the reference type here. Concordance of genotype calls between blood and buccal samples from the same individual was evaluated using % concordance and the Kappa statistic, which measures the agreement between methods exceeding that expected by chance. Percent concordance and Kappa statistics were calculated only among genotypes called in both samples being compared, excluding missing data using available online tool. (Ref:http://faculty.vassar.edu/lowry/ VassarStats.html).
Ind J Clin Biochem (Jan-Mar 2012) 27(1):28–33
Results We sought to address the suitability of amplified buccal swab DNA for high-throughput genotyping by doing a blood-buccal comparison from same subjects. DNA was successfully extracted from all the samples, blood and buccal, provided by the volunteers. A total of 200 ll of blood was drawn from each subject and two buccal swabs were collected to extract DNA. The yield of the DNA in each sample was estimated by spectroscopic method which also gave the purity of the nucleic acid (Table 1). The yield of the blood DNA ranged from 30.36 to 100 lg/ll which is as expected from the claims of Qiagen kit manufacturers. However, the buccal DNA yields showed a very broad zone with values as low as 18 ng/ll to as high as 433 ng/ll. Large DNA yields in buccal samples could be due to the presence of exogenous DNA. It is therefore advisable to take larger starting material in case of buccal samples to prevent inferior quality microarray results. The DNA was loaded on a 0.8% agarose gel to check integrity, degradation and RNA contamination (Fig. 1). Similarly, the purity of blood DNA seemed to be good when compared to buccal DNA. The average purity in case of buccal was only 1.3 (95% CI_0.67–1.95) (Table 1).
31
Of the entire 16 buccal DNA genotyped, all except four were genotyped successfully (75% genotypes called).In the blood samples, average number of SNPs that could be genotyped were (71.22%) and in buccal, the average number that could be genotyped were only (56.89%). Depending on the shapes of the clusters and their relative distance to each other, a statistical score is devised (the GenTrain score). This score is combined with several penalty terms (for example low intensity, mismatch between existing and predicted clusters) in order to make up the training (‘‘GenTrain’’) score. The GenTrain score, along with the cluster positions and shapes for each SNP, is saved for use by the calling algorithm. The theta value [2/pTan-1 (Cy5/Cy3)], indicates the allelic angle. Theta values near 0 (left side of graph) are homozygotes for allele A, and theta values near 1 (right side of graph) are homozygotes for allele B; heterozygotes fall between these two groups. R is the signal intensity. Genoplots of four SNP’s from a matched buccal and blood sample with a p50 GC ratio 0.50:0.53 is shown to demonstrate a failed SNP (Fig. 2; Table 2). To assess the reliability of the genotyping experiments, the percentage of agreement (i.e., genotype concordance) and unweighted Cohen’s kappa statistic (i.e., percentage of agreement above and beyond chance alone) were
Fig. 2 Genoplots of four SNP’s from a matched buccal and blood sample. In case of the buccal calls, the R values are very low, due to which a valid call could not be determined by the Genome Studio software
123
32
Ind J Clin Biochem (Jan-Mar 2012) 27(1):28–33
Table 2 Quality metrics of four SNP’s from buccal and blood sample from same subject Buccal
Blood
Gentrain score
Theta
R
Call
Gentrain score
Theta
R
Call
IVPFH-2014
0.5636
0.0097
0.9893
AA
0.5636
0.6763
1.5710
AB
IVSFH-3002
0.5038
0.0639
0.0039
NC
0.5038
0.0000
0.1874
AA
IVSFH-3083
0.0003
0.0000
0.0088
NC
0.5875
0.0000
0.5622
AA
IVSFH-3093
0.0770
0.1191
0.0364
NC
0.5544
0.0000
0.3748
AA
The Gentrain scores, allele angle, signal intensity and call data for four SNP’s generated by Genome Studio software is shown
calculated. Genotypes of paired buccal and blood samples were compared with one another. Only called genotypes were used for the comparisons; all missing data were excluded. The values of the calculated metrics are listed in Table 3.
Discussion It was noted from this study that there is no consistency in the quality of DNA derived from buccal cells of different individuals. Though a lab personnel was supervising the buccal swabbing procedure, it was hard to establish at that moment if swabbing pressure and technique was good enough or not. In some DNA degradation is more profound than others. Adequate quality check is to be done prior to doing microarray hybridization. To avoid such failures it would be more prudent to use blood DNA for expensive
Table 3 Kappa statistics for the matched blood and buccal DNA Sample
Unweighted kappa
SE
CI (upper limit)
FH1
0.0081
0.0686
0.1425
FH2
–
–
–
FH3 FH4
0.7069 0.6678
0.0599 0.0628
0.8243 0.7910
FH5
0.6076
0.0671
0.7391
FH6
0.0290
0.0835
0.1927
FH7
0.5667
0.0703
0.7044
FH8
–
–
–
FH9
–
–
–
FH10
0.009
0.0674
0.1912
FH11
–
–
–
FH12
0.0107
0.0493
0.1074
FH13
0.0244
0.0813
0.1837
FH14
0.023
0.0788
0.1775
FH15
0.0447
0.0849
0.2110
FH16
0.2604
0.0750
0.4074
Observed concordance is smaller than mean-chance concordance Average kappa = 0.2465
123
methodologies like microarrays than unpredictable buccal DNA. Buccal cell collection as a source of DNA was initially put forward as an efficient means of cost-effective DNA collection. Apart from being non invasive, cells can be easily collected on FTA cards and sent by mail [2, 4]. Rinsing of the mouth thoroughly prior to collecting buccal cells is very important. Chances are very high of the presence of contaminant DNA if mouth is inadequately rinsed. DNA quality of the mouth varies from person to person. In some cases the DNA is more prone to early degradation. At least 10% of saline mouth wash samples were degraded in a high myopia study [14]. Often cells recovered from mouth are superficial ones in the process of apoptosis. About 30% of the cells collected from healthy subjects with non inflammatory mucosa are apoptotic [15]. Diet also plays an important role in defining individual oral flora. Life style habits also lead to differences in desquamation of the oral mucosa [16]. The oral flora is extremely diverse and subject specific based on individual habits, like diet, eating habits, brushing habits, smoking habits etc. [17, 18]. Smoking has various influences on the oral mucosa. Cancer in the oral cavity usually begins due to irritation by cigarette products to be smoked. These irritants cause white lesions. Smoking can also cause abnormalities in the oral cavity such as the tongue, gums, mouth mucosa, teeth and palate in the form of nicotine stomatitis and fungal infections [19, 20]. All of these factors can cause DNA damage [21, 22]. DNA should be checked for contaminating non-human DNA by some suitable method. We recommend increasing the total amount of DNA used as starting material in case of buccal DNA to a much larger amount as recommended by Illumina, at least greater than 250 ng. More buccal swabs can be taken from an individual. Blood has the advantage that yield of DNA is good and in case of a failed experiment, enough reserve samples is available to repeat the experiment which is not possible with buccal samples, especially for cost intensive and sensitive methods like microarray. Our conclusion is that buccal cell DNA is not a suitable alternative to blood, for expensive genotyping experiments and it is not worth compromising the results to save a few dollars and obtain a larger sample size.
Ind J Clin Biochem (Jan-Mar 2012) 27(1):28–33 Acknowledgments We greatly appreciate the support of the study subjects and their families who participated in this project. This work was supported by the Ministry of Science, Technology and Innovation (MOSTI), the government of Malaysia. The authors thank the staff of UM, UMMC and INFOVALLEYÒ for their timely help during the subject recruitment and sample collection.
References 1. Garcia-Closas M, Egan KM, Abruzzo J, Newcomb PA, TitusErnstoff L, Franklin T, et al. Collection of genomic DNA from adults in epidemiological studies by buccal cytobrush and mouthwash. Cancer Epidemiol Biomarkers Prev. 2001;10(6):687–96. 2. Freeman B, Powell J, Ball D, Hill L, Craig I, Plomin R. DNA by mail: an inexpensive and noninvasive method for collecting DNA samples from widely dispersed populations. Behav Genet. 1997; 27(3):251–7. 3. Harty LC, Garcia-Closas M, Rothman N, Reid YA, Tucker MA, Hartge P. Collection of buccal cell DNA using treated cards. Cancer Epidemiol Biomarkers Prev. 2000;9(5):501–6. 4. Le Marchand L, Lum-Jones A, Saltzman B, Visaya V, Nomura AM, Kolonel LN. Feasibility of collecting buccal cell DNA by mail in a cohort study. Cancer Epidemiol Biomarkers Prev. 2001;10(6):701–3. 5. Herraez DL, Stoneking M. High fractions of exogenous DNA in human buccal samples reduce the quality of large-scale genotyping. Anal Biochem. 2008;383(2):329–31. 6. Feigelson HS, Rodriguez C, Robertson AS, Jacobs EJ, Calle EE, Reid YA, et al. Determinants of DNA yield and quality from buccal cell samples collected with mouthwash. Cancer Epidemiol Biomarkers Prev. 2001;10(9):1005–8. 7. Quinque D, Kittler R, Kayser M, Stoneking M, Nasidze I. Evaluation of saliva as a source of human DNA for population and association studies. Anal Biochem. 2006;353(2):272–7. 8. Bergen AW, Haque KA, Qi Y, Beerman MB, Garcia-Closas M, Rothman N, et al. Comparison of yield and genotyping performance of multiple displacement amplification and OmniPlex whole genome amplified DNA generated from multiple DNA sources. Hum Mutat. 2005;26(3):262–70. 9. Kirov G, Nikolov I, Georgieva L, Moskvina V, Owen MJ, O’Donovan MC. Pooled DNA genotyping on Affymetrix SNP genotyping arrays. BMC Genomics. 2006;7:27.
33 10. Chang ML, Terrill RL, Bautista MM, Carlson EJ, Dyer DJ, Overall KL, et al. Large-scale SNP genotyping with canine buccal swab DNA. J Hered. 2007;98(5):428–37. 11. Woo JG, Sun G, Haverbusch M, Indugula S, Martin LJ, Broderick JP, et al. Quality assessment of buccal versus blood genomic DNA using the Affymetrix 500K GeneChip. BMC Genet. 2007;8:79. 12. Brazma A, Hingamp P, Quackenbush J, Sherlock G, Spellman P, Stoeckert C, et al. Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nat Genet. 2001;29(4):365–71. 13. Fan JB, Gunderson KL, Bibikova M, Yeakley JM, Chen J, Wickham Garcia E, et al. Illumina universal bead arrays. Methods Enzymol. 2006;410:57–73. 14. Zayats T, Young TL, Mackey DA, Malecaze F, Calvas P, Guggenheim JA. Quality of DNA extracted from mouthwashes. PLoS One. 2009;4(7):e6165. 15. Rudney JD, Chen R. The vital status of human buccal epithelial cells and the bacteria associated with them. Arch Oral Biol. 2006;51(4):291–8. 16. King IB, Satia-Abouta J, Thornquist MD, Bigler J, Patterson RE, Kristal AR, et al. Buccal cell DNA yield, quality, and collection costs: comparison of methods for large-scale studies. Cancer Epidemiol Biomarkers Prev. 2002;11(10 Pt 1):1130–3. 17. Aas JA, Paster BJ, Stokes LN, Olsen I, Dewhirst FE. Defining the normal bacterial flora of the oral cavity. J Clin Microbiol. 2005; 43(11):5721–32. 18. Paster BJ, Boches SK, Galvin JL, Ericson RE, Lau CN, Levanos VA, et al. Bacterial diversity in human subgingival plaque. J Bacteriol. 2001;183(12):3770–83. 19. Haffajee AD, Socransky SS. Relationship of cigarette smoking to the subgingival microbiota. J Clin Periodontol. 2001;28(5):377–88. 20. Shiloah J, Patters MR, Waring MB. The prevalence of pathogenic periodontal microflora in healthy young adult smokers. J Periodontol. 2000;71(4):562–7. 21. Glei M, Habermann N, Osswald K, Seidel C, Persin C, Jahreis G, et al. Assessment of DNA damage and its modulation by dietary and genetic factors in smokers using the Comet assay: a biomarker model. Biomarkers. 2005;10(2–3):203–17. 22. Konig KG. Diet and oral health. Int Dent J. 2000;50(3):162–74.
123