Virology 485 (2015) 283–288
Contents lists available at ScienceDirect
Virology journal homepage: www.elsevier.com/locate/yviro
Does human papillomavirus-negative condylomata exist? Laila Sara Arroyo Mühr a, Davit Bzhalava a, Camilla Lagheden a, Carina Eklund a, Hanna Johansson b, Ola Forslund b, Joakim Dillner a,c,n, Emilie Hultin a a
Department of Laboratory Medicine, Karolinska Institutet, Stockholm, Sweden Department of Laboratory Medicine, Lund University, Malmö, Sweden c Department of Medical Epidemiology & Biostatistics, Karolinska Institutet, Stockholm, Sweden b
art ic l e i nf o
a b s t r a c t
Article history: Received 29 May 2015 Returned to author for revisions 3 July 2015 Accepted 30 July 2015 Available online 28 August 2015
Condylomata acuminata is caused by human papillomavirus (HPV). PCR with consensus primers will typically detect HPV in 496% of condylomata. Metagenomic sequencing has found that some “HPVnegative” condylomata do indeed contain HPV. We wished to perform a renewed evaluation of the “HPV-negative” condylomata using deeper metagenomics sequencing. Sequencing of whole genome amplified DNA from 40 apparently “HPV-negative” condylomata detected HPV in 37/40 specimens. We found 75 different HPV types, out of which 43 represented novel putative HPV types. Three types were cloned and established as HPV types 200, 201 and 202. Molluscum contagiosum virus was detected in 24 of the 40 samples. In summary, deep sequencing enables detection of HPV in almost all condylomata. “HPV-negative” condylomata might largely be explained by clinical misdiagnosis or the presence of viral variants, distantly related HPV types and/or low viral loads. & 2015 Elsevier Inc. All rights reserved.
Keywords: Human papillomavirus Metagenomic sequencing Condylomata
Introduction Human papillomaviruses (HPVs) are a large group of dsDNAviruses that infect keratinocytes of the skin and mucosa. About 200 different human papillomavirus (HPVs) types have been completely cloned, sequenced and given an official number by the International HPV Reference Center (www.hpvcenter.se, accessed on 2015-05-28), which moved to Karolinska Institutet from Heidelberg in 2013 (Bzhalava et al., 2015). In addition to the established types, a large number of subgenomic HPV sequences representing putatively novel HPV types have been discovered using the broad general primer PCR system FAP (FA-isolates) (Forslund, 2007), CUT PCR (GC-isolates) (Chouhy et al., 2010) and/or using deep sequencing (SE-isolates) (Ekstrom et al., 2011). Condylomata acuminata, anogenital warts, is a very common sexually transmitted disease caused by HPV infection, most commonly by HPV6 of the Alphapapillomavirus genus (Bernard et al., 2010; Gissmann et al., 1982). A small proportion of condylomata are reported to be “HPV-negative” and, besides true negativity, 3 possible explanations have been suggested (Walboomers and Meijer, 1997): (i) specimen inadequacy, (ii)
Abbreviations: HPV, Human papillomavirus; WGA, whole genome amplification n Correspondence to: Department of Laboratory Medicine, Karolinska Institutet, Stockholm SE-141 86, Sweden. Fax: þ 46 858587730. E-mail address:
[email protected] (J. Dillner). http://dx.doi.org/10.1016/j.virol.2015.07.023 0042-6822/& 2015 Elsevier Inc. All rights reserved.
detection method insensitivity, and (iii) the existence of novel HPV types that are not detectable by the method. We have previously pointed out that extended analysis of HPVassociated clinical lesions, such as cervical cancer or condyloma, that are seemingly “HPV-negative” is a promising strategy to discover new HPVs (Ekstrom et al., 2010). In southern Sweden, a systematic condyloma reporting system has performed HPV genotyping of the largest series of condylomata reported to date in the literature (Sturegard et al., 2013). Overall, 96.3% of condylomata were found to be HPV-positive, but a few were seemingly HPVnegative in analyses even with broad HPV-primer PCR systems (Soderlund-Strand et al., 2009). Metagenomic sequencing can be used for an increased sensitivity in HPV detection (Arroyo et al., 2013; Ekstrom et al., 2011) as well as to obtain a comprehensive and more unbiased identification of the DNA present in samples, as no prior PCR or any prior knowledge of the sequence is required (Bzhalava et al., 2014). The aim of this study was to apply modern deep sequencing technology, targeting individual “HPV-negative” condylomata, in order to obtain a sensitive virus detection, also of putatively novel HPVs.
Results The DNA from the 40 condylomata specimens together with 2 negative controls (lab-grade water) were amplified using WGA,
284
L.S. Arroyo Mühr et al. / Virology 485 (2015) 283–288
Table 1 The number of samples that were positive (at least 5 reads for a particular type) for the different HPV types and for each virus group. Disease Number of samples Virus group
Condyloma 40 Number of positive samples
HPV Established HPV types HPV6 HPV180 HPV153 HPV57 HPV202 (SE372) HPV165 HPV200 (SE370¼ SE92, SE94, SE95, SE100) HPV201 (SE371¼ SE107, SE106, SE116, SE183) HPV28 HPV37 HPV66 HPV124 HPV130 HPV148 HPV150 HPV31 HPV49 HPV50 HPV58 HPV101 HPV105 HPV118 HPV120 HPV137 HPV138 HPV140 HPV147 Known putative HPV types Novel putative HPV types Molluscum contagiosum Mimiviridae Baculoviridae Herpesviridae Iridoviridae Microviridae Parvoviridae unclassified
37a 35a 30 18 10 9 7 3 3 3 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 1 1 1 1 7a 30a 24 20 13 11 5 2 2 5
a The number of samples that were positive for at least 1 of the HPV types within the group.
labeled with index tags and sequenced using Illumina deep sequencing. Sequencing results from Illumina MiSeq were used to evaluate DNA library quality, before sequencing them on Illumina NextSeq500. MiSeq sequencing provided about 365 Mb sequencing depth per sample with a read length of 300 bp paired-end, whereas NextSeq sequencing provided about 3.33 Gb sequencing depth per sample with a read length of 150 bp paired-end. Sequencing data from NextSeq consisted of 753 923 432 reads whereof 363 182 were viral reads (0.05% of total sequences). The majority of the viral reads, 329 636 reads, were related to HPV sequences (91%). HPV sequences (with a cutoff of at least 5 reads per HPV type) were found in 37/40 specimens (92.5%), representing a total of 75 different HPV types or putative types within the genera Alpha, Beta and Gamma. The HPV sequences belonged to 27 established HPV types (including the 3 novel types that were found and established in this study) (Table 1). In addition, 5 known putative HPV types were detected in any of 7 samples and no less than 43 previously unknown HPV sequences, putatively representing novel types were detected in any of 30 samples (Table 1). The previously described known putative HPV types had partial sequences deposited in GenBank, but had not been designated an official HPV type number as they had not been cloned, submitted to the International HPV Reference Center and completely sequenced. A Bayesian phylogenetic tree of HPVs based on the L1 part of the 198 established HPV types as
well as the 3 novel putative HPV types (SE379, SE383, SE435) with complete genomes and complete L1 regions are shown in Fig. 1, where the different genera Alpha, Beta, Gamma, Mu and Nu are presented in red, green, blue, orange and purple colors, respectively. The 3 novel putative types cluster within the gamma genus. Sample positivity for any HPV as well as for established types, known and novel putative types as well as for molluscum contagiosum virus is shown in Table 1. HPV6 and HPV180 were the most frequent genotypes, detected in 30 and 18 samples respectively. 13 established HPV types were detected in between 2 to 10 samples, whereas 12 established HPV types were only present in single samples (Table 1). Only 3 samples were negative for any known or putative HPV type. Molluscum contagiosum virus was detected with a total of 27 538 reads, corresponding to 7.6% of all viral reads. Molluscum positivity were detected in 24/40 samples (Table 1). 22 out of those 24 were also positive for HPV sequences. The rest of the virus related reads (only corresponding to 1% of all viral reads) were related to Baculoviridae, Herpesviridae, Microviridae, Mimiviridae, Parvoviridae, Iridoviridae and unclassified viruses (Table 1). The number of these non-HPV, non-molluscum virus related reads were not correlated to the number of HPV, molluscum, as well as human, bacterial, unknown or total number of non-virus related reads in the samples. No viruses were detected in the 2 negative water controls. All HPV-positive samples had more than 10 HPV reads. In fact, 28/40 samples had more than 100 HPV reads and of those, 7 samples had more than 10,000 HPV reads (Table 2). Based only on established HPV types, 31/40 samples were positive for multiple HPV types, being positive for up to 8 different established types. DNA sequences of novel putative HPV types Overall, 46 novel HPV DNA sequences belonging to Beta and Gamma genera were detected from the deep sequencing of the 40 condyloma specimens in this study. Six of the novel putative HPV types represented complete HPV genomes (GenBank accession number KP692114 to KP692119). Three of these were cloned (isolates “SE370”, “SE371” and “SE372”). Clones and sequences were sent to the International HPV Reference Center where they were re-sequenced and established as new HPV types with the designation HPV200, HPV201 and HPV202 (for “SE370”, “SE371” and “SE372”, respectively) and reported in Table 1 as established HPV types. They belonged to the genus Gamma-papillomavirus, but with only 79%, 68% and 82% similarity to the most closely related type (Table 3). HPV200 did not have any ORF corresponding to the E4 gene. The sequences from the remaining 43 novel putative types are deposited in GenBank, accession numbers KP692120 to KP692178. Raw unassembled HPV related reads were deposited in the SRA database with BioProject ID: 278583. Comparison with previous 454 sequencing of the same specimens A previous study sequenced these 40 apparently HPV-negative condyloma specimen in pools of 4 samples with 454 pyrosequencing on a GS Junior instrument (Roche) (12). HPV sequences were detected in 5/10 pools from a total of 35 different HPV types: 13 now established HPV types (HPV 5, 6, 50, 57, 58, 66, 105, 124, 130, 150, 165 (former isolate KC7), 175 (former isolate SE87) and 180 (former isolate FA69)), 1 known putative HPV sequence (SE8) as well as 21 novel putative HPV sequences all belonging to the Gamma genus. Read lengths of singletons and contigs ranged from 89 bp to 7356 bp and samples were considered positive as long as 1 read was detected. Among the 13 established HPV types detected, only 5 (HPV6, 50, 165, 175 and 180) generated contigs 4700 bp. The deeper sequencing performed in this study confirmed positivity (with a cutoff at 45 reads per sample) for 11 of the 13 established
HPV63 HPV1 HPV41 HPV116 HPV HPV444 HPV713 HPVV91 HP 43 HPVPV7 H 40 HPVV54 HP 42 V HPPV327 H V17 3 7 HPHPV 34 V 1 HPPV335 H PV 16 8 H PV 5 3 H V 3 HP PV V67 2 H P V5 H P H
75 6 PV 7 9 H PV V4 115 7 H P V 0 H P V1 04 H P V1 98 H P 1 0 H PV 11 2 H PV 18 H PV 80 H PV 15 H PV 7 H PV1 7 H PV3 13 H PV1 1 H V11 HP V122 HP V9 HP V159 HP V145 HP 174 HPV 38 HPV 0 HPV10 HPV151 HPV22 HPV120 HPV23 HPV201 HPV130 HPV133 HPV14 HPV 2 HPV 191 HPV1180 HP 21 H P V194 HPVV163 18 H H PV17 3 H PV1 3 HPPV9558 H V4 H PV H PV 65 HPPV1190 H V 9 H P 17 7 H P V1 8 HPPVV12 44 H 1 8 H P V 5 H PV V1 137 3 PV 1 16 15 29 4
36 1 V1 14 2 HP V 20 0 HP PV V14 69 H P V1 71 H P V1 6 2 H HPPV11812 H PV 19 H V 35 HPPV1 79 H PV1 46 H V1 HPPV1671 H PV16 2 H V16 HP 166 HPVV200 HP V48 HP 31 HPV1188 HPV HPV50 HPV199 HPV127 HPV165 HPV132
H H P H P V1 HP PV V14 89 H V 1 9 H PV 19 55 HPPV1 108 3 H V 0 H PV 1013 HPPV1119 H V1 12 HPPV1 64 HP V16487 V S 17 HPVE3796 HPV187 HP 88 HPV V60 HPV1175 HPV1 56 HPV18742 SE435 SE383 HPV157 HPV148
H P H V HP PV 160 HP V 77 HP V 29 7 HP HP V28 8 V HP 12V3 H V 5 HP PV194 HPV1170 H V5 H PV 7 HPVPV272 HP 106 HP V90 HPVV71 HPV1 89 HPV 02 HPV883 HPV621 HPV72 HPV61 HPV86 HPV87 HPV114 HPV84 HPV85 HPV70 HPV689 HPV359 HPV 18 HPVV45 HP V97 HP V26 HP V69 HP V51 HP V826 HPPV6 6 H PV5 0 H PV353 H PV 70 H V1 86 HPPV1 123 4 H PV 13 09 H PV V1 38 H P V1 39 H P V1 H P H
285
HPV118 HPV195 HPV196 HPV124 HPV15 HPV 2 HPV120 HP 4 HP V21 HPVV19 25 H H PV47 H PV5 HPPV36 H V1 H PV 43 HPPV1105 H V 2 H PV 8 HPPV999 H H P V2 3 H P V9 4 H P V 8 HP PV V9 150 V9 18 6 2 5
L.S. Arroyo Mühr et al. / Virology 485 (2015) 283–288
Fig. 1. Bayesian phylogenetic tree based on the L1 part of the complete 198 established HPV types and 3 putative novel HPV types (SE-types.). Alpha, Beta, Gamma, Mu and Nu papillomaviruses are presented in red, green, blue, orange and purple colors, respectively.
Table 2 The number of samples within each range of HPV reads. Number of HPV reads
Number of samples
0 1r 10 10r 100 100 r1000 1000r 10,000 10,000r 100,000
1 0 11 12 9 7
HPV types detected in the previous study. The established types HPV5 and HPV175 as well as the known putative sequence SE8 were not detected in this study. 13 of the 21 novel putative HPVs (SE92-100, SE106, SE107, SE109 and SE116) were also detected in this study. As suggested in the previous study (12), the 9 phylogenetically related but non-overlapping fragments SE92-100 were found to belong to the same novel HPV type (HPV200) that was cloned and sequenced in the present study. Furthermore, the 3 non-overlapping sequence fragments SE106, SE107 and SE116 were all found to belong to HPV201, also cloned and sequenced in the present study.
Discussion Deep sequencing using Illumina has demonstrated an increased sensitivity for HPV detection, with all but 3 “HPV-negative”
condylomata (37/40) now being positive for at least one HPV type. Overall, we found 75 different HPV types in the 40 “HPV-negative” condyloma samples (27 established HPV types, 5 known putative HPV types and 43 novel putative types). Several samples were found to contain established HPV types known to cause condylomata, in particular HPV6, although the PCR methods used should have detected these viruses (Aubin et al., 2008; de Villiers et al., 2004; Sturegard et al., 2013). Possible explanations include the presence of viral variants with genomic substitutions in the sequences targeted by PCR primers or probes. Deep sequencing overcomes this problem by detecting HPV types without any prior sequence information, independent of PCR. In this study, we compared data from modern deep sequencing using Illumina with previous data using 454 sequencing. As expected, more HPV types were detected with the deeper sequencing. It also generated longer contigs of HPV sequences. A total of 46 putative novel types were detected, belonging to either Beta or Gamma genera. Beta- and gamma-HPVs are frequently detected on healthy skin (Antonsson et al., 2003; Ekstrom et al., 2013; Forslund et al., 2004; Foulongne et al., 2012) and therefore, presence of these types might be due to biological contamination from adjacent healthy genital skin. Interestingly, the Merkel cell polyomavirus which is known to be commonly present in healthy skin was not detected. Clinical distinction between condyloma and mollusca is not an easy task. Presence of molluscum contagiosum virus is also known to exist concomitantly with presence of HPV(Castronovo et al., 2012). In our study, molluscum contagiosum was detected in 24/40 condyloma samples and 22 out of those 24 also contained HPV
286
L.S. Arroyo Mühr et al. / Virology 485 (2015) 283–288
Table 3 The 3 novel established HPV types (HPV200, HPV201 and HPV202) genome organizations and their similarities of the open reading frames (ORFs) to the closest related HPV types. HPV200 (SE370)
Total
E6
E7
E1
E2
E4
L2
L1
7137 – – –
420 1–420 140 78 (72)
282 417–698 94 81 (84)
1797 685–2481 599 79 (80)
1206 2408–3613 402 82 (79)
x x x x
1509 3613–5121 503 77 (73)
1545 5132–6676 515 79 (86)
HPV201 (SE371)
Total
E6
E7
E1
E2
E4
L2
L1
Number of nucleotides (nt)a Position of ORF in genome Number of amino acids (aa) HPV201 vs HPV163 Gamma-20% identity on nt level (aa level)
7291 – – –
423 1–423 141 62 (47)
297 420–716 99 64 (50)
1827 700–2526 609 66 (51)
1257 2468–3724 419 67 (54)
549 2934–3482 183 –
1554 3726–5279 518 65 (55)
1527 5289–6815 509 68 (68)
HPV202 (SE372)
Total
E6
E7
E1
E2
E4
L2
L1
7344 – – –
432 1–432 144 87 (89)
300 434–733 100 86 (90)
1818 717–2534 606 87 (90)
1179 2470–3648 393 88 (86)
369 3041–3409 123 90 (86)
1497 3650–5146 499 77 (79)
1587 5155–6741 529 82 (92)
Number Position Number HPV200
Number Position Number HPV202 a
of nucleotides (nt)a of ORF in genome of amino acids (aa) vs HPV48 Gamma-2% identity on nt level (aa level)
of nucleotides (nt)a of ORF in genome of amino acids (aa) vs HPV140 Gamma-11% identity on nt level (aa level)
Including STOP codon.
sequences. This leaves only a single condyloma specimen where we did not find neither HPV, nor molluscum contagiousum virus. The detection of multiple viruses in the diseased tissue does not imply causality, as this study did not analyze matched controls. In summary, deep sequencing of apparently “HPV-negative” condylomata specimen greatly reduces the number of “HPVnegative” specimens. Sequencing of apparently HPV-negative specimens from diseases known to be HPV-associated continues to identify a large number of previously unknown HPV sequences.
Importance Conventional methods fail to detect papillomaviruses (HPV) in a small proportion of genital warts (condylomata), a disease established as caused by HPV. Modern deep sequencing is not based on prior knowledge of sequence information, can increase the analytic sensitivity of HPV detection as well as detect viral variants and distantly related viruses. We sequenced 40 apparently HPV-negative genital warts and observed that 37/40 samples did contain HPV, including 43 previously unknown apparently new HPV types. This suggests that seemingly HPV-negative condylomata might be explained by e.g., clinical misdiagnosis or presence of viral variants, low viral loads or presence of previously unknown HPV types.
Material and methods Patients 40 swab samples of apparently HPV-negative condylomata from 21 women and 19 men (Johansson et al., 2013) were analyzed with Illumina sequencing technology. Swab samples were collected with a pre-wetted (0.9% NaCl) cotton-tipped swab rolled over the condyloma and stored in 1 mL saline. Samples were then centrifuged (5 min, 3500 G) and 500 μL of the supernatant were left for pellet re-suspension. 200 μL were used for DNA extraction with MagNA Pure LC using the Total Nucleic Acid Kit (Roche) and DNA was eluted in 100 μL elution buffer. Informed consent was obtained from all participants, the study adhered to the declaration of Helsinki and was approved by the Ethical Review Committee of Lund University (Sweden).
Sample preparation Sample adequacy was assayed using beta-globin PCR, visualized by gel electrophoresis in 11 cases and by real-time PCR in 29 cases (Sturegard et al., 2013). In order to amplify the DNA in the sample, extracted DNA from 40 individual samples and 2 negative controls (water) were subjected to whole genome amplification (WGA) using GenomiPhi High Yield (GE Health Care, United Kingdom). WGA was performed according to manufacturer´s instructions with some modifications. 5 ml of sample were diluted with 20 ml sample buffer. The mix was incubated at 95 1C for 3 min and then cooled on ice. 22.5 μL of reaction buffer was mixed with 2.5 μL of enzyme mix and added to the diluted sample. Samples were incubated at 30 1C for 7 h and inactivated at 65 1C for 10 min. Products were dissolved by diluting samples 1:2 in PCR-Grade water. All amplification products were quantified with QuantiFluor-ST (Promega, US) and a quality analysis (Bioanalyzer, Agilent) was performed to check the DNA fragments size distribution prior library preparation. Deep sequencing For each sample, 50 ng of WGA DNA were tagmented and amplified using the Nextera DNA Sample Preparation kit according to the user guide revision B (Illumina). Two sample tags, a unique combination for each sample, were used in the short PCR amplification step to facilitate multiplex sequencing. All individual libraries were validated, normalized to 4 nM and pooled. Prior to sequencing, the 42 individual indexed libraries (40 specimens and 2 negative controls) were denatured and diluted, resulting in a 15 pM DNA solution. PhiX control was spiked at 1% and the pool was sequenced by paired-end 301 þ301 cycles using MiSeq version 3 reagent kit (Illumina, US). The sequencing preparations were made according to the user guides Preparing Libraries for Sequencing on the MiSeq revision C, Reagent Preparation Guide revision A and MiSeq System User Guide revision K. For an even deeper sequencing, the 42 individual indexed libraries were sequenced using the NextSeq500 instrument. The pool was denatured and diluted, resulting in a 2.6 pM DNA solution. PhiX control was spiked at 1% and the pool was sequenced by paired-end 151 þ151 cycles using NextSeq 500 High Output reagent kit (Illumina, US). The sequencing preparations were made according to the user guides Denaturing and Diluting
L.S. Arroyo Mühr et al. / Virology 485 (2015) 283–288
Libraries for the NextSeq 500 revision A and NextSeq 500 kit Reference Guide revision F. Bioinformatic analysis Sequences obtained from MiSeq and NextSeq platforms and the indexing tags, included in the Illumina primers, were used to assign the sequence reads to the originating samples. The bioinformatic analysis started with quality checking where sequences were trimmed according to their Phred quality scores (Ewing and Green, 1998). Quality checked reads were then screened against the human reference genome hg19, as well as bacterial, phage and vector sequences downloaded from GenBank using BWA-MEM (Li and Durbin, 2010) and reads with 495% identity over 75% of their length to human DNA were removed from further analysis. The rest of the sequences were normalized (http://ged.msu.edu/ papers/2012-diginorm) to discard redundant data and reduce sampling variation and sequencing errors. The normalized dataset was then processed for assembly using the Trinity (Haas et al., 2013), SOAPdenovo-Trans (http://soap.genomics.org.cn/) and IDBA-UD (Peng et al., 2012) assemblers into contiguous sequences (contigs). Reads before assembly were re-mapped to assembled contigs. The use of several assembly algorithms and re-mapping of all singleton reads to assembled contigs was used to validate assembly results (Bzhalava et al., 2013; Meiring et al., 2012). Assembled contigs were then subjected to taxonomic classification using GenBank and paracel blast (www.paracel.com) to classify them as i) previously known sequences, ii) related to previously known sequences, or iii) unrelated to any previously known sequences. All papillomavirus-related contigs and singletons were checked as described to identify possible artifactual “chimeric” sequences (contigs containing sequences originating from different viruses) (Bzhalava et al., 2013). Shortly, the sequence that aligned to its most closely related sequence in GenBank was divided into three equal segments. If at least one of the segments differed in similarity to the corresponding overlapping parts with more than 5% (for example if segment 1 was 88% similar and segment 2 was 94% similar) the sequence was considered as a “possible chimera” and not analyzed further. Papillomavirus-related contigs having Z 90% identity with each other were clustered. The total number of sequencing reads from each cluster was calculated and the longest contig was selected as a representative to remove redundancy. Clustering analysis was conducted separately for contigs longer and shorter than 700 bp length. Based on the percent similarity to the closest related type in GenBank, HPV related contigs were classified as known established, known putative and novel putative HPV types. For each of the HPV types, the total number of reads was calculated and specimens were considered positive if at least 5 reads from the same HPV type were detected in the same specimen. All analyses were performed using in-house R (www.R-project. org/), python (www.python.org) and bash (http://www.gnu.org/ software/bash/) scripts that ran on a high performance (40 core, 2 TB RAM) Linux server. Characterization of HPV200, 201 and 202 The complete genome sequence was obtained for 6 previously unknown HPV sequences, putatively representing new HPV types. 3 of the 6 complete putative novel HPV genomes, “SE370”, “SE371” and “SE372” isolates, were amplified and cloned from their respective specimens in 2 overlapping fragments each. The fragments were amplified using PrimeSTAR GXL DNA Polymerase from TAKARA, according to manufacturer's instructions. The first
287
fragment containing the L1 region was amplified using the primers SE370AF 50 -TGAGGAAGGAGACACTGTTACT-30 and SE370AR 50 -ACT TCCCGCCTCTGTTTGA-30 ; SE371AF 50 -TCCAGAGTCAGGCTTTGTGT-30 and SE371AR 50 -AGCATCTGACACCTGTTCCA-30 ; SE372AF 50 -ACATGAGCCCTATACCACGG-30 and SE372AR 50 -TTTGAGAGTTGAACAG CGCC-30 for “SE370”, “SE371” and “SE372” isolates, respectively. The second fragment was amplified using the primers SE370BF 50 GACAGCAGAGGAGGAGTTGT-30 and SE370BR 50 -CTACAGGAGCACTAGGAGGC-30 ; SE371BF 50 -GTTTTCGGTTGCAGGCATTG-30 and SE3 71BR 50 -CTAAAACCGGCACAGTAGGC-30 ; SE372BF 50 -TCCAGAGTCAGGCTTTGTGT-30 and SE372BR 50 -AAACGCCAAACCAACCTCTC-30 for “SE370”, “SE371” and “SE372” isolates respectively. The PCRprogram for all fragments consisted on 30 cycles at 98 1C for 10 s, 55 1C for 15 s and 68 1C for 8 min. The PCR products were gelpurified and cloned using the Zero Blunts TOPOs PCR Cloning kit (Invitrogen) and the pCRTM-Blunt II-TOPOs vector, according to manufacturer's instructions and incubated overnight. The complete cloned genomes for the 3 novel HPV types were also sequenced by Sanger sequencing to confirm the sequence obtained with the MiSeq instrument from each index sample, using designed primers for conventional primer walking. Clones were sent to the International HPV Reference Center (Karolinska Institutet, Sweden) that confirmed the DNA sequence and assigned the submitted clones the novel HPV type number 200, 201 and 202 for “SE370”, “SE371” and “SE372”, respectively.
Acknowledgments Supported by the Swedish Cancer Society, grant number 11 0569, and by the Swedish Research Council, grant number K201257X-21044-04-6. All authors declare no conflicts of interest. References Antonsson, A., Erfurt, C., Hazard, K., Holmgren, V., Simon, M., Kataoka, A., Hossain, S., Hakangard, C., Hansson, B.G., 2003. Prevalence and type spectrum of human papillomaviruses in healthy skin samples collected in three continents. J. Gen. Virol. 84, 1881–1886. Arroyo, L.S., Smelov, V., Bzhalava, D., Eklund, C., Hultin, E., Dillner, J., 2013. Next generation sequencing for human papillomavirus genotyping. J. Clin. Virol.: Off. Publ. Pan Am. Soc. Clin. Virol. 58, 437–442. Aubin, F., Pretet, J.L., Jacquard, A.C., Saunier, M., Carcopino, X., Jaroud, F., Pradat, P., Soubeyrand, B., Leocmach, Y., Mougin, C., Riethmuller, D., Group, E.D.S, 2008. Human papillomavirus genotype distribution in external acuminata condylomata: a Large French National Study (EDiTH IV). Clin. Infect. Dis.: Off. Publ. Infect. Dis. Soc. Am. 47, 610–615. Bernard, H.U., Burk, R.D., Chen, Z., van Doorslaer, K., zur Hausen, H., de Villiers, E.M., 2010. Classification of papillomaviruses (PVs) based on 189 PV types and proposal of taxonomic amendments. Virology 401, 70–79. Bzhalava, D., Eklund, C., Dillner, J., 2015. International standardization and classification of human papillomavirus types. Virology 476, 341–344. Bzhalava, D., Johansson, H., Ekstrom, J., Faust, H., Moller, B., Eklund, C., Nordin, P., Stenquist, B., Paoli, J., Persson, B., Forslund, O., Dillner, J., 2013. Unbiased approach for virus detection in skin lesions. PLoS One 8, e65953. Bzhalava, D., Muhr, L.S., Lagheden, C., Ekstrom, J., Forslund, O., Dillner, J., Hultin, E., 2014. Deep sequencing extends the diversity of human papillomaviruses in human skin. Sci. Rep. 4, 5807. Castronovo, C., Lebas, E., Nikkels-Tassoudji, N., Nikkels, A.F., 2012. Viral infections of the pubis. Int. J. STD AIDS 23, 48–50. Chouhy, D., Gorosito, M., Sanchez, A., Serra, E.C., Bergero, A., Fernandez Bussy, R., Giri, A.A., 2010. New generic primer system targeting mucosal/genital and cutaneous human papillomaviruses leads to the characterization of HPV 115, a novel Beta-papillomavirus species 3. Virology 397, 205–216. de Villiers, E.M., Fauquet, C., Broker, T.R., Bernard, H.U., zur Hausen, H., 2004. Classification of papillomaviruses. Virology 324, 17–27. Ekstrom, J., Bzhalava, D., Svenback, D., Forslund, O., Dillner, J., 2011. High throughput sequencing reveals diversity of Human Papillomaviruses in cutaneous lesions. Int. J. Cancer (Journal International du Cancer) 129, 2643–2650. Ekstrom, J., Forslund, O., Dillner, J., 2010. Three novel papillomaviruses (HPV109, HPV112 and HPV114) and their presence in cutaneous and mucosal samples. Virology 397, 331–336. Ekstrom, J., Muhr, L.S., Bzhalava, D., Soderlund-Strand, A., Hultin, E., Nordin, P., Stenquist, B., Paoli, J., Forslund, O., Dillner, J., 2013. Diversity of human papillomaviruses in skin lesions. Virology 447, 300–311.
288
L.S. Arroyo Mühr et al. / Virology 485 (2015) 283–288
Ewing, B., Green, P., 1998. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8, 186–194. Forslund, O., 2007. Genetic diversity of cutaneous human papillomaviruses. J. Gen. Virol. 88, 2662–2669. Forslund, O., Lindelof, B., Hradil, E., Nordin, P., Stenquist, B., Kirnbauer, R., Slupetzky, K., Dillner, J., 2004. High prevalence of cutaneous human papillomavirus DNA on the top of skin tumors but not in “Stripped“ biopsies from the same tumors. J. Investig. Dermatol. 123, 388–394. Foulongne, V., Sauvage, V., Hebert, C., Dereure, O., Cheval, J., Gouilh, M.A., Pariente, K., Segondy, M., Burguiere, A., Manuguerra, J.C., Caro, V., Eloit, M., 2012. Human skin microbiota: high diversity of DNA viruses identified on the human skin by high throughput sequencing. PLoS One 7, e38499. Gissmann, L., deVilliers, E.M., zur Hausen, H., 1982. Analysis of human genital warts (condylomata acuminata) and other genital tumors for human papillomavirus type 6 DNA. Int. J. Cancer 29, 143–146. Haas, B.J., Papanicolaou, A., Yassour, M., Grabherr, M., Blood, P.D., Bowden, J., Couger, M.B., Eccles, D., Li, B., Lieber, M., Macmanes, M.D., Ott, M., Orvis, J., Pochet, N., Strozzi, F., Weeks, N., Westerman, R., William, T., Dewey, C.N., Henschel, R., Leduc, R.D., Friedman, N., Regev, A., 2013. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512.
Johansson, H., Bzhalava, D., Ekstrom, J., Hultin, E., Dillner, J., Forslund, O., 2013. Metagenomic sequencing of “HPV-negative” condylomas detects novel putative HPV types. Virology 440, 1–7. Li, H., Durbin, R., 2010. Fast and accurate long-read alignment with Burrows– Wheeler transform. Bioinformatics 26, 589–595. Meiring, T.L., Salimo, A.T., Coetzee, B., Maree, H.J., Moodley, J., HitzerothII, J., Freeborough, M.J., Rybicki, E.P., Williamson, A.L., 2012. Next-generation sequencing of cervical DNA detects human papillomavirus types not detected by commercial kits. Virol. J. 9, 164. Peng, Y., Leung, H.C., Yiu, S.M., Chin, F.Y., 2012. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28, 1420–1428. Soderlund-Strand, A., Carlson, J., Dillner, J., 2009. Modified general primer PCR system for sensitive detection of multiple types of oncogenic human papillomavirus. J. Clin. Microbiol. 47, 541–546. Sturegard, E., Johansson, H., Ekstrom, J., Hansson, B.G., Johnsson, A., Gustafsson, E., Dillner, J., Forslund, O., 2013. Human papillomavirus typing in reporting of condyloma. Sex. Transm. Dis. 40, 123–129. Walboomers, J.M., Meijer, C.J., 1997. Do HPV-negative cervical carcinomas exist? J. Pathol. 181, 253–254.