Supplementary Information
CRISPR-RT: A web application for designing CRISPRC2c2 crRNA with improved target specificity Houxiang Zhu1, Emily Richmond2 and Chun Liang1,2,* 1
Department of Biology and 2Department of Computer Science and Software Engineering,
Miami University, Oxford 45056, USA
HZ:
[email protected] ER:
[email protected] CL:
[email protected]
* To whom correspondence should be addressed.
Leptotrichia shahii CRISPR-C2c2 locus
C2c2
CRISPR array
Cas1 Cas2
Pre-crRNA
5’
C2c2
3’ C2c2
Mature crRNA
5’
PFS
Target RNA 5’
3’
3’
Seed
Supplementary Figure S1. The configuration and architecture of the CRISPR-C2c2 system. The CRISPR array of CRISPR-C2c2 is first transcribed into pre-crRNA. Then, the pre-crRNA is processed by C2c2 into mature crRNAs without attaching to trans-activating crRNAs. The mature crRNA binds to C2c2 and guides C2c2 to target a specific single-strand RNA at a proper target site.
Supplementary Figure S2. The parameter setting page of CRISPR-C2c2. First, users input an RNA sequence in FASTA format into the text area. Second, users select a reference transcriptome. Third, users set up the PFS and crRNA requirements for the CRISPR-C2c2 system. Fourth, users choose an off-target setting (either “Basic settings” or “Specific settings”).
A
B
C
D Supplementary Figure S3. The result web interfaces of CRISPR-RT. (A) The detailed information of all the target candidates in the user input RNA sequence. (B) User input RNA sequence viewer. (C) The detailed information of target sites in the transcriptome for each target candidate. (D) Visualization of on- and off-targets with gene and transcript annotations in JBrowse.
Supplementary Figure S4. The basic process of crRNA design based on the target candidate sequence (protospacer). The target complementarity sequence of crRNA is labelled in purple color. The stem-loop sequence of crRNA comes from previous research (Abudayyeh et al., 2016). The designed crRNA binds to C2c2 to form a crRNA-C2c2 complex for RNA targeting.
Supplementary Figure S5. The flowchart of CRISPR-RT backend pipeline that processes user input RNA sequence and displays results in the interfaces. All of input parameters, including the RNA sequence, are passed to the main Perl script. The main Perl script invokes other specific Perl scripts or commands to execute specific functions. Finally, the results will be displayed in DataTables and JBrowse in the web interfaces.
Supplementary Methods CRISPR-RT is essentially composed of many web interfaces and a backend pipeline. Web interfaces are implemented by PHP and JavaScript code, which are used to accept user inputs and display the results interactively. The backend pipeline is implemented by Perl code, which is used to process user input data and generate multiple result files. The same strategy was applied in our previously published web application CT-Finder (Zhu et al., 2016), which helps to design Cas9 gRNAs. After setting up proper parameters and clicking the “Find targets!” button in the parameter setting page of CRISPR-C2c2 (Supplementary Figure S2), all of the parameters stored in PHP code are passed to the main Perl script, which invokes other specific Perl scripts or commands to execute specific functions. The default parameters are set based on Abudayyeh et al.’s research. As shown in Supplementary Figure S5, first, the Perl script for target candidate search is called to find target candidates of specified length with the PFS in the input RNA sequence. Second, Bowtie2 (Langmead and Salzberg, 2012) is used to map each target candidate sequence to the reference transcriptome, which is extracted from the genome by RSEM (Li and Dewey, 2011) using Ensembl or Phytozome gene annotation, to search for on- and off-target sites within the transcriptome. Bowtie2 is particularly good at aligning short reads to long genomes or transcriptomes, which has been used for Cas9 gRNA design in previous study (Heigwer et al., 2014; Zhu et al., 2016). RSEM is a quite popular package for analyzing RNASeq data (Haas et al., 2013; Konermann et al., 2015; Shalek et al., 2013). Third, the Perl scripts for result filtration based on the input parameters are invoked to filter out the off targets that do not meet the requirements set by users. Since the seed region is more sensitive to mismatches than the non-seed region, they are handled separately. Next, RSEM is used to convert the
transcriptome mapping result to a genome mapping result, which can be displayed properly in JBrowse (Skinner et al., 2009). Then, the main Perl script separates the file storing target sites of all target candidates into many single files that store target sites for each target candidate respectively. The main Perl script also processes the file of target candidates and the files of target sites for each target candidate to generate Ajax format files, which are required by DataTables (a table plug-in for jQuery). After getting all of those files, the PHP and JavaScript code is used to display detailed information of target candidates and corresponding target sites in DataTables. The on- and off-target sites can be visualized in JBrowse.
References Abudayyeh,O.O. et al. (2016) C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector. Science, 353, aaf5573. Haas,B.J. et al. (2013) De novo transcript sequence reconstruction from RNA-Seq: reference generation and analysis with Trinity. Nat Protoc, 8. Heigwer,F. et al. (2014) E-CRISP: fast CRISPR target site identification. Nat Meth, 11, 122–123. Konermann,S. et al. (2015) Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex. Nature, 517, 583–588. Langmead,B. and Salzberg,S.L. (2012) Fast gapped-read alignment with Bowtie 2. Nat Meth, 9, 357–359. Li,B. and Dewey,C.N. (2011) RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics, 12, 323. Shalek,A.K. et al. (2013) Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells. Nature, 498, 236–240. Skinner,M.E. et al. (2009) JBrowse: A next-generation genome browser. Genome Res., 19, 1630–1638. Zhu,H. et al. (2016) CT-Finder: A Web Service for CRISPR Optimal Target Prediction and Visualization. Scientific Reports, 6, srep25516.