Plant Physiology Preview. Published on June 18, 2018, as DOI:10.1104/pp.17.01621
1
Characterization of imprinted genes in rice reveals conservation of regulation and
2
imprinting with other plant species
3 4
Chen Chena,1,2, Tingting Lib,1, Shan Zhuc,1, Zehou Liud, Zhenyuan Shia, Xiaoming Zhenge,
5
Rui Chenf, Jianfeng Huangg, Yi Shenh, Shiyou Luoc, Lei Wangb, Qiao-Quan Liua, and Zhiguo
6
Eb,2
7 8
a
9
Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional
Jiangsu Key Laboratory of Crop Genetics and Physiology, Co-Innovation Center for
10
Genomics of the Ministry of Education, Yangzhou University, Yangzhou, China
11
b
12
c
13
d
14
e
15
Facilities for Crop Gene Resources and Genetic Improvement, Beijing, China
16
f
17
Academy of Agricultural Sciences, Tianjin, China
18
g
19
h
20
1
21
2
China National Rice Research Institute, Hangzhou, China
Rice Research Institute, Jiangxi Academy of Agricultural Sciences, Nanchang, China Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, National Key
Tianjin Institute of Agricultural Quality Standard and Testing Technology, Tianjin
Shanghai Biotechnology Corporation, Shanghai, China Institute of Industrial Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China
These authors contributed equally to this work Address correspondence to
[email protected],
[email protected]
22 23
Running title: Genomic Imprinting of Rice
24 25
One-sentence summary
26
Compared with other species, rice imprinted genes are less associated with transposable
27
elements, and the epigenetic regulation of imprinting occurs both pre- and 1
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
Copyright 2018 by the American Society of Plant Biologists
28
post-fertilization in rice.
29
Author Contributions
30
C.C. conceived the original screening and research plans; C.C., Z.E., and Q.Q.L. designed
31
the research; C.C., T.L., S.Z., Z.L., Z.S., and S.L. performed the experiments; X.Z., R.C., J.H.,
32
Y.S., and L.W. analyzed the data. C.C. wrote the article with contributions of all the
33
authors.
34
2
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
35
Abstract
36
Genomic imprinting is an epigenetic phenomenon by which certain genes display
37
differential expression in a parent-of-origin-dependent manner. Hundreds of imprinted
38
genes have been identified from several plant species. Here we identified, with a high
39
level of confidence, 208 imprinted gene candidates from rice (Oryza sativa). Imprinted
40
genes of rice showed limited association to the transposable elements, which contrasts
41
with findings from Arabidopsis thaliana. Generally, imprinting in rice is conserved within
42
a species, but intraspecific variation was also detected. The imprinted rice genes do not
43
show signatures of selection, which suggests that domestication has had a limited
44
evolutionary consequence on genomic imprinting. Though conservation of imprinting in
45
plants is limited, we show that some loci are imprinted in several different species.
46
Moreover, our results suggest that different types of epigenetic regulation can be
47
established either before or after fertilization. Imprinted 24-nt small RNAs and their
48
neighboring genes tend to express alleles from different parents. This association was
49
not observed between 21-nt small RNAs and their neighboring genes. Together, our
50
findings suggest that regulation of imprinting can be diverse, and genomic imprinting
51
has evolutionary and biological significance.
52
3
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
53
Introduction
54
Imprinted genes are expressed in only one of the parental alleles in a
55
parent-of-origin-dependent manner (Köhler et al., 2012; Gehring, 2013). Genes that
56
exclusively or preferentially express the maternal or paternal alleles are termed
57
maternally expressed genes (MEGs) or paternally expressed genes (PEGs), respectively.
58
Genomic imprinting has been observed in many species, from mammals to flowering
59
plants (Pires and Grossniklaus, 2014), and hundreds of putative imprinted loci, including
60
protein-coding genes and non-coding RNAs, have been discovered in plants (Gehring et
61
al., 2011; Hsieh et al., 2011; Luo et al., 2011; Waters et al., 2011; Wolff et al., 2011;
62
Zhang et al., 2011; Waters et al., 2013; Xin et al., 2013; Pignatta et al., 2014; Xu et al.,
63
2014; Florez-Rueda et al., 2016; Hatorangan et al., 2016; Klosinska et al., 2016; Zhang et
64
al., 2016). However, there is little conservation of imprinting among plant species
65
(Waters et al., 2013; Hatorangan et al., 2016), which suggests that the evolution of
66
genomic imprinting in plants is very rapid.
67
Genomic imprinting is regulated epigenetically by either DNA methylation or histone
68
modification, and in some circumstances, both mechanisms are involved (Huh et al.,
69
2008; Köhler and Weinhofer-Molisch, 2010). In plants, imprinting is predominantly
70
expressed in the endosperm (Luo et al., 2011; Raissig et al., 2013; Klosinska et al., 2016),
71
which is hypomethylated relative to the embryo and vegetative tissues (Gehring et al.,
72
2009; Zemach et al., 2010; Rodrigues et al., 2013; Xing et al., 2015; Klosinska et al., 2016).
73
The DNA glycosylase DEMETER (DME), which is responsible for DNA demethylation, is
74
expressed predominantly in the central cell prior to fertilization (Choi et al., 2004).
75
Imprinting disorders can be found in dme mutants (Hsieh et al., 2011; Wolff et al., 2011;
76
Vu et al., 2013), indicating that proper DNA methylation status is required for the
77
establishment of genetic imprinting in the endosperm. Several studies have revealed
78
that imprinted genes usually neighbor transposable elements (TEs) (Wolff et al., 2011;
79
Pignatta et al., 2014; Hatorangan et al., 2016). High expression of DME in the central cell 4
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
80
promotes demethylation of TEs and their adjacent imprinted genes (Gehring et al., 2009).
81
Therefore, maternal alleles of imprinted genes are activated in the central cell, but the
82
paternal alleles are hypermethylated and remain silenced in the sperm cell, which
83
eventually leads to expression of only the maternal allele after fertilization. Polycomb
84
Repressive Complex 2 (PRC2)-mediated Histone H3 Lysine 27 Trimethylation (H3K27me3)
85
is required for proper genomic imprinting (Huh et al., 2008; Köhler and
86
Weinhofer-Molisch, 2010). Mutation of fertilization-independent endosperm (FIE), a
87
member of the Fertilization Independent Seed (FIS)-PRC2 family, may result in disruption
88
of imprinting at some loci (Hsieh et al., 2011; Wolff et al., 2011).
89
Genetic evidence has revealed that several MEGs are likely important for seed and/or
90
endosperm development in Arabidopsis thaliana (Chaudhury et al., 1997; Luo et al.,
91
2000; Tiwari et al., 2008; Gerald et al., 2009; Costa et al., 2012; Liu et al., 2014). However,
92
many mutants of paternally expressed imprinted genes show no altered phenotype
93
(Köhler et al., 2005; Shirzadi et al., 2011; Bratzel et al., 2012; Vu et al., 2013; Wolff et al.,
94
2015). In addition, conservation of imprinting between closely related species is limited
95
(Waters et al., 2013; Hatorangan et al., 2016). These observations cast doubt on the
96
importance of genomic imprinting in plants. The kinship theory (Haig and Westoby, 1991)
97
hypothesizes that imprinting arose as a consequence of conflict between male and
98
female gametes. Specifically, this theory suggests that males gain fitness benefits from
99
greater transmission of resources from the mother to the offspring, but females benefit
100
by suppression of growth-related demands from the offspring that are driven by
101
paternally active genes. Numerous observations support the kinship hypothesis (Köhler
102
et al., 2012; Pires and Grossniklaus, 2014; Rodrigues and Zilberman, 2015). However, in
103
plants, this theory has been challenged (Rodrigues and Zilberman, 2015). Recent
104
findings in Arabidopsis revealed that some PEGs may be involved in hybrid
105
incompatibility (Wolff et al., 2015), implying that genomic imprinting may play a role in
106
speciation. However, the evolutionary significance of genomic imprinting requires 5
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
107
further investigation in different plant species.
108
Rice (Oryza sativa) is a model monocot and a vital agricultural crop. Consistent with
109
results from dicots, studies in maize (Zea mays) (another monocot) have revealed that
110
differential DNA methylation and H3K27me3 modification between parental genomes
111
are essential for imprinting of monocots (Waters et al., 2013; Zhang et al., 2014a).
112
However, our understanding of the epigenetic regulation of genomic imprinting in rice
113
remains limited. Through a genome-wide survey of imprinted genes from reciprocal
114
intraspecific crosses, Luo et al. (2011) and Yuan et al. (2017) identified more than three
115
hundred imprinted candidates in rice. However, there was little overlap in the identity of
116
these candidates between the studies (Yuan et al., 2017). The ability to identify
117
imprinted genes is dependent largely on the availability of informative single-nucleotide
118
polymorphisms (SNPs). Therefore, discovery of imprinted genes from distinct
119
intraspecific crosses is necessary to obtain greater certainty for identifying imprinted
120
candidates. Moreover, due to its rapid evolution, variation in imprinting has been found
121
among accessions in both Arabidopsis and maize (Pignatta et al., 2014; Waters et al.,
122
2013). In this study, we explored imprinted genes in rice by making two independent
123
reciprocal crosses between indica and japonica cultivars, as well as crosses between
124
cultivated and wild rice. Our results showed that the regulatory mechanisms of genomic
125
imprinting in rice are diverse. The findings will help us to better understand the
126
regulation and evolution of imprinting in plants.
127 128
6
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
129
Results
130
Identification of imprinted genes in cultivated rice
131
To explore the imprinted genes in rice, we used three-line hybrid rice strains to construct
132
reciprocal crosses. A cytoplasmic male sterile (CMS) line carries a sterility gene encoded
133
in the cytoplasmic genome, such that the pollen produced is sterile. A “maintainer” line
134
shares the same nuclear genome as its corresponding CMS line but has a distinct
135
cytoplasmic genome. Due to the absence of the cytoplasmic sterility gene, the
136
maintainer line is fertile. Using rice CMS and maintainer lines for crossing greatly
137
increases the efficiency of hybridization and enables us to rigorously control the timing
138
of fertilization and to avoid false hybridization events. Here, we used the japonica CMS
139
lines (Liuqianxin-A and Yu6-A) and their maintainer lines (Liuqianxin-B and Yu6-B), and
140
the indica CMS lines (Rongfeng-A and Wufeng-A) with their corresponding maintainer
141
lines (Rongfeng-B and Wufeng-B) to make two distinct reciprocal-cross sets, which we
142
refer to as Liuqianxin-A/Rongfeng-B (LR), Rongfeng-A/Liuqianxin-B (RL), Yu6-A/Wufeng-B
143
(YW), and Wufeng-A/Yu6-B (WY). The LR and RL reciprocal crosses, and YW and WY
144
reciprocal crosses are indicated as LR-RL and YW-WY hereafter for convenience.
145
Meanwhile, we made crosses of Wufeng-A/Wufeng-B (WW), Rongfeng-A/Rongfeng-B
146
(RR), Liuqianxin-A/Liuqianxin-B (LL), and Yu6-A/Yu6-B (YY), which resembled selfing of
147
inbred lines as controls to validate imprinting.
148
Morphological observations suggested that seeds produced by LR, RL, YW, and WY
149
developed normally (Figure 1A and B). The endosperm from these reciprocal crosses
150
was collected at five days after fertilization (DAF) for RNA-sequencing (RNA-seq). By
151
comparing LR-RL transcriptome data to whole genomic resequencing data from
152
Liuqianxin-A and Rongfeng-B, we identified 62,102 SNPs from 12,299 endosperm
153
expressed genes in the LR-RL cross that could be used to distinguish parental alleles.
154
Similarly, by comparing the RNA-seq data from YW-WY to genomic resequencing of
155
Yu6-A and Wufeng-B, we identified 59,107 SNPs from 11,044 genes. In total, parental 7
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
156
expression of 4,946 genes in LR-RL and 6,308 genes in YW-WY deviated from the
157
expected 2:1 maternal-to-paternal ratio [χ2 test; p < 0.05, false discovery rate (FDR) < 8
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
158
0.05].
159
Next, we surveyed parent-of-origin effects of these genes by examining expression levels
160
that were greater than 2-fold (>4:1 for moderate MEGs and 10:1 for strong MEGs and 5-fold bias) shared between LR-RL and YW-WY in the following
199
studies (Figure 1F).
200
Using Reverse Transcription PCR and sequencing, we confirmed that all 14 of the tested
201
candidates were imprinted genes (Supplemental Figure 1A). We also performed
202
single-colony sequencing and pyro-sequencing (Supplemental Figure 1B to D) to confirm
203
the imprinting status of two quantitative trait loci, Grain Weight 2 (GW2) and protein
204
phosphatase with Kelch-like repeat domain 2 (OsPPKL2), for rice seed development
205
(Song et al., 2007; Zhang et al., 2012b), suggesting that genome imprinting may function
206
directly for seed development in rice.
207
For MEGs, Gene Ontology (GO) analysis showed enrichment for the term “DNA binding”
208
(p = 0.00031, FDR = 0.017), whereas for PEGs, the terms “transferase activity” (p =
209
6.3e-05, FDR = 0.0014), “kinase activity” (p = 6.3e-05, FDR = 0.0014) and “protein
210
binding” (p = 0.001, FDR = 0.015) were overrepresented. This result suggested that PEGs
211
and MEGs might have different functions. Interestingly, we noticed that several of the 10
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
212
imprinted genes were involved in epigenetic regulation (Supplemental Table 2). A
213
previous study reported that OsFIE1 was the only imprinted PRC2 gene in rice (Luo et al.,
214
2009). Here we found that rice Embryonic Flower 2a (OsEMF2a), a homolog of the
215
Su(z)12 family, also was imprinted, which was validated by different approaches
216
(Supplemental Figure 2).
217 218
Comparison to previously identified imprinted genes in rice
219
Luo et al. (2011) conducted a genome-wide survey of imprinted genes in rice using the
220
japonica cultivar Nipponbare and the indica cultivar 9311 as the parents (N9-9N).
221
Approximately 74% (53/72) of the PEGs and 53% (49/93) of the MEGs identified
222
overlapped with the ones identified in the present study (Figure 1F). Similarly, about 80%
223
(76/97) of the PEGs and 44% (72/162) of the MEGs discovered from L0-0L (Yuan et al.,
224
2017) were included in our list of candidates (Figure 1F). However, there were fewer
225
overlaps between N9-9N and L0-0L (Yuan et al., 2017). In total, 300 MEGs and 289 PEGs
226
have been identified from rice (Figure 1F and Supplemental Table 1), with PEGs being
227
more conserved than MEGs. About 54.3% of the PEGs (157/289) could be found in at
228
least two sets of the four reciprocal crosses (RL-LR, WY-YW, N9-9N and L0-0L), while only
229
39% of the MEGs (117/300) coincided among the different combinations.
230
Most of the imprinted genes identified in one cross also tended to be imprinted in
231
other reciprocal crosses (Figure 2A to C and Supplemental Figure 3). The ones found in
232
only one set of reciprocals usually lacked informative SNPs or sufficient reads to identify
233
if they were imprinted in other crosses (Supplemental Figure 3). For example, among
234
the 156 imprinted genes identified in the other two studies but not in the present study
235
(Figure 1F), 62 (40%) lacked polymorphisms or expression in WY-YW and LR-RL.
236
Allele-specific imprinting has been found in maize and Arabidopsis (Waters et al.,
237
2013; Pignatta et al., 2014). To test for allele-specific imprinting in rice, we used a cut-off
238
criterion. If a strong imprinted gene identified in one reciprocal-cross set failed to show 11
Downloaded from on June 18, 2018 - Published by www.plantphysiol.org Copyright © 2018 American Society of Plant Biologists. All rights reserved.
239
moderate imprinting (