Summary
CpG islands, which are hypomethylated regions with clustered CpG dinucleotides resulting from a high G+C content and a lack of the normal CpG deficiency, are associated with a large number of vertebrate nuclear genes. There is some evidence that CpG islands may also be a feature of angiosperm genes. In mammalian and avian genes, candidate CpG islands can be readily identified by a simple set of sequence criteria. However, identification of candidate angiosperm CpG islands is more difficult due to the much higher ratio of observed/expected (O/E) CpG in the average angiosperm gene relative to the average vertebrate gene (see accompanying paper). We have developed sets of objective criteria that readily detect angiosperm DNA regions with a consistently high O/E CpG compared to the rest of the genome. These regions, which we call significant CpG-rich regions, always showed a CpG frequency equivalent to or greater than that expected from the base composition, suggesting that they may undergo positive selection for the presence of CpG dinucleotides. The significant CpG-rich regions were similar to vertebrate CpG islands in that they tended to be associated with the 5′ ends of genes, particularly of housekeeping genes, and in that they varied in location in tissue-specific genes. However, unlike the situation in mammalian and avian genes, DNA with a high O/E CpG did not surround the transcription start site of all angiosperm housekeeping genes. Significant CpG-rich regions in monocotyledonous species were G+C rich, like CpG islands. Significant CpG-rich regions in dicotyledonous species, on the other hand, differed from CpG islands but were similar to some CpG-rich regions in cold-blooded vertebrates (Xenopus and fish) in that they were A+T rich. We have argued that both the A+T-rich and the G+C-rich significant CpG-rich regions may fulfill the same function as the G+C-rich CpG islands of vertebrates.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Antequera F, Bird AP (1988) Unmethylated CpG islands associated with genes in higher plant DNA. EMBO J 7:2295–2299
Antequera F, Boyes J, Bird A (1990) High levels of de novo methylation and altered chromatin structure at CpG islands in cell lines. Cell 62:503–514
Banks JA, Masson P, Federoff N (1988) Molecular mechanisms in the developmental regulation of the maize suppressor-mutator transposable element. Genes & Dev 2:1364–1380
Bennetzen JL, Brown WE, Springer PS (1988) The state of DNA modification within and flanking maize transposable elements. In: Nelson O (ed) Plant transposable elements. Plenum, New York, pp 237–250
Bird A, Taggart M, Frommer M, Miller OJ, Macleod D (1985) A fraction of the mouse genome that is derived from islands of nonmethylated, CpG rich DNA. Cell 40:91–99
Bird AP (1986) CpG-rich islands and the function of DNA methylation. Nature 321:209–213
Bird AP, Taggart MH (1980) Variable patterns of total DNA and rDNA methylation in animals. Nucleic Acids Res 8:1485–1497
Cooper DN, Taggart MH, Bird AP (1983) Unmethylated domains in vertebrate DNA. Nucleic Acids Res 11:647–658
Cross S, Kovarik P, Schmidtke J, Bird AP (1991) Non-methylated islands in fish genomes are GC-poor. Nucleic Acids Res 19:1469–1474
Dvorák M, Urbánek U, Bartunek P, Paces V, Vlach J, Pecenka V, Arnold L, Trávnícek M, Ríman J (1989) Transcription of the chicken myb proto-oncogene starts within a CpG island. Nucleic Acids Res 17:5651–5664
Edwards YH, Charlton J, Brownson C (1988) A non-methylated CpG-rich island associated with the human muscle-specific carbonic anhydrase III gene. Gene 71:473–481
Federoff NV (1989) Maize transposable elements. In: Berg DE, Howe MM (ed) Mobile DNA. American Society for Microbiology, Washington DC, pp 375–412
Fischel-Ghodsian N, Nicholls RD, Higgs DR (1987) Unusual features of CpG-rich (HTF) islands in the human α globin complex: association with non-functional pseudogenes and presence within the 3′ portion of the ζ gene. Nucleic Acids Res 15:9215–9225
Gardiner-Garden M, Frommer M (1987) CpG islands in vertebrate genomes. J Mol Biol 196:261–282
Grant SG, Chapman VM (1988) Mechanisms of X-chromosome regulation. Annu Rev Genet 22:199–233
G'regory SP, Dillon NO, Butterworth PHW (1982) The localization of the 5′ termini of in vivo and in vitro transcripts of a cloned rainbow trout protamine gene. Nucleic Acids Res 10:7581–7592
Gruenbaum Y, Naveh-Many T, Cedar H, Razin A (1981a) Sequence specificity of methylation in higher plant DNA. Nature 292:860–862
Gruenbaum Y, Stein R, Cedar H, Razin A (1981b) Methylation of CpG sequences in eukaryotic DNA. FEBS Lett 124:67–71
Harker CL, Ellis THN, Coen ES (1990) Identification and genetic regulation of the chalcone synthase multigene family in pea. Plant Cell 2:185–194
Hartings H, Maddaloni M, Lazzaroni N, Di Fonzo N, Motto M, Salamini F, Thompson R (1989) The O2 gene which regulates zein deposition in maize endosperm encodes a protein with structural homologies to transcriptional activators. EMBO J 8:2795–2801
Jin Y-K, Bennetzen JL (1989) Structure and coding properties of Bs1, a maize retrovirus-like transposon. Proc Natl Acad Sci USA 86:6235–6239
Josse J, Kaiser AD, Kornberg A (1961) Enzymatic synthesis of deoxyribonucleic acid; VIII. Frequencies of nearest neighbor base sequences in deoxyribonucleic acid. J Biol Chem 236: 864–875
Kolsto A-B, Kollias G, Giguere V, Isobe K-I, Prydz H, Grosveld F (1986) The maintenance of methylation-free islands in transgenic mice. Nucleic Acids Res 14:9667–9678
Krebbers E, Hehl R, Piotrowiak R, Lönnig W-E, Sommer H, Saedler H (1987) Molecular analysis of paramutant plants of Antirrhinum majus and the involvement of transposable elements. Mol Gen Genet 209:499–507
Kunze R, Stochaj U, Laufs J, Starlinger P (1987) Transcription of transposable element activator (Ac) of Zea mays L. EMBO J 6:1555–1563
Langdale JA, Taylor WC, Nelson T (1991) Cell-specific accumulation of maize phosphoenolpyruvate carboxylase is correlated with demethylation at a specific site >3 kb upstream of the gene. Mol Gen Genet 225:49–55
Martinez P, Martin W, Cerff R (1989) Structure, evolution and anaerobic regulation of a nuclear gene encoding cytosolic glyceraldehyde-3-phosphate dehydrogenase from maize. J Mol Biol 208:551–565
McGeoch DJ (1970) PhD thesis, Institute of Biochemistry, University of Glasgow
Nick H, Bowen B, Ferl RJ, Gilbert W (1986) Detection of cytosine methylation in the maize alcohol dehydrogenase gene by genomic sequencing. Nature 319:243–246
Pereira A, Cuypers H, Gierl A, Schwarz-Sommer Z, Saedler H (1986) Molecular analysis of the En/Spm transposable element system of Zea mays. EMBO J 5:835–841
Pfeifer GP, Tanguay RL, Steigerwald SD, Riggs AD (1990) In vivo footprint and methylation analysis by PCR-aided genomic sequencing: comparison of active and inactive X chromosomal DNA at the CpG island and promoter of human PGK-1. Genes & Dev 4:1277–1287
Rogers JC, Dean D, Heck GR (1985) Aleurain: a barley thiol protease closely related to mammalian cathepsin H. Proc Natl Acad Sci USA 82:6512–6516
Russell GJ, Follett EAC, Subak-Sharpe JH (1971) The doublestranded DNA of cauliflower mosaic virus. J Gen Virol 11: 129–138
Russell GJ, Walker PMB, Elton RA, Subak-Sharpe JH (1976) Doublet frequency analysis of fractionated vertebrate nuclear DNA. J Mol Biol 108:1–23
Salinas J, Matassi G, Montero LM, Bernardi G (1988) Compositional compartmentalization and compositional patterns in the nuclear genomes of plants. Nucleic Acids Res 16:4269–4285
Swartz MN, Trautner TA, Kornberg A (1962) Enzymatic synthesis of deoxyribonucleic acid; XI. Further studies on nearest neighbor base sequences in deoxyribonucleic acids. J Biol Chem 237:1961–1967
Tykocinski ML, Max EE (1984) CG dinucleotide clusters in MHC genes and in 5′ demethylated genes. Nucleic Acids Res 12:4385–4396
Wu S-C, Bogre L, Vincze E, Kiss GB, Dudits D (1988) Isolation of an alfalfa histone H3 gene: structure and expression. Plant Mol Biol 11:641–649
Author information
Authors and Affiliations
Additional information
Offprint requests to: M. Gardiner-Garden
Rights and permissions
About this article
Cite this article
Gardiner-Garden, M., Frommer, M. Significant CpG-rich regions in angiosperm genes. J Mol Evol 34, 231–245 (1992). https://doi.org/10.1007/BF00162972
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00162972