The CpG landscape of protein coding DNA in vertebrates

dc.contributor.authorWilcox, Justin J.
dc.contributor.authorOrd, James
dc.contributor.authorKappei, Dennis
dc.contributor.authorGossmann, Toni I.
dc.date.accessioned2025-11-24T12:54:03Z
dc.date.available2025-11-24T12:54:03Z
dc.date.issued2025-05-04
dc.description.abstractDNA methylation has fundamental implications for vertebrate genome evolution by influencing the mutational landscape, particularly at CpG dinucleotides. Methylation-induced mutations drive a genome-wide depletion of CpG sites, creating a dinucleotide composition bias across the genome. Examination of the standard genetic code reveals CpG to be the only facultative dinucleotide; it is however unclear what specific implications CpG bias has on protein coding DNA. Here, we use theoretical considerations of the genetic code combined with empirical genome-wide analyses in six vertebrate species—human, mouse, chicken, great tit, frog, and stickleback—to investigate how CpG content is shaped and maintained in protein-coding genes. We show that protein-coding sequences consistently exhibit significantly higher CpG content than noncoding regions and demonstrate that CpG sites are enriched in genes involved in regulatory functions and stress responses, suggesting selective maintenance of CpG content in specific loci. These findings have important implications for evolutionary applications in both natural and managed populations: CpG content could serve as a genetic marker for assessing adaptive potential, while the identification of CpG-free codons provides a framework for genome optimization in breeding and synthetic biology. Our results underscore the intricate interplay between mutational biases, selection, and epigenetic regulation, offering new insights into how vertebrate genomes evolve under varying ecological and selective pressures.en
dc.identifier.urihttp://hdl.handle.net/2003/44391
dc.language.isoen
dc.relation.ispartofseriesEvolutionary applications; 18(5)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectBase compositionen
dc.subjectDinucleotidesen
dc.subjectDNA methylationen
dc.subjectEpigeneticsen
dc.subjectProtein coding DNAen
dc.subject.ddc660
dc.titleThe CpG landscape of protein coding DNA in vertebratesen
dc.typeText
dc.type.publicationtypeArticle
dcterms.accessRightsopen access
eldorado.dnb.deposittrue
eldorado.doi.registerfalse
eldorado.secondarypublicationtrue
eldorado.secondarypublication.primarycitationWilcox, J., Ord, J., Kappei, D., & GoĂźmann, T. (2025). The CpG landscape of protein coding DNA in vertebrates. Evolutionary Applications, 18(5), Article e70101. https://doi.org/10.1111/eva.70101
eldorado.secondarypublication.primaryidentifierhttps://doi.org/10.1111/eva.70101

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Evolutionary Applications - 2025 - Wilcox - The CpG Landscape of Protein Coding DNA in Vertebrates.pdf
Size:
3.13 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.82 KB
Format:
Item-specific license agreed upon to submission
Description: