Phylogenetic conservation of cis-regulatory regions using sequence alignability and cladistic motifs.
📄 Viewing lite version
Full site ›
Book Details
Author(s)David C King
ISBN / ASIN1243655275
ISBN-139781243655271
AvailabilityUsually ships in 1 to 3 weeks
MarketplaceUnited States 🇺🇸
Description ▲
A growing body of research shows that conservation of regulatory regions across wide phylogenetic spans (e.g. pan-vertebrate) is the exception rather than the rule. Here we study the conservation of regulatory regions without predisposition toward perfect alignment or deep conservation, allowing all possible observations to be interpreted a posteriori as conserved, lineage-specific, misalignment, or as ambiguous. In order to do this, we use the multi-species alignment as the measurement, and define the conservation of any given region as its "alignability" to various species. We also define a conservation-agnostic datatype, called a cladistic motif, which is produced by scanning each row of an alignment as a single sequence and then organizing the matches in terms of their placement within the alignment. Because no a priori assumptions are made about the strength of the alignment, cladistic motifs can describe all simple as well as irregular forms of conservation. We explore cladistic motifs as components of common regulatory regions, i.e. those that fall outside the conventional classification of a multi-species constrained sequence, and are the product of a variety of evolutionary source material for binding sites, such as those arisen by adaptation in human accelerated regions, motif turnover. These cases emphasize the spontaneity of cis-regulatory evolution, and may help explain why functional regulatory regions are a lesser-conserved fraction of the genome.