figshare
Browse

Drosophila 3ʹ UTRs are more complex than protein-coding sequences

Download (870.69 kB)
journal contribution
posted on 2025-05-11, 23:20 authored by Manjula Algama, Christopher OldmeadowChristopher Oldmeadow, Edward Tasker, Kerrie Mengersen, Jonathan M. Keith
The 3ʹ UTRs of eukaryotic genes participate in a variety of post-transcriptional (and some transcriptional) regulatory interactions. Some of these interactions are well characterised, but an undetermined number remain to be discovered. While some regulatory sequences in 3ʹ UTRs may be conserved over long evolutionary time scales, others may have only ephemeral functional significance as regulatory profiles respond to changing selective pressures. Here we propose a sensitive segmentation methodology for investigating patterns of composition and conservation in 3ʹ UTRs based on comparison of closely related species. We describe encodings of pairwise and three-way alignments integrating information about conservation, GC content and transition/transversion ratios and apply the method to three closely related Drosophila species: D. melanogaster, D. simulans and D. yakuba. Incorporating multiple data types greatly increased the number of segment classes identified compared to similar methods based on conservation or GC content alone. We propose that the number of segments and number of types of segment identified by the method can be used as proxies for functional complexity. Our main finding is that the number of segments and segment classes identified in 3ʹ UTRs is greater than in the same length of protein-coding sequence, suggesting greater functional complexity in 3ʹ UTRs. There is thus a need for sustained and extensive efforts by bioinformaticians to delineate functional elements in this important genomic fraction. C code, data and results are available upon request.

Funding

ARC

DP0556631

DP0879308

DP1095849

History

Journal title

PLoS One

Volume

9

Issue

5

Publisher

Public Library of Science

Place published

San Francisco, CA

Language

  • en, English

College/Research Centre

Faculty of Science and Information Technology

School

School of Mathematical and Physical Sciences