Considerations and complications of mapping small RNA high-throughput data to transposable elements
Authors | |
---|---|
Year of publication | 2017 |
Type | Article in Periodical |
Magazine / Source | Mobile DNA |
MU Faculty or unit | |
Citation | |
Web | https://mobilednajournal.biomedcentral.com/track/pdf/10.1186/s13100-017-0086-z?site=mobilednajournal.biomedcentral.com |
Doi | http://dx.doi.org/10.1186/s13100-017-0086-z |
Keywords | Transposable elements; Small RNAs; High-throughput sequencing; siRNAs; Genome mapping; Annotation; Bioinformatics; RNA-seq |
Description | Background: High-throughput sequencing (HTS) has revolutionized the way in which epigenetic research is conducted. When coupled with fully-sequenced genomes, millions of small RNA (sRNA) reads are mapped to regions of interest and the results scrutinized for clues about epigenetic mechanisms. However, this approach requires careful consideration in regards to experimental design, especially when one investigates repetitive parts of genomes such as transposable elements (TEs), or when such genomes are large, as is often the case in plants. Results: Here, in an attempt to shed light on complications of mapping sRNAs to TEs, we focus on the 2,300 Mb maize genome, 85% of which is derived from TEs, and scrutinize methodological strategies that are commonly employed in TE studies. These include choices for the reference dataset, the normalization of multiply mapping sRNAs, and the selection among sRNA metrics. We further examine how these choices influence the relationship between sRNAs and the critical feature of TE age, and contrast their effect on low copy genomic regions and other popular HTS data. Conclusions: Based on our analyses, we share a series of take-home messages that may help with the design, implementation, and interpretation of high-throughput TE epigenetic studies specifically, but our conclusions may also apply to any work that involves analysis of HTS data. |
Related projects: |