A computational survey of candidate exonic splicing enhancer motifs in the model plant Arabidopsis thaliana.

TitleA computational survey of candidate exonic splicing enhancer motifs in the model plant Arabidopsis thaliana.
Publication TypeJournal Articles
Year of Publication2007
AuthorsPertea M, Mount SM, Salzberg SL
JournalBMC Bioinformatics
Volume8
Pagination159
Date Published2007
ISSN1471-2105
KeywordsAlternative Splicing, Arabidopsis, Computational Biology, Enhancer Elements, Genetic, Exons, Genes, Plant, RNA, Plant
Abstract

BACKGROUND: Algorithmic approaches to splice site prediction have relied mainly on the consensus patterns found at the boundaries between protein coding and non-coding regions. However exonic splicing enhancers have been shown to enhance the utilization of nearby splice sites.

RESULTS: We have developed a new computational technique to identify significantly conserved motifs involved in splice site regulation. First, 84 putative exonic splicing enhancer hexamers are identified in Arabidopsis thaliana. Then a Gibbs sampling program called ELPH was used to locate conserved motifs represented by these hexamers in exonic regions near splice sites in confirmed genes. Oligomers containing 35 of these motifs have been shown experimentally to induce significant inclusion of A. thaliana exons. Second, integration of our regulatory motifs into two different splice site recognition programs significantly improved the ability of the software to correctly predict splice sites in a large database of confirmed genes. We have released GeneSplicerESE, the improved splice site recognition code, as open source software.

CONCLUSION: Our results show that the use of the ESE motifs consistently improves splice site prediction accuracy.

DOI10.1186/1471-2105-8-159
Alternate JournalBMC Bioinformatics
PubMed ID17517127
PubMed Central IDPMC1892810
Grant ListR01 LM006845 / LM / NLM NIH HHS / United States
R01 LM006845-07S1 / LM / NLM NIH HHS / United States
R01-LM007938 / LM / NLM NIH HHS / United States