The sequence and analysis of Trypanosoma brucei chromosome II.

TitleThe sequence and analysis of Trypanosoma brucei chromosome II.
Publication TypeJournal Articles
Year of Publication2003
Authorsel-Sayed NMA, Ghedin E, Song J, MacLeod A, Bringaud F, Larkin C, Wanless D, Peterson J, Hou L, Taylor S, Tweedie A, Biteau N, Khalak HG, Lin X, Mason T, Hannick L, Caler E, Blandin G, Bartholomeu D, Simpson AJ, Kaul S, Zhao H, Pai G, Van Aken S, Utterback T, Haas B, Koo HL, Umayam L, Suh B, Gerrard C, Leech V, Qi R, Zhou S, Schwartz D, Feldblyum T, Salzberg S, Tait A, C Turner MR, Ullu E, White O, Melville S, Adams MD, Fraser CM, Donelson JE
JournalNucleic Acids Res
Volume31
Issue16
Pagination4856-63
Date Published2003 Aug 15
ISSN1362-4962
KeywordsAnimals, Antigens, Protozoan, Chromosome mapping, Chromosomes, DNA, Protozoan, Gene Duplication, Genes, Protozoan, Molecular Sequence Data, Pseudogenes, Recombination, Genetic, Sequence Analysis, DNA, Trypanosoma brucei brucei
Abstract

We report here the sequence of chromosome II from Trypanosoma brucei, the causative agent of African sleeping sickness. The 1.2-Mb pairs encode about 470 predicted genes organised in 17 directional clusters on either strand, the largest cluster of which has 92 genes lined up over a 284-kb region. An analysis of the GC skew reveals strand compositional asymmetries that coincide with the distribution of protein-coding genes, suggesting these asymmetries may be the result of transcription-coupled repair on coding versus non-coding strand. A 5-cM genetic map of the chromosome reveals recombinational 'hot' and 'cold' regions, the latter of which is predicted to include the putative centromere. One end of the chromosome consists of a 250-kb region almost exclusively composed of RHS (pseudo)genes that belong to a newly characterised multigene family containing a hot spot of insertion for retroelements. Interspersed with the RHS genes are a few copies of truncated RNA polymerase pseudogenes as well as expression site associated (pseudo)genes (ESAGs) 3 and 4, and 76 bp repeats. These features are reminiscent of a vestigial variant surface glycoprotein (VSG) gene expression site. The other end of the chromosome contains a 30-kb array of VSG genes, the majority of which are pseudogenes, suggesting that this region may be a site for modular de novo construction of VSG gene diversity during transposition/gene conversion events.

Alternate JournalNucleic Acids Res.
PubMed ID12907728
PubMed Central IDPMC169936
Grant ListU01 AI43062 / AI / NIAID NIH HHS / United States