The sequence and analysis of Trypanosoma brucei chromosome II.
Title | The sequence and analysis of Trypanosoma brucei chromosome II. |
Publication Type | Journal Articles |
Year of Publication | 2003 |
Authors | el-Sayed NMA, Ghedin E, Song J, MacLeod A, Bringaud F, Larkin C, Wanless D, Peterson J, Hou L, Taylor S, Tweedie A, Biteau N, Khalak HG, Lin X, Mason T, Hannick L, Caler E, Blandin G, Bartholomeu D, Simpson AJ, Kaul S, Zhao H, Pai G, Van Aken S, Utterback T, Haas B, Koo HL, Umayam L, Suh B, Gerrard C, Leech V, Qi R, Zhou S, Schwartz D, Feldblyum T, Salzberg S, Tait A, C Turner MR, Ullu E, White O, Melville S, Adams MD, Fraser CM, Donelson JE |
Journal | Nucleic Acids Res |
Volume | 31 |
Issue | 16 |
Pagination | 4856-63 |
Date Published | 2003 Aug 15 |
ISSN | 1362-4962 |
Keywords | Animals, Antigens, Protozoan, Chromosome mapping, Chromosomes, DNA, Protozoan, Gene Duplication, Genes, Protozoan, Molecular Sequence Data, Pseudogenes, Recombination, Genetic, Sequence Analysis, DNA, Trypanosoma brucei brucei |
Abstract | We report here the sequence of chromosome II from Trypanosoma brucei, the causative agent of African sleeping sickness. The 1.2-Mb pairs encode about 470 predicted genes organised in 17 directional clusters on either strand, the largest cluster of which has 92 genes lined up over a 284-kb region. An analysis of the GC skew reveals strand compositional asymmetries that coincide with the distribution of protein-coding genes, suggesting these asymmetries may be the result of transcription-coupled repair on coding versus non-coding strand. A 5-cM genetic map of the chromosome reveals recombinational 'hot' and 'cold' regions, the latter of which is predicted to include the putative centromere. One end of the chromosome consists of a 250-kb region almost exclusively composed of RHS (pseudo)genes that belong to a newly characterised multigene family containing a hot spot of insertion for retroelements. Interspersed with the RHS genes are a few copies of truncated RNA polymerase pseudogenes as well as expression site associated (pseudo)genes (ESAGs) 3 and 4, and 76 bp repeats. These features are reminiscent of a vestigial variant surface glycoprotein (VSG) gene expression site. The other end of the chromosome contains a 30-kb array of VSG genes, the majority of which are pseudogenes, suggesting that this region may be a site for modular de novo construction of VSG gene diversity during transposition/gene conversion events. |
Alternate Journal | Nucleic Acids Res. |
PubMed ID | 12907728 |
PubMed Central ID | PMC169936 |
Grant List | U01 AI43062 / AI / NIAID NIH HHS / United States |