HomeScienceGenome-scale RNA interference profiling of Trypanosoma brucei cell cycle progression defects

Genome-scale RNA interference profiling of Trypanosoma brucei cell cycle progression defects

[ad_1]

A genome-wide conditional knockdown screen for cell cycle progression defects

Bloodstream form T. brucei are readily grown in cell culture, with exponential proliferation and a doubling time of approximately 6.5 h. The T. brucei nuclear genome is typically diploid such that G1 cells have a 2 C genome content; C represents the haploid DNA content. Cells progressing through nuclear S phase, and replicating their DNA, have a genome content between 2 C and 4 C, while cells that have completed DNA replication (G2M) have a 4 C DNA content (Fig. 1a). Mitosis and cytokinesis then produce two daughter cells with a 2 C DNA content. Some perturbations yield defects involving a ‘short-circuit’, whereby S phase, mitosis and/or cytokinesis are skipped, producing sub-2C cells or over-replicated, polyploid (>4 C) cells (Fig. 1a). Polyploid cells arise due to endoreduplication, additional rounds of DNA replication without cytokinesis, either with24 or without25,26 mitosis, yielding cells with multiple nuclei or with polyploid nuclei, respectively.

Fig. 1: A genome-wide conditional knockdown screen for cell cycle progression defects.
figure 1

a Schematic representation of the bloodstream form T. brucei cell cycle, also showing aberrant sub-2C and >4 C (multiple nuclei) phenotypes; anuclear zoids are not shown as they are undetectable by RIT-seq. b The schematic illustrates the RIT-seq screen; massive parallel induction of RNAi with tetracycline (Tet), followed by flow cytometry and RIT-seq, allowing for reconstruction of cell cycle profiles, using mapped reads from each knockdown. Each read-mapping profile encompasses the gene of interest and associated untranslated regions present in the cognate mRNA. The library data represents the uninduced and unsorted population. GeneIDs, Tb927.7.3160 for example, are indicated without the common ‘Tb927.’ component.

We devised a high-throughput RNA interference (RNAi) target sequencing (RIT-seq) screen to identify cell cycle controls and regulators at a genomic scale. Key features of RIT-seq screening include: first, use of a high-complexity T. brucei RNAi library comprising, in this case, approximately one million clones; second, massive parallel tetracycline-inducible expression of cognate dsRNA; and third, deep sequencing, mapping and counting of mapped reads derived from RNAi target fragments22. Each clone in the library has one of approximately 100,000 different RNAi target fragments (250–1500 bp) between head-to-head inducible T7-phage promoters. This is achieved by targeting each cassette to a specific, single chromosomal locus that supports robust and reproducible inducible expression22,27. Inducibly expressed long dsRNA is then processed to siRNA by the native RNAi machinery28. Complexity and depth of genome coverage in the library are critical, in that similar phenotypes produced by multiple clones with distinct RNAi target fragments against the same gene provide cross-validation. Improvements in reference genome annotation29, next generation sequencing technology and sequence data analysis tools (see Methods) have also greatly facilitated quantitative phenotypic analysis using short-read sequence data.

Briefly, we induced massive parallel knockdown in an asynchronous T. brucei bloodstream form RNAi library for 24 h, fixed the cells, stained their DNA with propidium iodide (PI) and then used fluorescence-activated cell sorting (FACS) to divide the perturbed cell population into; sub-diploid (<2 C), G1 (2 C), S (between 2 C and 4 C), G2M (4 C) and over-replicated (>4 C) pools (Supplementary Fig. 1). Fixation and staining with the fluorescent DNA intercalating dye were pre-optimised for high-throughput sorting (see Materials and Methods). Approximately 10 million cells were collected for each of the G1, S and G2M pools and samples from these pools were checked post-sorting to assess their purity (Fig. 1b, Supplementary Fig. 1). For the perturbed and less abundant <2 C and >4 C pools, less than one million cells were collected; these pools were retained in their entirety for RIT-seq analysis.

RIT-seq was carried out for both the uninduced and induced, unsorted library controls, and for each of the five induced and sorted pools of cells as described in the Methods section. Briefly, we extracted genomic DNA from each sample, amplified DNA fragments containing each RNAi target fragment in PCR reactions (Supplementary Fig. 1) and used the amplified products to generate Illumina sequencing libraries. Analysis of sequencing reads mapped to the reference genome yielded counts for both total reads as well as reads containing the barcode (GTGAGGCCTCGCGA) that flanks each RNAi target fragment; the presence of the barcode confirmed that reads were derived from a specific RNAi target fragment and not from elsewhere in the genome. We derived counts of reads mapped to each of >7200 non-redundant gene sequences in the uninduced and induced, unsorted library controls and in each of the five sorted samples. We selected the 24 h timepoint, equivalent to approximately 3.5 population doubling times, for the current analysis. We found that reads for 23.4% of genes were diminished by >3-fold following 72 h of knockdown in our prior RIT-seq study23, while reads for only 0.6% of genes dropped by >3-fold following 24 h of knockdown in the unsorted control samples analysed here (see Supplementary Fig. 2a, b). Thus, 24 h should have allowed sufficient time for the development of robust inducible phenotypes and also captured perturbed cells before they were critically diminished due to loss-of-fitness. An unanticipated feature that emerged from this analysis of prior RIT-seq data was that knockdown of proteins associated with DNA replication typically failed to register a major loss-of-fitness (Supplementary Fig. 2a, b). This suggested that a reduced rate of DNA replication can be tolerated, albeit extending S phase (see below) but having relatively little impact on viability. Each sorted sample library yielded between 23 and 37 million mapped read-pairs; <2 C = 37 M, G1 = 35 M, S = 30 M, G2M = 23 M, > 4 C = 25 M; this set of five samples yielded data for >7000 genes which equates to >35,000 RNAi data-points (Supplementary data 1).

The RIT-seq digital data for individual genes following knockdown provided a measure of abundance in each pool and were, therefore, used to digitally reconstruct cell cycle profiles for individual gene knockdowns (Fig. 1b). We expected to observe accumulation of particular knockdowns in specific cell cycle phase pools, thereby reflecting specific defects. This was indeed the case, and some examples are shown to illustrate; no major defect, G2M overrepresented or >4 C overrepresented, following knockdown (Fig. 1b). These outputs suggest that loss of a cytoplasmic dynein heavy chain (7.3160) does not perturb cell cycle distribution; that the proteasome is required to complete G2M (see below); and that knockdown of a flagellar axonemal dynein heavy chain (11.11220) results in endoreduplication in the absence of cytokinesis; dyneins are cytoskeletal motor proteins that either move along microtubules or drive microtubule sliding, to produce a flagellar beat, for example30.

Validation and identification of genes linked to cell cycle defects

The T. brucei core genome comprises a non-redundant set of over 7200 protein-coding sequences, for which we were now able to digitally reconstruct cell cycle profiles following knockdown. First, we examined knockdowns reporting an overrepresentation of >4 C cells, indicating endoreduplication, which yielded 284 genes (Fig. 2a, left-hand panel; Supplementary data 1). The >4 C phenotype was previously observed following α-tubulin knockdown in a landmark study that first described RNAi in T. brucei24 and, indeed, we observed pronounced overrepresentation of >4 C cells for both adjacent α-tubulin and β-tubulin gene knockdowns (Fig. 2a, middle and right-hand panel). We then examined knockdowns reporting an overrepresentation of <2 C cells, indicating a reduced DNA content, which yielded 10 hits (Fig. 2b, left-hand panel; Supplementary data 1). Haploid cells were previously observed following DOT1A knockdown31 and, consistent with the previous report, we observed pronounced overrepresentation of <2 C cells for the DOT1A gene knockdown (Fig. 2b, middle and right-hand panel); we are not aware of other knockdowns reported to yield a similar phenotype. Indeed, other ‘<2 C hits’ mostly encode small hypothetical proteins, seven of which are 73 ± 11% shorter than the average, consistent with low read-count and under-sampling for these hits (Supplementary Fig. 2c). The remaining two hits are a histone chaperone (ASF1B) and a glycolytic enzyme (PFK). Together, these results provided initial validation for the >4 C and <2 C components of the screen.

Fig. 2: Validation and identification of >1000 candidates linked to cell cycle defects.
figure 2

a The plot on the left shows knockdowns overrepresented in the >4 C experiment in red; those with reads in the >4 C pool that exceeded the mean fold-change value by >1.75 times the SD, equivalent to >1.117-fold the sum of reads in the G1, S phase and G2M samples combined. The read-mapping profile and read-counts for α/β-tubulin are shown to the right. b The plot on the left shows knockdowns overrepresented in the sub-2C experiment in orange; those with reads in the sub-2C pool that exceeded the mean fold-change value by >1.75 times the SD, equivalent to >4-fold the sum of reads in the G1, S phase and G2M samples combined. The read-mapping profile and read-counts for DOT1A are shown to the right. c The RadViz plot shows knockdowns that registered >25% overrepresented read-counts in the G1 (purple), S phase (green), or G2M (blue) categories. d Read-mapping profiles and relative read-counts for example hits. PCNA, proliferating cell nuclear antigen; PPL2, PrimPol-like 2. e The Venn diagram shows the distribution of knockdowns overrepresented in each arm of the screen.

Next, we turned our attention to knockdowns reporting an overrepresentation of G1, S phase or G2M cells. The pools of knockdowns that registered >25% overrepresented read counts in each of these categories are highlighted in the RadViz plot in Fig. 2c (also see Supplementary data 1) and data for an example from each category are shown in Fig. 2d; the glycolytic enzyme, aldolase, reported 104% increase in G1 cells (further details below); the proliferating cell nuclear antigen (PCNA), a DNA sliding clamp that is a central component of the replication machinery32, reported 25% increase in S phase cells and 13% increase in G2M cells, consistent with prior analysis33; and PrimPol-like 2 (PPL2), a post-replication translesion polymerase, reported 65% increase in G2M cells, also consistent with prior analysis34. These results provided initial validation for the G1, S phase and G2M components of the screen. The full dataset can be searched and browsed using an interactive, open access, online data visualization tool (see Supplementary Fig. 3; https://tryp-cycle.pages.dev/).

Overall, the five components of the screen yielded 1198 genes that registered a cell cycle defect, based on the thresholds applied above. This is 16.6% of the 7205 genes analysed, and the distributions of these genes among the five arms of the screen are shown in the Venn diagram in Fig. 2e. Since we predicted that knockdowns associated with a cell cycle defect were more likely to also register a growth defect, we compared these datasets to prior RIT-seq fitness profiling data23. All groups of genes that registered cell cycle defects, except for the small <2 C set, were significantly enriched for genes that previously registered a loss-of-fitness phenotype following knockdown in bloodstream form cells (χ2 test; <2 C, p = 0.93; G1, p = 0.04; S phase, p = 1.3−4; G2M, p = 1.3−23; >4 C, p = 9.4−213), consistent with loss-of-fitness as a common outcome following a cell cycle progression defect. Taken together, the analyses above provided validation for the RIT-seq based cell cycle phenotyping approach and yielded >1000 candidate proteins that impact progression through specific steps of the T. brucei cell cycle.

Cytokinesis defects associated with endoreduplication

In bloodstream form T. brucei, defective >4 C cells can arise due to endoreduplication without cytokinesis, either with24 or without25,26 mitosis. Endoreduplication defects were previously observed following knockdown of α-tubulin24 or flagellar proteins7,35; consistent with the view that flagellar beat is required for cytokinesis in bloodstream form T. brucei. As shown above, dynein heavy chain (see Fig. 1b), α-tubulin and β-tubulin (see Fig. 2a) knockdowns were amongst 284 knockdowns overrepresented in the endoreduplicated pool in our screen. Gene Ontology (GO) annotations, which provide structured descriptions of gene products in terms of functions, processes, and compartments, were assessed to further profile this cohort of knockdowns. Terms overrepresented in association with an endoreduplication defect included ‘dynein’, ‘intraflagellar transport’ (IFT), ‘axoneme’ and ‘cytoskeleton’, and also ‘chaperonin T-complex’, ‘cytokinesis’ and ‘cell cycle’ (Fig. 3a). The violin plot in Fig. 3b shows specific enrichment of IFT and dynein knockdowns in association with endoreduplication. Exocyst components, primarily involved in exocytosis36, were included as a control cohort since none of the exocyst components registered enrichment in the >4 C pool, nor in any other experimental pool analysed here (see below). Enrichment of individual chaperonin T-complex components, dyneins, and IFT factors in the >4 C pool is illustrated in Fig. 3c. The chaperonin T-complex is involved in tubulin and actin folding37 and, notably, actin knockdown was also associated with endoreduplication (Supplementary Fig. 4).

Fig. 3: Cytokinesis defects associated with endoreduplication.
figure 3

a The bar-graph shows enriched Gene Ontology terms associated with the >4 C hits, those that exceed the mean fold-change value in this set by >1.75 times the SD. P-values are shown on the right. b The violin plot shows relative >4 C read-counts for cohorts of genes and reflects data distribution. Open circles indicate median values and the vertical bars indicate 95% confidence intervals. Significantly overrepresented cohorts are indicated in red. c The plots show overrepresentation of T-complex, dynein and intraflagellar transport (IFT) factors in red in the >4 C experiment. d The heatmaps show relative representation in all five sorted pools for the above and additional cohorts of knockdowns; blue, most overrepresented. e Example read-mapping profiles for hits overrepresented in the >4 C pool. f Example read-mapping profiles for ciliopathy-associated hits overrepresented in the >4 C pool. CMF Component of Motile Flagella, CFAP Cilia and Flagella Associated Protein.

The heat-map in Fig. 3d shows the data for all five sorted pools for the cohorts described above and for additional cohorts of knockdowns enriched in the >4 C pool; these include additional dynein chains, radial spoke proteins, extra-axonemal paraflagellar rod (PFR) proteins, as well as nucleoporins. The gallery in Fig. 3e shows examples of RIT-seq read-mapping profiles for twenty-six individual genes that register >4 C enrichment following knockdown. In addition to the categories above, these include the inner arm dynein 5-138, FAZ proteins which mediate attachment of the flagellum to the cell body39; all four cytokinesis initiation factors CIF1-440, and chromosomal passenger complex components, including CPC1 and the aurora B kinase, AUK1. AUK1 and CPC1 are spindle-associated and regulate mitosis and cytokinesis26,41. Notably, endoreduplication was reported previously following AUK1 knockdown in bloodstream form T. brucei42 and this is the kinase with the most pronounced overrepresentation in our >4 C dataset. The next >4 C overrepresented kinase is the CMGC/RCK (Tb927.3.690), knockdown of which previously yielded a striking cytokinesis defect43.

Additional examples of genes registering >4 C overrepresentation include the centriole cartwheel protein SAS644, the cleavage furrow-localizing protein FRW145, the basal body—axoneme transition zone protein TZP12546 and the basal body protein BBP24847. One hundred additional examples are shown in Supplementary Fig. 4, including intermediate and light chain dyneins, other flagellum-associated factors, radial spoke proteins, components of motile flagella, flagellum attachment and transition zone proteins, kinesins48,49, nucleoporins50, and many previously uncharacterised hypothetical proteins. Some other notable examples include the microtubule-severing katanin KAT8051, the dynein regulatory factor trypanin52, the AIR9 microtubule associated protein53, CAP51V54 and importin, IMP155.

Orthologues of several T. brucei flagellar proteins have previously been linked to debilitating human ciliopathies, such that the trypanosome flagellum is exploited as a model for studies on these defects7. Defects in intraflagellar dynein transport are associated with respiratory infections, for example9. Orthologues of DNAH (10.5350 and 11.8160, Fig. 3e) are linked to male infertility, while additional ciliopathy-associated orthologues which register overrepresentation in the >4 C pool are shown in Fig. 3f and Supplementary Fig. 4. These include orthologues of proteins linked to primary ciliary dyskinesia (DNAH5, DNAH11, RSPH4 and DNAI1)11; male infertility (PF16, PACRGA, CFAP43 and CMF7/TbCFAP44)7,10; and cone-rod dystrophies, as well as other ocular defects (CMF17, CMF39 and CMF46)8.

From analysis of knockdowns overrepresented in the >4 C pool, we conclude that RIT-seq screening provided comprehensive genome-scale identification of cytokinesis defects in bloodstream form T. brucei. Endoreduplication appears to be a common outcome following a cytokinesis defect. Amongst hundreds of genes required for progression through cytokinesis, flagellar proteins featured prominently, including the majority of dynein chains and intraflagellar transport factors. Many of these factors are essential for viability and include potential druggable targets in trypanosomatids, as well as orthologues of proteins associated with ciliopathies.

Defects producing sub-diploid cells

A DNA replication or mitosis defect followed by cytokinesis may result in generation of cells that retain nuclear DNA with a sub-2C DNA content. We emphasise retention of nuclear DNA here because T. brucei cells lacking nuclear DNA, referred to as zoids, have been reported previously as a result of asymmetrical cell division. Zoids are typically observed when DNA replication or mitosis are perturbed in insect stage cells16,25,56. The zoid phenotype is typically either absent or less pronounced in the developmentally distinct bloodstream form cells57 that we analysed here. Nevertheless, any zoids present in the <2 C pool will not have been detected using RIT-seq, since detection relies upon the presence of a nuclear RNAi target fragment.

Ten knockdowns were overrepresented in the <2 C RIT-seq screening dataset (Supplementary data 1), including the previously identified histone methyltransferase, DOT1A (Fig. 2b). DOT1A is responsible for dimethylation of histone H3K76, and DOT1A knockdown results in mitosis and cytokinesis without DNA replication, generating cells with a haploid DNA content 31. Our data suggest that few additional knockdowns yield a similar phenotype in bloodstream form T. brucei.

A profile of G1, S phase and G2M defects

We next analysed knockdowns overrepresented in the G1, S phase or G2M pools. Several hundred knockdowns registered >25% overrepresented read counts in each of these categories (Fig. 2c, e). GO annotations within each cohort revealed a number of enriched terms (Fig. 4a). Overrepresented knockdowns were associated with glycolysis, mRNA binding and the mitochondrion in the G1 pool, with DNA replication in the S phase pool and with a broadly similar profile to that seen for the >4 C set in the G2M pool.

Fig. 4: A profile of G1, S phase and G2M defects.
figure 4

a The bar-graphs show enriched Gene Ontology terms associated with the G1, S phase or G2M hits, those that register >25% overrepresented read-count in each of these categories. P-values are shown on the right. b The violin plots show relative G1, S phase or G2M read-counts for cohorts of genes and reflect data distribution. Open circles indicate median values and the vertical bars indicate 95% confidence intervals. Overrepresented cohorts are indicated in purple, green and blue, respectively. IFT intraflagellar transport. c The heatmaps show relative representation in all five sorted pools for the above and additional cohorts of knockdowns; blue, most overrepresented. MCM minichromosome maintenance, PSP1 DNA polymerase suppressor 1.

The violin plots in Fig. 4b show specific enrichment of individual knockdowns for glycolytic enzymes and a subset of mRNA binding proteins in the G1 pool, for DNA replication factors in the S phase pool, and proteasome components and a subset of kinetochore components in the G2M pool (Fig. 4b). Overlap between knockdowns that accumulate in both the G2M and >4 C pools likely reflects cytokinesis defects with cells accumulating both before and after endoreduplication; compare G2M and >4 C data for IFT factors and dyneins in Fig. 4b and Fig. 3b, for example. Other mitosis or cytokinesis-perturbed phenotypes are likely not associated with substantial endoreduplication; see the kinetochore and proteasome cohorts in Fig. 4b, for example. Once again, the exocyst provided a control cohort with no components registering enrichment in the G1, S phase or G2M pools following knockdown (Fig. 4b).

The heat-map in Fig. 4c shows the data for all five sorted pools for the cohorts described above and for additional knockdowns enriched in the G1 or S phase (tRNA synthetases), S phase (core histones) or G2M pools (PSP1, DNA polymerase suppressor 1), or not enriched in any pool. These latter sets provide further controls that do not appear to have substantial impacts on cell cycle progression, including the mitochondrial RNA editing accessory complex MRB158 and the mitochondrial ATP synthase complex V59. Thus, we identify protein complexes, pathways and regulatory factors that are specifically required for progressive steps through the trypanosome cell cycle.

Pathways and protein complexes associated with G1, S phase and G2M defects

We next explored some of the cohorts of hits described above in more detail. Glycolytic enzymes are particularly prominent amongst knockdowns that accumulate in the G1 pool, and we illustrate the RIT-seq profiling data for these enzymes in Fig. 5a. Seven of eleven glycolytic enzyme knockdowns register >25% overrepresentation in the G1 pool; hexokinase, phosphofructokinase, aldolase (see Fig. 2c), triosephosphate isomerase, glyceraldehyde 3-phosphate dehydrogenase, phosphoglycerate kinase C and pyruvate kinase. Glycolysis operates in peroxisome-like organelles known as glycosomes in trypanosomes and is thought to be the single source of ATP in bloodstream form cells12. Glycolysis also provides metabolic intermediates that support nucleotide production. Notably, mammalian cell proliferation is accompanied by activation of glycolysis, and the Warburg effect relates to this phenomenon in oncology60,61. Indeed, hexokinase regulates the G1/S checkpoint in tumour cells62. The results are also consistent with the observation that T. brucei accumulate in G1 or G0 under growth-limiting conditions63 or during differentiation to the non-dividing stumpy form64, possibly reflecting a role for glucose sensing in differentiation65. Notably, glycolytic enzymes are downregulated 6.7 + /−5.2-fold in stumpy-form cells66. We conclude that, as in other organisms67, there is metabolic control of the cell cycle and a nutrient sensitive restriction point in T. brucei, with glycolysis playing a role in the G1 to S phase transition and possibly also the G1/G0 transition.

Fig. 5: Protein complexes and pathways associated with G1, S phase and G2M defects.
figure 5

a The RadViz plot shows glycolytic enzyme knockdowns. Those that registered >25% overrepresented read-counts in the G1 category are indicated. Black data-points indicate other genes from each cohort. Grey data-points indicate all other genes. The read-mapping profiles and relative read-counts in the lower panel show example hits. b As in a but for DNA replication initiation factor knockdowns that registered >25% overrepresented read-counts, primarily in the S phase category. c As in a but for proteasome component knockdowns that registered >25% overrepresented read-counts, primarily in the G2M category. d As in a but for kinetochore component knockdowns that registered >25% overrepresented read-counts, primarily in the G2M category.

DNA replication initiation factors are particularly prominent amongst knockdowns that accumulate in S phase and we illustrate the RIT-seq profiling data for these factors in Fig. 5b. Five knockdowns that register >25% overrepresentation in the S phase pool are components of the eukaryotic replicative helicase, the CMG (Cdc45-MCM-GINS) complex. At the core of this complex is the minichromosome maintenance complex (MCM2-7), a helicase that unwinds the duplex DNA ahead of the moving replication fork68. Identification of CMG complex components suggests that each of these subunits is required for timely progression through S phase.

Proteasome activity promotes mitosis in T. brucei69 and, consistent with this, proteasome components are particularly prominent amongst knockdowns that accumulate in G2M; we illustrate the RIT-seq profiling data for this protein complex in Fig. 5c. Sixteen of 28 proteasome component knockdowns register >25% overrepresentation in the G2M pool. This output is consistent with the view that the T. brucei proteasome is responsible for degrading cell cycle regulators, such as poly-ubiquitinated cyclins, some of which are known to control cell cycle checkpoints in T. brucei. Candidate targets in T. brucei include: CIF1, AUK170, cyclin 6 (CYC6), degradation of which is required for mitosis71; cyclin-like CFB2, required for cytokinesis72; and cyclin 2 (CYC2) or cyclin 3 (CYC3), which have short half-lives and a candidate destruction box motif in the case of CYC373.

Kinetochore components17 are also amongst knockdowns that accumulate in G2M and we illustrate the RIT-seq profiling data for this protein complex in Fig. 5d. Although knockdown of KKT2, a putative kinase, registered overrepresentation in the S phase pool, KKT1, KKT7 and KKT10/CLK1 knockdowns registered >25% overrepresentation in the G2M pool, suggesting that these particular kinetochore components, which all display temporal patterns of phosphorylation from S phase to G2M21, are required for progression through mitosis. Notably, KKT10 is a kinase responsible for phosphorylation of KKT7, which is required for the metaphase to anaphase transition74; as well as for the phosphorylation of KKT1 and KKT2, in turn required for kinetochore assembly75,76. These findings are consistent with the view that kinetochore components control a non-canonical spindle checkpoint in trypanosomes74.

RBPs, kinases and hypothetical proteins associated with G1, S phase and G2M defects

Widespread polycistronic transcription in trypanosomatids places great emphasis on post-transcriptional controls and, consistent with this, knockdowns overrepresented in the G1, S phase and G2M pools revealed many putative mRNA binding proteins (RBPs) and kinases. Indeed, RBPs are significantly enriched amongst knockdowns that registered G1, S phase or G2M cell cycle defects (χ2 test, p = 7−5). We show the RIT-seq profiling data for eleven RBP knockdowns that register >25% overrepresentation in these pools (Fig. 6a). These include knockdowns for RBP10 and RBP29 enriched in G1; RBP10, in particular, has been characterised in some detail and promotes the bloodstream form state77. ZC3H1178, ZC3H41 and ZC3H2879 knockdowns were enriched in G1, S phase and G2M, respectively, while knockdowns of CFB2, MKT1 or PBP1, all recently linked to variant surface glycoprotein expression control80,81, were enriched in G2M. Indeed, based on the outputs of the current screen, we prioritised these latter three RBPs for follow-up analysis in a separate study; all three were thereby validated as G2M hits80. Thus, the RIT-seq cell cycle screen implicated a number of specific RBPs in post-transcriptional control of cell cycle progression through modulation of mRNA stability and/or translation.

Fig. 6: RBPs, kinases and hypothetical proteins associated with gap and S phase defects.
figure 6

a The RadViz plot shows mRNA binding protein knockdowns (RBPs). Those that registered >25% overrepresented read-counts in the G1, S phase or G2M categories are indicated, in purple, green and blue, respectively. The read-mapping profiles and relative read-counts in the other panels show example hits. b As in a but for protein kinase knockdowns, with selected kinases indicated. c As in a but for hypothetical (conserved) protein knockdowns.

We show data for protein kinases above, linked to enriched >4 C (Fig. 3d), S phase or G2M (Fig. 5d) phenotypes, and now show the RIT-seq profiling data for five additional protein kinase knockdowns that register >25% overrepresentation in the G1, S phase or G2M pools (Fig. 6b). These include knockdowns for CRK7, linked to accumulation in G1; MAPK5, linked to accumulation in S phase and polo-like kinase (PLK) and cdc2-related kinase 3 (CRK3), linked to accumulation in G2M. PLK was previously shown to control cell morphology, furrow ingression and cytokinesis82,83,84, while CRK3 was shown to play a role in G2M progression in bloodstream form T. brucei43,85. Overall correspondence was also excellent with a prior kinome-wide RNAi screen43. For example, eight among nine kinases linked to a mitosis defect in that screen also reported an (21 ± 12%) increase in the G2M pool in the current screen.

Finally, we analysed genes encoding proteins annotated as hypothetical (conserved). Despite excellent progress in genome annotation, 35% of non-redundant genes in T. brucei retain this annotation, amounting to >2500 genes. We show data for several hypothetical protein knockdowns above, linked to the enriched >4 C phenotype (Supplementary Fig. 4), and we here identify >300 additional hypothetical protein knockdowns that register >25% overrepresentation in the G1, S phase or G2M pools. RIT-seq profiling data are shown for five examples in Fig. 6c and for several additional examples in Supplementary Fig. 5. Amongst other examples of knockdowns shown in Supplementary Fig. 5, are alternative oxidase86, linked to G1 enrichment; kinesins linked to G2M enrichment, including both chromosomal passenger complex kinesins (KIN-A and KIN-B)26 and KIN-G; CYC625,87, centrin 388 and, finally, both components of the histone chaperone FACT (facilitates chromatin transcription) complex89 Spt16 and Pob3, linked to G2M enrichment. Notably, the FACT complex has been linked to centromere function in human cells90.

Cell cycle regulated proteins linked to cell cycle progression defects

Factors required for cell cycle progression may themselves be cell cycle regulated. To identify some of these factors, we compared our current dataset with quantitative transcriptome19, proteome20 and phosphoproteome21 cell cycle profiling data (Supplementary data 1). An initial survey of all 1,198 genes that registered a cell cycle defect here (see Fig. 2e) revealed significant enrichment of cell cycle regulated mRNAs (overlap = 114 of 484, χ2 p = 3.2−4), and proteins displaying cell cycle regulated phosphorylation (overlap = 112 of 547, χ2 p = 0.025). This, despite the fact that the transcriptome and (phospho)proteome datasets were derived from insect stage T. brucei, such that regulation may differ in the bloodstream T. brucei cells used for RIT-seq analysis here.

In terms of specific cell cycle regulated proteins20 required for specific cell cycle progression steps, multiple glycolytic enzymes upregulated in G1 were linked to accumulation in the G1 pool following knockdown (χ2 p = 7.9−11). In addition, proteins upregulated in G2 and M were linked to accumulation in the G2M (χ2 p = 1.8−8) or >4 C pools (χ2 p = 8.9−9) following knockdown, including kinetochore and chromosomal passenger complex components, respectively. Some specific transcripts required for cell cycle progression may be upregulated prior to peak demand for the encoded protein, and we found evidence to support this view. For example, transcripts upregulated in late G1 or in S phase were enriched amongst those knockdowns linked to accumulation in the G2M pool (χ2 p = 3.3−3 and p = 0.011, respectively); both components of the FACT complex, upregulated in G1, for example (see Supplementary Fig. 5). Similarly, S phase and G2M upregulated transcripts, including those encoding multiple flagellum-associated proteins, were enriched amongst knockdowns linked to accumulation in the >4 C pool (χ2 p = 4.6−18 and p = 2.4−5 respectively).

Some proteins displayed both cell cycle regulated expression and phosphorylation patterns that were consistent with their roles in cell cycle progression. These included putative RBPs of the DNA polymerase suppressor 1 (PSP1) family, which display mRNA upregulation in G1, protein upregulation in S phase, cell cycle regulated phosphorylation and, following knockdown, accumulation in G2M (see Figs. 4c and 6a). The kinetochore components, KKT1 and KKT7, and also CRK3, all display mRNA upregulation in S phase, protein upregulation in G2 and M, cell cycle regulated phosphorylation and, following knockdown, accumulation in G2M (see Figs. 5d and 6b); KKT10 and CYC6 report a similar profile (see Fig. 5d), except for the mRNA regulation component. The cytokinesis initiation factors, CIF1 and CIF2, display mRNA upregulation in S phase, protein upregulation in G2 and M, cell cycle regulated phosphorylation and, following knockdown, accumulation in the endoreduplicated pool (see Fig. 3e). Finally, the chromosomal passenger complex components, CPC1 and AUK1, as well as furrow localized FRW1, report mRNA and protein upregulation in G2M and, following knockdown, accumulation in the endoreduplicated pool (see Fig. 3e). Thus, several regulators linked to specific cell cycle progression defects by RIT-seq profiling, are themselves cell cycle regulated.

A putative nucleoredoxin controls kinetoplast segregation and mitosis

The current RIT-seq screen identified many novel candidate cell cycle regulators, two of which, both associated with significant loss-of-fitness in our prior RIT-seq screen23, were investigated in more detail. First, Tb927.10.970 was associated with pronounced endoreduplication following knockdown (Fig. 7a). The predicted protein contains a tetratricopeptide repeat motif, a cluster of phosphorylation sites, one of which, T706, has been reported to be cell cycle regulated, peaking in late G2M21, and a string of putative calmodulin-binding IQ domains (Fig. 7b). Tb927.10.970 was shown to localise to the paraflagellar rod in insect stage T. brucei (www.tryptag.org91,92). To validate Tb927.10.970 as a >4 C hit in bloodstream-form trypanosomes, we assembled a pair of independent inducible RNAi knockdown strains. Analysis of cell growth revealed a severe loss-of-fitness following knockdown, confirmed by qRT-PCR (Fig. 7c). Flow cytometry then confirmed endoreduplication, with prominent peaks detected representing 8 C and 16 C cells following knockdown (Fig. 7d, left-hand panel), while examination of these cells by microscopy revealed multiple nuclei, indicating endoreduplication with continued mitosis (Fig. 7d, right-hand panel).

Fig. 7: Validation of endoreduplication following Tb927.10.970 knockdown.
figure 7

a Mapped reads and reconstructed cell cycle profile following 10.970 knockdown. b Schematic map of 10.970. TPR tetratricopeptide repeat motif, P phosphorylation, IQ motifs, putative calmodulin-binding domains. c The plot on the left reveals efficient tetracycline (tet)-induced knockdown of 10.970 after 24 h. Mean values and 3 technical replicates are indicated, with the p-value derived using an unpaired t-test. The growth curve on the right reveals a severe growth defect following knockdown. n = 2 biologically independent clones and the line indicates mean values. Source data are provided as a Source Data file. d Flow cytometry analysis revealed endoreduplication following 24 h of 10.970 knockdown, based on propidium iodide staining. The gates indicate the various cell cycle stages, based on uninduced (- tet) cells. The panel on the right shows representative cells with DNA stained with DAPI (4′,6-diamidino-2-phenylindole) before and after knockdown. N nucleus, K kinetoplast.

We next turned our attention to Tb927.10.3970, annotated ‘hypothetical protein, conserved’, and associated with increased DNA content following knockdown (Fig. 8a). The predicted Tb927.10.3970 protein contains three cell cycle regulated phosphorylation sites21 and a thioredoxin-like domain (Fig. 8b). This protein was shown to be cell cycle regulated based on proteomic analysis and to localise to the nucleus in insect stage T. brucei20. To explore the role of Tb927.10.3970 in bloodstream-form trypanosomes, we assembled a pair of independent inducible RNAi knockdown strains. Analysis of cell growth revealed a severe loss-of-fitness following knockdown, confirmed by monitoring the expression of epitope-tagged 10.3970 (Fig. 8c). Flow cytometry confirmed increased DNA content following knockdown, and also revealed increased cell size (Fig. 8d). Examination of these cells by microscopy allowed us to assess both the nuclei and kinetoplasts (mitochondrial genomes), revealing a major increase in the proportion of cells with a single nucleus and a single rounded kinetoplast following knockdown, an arrangement typically characteristic of G1 cells (Fig. 8e). Quantitative analysis of these compartments revealed a pronounced increase in DNA content in both genomes following knockdown (Fig. 8f), indicating that both kinetoplast segregation and mitosis failed, while genome replication was able to proceed to varying degrees, generating enlarged kinetoplasts and polyploid nuclei. Finally, we assessed the subcellular localisation of epitope-tagged 10.3970 during the bloodstream form cell cycle. Quantitative immunofluorescence microscopy revealed a pattern that was also observed in insect stage cells (www.tryptag.org20,91,92). 10.3970 displayed a nuclear localisation, which increased in intensity during the cell cycle, and then dropped precipitously in post-mitotic cells (Fig. 8g). We conclude that 10.3970 encodes a putative nucleoredoxin that is cell cycle regulated and required for both mitochondrial and nuclear genome segregation and cytokinesis.

Fig. 8: A putative nucleoredoxin controls kinetoplast segregation and mitosis.
figure 8

a Mapped reads and reconstructed cell cycle profile following 10.3970 knockdown. b Schematic map of the putative nucleoredoxin encoded by 10.3970. TRX thioredoxin-like domain. c The protein blots on the left reveal efficient tetracycline (tet)-induced knockdown of N-terminal myc-tagged 10.3970 for one representative clone after 24 h. EF1α serves as a loading control. The growth curve on the right reveals a severe growth defect following knockdown. n = 2 biologically independent clones and the line indicates mean values. d Flow cytometry analysis revealed increased DNA content following 24 h of 10.3970 knockdown, based on propidium iodide staining on the left, and increased cell size, based on the forward scatter profile on the right. The gates on the profile on the left indicate the various cell cycle stages, based on uninduced (- tet) cells. e Quantification of cells with a single nucleus and a single rounded kinetoplast (1 N: 1Kr) before and after 24 h of 10.3970 knockdown, based on DAPI-staining and microscopy. n = 2 biologically independent clones and the lines indicate mean values; n = 80 cells in each case. f The boxplots show quantification of DAPI intensity of nuclei (left) and kinetoplasts (right) in cells with a single nucleus and a single rounded kinetoplast, before and after 24 h of 10.3970 knockdown; n = 100 in each case. The data are from two independent clones and are shown as jittered dots, the box indicates the IQR, whiskers show the range of values within 1.5*IQR, the horizontal lines indicate the median and the notches represent 95% confidence intervals. The inset on the right shows representative cells before and after knockdown. N, nucleus; K, kinetoplast. g The representative immunofluorescence microscopy images show N-terminal GFP-tagged 10.3970 (green) at different cell cycle stages, as defined by the nucleus (N) and kinetoplast (K) configuration observed by staining DNA with DAPI. Intensity of the GFP10.3970 signal was quantified and is shown in the graph on the right. n = 26, 21, 5 and 6 cells for G1, S phase, G2M and post-M, respectively. The lines indicate the median. Source data for c and eg are provided as a Source Data file.


[ad_2]

Source link

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments

%d bloggers like this: