Papers by Robrecht Cannoodt
Nature Communications
We present dyngen, a multi-modal simulation engine for studying dynamic cellular processes at sin... more We present dyngen, a multi-modal simulation engine for studying dynamic cellular processes at single-cell resolution. dyngen is more flexible than current single-cell simulation engines, and allows better method development and benchmarking, thereby stimulating development and testing of computational methods. We demonstrate its potential for spearheading computational methods on three applications: aligning cell developmental trajectories, cell-specific regulatory network inference and estimation of RNA velocity.
Methods in Molecular Biology

Oncotarget
Background: Neuroblastoma is an aggressive childhood malignancy of the sympathetic nervous system... more Background: Neuroblastoma is an aggressive childhood malignancy of the sympathetic nervous system. Despite multi-modal therapy, survival of high-risk patients remains disappointingly low, underscoring the need for novel treatment strategies. The discovery of ALK activating mutations opened the way to precision treatment in a subset of these patients. Previously, we investigated the transcriptional effects of pharmacological ALK inhibition on neuroblastoma cell lines, six hours after TAE684 administration, resulting in the 77-gene ALK signature, which was shown to gradually decrease from 120 minutes after TAE684 treatment, to gain deeper insight into the molecular effects of oncogenic ALK signaling. Aim: Here, we further dissected the transcriptional dynamic profiles of neuroblastoma cells upon TAE684 treatment in a detailed timeframe of ten minutes up to six hours after inhibition, in order to identify additional early targets for combination treatment. Results: We observed an unexpected initial upregulation of positively regulated MYCN target genes following subsequent downregulation of overall MYCN activity. In addition, we identified adrenomedullin (ADM), previously shown to be implicated in sunitinib resistance, as the earliest response gene upon ALK inhibition. Conclusions: We describe the early and late effects of ALK inhibitor TAE684 treatment on the neuroblastoma transcriptome. The observed unexpected upregulation of ADM warrants further investigation in relation to putative ALK resistance in neuroblastoma patients currently undergoing ALK inhibitor treatment.

Bioinformatics
Motivation During the last decade, trajectory inference (TI) methods have emerged as a novel fram... more Motivation During the last decade, trajectory inference (TI) methods have emerged as a novel framework to model cell developmental dynamics, most notably in the area of single-cell transcriptomics. At present, more than 70 TI methods have been published, and recent benchmarks showed that even state-of-the-art methods only perform well for certain trajectory types but not others. Results In this work, we present TinGa, a new TI model that is fast and flexible, and that is based on Growing Neural Graphs. We performed an extensive comparison of TinGa to five state-of-the-art methods for TI on a set of 250 datasets, including both synthetic as well as real datasets. Overall, TinGa improves the state-of-the-art by producing accurate models (comparable to or an improvement on the state-of-the-art) on the whole spectrum of data complexity, from the simplest linear datasets to the most complex disconnected graphs. In addition, TinGa obtained the fastest execution times, showing that our met...

Nature Communications
Trajectory inference has radically enhanced single-cell RNA-seq research by enabling the study of... more Trajectory inference has radically enhanced single-cell RNA-seq research by enabling the study of dynamic changes in gene expression. Downstream of trajectory inference, it is vital to discover genes that are (i) associated with the lineages in the trajectory, or (ii) differentially expressed between lineages, to illuminate the underlying biological processes. Current data analysis procedures, however, either fail to exploit the continuous resolution provided by trajectory inference, or fail to pinpoint the exact types of differential expression. We introduce tradeSeq, a powerful generalized additive model framework based on the negative binomial distribution that allows flexible inference of both within-lineage and between-lineage differential expression. By incorporating observation-level weights, the model additionally allows to account for zero inflation. We evaluate the method on simulated datasets and on real datasets from droplet-based and full-length protocols, and show that...
PurposeWhen developing new types of tools for single-cell analyses, there is often a lack of data... more PurposeWhen developing new types of tools for single-cell analyses, there is often a lack of datasets on which to quantitatively assess the performance.ResultsWe developed dyngen, a multi-modality simulator of single cells. In dyngen, the biomolecular state of an in silico changes over time according to a predefined gene regulatory network. We used dyngen to benchmark three emerging ways of analysing single-cell data: RNA velocity, cell-specific network inference and trajectory alignment methods.Conclusiondyngen lays the foundations for benchmarking a wide variety of computational single-cell tools and can be used to help kick-start the development of future types of analyses.

The human transcriptome consists of various RNA biotypes including multiple types of non-coding R... more The human transcriptome consists of various RNA biotypes including multiple types of non-coding RNAs (ncRNAs). Current ncRNA compendia remain incomplete partially because they are almost exclusively derived from the interrogation of small- and polyadenylated RNAs. Here, we present a more comprehensive atlas of the human transcriptome that is derived from matching polyA-, total-, and small-RNA profiles of a heterogeneous collection of nearly 300 human tissues and cell lines. We report on thousands of novel RNA species across all major RNA biotypes, including a hitherto poorly-cataloged class of non-polyadenylated single-exon long non-coding RNAs. In addition, we exploit intron abundance estimates from total RNA-sequencing to test and verify functional regulation by novel non-coding RNAs. Our study represents a substantial expansion of the current catalogue of human ncRNAs and their regulatory interactions. All data, analyses, and results are available in the R2 web portal and serve a...
Genome Biology
In computational biology and other sciences, researchers are frequently faced with a choice betwe... more In computational biology and other sciences, researchers are frequently faced with a choice between several computational methods for performing data analyses. Benchmarking studies aim to rigorously compare the performance of different methods using well-characterized benchmark datasets, to determine the strengths of each method or to provide recommendations regarding suitable choices of methods for an analysis. However, benchmarking studies must be carefully designed and implemented to provide accurate, unbiased, and informative results. Here, we summarize key practical guidelines and recommendations for performing high-quality benchmarking analyses, based on our experiences in computational biology.

Trajectory inference has radically enhanced single-cell RNA-seq research by enabling the study of... more Trajectory inference has radically enhanced single-cell RNA-seq research by enabling the study of dynamic changes in gene expression levels during biological processes such as the cell cycle, cell type differentiation, and cellular activation. Downstream of trajectory inference, it is vital to discover genes that are associated with the lineages in the trajectory to illuminate the underlying biological processes. Furthermore, genes that are differentially expressed between developmental/activational lineages might be highly relevant to further unravel the system under study. Current data analysis procedures, however, typically cluster cells and assess differential expression between the clusters, which fails to exploit the continuous resolution provided by trajectory inference to its full potential. The few available non-cluster-based methods only assess broad differences in gene expression between lineages, hence failing to pinpoint the exact types of divergence. We introduce a pow...

Using single-cell -omics data, it is now possible to computationally order cells along trajectori... more Using single-cell -omics data, it is now possible to computationally order cells along trajectories, allowing the unbiased study of cellular dynamic processes. Since 2014, more than 50 trajectory inference methods have been developed, each with its own set of methodological characteristics. As a result, choosing a method to infer trajectories is often challenging, since a comprehensive assessment of the performance and robustness of each method is still lacking. In order to facilitate the comparison of the results of these methods to each other and to a gold standard, we developed a global framework to benchmark trajectory inference tools. Using this framework, we compared the trajectories from a total of 29 trajectory inference methods, on a large collection of real and synthetic datasets. We evaluate methods using several metrics, including accuracy of the inferred ordering, correctness of the network topology, code quality and user friendliness. We found that some methods, includ...

Immunity, Jan 21, 2018
Heterogeneity between different macrophage populations has become a defining feature of this line... more Heterogeneity between different macrophage populations has become a defining feature of this lineage. However, the conserved factors defining macrophages remain largely unknown. The transcription factor ZEB2 is best described for its role in epithelial to mesenchymal transition; however, its role within the immune system is only now being elucidated. We show here that Zeb2 expression is a conserved feature of macrophages. Using Clec4f-cre, Itgax-cre, and Fcgr1-cre mice to target five different macrophage populations, we found that loss of ZEB2 resulted in macrophage disappearance from the tissues, coupled with their subsequent replenishment from bone-marrow precursors in open niches. Mechanistically, we found that ZEB2 functioned to maintain the tissue-specific identities of macrophages. In Kupffer cells, ZEB2 achieved this by regulating expression of the transcription factor LXRα, removal of which recapitulated the loss of Kupffer cell identity and disappearance. Thus, ZEB2 express...

Journal of the National Cancer Institute, Jan 5, 2018
Neuroblastoma is characterized by substantial clinical heterogeneity. Despite intensive treatment... more Neuroblastoma is characterized by substantial clinical heterogeneity. Despite intensive treatment, the survival rates of high-risk neuroblastoma patients are still disappointingly low. Somatic chromosomal copy number aberrations have been shown to be associated with patient outcome, particularly in low- and intermediate-risk neuroblastoma patients. To improve outcome prediction in high-risk neuroblastoma, we aimed to design a prognostic classification method based on copy number aberrations. In an international collaboration, normalized high-resolution DNA copy number data (arrayCGH and SNP arrays) from 556 high-risk neuroblastomas obtained at diagnosis were collected from nine collaborative groups and segmented using the same method. We applied logistic and Cox proportional hazard regression to identify genomic aberrations associated with poor outcome. In this study, we identified two types of copy number aberrations that are associated with extremely poor outcome. Distal 6q losses...

European Journal of Immunology, 2016
Recent developments in single-cell transcriptomics have opened new opportunities for studying dyn... more Recent developments in single-cell transcriptomics have opened new opportunities for studying dynamic processes in immunology in a high throughput and unbiased manner. Starting from a mixture of cells in different stages of a developmental process, unsupervised trajectory inference algorithms aim to automatically reconstruct the underlying developmental path that cells are following. In this review, we break down the strategies used by this novel class of methods, and organize their components into a common framework, highlighting several practical advantages and disadvantages of the individual methods. We also give an overview of new insights these methods have already provided regarding the wiring and gene regulation of cell differentiation. As the trajectory inference field is still in its infancy, we propose several future developments that will ultimately lead to a global and data-driven way of studying immune cell differentiation.

Recent advances in RNA sequencing enable the generation of genome-wide expression data at the sin... more Recent advances in RNA sequencing enable the generation of genome-wide expression data at the single-cell level, opening up new avenues for transcriptomics and systems biology. A new application of single-cell whole-transcriptomics is the unbiased ordering of cells according to their progression along a dynamic process of interest. We introduce SCORPIUS, a method which can effectively reconstruct an ordering of individual cells without any prior information about the dynamic process. Comprehensive evaluation using ten scRNA-seq datasets shows that SCORPIUS consistently outperforms state-of-the-art techniques. We used SCORPIUS to generate novel hypotheses regarding dendritic cell development, which were subsequently validated in vivo. This work enables data-driven investigation and characterization of dynamic processes and lays the foundation for objective benchmarking of future trajectory inference methods.

Genetics in Medicine, 2016
Our goal was to design a customized microarray, arrEYE, for high-resolution copy number variant (... more Our goal was to design a customized microarray, arrEYE, for high-resolution copy number variant (CNV) analysis of known and candidate genes for inherited retinal dystrophy (iRD) and retinaexpressed noncoding RNAs (ncRNAs). Methods: arrEYE contains probes for the full genomic region of 106 known iRD genes, including those implicated in retinitis pigmentosa (RP) (the most frequent iRD), cone-rod dystrophies, macular dystrophies, and an additional 60 candidate iRD genes and 196 ncRNAs. Eight CNVs in iRD genes identified by other techniques were used as positive controls. The test cohort consisted of 57 patients with autosomal dominant, X-linked, or simplex RP. Results: In an RP patient, a novel heterozygous deletion of exons 7 and 8 of the HGSNAT gene was identified: c.634-408_820+338delins AGAATATG, p.(Glu212Glyfs*2). A known variant was found on the second allele: c.1843G>A, p.(Ala615Thr). Furthermore, we expanded the allelic spectrum of USH2A and RCBTB1 with novel CNVs. Conclusion: The arrEYE platform revealed subtle single-exon to larger CNVs in iRD genes that could be characterized at the nucleotide level, facilitated by the high resolution of the platform. We report the first CNV in HGSNAT that, combined with another mutation, leads to RP, further supporting its recently identified role in nonsyndromic iRD.
Uploads
Papers by Robrecht Cannoodt