A machine learning approach for the identification of key markers involved in brain development from single-cell transcriptomic dataReportar como inadecuado

A machine learning approach for the identification of key markers involved in brain development from single-cell transcriptomic data - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

BMC Genomics

, 17:1025

First Online: 22 December 2016DOI: 10.1186-s12864-016-3317-7

Cite this article as: Hu, Y., Hase, T., Li, H.P. et al. BMC Genomics 2016 17Suppl 13: 1025. doi:10.1186-s12864-016-3317-7


BackgroundThe ability to sequence the transcriptomes of single cells using single-cell RNA-seq sequencing technologies presents a shift in the scientific paradigm where scientists, now, are able to concurrently investigate the complex biology of a heterogeneous population of cells, one at a time. However, till date, there has not been a suitable computational methodology for the analysis of such intricate deluge of data, in particular techniques which will aid the identification of the unique transcriptomic profiles difference between the different cellular subtypes. In this paper, we describe the novel methodology for the analysis of single-cell RNA-seq data, obtained from neocortical cells and neural progenitor cells, using machine learning algorithms Support Vector machine SVM and Random Forest RF.

ResultsThirty-eight key transcripts were identified, using the SVM-based recursive feature elimination SVM-RFE method of feature selection, to best differentiate developing neocortical cells from neural progenitor cells in the SVM and RF classifiers built. Also, these genes possessed a higher discriminative power enhanced prediction accuracy as compared commonly used statistical techniques or geneset-based approaches. Further downstream network reconstruction analysis was carried out to unravel hidden general regulatory networks where novel interactions could be further validated in web-lab experimentation and be useful candidates to be targeted for the treatment of neuronal developmental diseases.

ConclusionThis novel approach reported for is able to identify transcripts, with reported neuronal involvement, which optimally differentiate neocortical cells and neural progenitor cells. It is believed to be extensible and applicable to other single-cell RNA-seq expression profiles like that of the study of the cancer progression and treatment within a highly heterogeneous tumour.

KeywordsSingle-cell RNA-seq Machine learning Network reconstruction Systems biology  Download fulltext PDF

Autor: Yongli Hu - Takeshi Hase - Hui Peng Li - Shyam Prabhakar - Hiroaki Kitano - See Kiong Ng - Samik Ghosh - Lawrence Jin K

Fuente: https://link.springer.com/

Documentos relacionados