Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq DataReport as inadecuate

Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq Data - Download this document for free, or read online. Document in PDF available to download.


Department of Statistics and Public Health Sciences, Penn State University, 514A Wartik Building, University Park, PA 16802, USA


Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Center for Comprehensive Informatics, Emory University, 1518 Clifton Rd., N.E., Atlanta, GA 30322, USA


Author to whom correspondence should be addressed.

Abstract The recent arrival of ultra-high throughput, next generation sequencing NGS technologies has revolutionized the genetics and genomics fields by allowing rapid and inexpensive sequencing of billions of bases. The rapid deployment of NGS in a variety of sequencing-based experiments has resulted in fast accumulation of massive amounts of sequencing data. To process this new type of data, a torrent of increasingly sophisticated algorithms and software tools are emerging to help the analysis stage of the NGS applications. In this article, we strive to comprehensively identify the critical challenges that arise from all stages of NGS data analysis and provide an objective overview of what has been achieved in existing works. At the same time, we highlight selected areas that need much further research to improve our current capabilities to delineate the most information possible from NGS data. The article focuses on applications dealing with ChIP-Seq and RNA-Seq. View Full-Text

Keywords: next generation sequencing; statistical analysis next generation sequencing; statistical analysis

Author: Debashis Ghosh 1,* and Zhaohui S. Qin 2



Related documents