From hybridization theory to microarray data analysis: performance evaluationReport as inadecuate

From hybridization theory to microarray data analysis: performance evaluation - Download this document for free, or read online. Document in PDF available to download.

BMC Bioinformatics

, 12:464

Transcriptome analysis


BackgroundSeveral preprocessing methods are available for the analysis of Affymetrix Genechips arrays. The most popular algorithms analyze the measured fluorescence intensities with statistical methods. Here we focus on a novel algorithm, AffyILM, available from Bioconductor, which relies on inputs from hybridization thermodynamics and uses an extended Langmuir isotherm model to compute transcript concentrations. These concentrations are then employed in the statistical analysis. We compared the performance of AffyILM and other traditional methods both in the old and in the newest generation of GeneChips.

ResultsTissue mixture and Latin Square datasets provided by Affymetrix were used to assess the performances of the differential expression analysis depending on the preprocessing strategy. A correlation analysis conducted on the tissue mixture data reveals that the median-polish algorithm allows to best summarize AffyILM concentrations computed at the probe-level. Those correlation results are equivalent to the best correlations observed using popular preprocessing methods relying on intensity values. The performances of each tested preprocessing algorithm were quantified using the Latin Square HG-U133A dataset, thanks to the comparison of differential analysis results with the list of spiked genes. The figures of merit generated illustrates that the performances associated to AffyILMmedianpolish, inferred from the present statistical analysis, are comparable to the best performing strategies previously reported.

ConclusionsConverting probe intensities to estimates of target concentrations prior to the statistical analysis, AffyILMmedianpolish is one of the best performing strategy currently available. Using hybridization theory, probe-level estimates of target concentrations should be identically distributed. In the future, a probe-level multivariate analysis of the concentrations should be compared to the univariate analysis of probe-set summarized expression data.

Electronic supplementary materialThe online version of this article doi:10.1186-1471-2105-12-464 contains supplementary material, which is available to authorized users.

Download fulltext PDF

Author: Fabrice Berger - Enrico Carlon


Related documents