Skip to main content

Clustering Time Series Gene Expression Data Based on Sum-of-Exponentials Fitting

Abstract

This paper presents a method based on fitting a sum-of-exponentials model to the nonuniformly sampled data, for clustering the time series of gene expression data. The structure of the model is estimated by using the minimum description length (MDL) principle for nonlinear regression, in a new form, incorporating a normalized maximum-likelihood (NML) model for a subset of the parameters. The performance of the structure estimation method is studied using simulated data, and the superiority of the new selection criterion over earlier criteria is demonstrated. The accuracy of the nonlinear estimates of the model parameters is analyzed with respect to the Cramér-Rao lower bounds. Clustering examples of gene expression data sets from a developmental biology application are presented, revealing gene grouping into clusters according to functional classes.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Ciprian Doru Giurcăneanu.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Giurcăneanu, C.D., Tăbuş, I. & Astola, J. Clustering Time Series Gene Expression Data Based on Sum-of-Exponentials Fitting. EURASIP J. Adv. Signal Process. 2005, 358568 (2005). https://doi.org/10.1155/ASP.2005.1159

Download citation

Keywords and phrases

  • nonuniformly sampled data
  • sum-of-exponentials model
  • normalized maximum likelihood
  • time series clustering
  • gene expression data
  • developmental biology