Skip to content

Advertisement

  • Research Article
  • Open Access

Clustering Time Series Gene Expression Data Based on Sum-of-Exponentials Fitting

  • Ciprian Doru Giurcăneanu1Email author,
  • Ioan Tăbuş1 and
  • Jaakko Astola1
EURASIP Journal on Advances in Signal Processing20052005:358568

https://doi.org/10.1155/ASP.2005.1159

Received: 8 June 2004

Published: 31 May 2005

Abstract

This paper presents a method based on fitting a sum-of-exponentials model to the nonuniformly sampled data, for clustering the time series of gene expression data. The structure of the model is estimated by using the minimum description length (MDL) principle for nonlinear regression, in a new form, incorporating a normalized maximum-likelihood (NML) model for a subset of the parameters. The performance of the structure estimation method is studied using simulated data, and the superiority of the new selection criterion over earlier criteria is demonstrated. The accuracy of the nonlinear estimates of the model parameters is analyzed with respect to the Cramér-Rao lower bounds. Clustering examples of gene expression data sets from a developmental biology application are presented, revealing gene grouping into clusters according to functional classes.

Keywords and phrases

nonuniformly sampled datasum-of-exponentials modelnormalized maximum likelihoodtime series clusteringgene expression datadevelopmental biology

Authors’ Affiliations

(1)
Institute of Signal Processing, Tampere University of Technology, Tampere, Finland

Copyright

© Giurcăneanu et al. 2005

Advertisement