Total frame potential and its applications in data clustering

dc.contributor.advisorStöckler, Joachim
dc.contributor.authorSpringer, Tobias
dc.contributor.refereeIckstadt, Katja
dc.date.accepted2013-11-26
dc.date.accessioned2013-12-06T08:24:28Z
dc.date.available2013-12-06T08:24:28Z
dc.date.issued2013-12-06
dc.description.abstractShort time series arise in a variety of fields such as biology or social sciences. For the statistical analysis of microarray gene expression data, the clustering of short time series is an important objective in order to identify subsets of genes sharing a temporal expression pattern. An established method, the Short Time Series Expression Miner (STEM) by Ernst et al. (2005), assigns time series data to the closest of suitably selected prototypes followed by the selection of significant clusters and eventual grouping. For the clustering of normalized d-dimensional data we propose to minimize the penalized frame potential by Springer et al. (2011). The functional contains the "Total Frame Potential" of Finite Unit Norm Tight Frames (FUNTFs), see Benedetto and Fickus (2003), and includes a data-driven component for the selection of prototypes. The idea of using the frame potential in combination with a data-dependent term for optimization was originally proposed by Benedetto, Czaja and Ehler (2010) for finding sparse representations. We show that the solution of the corresponding constrained optimization problem is naturally connected to the spherical Dirichlet cells of the given normalized data. Furthermore, the minimizers of the functional are located in the interior of the Dirichlet cells. The objective function is differentiable in the minimum and satisfies a matrix-valued extremal condition. The general problem is closely related to the search for point configurations on the unit sphere like in Tammes' (1930) or Thomson's Problem (1904). Moreover, the minimization contains connections to problems in matrix completion (see e.g. Candes and Tao (2009) or Mazumder, Hastie and Tibshirani (2010)). The work contains an exhaustive analysis of the proposed functional using methods from calculus, linear algebra and nonlinear programming. Numerical results from the application on real and artificial data are included.en
dc.identifier.urihttp://hdl.handle.net/2003/31251
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-7234
dc.language.isoende
dc.subjectData clusteringen
dc.subjectFinite unit norm tight framesen
dc.subjectFrame potentialen
dc.subjectGene expression dataen
dc.subjectShort time seriesen
dc.subject.ddc510
dc.titleTotal frame potential and its applications in data clusteringen
dc.typeTextde
dc.type.publicationtypedoctoralThesisde
dcterms.accessRightsopen access

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Dissertation.pdf
Size:
1.26 MB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.85 KB
Format:
Item-specific license agreed upon to submission
Description: