Model and Algorithm Selection in Statistical Learning and Optimization.

dc.contributor.advisorWeihs, Claus
dc.contributor.authorBischl, Bernd
dc.contributor.refereeRahnenführer, Jörg
dc.date.accepted2013-08-30
dc.date.accessioned2014-02-07T13:51:05Z
dc.date.available2014-02-07T13:51:05Z
dc.date.issued2014-02-07
dc.description.abstractModern data-driven statistical techniques, e.g., non-linear classification and regression machine learning methods, play an increasingly important role in applied data analysis and quantitative research. For real-world we do not know a priori which methods will work best. Furthermore, most of the available models depend on so called hyper- or control parameters, which can drastically influence their performance. This leads to a vast space of potential models, which cannot be explored exhaustively. Modern optimization techniques, often either evolutionary or model-based, are employed to speed up this process. A very similar problem occurs in continuous and discrete optimization and, in general, in many other areas where problem instances are solved by algorithmic approaches: Many competing techniques exist, some of them heavily parametrized. Again, not much knowledge exists, how, given a certain application, one makes the correct choice here. These general problems are called algorithm selection and algorithm configuration. Instead of relying on tedious, manual trial-and-error, one should rather employ available computational power in a methodical fashion to obtain an appropriate algorithmic choice, while supporting this process with machine-learning techniques to discover and exploit as much of the search space structure as possible. In this cumulative dissertation I summarize nine papers that deal with the problem of model and algorithm selection in the areas of machine learning and optimization. Issues in benchmarking, resampling, efficient model tuning, feature selection and automatic algorithm selection are addressed and solved using modern techniques. I apply these methods to tasks from engineering, music data analysis and black-box optimization. The dissertation concludes by summarizing my published R packages for such tasks and specifically discusses two packages for parallelization on high performance computing clusters and parallel statistical experiments.en
dc.identifier.urihttp://hdl.handle.net/2003/32861
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-7142
dc.language.isoende
dc.subjectmodel selectionen
dc.subjectalgorithm selectionen
dc.subjectalgorithm configurationen
dc.subjecttuningen
dc.subjectbenchmarkingen
dc.subjectmachine learningen
dc.subject.ddc310
dc.titleModel and Algorithm Selection in Statistical Learning and Optimization.en
dc.typeTextde
dc.type.publicationtypedoctoralThesisde
dcterms.accessRightsopen access

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
phd.pdf
Size:
1.07 MB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.85 KB
Format:
Item-specific license agreed upon to submission
Description: