Computing on High Performance Clusters with R: Packages BatchJobs and BatchExperiments

Bischl, Bernd; Lang, Michel; Mersmann, Olaf; Rahnenführer, Jörg; Weihs, Claus

Authors:	Bischl, Bernd Lang, Michel Mersmann, Olaf Rahnenführer, Jörg Weihs, Claus
Title:	Computing on High Performance Clusters with R: Packages BatchJobs and BatchExperiments
Language (ISO):	en
Abstract:	Empirical analysis of statistical algorithms often demands time-consuming experiments which are best performed on high performance computing clusters. We present two R packages which greatly simplify working in batch computing environments. The package BatchJobs implements the basic objects and procedures to control a batch cluster within R. It is structured around cluster versions of the well-known higher order functions Map, Reduce and Filter from functional programming. An important feature is that the state of computation is persistently available in a database. The user can query the status of jobs and then continue working with a desired subset. The second package, BatchExperiments, is tailored for the still very general scenario of analyzing arbitrary algorithms on problem instances. It extends BatchJobs by letting the user define an array of jobs of the kind “apply algorithm A to problem instance P and store results”. It is possible to associate statistical designs with parameters of algorithms and problems and therefore to systematically study their influence on the results. In general our main contributions are: (a) Portability : Both packages use a clear and well-defined interface to the batch system which makes them applicable in most high-performance computing environments. (b) Reproducibility: Every computational part has an associated seed that the user can control to ensure reproducibility even when the underlying batch system changes. (c) Efficiency: Efficiently use batch computing clusters completely within R. (d) Abstraction and good software design: The code layers for algorithms, experiment definitions and execution are cleanly separated and enable the writing of readable and maintainable code.
URI:	http://hdl.handle.net/2003/37185 http://dx.doi.org/10.17877/DE290R-19181
Issue Date:	2012-01
Appears in Collections:	Sonderforschungsbereich (SFB) 876

Files in This Item:

File	Description	Size	Format
bischl_etal_2012a.pdf	DNB	484.63 kB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show full item record

This item is protected by original copyright rightsstatements.org