Algebraic aggregation of random forests

Gossen, Frederik; Steffen, Bernhard

Algebraic aggregation of random forests

Files

Gossen-Steffen2021_Article_AlgebraicAggregationOfRandomFo.pdf (1.49 MB)

Date

2021-09-29

Authors

Gossen, Frederik

Steffen, Bernhard

Alternative Title(s)

towards explainability and rapid evaluation

Abstract

Random Forests are one of the most popular classifiers in machine learning. The larger they are, the more precise the outcome of their predictions. However, this comes at a cost: it is increasingly difficult to understand why a Random Forest made a specific choice, and its running time for classification grows linearly with the size (number of trees). In this paper, we propose a method to aggregate large Random Forests into a single, semantically equivalent decision diagram which has the following two effects: (1) minimal, sufficient explanations for Random Forest-based classifications can be obtained by means of a simple three step reduction, and (2) the running time is radically improved. In fact, our experiments on various popular datasets show speed-ups of several orders of magnitude, while, at the same time, also significantly reducing the size of the required data structure.

Keywords

Random forest, Algebraic decision diagram, Aggregation, Explainability, Interpretability, Running time optimisation, Memory optimisation

Subjects based on RSWK

Entscheidungsgraph, Aggregation, Laufzeit, Erklärung, Klassifikator <Informatik>, Speicher <Informatik>, Optimierung

URI

http://hdl.handle.net/2003/40778
http://dx.doi.org/10.17877/DE290R-22635

Collections

LS 14 Software Engineering

Full item page

Algebraic aggregation of random forests

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Alternative Title(s)

Abstract

Description

Table of contents

Keywords

Subjects based on RSWK

Citation

URI

Collections