Comparing Knowledge-Based Sampling to Boosting

Scholz, Martin

Comparing Knowledge-Based Sampling to Boosting

Dateien

tr26-05.pdf (117.94 KB)

Datum

2005-10-12T06:58:52Z

Autor:innen

Scholz, Martin

Zusammenfassung

Boosting algorithms for classifcation are based on altering the initial distribution assumed to underly a given example set. The idea of knowledge-based sampling (KBS) is to sample out prior knowledgeand previously discovered patterns to achieve that subsequently applied data mining algorithms automatically focus on novel patterns without any need to adjust the base algorithm. This sampling strategy anticipates a user's expectation based on a set of constraints how to adjust the distribution. In the classified case KBS is similar to boosting. This article shows that a specific, very simple KBS algorithm is able to boost weak base classifiers. It discusses differences to AdaBoost.M1 and LogitBoost, and it compares performances of these algorithms empirically in terms of predictive accuracy, the area under the ROC curve measure, and squared error.

Schlagwörter

Adaboost.M1, Boosting algorithm, Classification, Data mining, Knowledge-based sampling, LogitBoost, ROC curve measure, Sampling strategy

URI

http://hdl.handle.net/2003/21652
http://dx.doi.org/10.17877/DE290R-14491

Sammlungen

Sonderforschungsbereich (SFB) 475

Komplettanzeige

Comparing Knowledge-Based Sampling to Boosting

Dateien

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Sonstige Titel

Zusammenfassung

Beschreibung

Inhaltsverzeichnis

Schlagwörter

Schlagwörter nach RSWK

Zitierform

URI

Sammlungen

Befürwortung

Review

Ergänzt durch

Referenziert von