Comparing Knowledge-Based Sampling to Boosting

dc.contributor.authorScholz, Martin
dc.date.accessioned2005-10-12T06:58:52Z
dc.date.available2005-10-12T06:58:52Z
dc.date.issued2005-10-12T06:58:52Z
dc.description.abstractBoosting algorithms for classifcation are based on altering the initial distribution assumed to underly a given example set. The idea of knowledge-based sampling (KBS) is to sample out prior knowledgeand previously discovered patterns to achieve that subsequently applied data mining algorithms automatically focus on novel patterns without any need to adjust the base algorithm. This sampling strategy anticipates a user's expectation based on a set of constraints how to adjust the distribution. In the classified case KBS is similar to boosting. This article shows that a specific, very simple KBS algorithm is able to boost weak base classifiers. It discusses differences to AdaBoost.M1 and LogitBoost, and it compares performances of these algorithms empirically in terms of predictive accuracy, the area under the ROC curve measure, and squared error.de
dc.format.extent120774 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/2003/21652
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-14491
dc.language.isoen
dc.subjectAdaboost.M1en
dc.subjectBoosting algorithmen
dc.subjectClassificationen
dc.subjectData miningen
dc.subjectKnowledge-based samplingen
dc.subjectLogitBoosten
dc.subjectROC curve measureen
dc.subjectSampling strategyen
dc.subject.ddc004
dc.titleComparing Knowledge-Based Sampling to Boostingen
dc.typeText
dc.type.publicationtypereporten
dcterms.accessRightsopen access
eldorado.dnb.deposittrue

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
tr26-05.pdf
Size:
117.94 KB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.91 KB
Format:
Item-specific license agreed upon to submission
Description: