Robust Learning from Bites for Data Mining

dc.contributor.authorChristmann, Andreas
dc.contributor.authorHubert, Mia
dc.contributor.authorSteinwart, Ingo
dc.date.accessioned2006-02-10T11:10:45Z
dc.date.available2006-02-10T11:10:45Z
dc.date.issued2006-02-10T11:10:45Z
dc.description.abstractSome methods from statistical machine learning and from robust statistics have two drawbacks. Firstly, they are computer-intensive such that they can hardly be used for massive data sets, say with millions of data points. Secondly, robust and non-parametric confidence intervals for the predictions according to the fitted models are often unknown. Here, we propose a simple but general method to overcome these problems in the context of huge data sets. The method is scalable to the memory of the computer, can be distributed on several processors if available, and can help to reduce the computation time substantially. Our main focus is on robust general support vector machines (SVM) based on minimizing regularized risks. The method offers distribution-free confidence intervals for the median of the predictions. The approach can also be helpful to fit robust estimators in parametric models for huge data sets.en
dc.format.extent300829 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/2003/22177
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-1956
dc.language.isoen
dc.subjectBreakdown pointen
dc.subjectConvex risk minimizationen
dc.subjectData miningen
dc.subjectDistributed computingen
dc.subjectInfluence functionen
dc.subjectLogistic regressionen
dc.subjectRobustnessen
dc.subjectScalabilityen
dc.subjectStatistical machine learningen
dc.subjectSupport vector machineen
dc.subject.ddc004
dc.titleRobust Learning from Bites for Data Miningen
dc.typeText
dc.type.publicationtypereporten
dcterms.accessRightsopen access
eldorado.dnb.deposittrue

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
tr03-06.pdf
Größe:
293.78 KB
Format:
Adobe Portable Document Format
Beschreibung:
DNB

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
1.92 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: