Learning from Label Proportions by Optimizing Cluster Model Selection

dc.contributor.authorMorik, Katharina
dc.contributor.authorStolpe, Marco
dc.contributor.editorGunopulos, D.
dc.date.accessioned2012-02-28T15:32:38Z
dc.date.available2012-02-28T15:32:38Z
dc.date.issued2012-02-28
dc.description.abstractIn a supervised learning scenario, we learn a mapping from input to output values, based on labeled examples. Can we learn such a mapping also from groups of unlabeled observations, only knowing, for each group, the proportion of observations with a particular label? Solutions have real world applications. Here, we consider groups of steel sticks as samples in quality control. Since the steel sticks cannot be marked individually, for each group of sticks it is only known how many sticks of high (low) quality it contains. We want to predict the achieved quality for each stick before it reaches the final production station and quality control, in order to save resources. We define the problem of learning from label proportions and present a solution based on clustering. Our method empirically shows a better prediction performance than recent approaches based on probabilistic SVMs, Kernel k-Means or conditional exponential models.en
dc.identifier.urihttp://hdl.handle.net/2003/29343
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-3964
dc.language.isoende
dc.relation.ispartofECML PKDD 2011, Part IIIen
dc.subject.ddc004
dc.titleLearning from Label Proportions by Optimizing Cluster Model Selectionen
dc.typeTextde
dc.type.publicationtypeconferenceObjectde
dcterms.accessRightsopen access

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
stolpe_morik_2011a.pdf
Size:
795.58 KB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.85 KB
Format:
Item-specific license agreed upon to submission
Description: