On the Complexity of Rule Discovery from Distributed Data

dc.contributor.authorScholz, Martin
dc.date.accessioned2005-10-12T06:57:36Z
dc.date.available2005-10-12T06:57:36Z
dc.date.issued2005-10-12T06:57:36Z
dc.description.abstractThis paper analyses the complexity of rule selection for supervised learning in distributed scenarios. The selection of rules is usually guided by a utility measure such as predictive accuracy or weighted relative accuracy. Other examples are support and confidence, known from association rule mining. A common strategy to tackle rule selection from distributed data is to evaluate rules locally on each dataset. While this works well for homogeneously distributed data, this work proves limitations of this strategy if distributions are allowed to deviate. To identify those subsets for which local and global distributions deviate may be regarded as an interesting learning task of its own, explicitly taking the locality of data into account. This task can be shown to be basically as complex as discovering the globally best rules from local data. Based on the theoretical results some guidelines for algorithm design are derived.en
dc.format.extent253791 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/2003/21647
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-14497
dc.language.isoen
dc.subjectRule selectionen
dc.subjectSupervised learningen
dc.subject.ddc004
dc.titleOn the Complexity of Rule Discovery from Distributed Dataen
dc.typeText
dc.type.publicationtypereporten
dcterms.accessRightsopen access

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
tr31-05.pdf
Size:
247.84 KB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.91 KB
Format:
Item-specific license agreed upon to submission
Description: