On the Complexity of Rule Discovery from  Distributed Data

Scholz, Martin

On the Complexity of Rule Discovery from Distributed Data

dc.contributor.author	Scholz, Martin
dc.date.accessioned	2005-10-12T06:57:36Z
dc.date.available	2005-10-12T06:57:36Z
dc.date.issued	2005-10-12T06:57:36Z
dc.description.abstract	This paper analyses the complexity of rule selection for supervised learning in distributed scenarios. The selection of rules is usually guided by a utility measure such as predictive accuracy or weighted relative accuracy. Other examples are support and confidence, known from association rule mining. A common strategy to tackle rule selection from distributed data is to evaluate rules locally on each dataset. While this works well for homogeneously distributed data, this work proves limitations of this strategy if distributions are allowed to deviate. To identify those subsets for which local and global distributions deviate may be regarded as an interesting learning task of its own, explicitly taking the locality of data into account. This task can be shown to be basically as complex as discovering the globally best rules from local data. Based on the theoretical results some guidelines for algorithm design are derived.	en
dc.format.extent	253791 bytes
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/2003/21647
dc.identifier.uri	http://dx.doi.org/10.17877/DE290R-14497
dc.language.iso	en
dc.subject	Rule selection	en
dc.subject	Supervised learning	en
dc.subject.ddc	004
dc.title	On the Complexity of Rule Discovery from Distributed Data	en
dc.type	Text
dc.type.publicationtype	report	en
dcterms.accessRights	open access
eldorado.dnb.deposit	true

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1

Name:: tr31-05.pdf
Größe:: 247.84 KB
Format:: Adobe Portable Document Format
Beschreibung:: DNB

Herunterladen

Lizenzbündel

Gerade angezeigt 1 - 1 von 1

Name:: license.txt
Größe:: 1.91 KB
Format:: Item-specific license agreed upon to submission
Beschreibung:

Herunterladen

Sammlungen

Sonderforschungsbereich (SFB) 475