Demixing empirical distribution functions

dc.contributor.authorMunteanu, Alexander
dc.contributor.authorWornowizki, Max
dc.date.accessioned2018-10-12T08:28:28Z
dc.date.available2018-10-12T08:28:28Z
dc.date.issued2014-02
dc.description.abstractWe consider the two-sample homogeneity problem where the information contained in two samples is used to test the equality of the underlying distributions. For instance, in cases where one sample stems from a simulation procedure modelling the data generating process of the other sample consisting of observed data, a mere rejection of the null hypothesis is unsatisfactory. Instead, the data analyst would like to know how the simulation can b e improved while changing it as little as possible. Based on the popular Kolmogorov-Smirnov test and a general nonparametric mixture model, we propose an algorithm which determines an appropriate correction distribution function describing how the simulation procedure can b e corrected. It is constructed in such a way that complementing the simulation sample by a given proportion of observations sampled from the correction distribution do es not lead to a rejection of the null hypothesis of equal distributions when the modified and the observed sample are compared. We prove our algorithm to run in linear time and evaluate it on simulated and real spectrometry data showing that it leads to intuitive results. We illustrate its practical performance considering runtime as well as accuracy in a real world scenario.en
dc.identifier.urihttp://hdl.handle.net/2003/37171
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-19167
dc.language.isoende
dc.relation.ispartofseriesTechnical report / Sonderforschungsbereich Verfügbarkeit von Information durch Analyse unter Ressourcenbeschränkung;2/2014
dc.subject.ddc004
dc.titleDemixing empirical distribution functionsen
dc.typeTextde
dc.type.publicationtypereportde
dcterms.accessRightsopen access
eldorado.secondarypublicationfalsede

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
munteanu_wornowizki_2014a.pdf
Size:
607.76 KB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.85 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections