Full metadata record
DC FieldValueLanguage
dc.contributor.advisorMorik, Katharina-
dc.contributor.authorBunse, Mirko-
dc.date.accessioned2022-12-16T06:42:55Z-
dc.date.available2022-12-16T06:42:55Z-
dc.date.issued2022-
dc.identifier.urihttp://hdl.handle.net/2003/41174-
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-23021-
dc.description.abstractThis thesis explores the fundamental aspects of machine learning, which are involved with acquiring knowledge in the research field of astro-particle physics. This research field substantially relies on machine learning methods, which reconstruct the properties of astro-particles from the raw data that specialized telescopes record. These methods are typically trained from resource-intensive simulations, which reflect the existing knowledge about the particles—knowledge that physicists strive to expand. We study three fundamental machine learning tasks, which emerge from this goal. First, we address ordinal quantification, the task of estimating the prevalences of ordered classes in sets of unlabeled data. This task emerges from the need for testing the agreement of astro-physical theories with the class prevalences that a telescope observes. To this end, we unify existing methods on quantification, propose an alternative optimization process, and develop regularization techniques to address ordinality in quantification problems, both in and outside of astro-particle physics. These advancements provide more accurate reconstructions of the energy spectra of cosmic gamma ray sources and, hence, support physicists in drawing conclusions from their telescope data. Second, we address learning under class-conditional label noise. More particularly, we focus on a novel setting, in which one of the class-wise noise rates is known and one is not. This setting emerges from a data acquisition protocol, through which astro-particle telescopes simultaneously observe a region of interest and several background regions. We enable learning under this type of label noise with algorithms for consistent, noise-aware decision thresholding. These algorithms yield binary classifiers, which outperform the existing state-of-the-art in gamma hadron classification with the FACT telescope. Moreover, unlike the state-of-the-art, our classifiers are entirely trained from the real telescope data and thus do not require any resource-intensive simulation. Third, we address active class selection, the task of actively finding those proportions of classes which optimize the classification performance. In astro-particle physics, this task emerges from the simulation, which produces training data in any desired class proportions. We clarify the implications of this setting from two theoretical perspectives, one of which provides us with bounds of the resulting classification performance. We employ these bounds in a certificate of model robustness, which declares a set of class proportions for which the model is accurate with a high probability. We also employ these bounds in an active strategy for class-conditional data acquisition. Our strategy uniquely considers existing uncertainties about those class proportions that have to be handled during the deployment of the classifier, while being theoretically well-justified.en
dc.language.isoende
dc.subjectMachine learningen
dc.subjectSupervised learningen
dc.subjectOrdinal quantificationen
dc.subjectClass-conditional label noiseen
dc.subjectActive class selectionen
dc.subjectCherenkov telescopeen
dc.subjectGamma-ray astronomyen
dc.subjectSimulationen
dc.subjectData analysisen
dc.subjectBig dataen
dc.subject.ddc004-
dc.titleMachine learning for acquiring knowledge in astro-particle physicsen
dc.typeTextde
dc.contributor.refereeSebastiani, Fabrizio-
dc.date.accepted2022-11-17-
dc.type.publicationtypedoctoralThesisde
dc.subject.rswkMaschinelles Lernende
dc.subject.rswkÜberwachtes Lernende
dc.subject.rswkQuantifizierungde
dc.subject.rswkRauschende
dc.subject.rswkAuswahlverfahrende
dc.subject.rswkCherenkov Telescope Arrayde
dc.subject.rswkGammaastronomiede
dc.subject.rswkSimulationde
dc.subject.rswkDatenanalysede
dc.subject.rswkBig Datade
dcterms.accessRightsopen access-
eldorado.secondarypublicationfalsede
Appears in Collections:LS 08 Künstliche Intelligenz

Files in This Item:
File Description SizeFormat 
Dissertation.pdfDNB1.5 MBAdobe PDFView/Open


This item is protected by original copyright



This item is protected by original copyright rightsstatements.org