Smart access strategies for Data-­Centric processing

dc.contributor.advisorTeubner, Jens
dc.contributor.authorBerens, Maximilian
dc.contributor.refereeSattler, Kai-Uwe
dc.date.accepted2025-09-01
dc.date.accessioned2026-05-12T07:18:56Z
dc.date.issued2025
dc.description.abstractData movement in analytical database systems is a critical bottleneck, driving energy consumption and infrastructure costs. In the context of storage access, this thesis contributes techniques to mitigate these costs through Cooperative Refinement, the symbiotic interplay between indexing and data-centric processing. First, we address the intersection of these two fields: We by analyze probabilistic data structures, such as Bloom filters and binary sketches, as candidates for Processing-in-NAND (PiN) due to their high error tolerance. To expand the applicability of current PiN architectures, we propose a scheme for emulating inequality comparisons inside NAND. To maximize the potential of fine-granular index information, we present early work on Gravity Store, a data-centric in-storage materialization engine for declarative analytics. The second major contribution is Team-based indexing, a generalization of bitmap indexing for selective, high-dimensional range queries. By forming "Teams" of moderately-sized attribute subsets, this strategy improves runtime efficiency and reduces storage overhead compared to traditional indexing. We address the central challenges of efficient index intersection and Team composition. Finally, we introduce TeamBench, a benchmark generator specifically designed to evaluate these index intersection performances at scale.en
dc.identifier.urihttp://hdl.handle.net/2003/44868
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-26633
dc.language.isoen
dc.subjectLarge-scale databasesen
dc.subjectStorageen
dc.subjectIndexingen
dc.subjectSSDen
dc.subjectData-centric processingen
dc.subjectProcessing-in-memoryen
dc.subjectNANDen
dc.subject.ddc004
dc.subject.rswkDatenbankde
dc.subject.rswkSpeicher (Informatik)de
dc.subject.rswkAutomatische Indexierungde
dc.subject.rswkData-centric computingde
dc.subject.rswkIn-Memory-Datenbankde
dc.subject.rswkNAND-Gatterde
dc.titleSmart access strategies for Data-­Centric processingen
dc.typeText
dc.type.publicationtypePhDThesis
dcterms.accessRightsopen access
eldorado.dnb.deposittrue
eldorado.secondarypublicationfalse

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
Dissertation_Berens.pdf
Größe:
2 MB
Format:
Adobe Portable Document Format
Beschreibung:
DNB

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
4.82 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: