Smart access strategies for Data-Centric processing
| dc.contributor.advisor | Teubner, Jens | |
| dc.contributor.author | Berens, Maximilian | |
| dc.contributor.referee | Sattler, Kai-Uwe | |
| dc.date.accepted | 2025-09-01 | |
| dc.date.accessioned | 2026-05-12T07:18:56Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Data movement in analytical database systems is a critical bottleneck, driving energy consumption and infrastructure costs. In the context of storage access, this thesis contributes techniques to mitigate these costs through Cooperative Refinement, the symbiotic interplay between indexing and data-centric processing. First, we address the intersection of these two fields: We by analyze probabilistic data structures, such as Bloom filters and binary sketches, as candidates for Processing-in-NAND (PiN) due to their high error tolerance. To expand the applicability of current PiN architectures, we propose a scheme for emulating inequality comparisons inside NAND. To maximize the potential of fine-granular index information, we present early work on Gravity Store, a data-centric in-storage materialization engine for declarative analytics. The second major contribution is Team-based indexing, a generalization of bitmap indexing for selective, high-dimensional range queries. By forming "Teams" of moderately-sized attribute subsets, this strategy improves runtime efficiency and reduces storage overhead compared to traditional indexing. We address the central challenges of efficient index intersection and Team composition. Finally, we introduce TeamBench, a benchmark generator specifically designed to evaluate these index intersection performances at scale. | en |
| dc.identifier.uri | http://hdl.handle.net/2003/44868 | |
| dc.identifier.uri | http://dx.doi.org/10.17877/DE290R-26633 | |
| dc.language.iso | en | |
| dc.subject | Large-scale databases | en |
| dc.subject | Storage | en |
| dc.subject | Indexing | en |
| dc.subject | SSD | en |
| dc.subject | Data-centric processing | en |
| dc.subject | Processing-in-memory | en |
| dc.subject | NAND | en |
| dc.subject.ddc | 004 | |
| dc.subject.rswk | Datenbank | de |
| dc.subject.rswk | Speicher (Informatik) | de |
| dc.subject.rswk | Automatische Indexierung | de |
| dc.subject.rswk | Data-centric computing | de |
| dc.subject.rswk | In-Memory-Datenbank | de |
| dc.subject.rswk | NAND-Gatter | de |
| dc.title | Smart access strategies for Data-Centric processing | en |
| dc.type | Text | |
| dc.type.publicationtype | PhDThesis | |
| dcterms.accessRights | open access | |
| eldorado.dnb.deposit | true | |
| eldorado.secondarypublication | false |
Dateien
Originalbündel
1 - 1 von 1
Lade...
- Name:
- Dissertation_Berens.pdf
- Größe:
- 2 MB
- Format:
- Adobe Portable Document Format
- Beschreibung:
- DNB
Lizenzbündel
1 - 1 von 1
Lade...
- Name:
- license.txt
- Größe:
- 4.82 KB
- Format:
- Item-specific license agreed upon to submission
- Beschreibung:
