Seconds first!

dc.contributor.advisorMutzel, Petra
dc.contributor.authorBrinkjost, Tobias
dc.contributor.refereeRauh, Daniel
dc.date.accepted2019-12-18
dc.date.accessioned2020-02-18T07:23:06Z
dc.date.available2020-02-18T07:23:06Z
dc.date.issued2019
dc.description.abstractThe visualization of a protein is hardly imaginable without secondary structure elements (SSEs). But SSEs play a by far more important role in the field of chemical biology, apart from the creation of fancy protein images. They are essential in structure-based analyses due to their impact on secondary structure prediction and structural protein alignment. However, their proper classification is a challenging issue. There are more than 30 tools available that underline the subjective character of the classification of SSEs. But only two tools have dominated this field of research for decades. Why is that? What are the advantages of hydrogen bond-based methods, despite the fact that they are often unable to assign left-handed helices, PPI-helices, or bent structures? We have developed SCOT, a novel multipurpose software that incorporates the benefits of a multitude of approaches for the classification of helices, strands, and turns in proteins. To our knowledge, it is the very first method that not only captures a variety of rare and basic SSEs (rightand left-handed a-, 310-, 2.27-, plus right-handed p-helices, PPII helices, and b-sheets) in protein structures, but also their irregularities in a single step, and provides proper output and visualization options. SCOT combines the benefits of geometry-based and hydrogen bond-based methods by using hydrogen bond and geometric information to gain insights into the structural space of proteins. Its dual character enables robust classifications of SSEs without major influence on the geometric regularity of the assigned SSEs. In consequence, it is perfectly suited to automatically assign SSEs for subsequent helix- and strand-based protein alignments with methods such as LOCK2. This is especially supported by our elaborate kink detection. All of these benefits are clearly demonstrated by our results. Together with the easy to use visualization of assignments by the means of PyMOL scripts, SCOT enables a comprehensive analysis of regular backbone geometries in protein structures. The high number of available secondary structure assignment methods (SSAMs) hampers a straight forward selection of the most suitable one for a certain application. In addition, relying on the most frequently cited tool must not necessarily result in an optimal choice. Thus, we have developed SNOT to fill the gap of a tool that provides a multitude of objective and rational criteria for the comparison and evaluation of different SSAMs. It provides exhaustive information on geometrical parameters, residue statistics, the consistency with respect to protein flexibility and quality, SSE overlaps and sequence coverage, and the consent of two classifications. We used SNOT to compare SCOT to DSSP, STRIDE, ASSP, SEGNO, DISICL, and SHAFT. The results point toward SCOT’s unique features as a solitary multipurpose SSAM with optimal performance for numerous challenges: the support of commonly observed and rare SSEs, the comprehensive assignment of turn types, the elaborate kink detection, the geometric consistency of the SSEs, the robustness with respect to structure quality and protein flexibility, and its superior suitability for SSE-based protein structure alignments. Our analyses of alternative p- and PPII-helix assignments indicate challenges which we try to address with our methodology. There are also hints toward a correlation between the SSEs of a protein and its function. Koch and Waldmann proposed that similar arrangements of SSEs in the neighborhood of a ligand binding site (ligand-sensing cores) can recognize similar scaffolds in disregard of the overall fold. We have developed SLOT to discover these unrevealed similarities which are solely based on SSEs. Its graph-based methodology is able to mimic the geometry of SSEs by using a flexible multi-point representation instead of a straight vector. These points are used to capture the geometry of a protein, i.e., the arrangement of SSEs, in distance matrices. This unique representation enables the comparison of protein structures regardless of their SSEs’ directions or SSE sequence. An optimized algorithm to determine the MCS of two given graphs is used to calculate their structural similarity and to ensure fast runtimes. What sets SLOT further apart from its 40 competitors is that it can be used with any external secondary structure classification. Our exhaustive evaluation highlights the benefits of SCOT for the use with SLOT and also covers our optimizations of the comparison algorithm. It additionally questions the applicability of the concept of ligand-sensing cores. In the end, SCOT, SNOT, and SLOT were the beacons on our journey to answer the question: Are there similar ligand-sensing cores or undiscovered structural similarities solely based on SSEs?en
dc.identifier.urihttp://hdl.handle.net/2003/38692
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-20611
dc.language.isoende
dc.subjectSecondary structure elementsen
dc.subjectProteinen
dc.subjectProtein structureen
dc.subjectSCOTen
dc.subjectSNOTen
dc.subjectSLOTen
dc.subject.ddc004
dc.subject.rswkProteinede
dc.subject.rswkSekundärstrukturde
dc.titleSeconds first!en
dc.title.alternativeA thesis dedicated to secondary structure elementsen
dc.typeTextde
dc.type.publicationtypedoctoralThesisde
dcterms.accessRightsopen access
eldorado.secondarypublicationfalsede

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
main.pdf
Size:
33.56 MB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.85 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections