Dynamic Expressibility under
Complex Changes
Dissertation
zur Erlangung des Grades eines
Doktors der Naturwissenschaften
der Technischen Universität Dortmund
an der Fakultät für Informatik
von
Nils Vortmeier
Dortmund
2019
Tag der mündlichen Prüfung: 20. November 2019
Dekan: Prof. Dr.-Ing. Gernot Fink
Gutachter:
Prof. Dr. Thomas Schwentick
Prof. Dr. Victor Vianu
Abstract
Whenever a database is changed, any previously computed and stored result of a query
evaluation on that database becomes invalid and needs to be updated. This update can in
principle be conducted by re-evaluating the query from scratch. However, this approach
might be too inefficient, especially for large databases. Instead, it is often advantageous
to make use of the old query result and possibly further stored auxiliary information.
A descriptive framework for studying this dynamic setting of query evaluation was
introduced by Patnaik and Immerman in 1994, independently of a very similar formalisa-
tion by Dong, Su and Topor that was first published in 1992 and 1993. In this dynamic
descriptive complexity framework, auxiliary relations are stored that hold the result of
a fixed query as well as further auxiliary information. The update of these auxiliary
relations after a change to the input database is specified by first-order formulas that
may refer to the input database as well as to the stored auxiliary relations. The dynamic
complexity class DynFO contains all queries for which the result can be maintained this
way under some specified set of changes.
Most of the previous work in the DynFO framework focussed on changes that are
insertions and deletions of single tuples. In this thesis we consider also more complex
changes, in particular we study changes that insert or delete a set of tuples of bounded
size, and changes that are specified by first-order definable queries on the input.
The main contribution of this thesis is an investigation of the expressive power of
first-order update formulas in the DynFO framework with respect to these classes of
complex changes. We show strong DynFO maintainability results under complex changes
for queries such as the reachability query for directed and undirected graphs. We also
give non-maintainability results for certain restricted dynamic settings.
Some of our maintainability results rely on a new technique of dynamic query mainte-
nance. This technique is also used to show that all queries that are definable in monadic
second-order logic are in DynFO under single-edge changes of input graphs of bounded
treewidth.
We experimentally evaluate the dynamic approach and compare the performance of
query evaluation from scratch with the performance of different strategies of dynamic
query maintenance under complex changes.
iii

Acknowledgements
When I started my Bachelor studies in Computer Science, I thought I would end up
doing robotics. Then I took Thomas Schwentick’s logics course, afterwards basically
every other course he offered, and now I try to obtain a doctorate working on descriptive
complexity theory. How plans can change!
Thomas accompanied me during my whole studies, and his enthusiasm and dedication
for research and teaching impressed me deeply. I am indebted to him for his guidance,
encouragement and constant support. During my time as a PhD student I always felt his
trust, and that he is convinced I can actually finish this project, even when I was not
convinced myself. I thank him for helpful criticism, honest praise, and occasional visits
to the stadium. He really deserves the title Doktorvater.
This thesis would not exist without Thomas Zeume. Literally everything I did in
research is joint work with him, and our tea breaks were the place where many ideas
took shape. He is to me collaborator, advisor, critic, and a friend. Thank you very much
for introducing me to the fun side of research!
The visits in Chennai were one of the most productive times during my PhD studies.
I am thankful to Samir Datta for hosting Thomas and me, and for sharing his many
ideas. Many thanks to him and to Anish Mukherjee for a very successful collaboration!
I also thank my colleagues at Lehrstuhl 1 in Dortmund for the great teamwork, many
inspirations, a delightful time, and that they always waited for me at lunch. Special
thanks to Gaetano for being funny sometimes!
I am thankful and honoured that Victor Vianu agreed to review this thesis and to be
part of my examination committee. I acknowledge the financial support by Deutsche
Forschungsgemeinschaft under grant SCHW 678/6-2.
Many people distracted me from research and kept me kind of sane. I thank my friends
for the time together: Christopher, especially for the wonderful journeys, the members of
ProStuKo and Team Spieleabend, the inhabitants of Belgarien, the ASG-Helden, and all
those people from Frohlinde, the MAR, Frohlinder Hopfenkinder, and Callo-Theater.
Much of what we achieve in life depends on the roots we have. I thank my family,
especially my mother and my grandparents, for their love and support. Thank you for
enabling me to do all this.
v

Contents
Abstract iii
Acknowledgements v
1 Introduction 1
2 The dynamic setting 11
2.1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.2 The dynamic complexity framework . . . . . . . . . . . . . . . . . . . . . 16
2.3 Variants of the setting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.4 Outlook and bibliographic remarks . . . . . . . . . . . . . . . . . . . . . . 23
3 The Muddling technique: Bounding the number of changes 25
3.1 Notions used for the Muddling technique . . . . . . . . . . . . . . . . . . . 27
3.2 From short to arbitrary long change sequences . . . . . . . . . . . . . . . 28
3.3 Outlook and bibliographic remarks . . . . . . . . . . . . . . . . . . . . . . 32
4 Expressibility under single-tuple changes 35
4.1 A dynamic version of Courcelle’s Theorem . . . . . . . . . . . . . . . . . . 38
4.1.1 Treewidth and tree decompositions . . . . . . . . . . . . . . . . . . 39
4.1.2 Showcase 3-colourability for graphs of bounded treewidth . . . . . 42
4.1.3 MSO-definable queries on graphs of bounded treewidth . . . . . . . 45
A Feferman–Vaught-type composition theorem . . . . . . . . 47
Maintaining MSO-definable decision problems . . . . . . . . . 52
Extension to optimisation problems . . . . . . . . . . . . . . . 56
4.2 An arity hierarchy for DynProp . . . . . . . . . . . . . . . . . . . . . . . . 58
4.2.1 Expressibility results for the arity hierarchy . . . . . . . . . . . . . 62
4.2.2 Proving inexpressibility results for DynProp . . . . . . . . . . . . . 66
4.2.3 Inexpressibility results for the arity hierarchy . . . . . . . . . . . . 71
4.2.4 Obtaining the arity hierarchy . . . . . . . . . . . . . . . . . . . . . 73
4.2.5 Allowing quantification . . . . . . . . . . . . . . . . . . . . . . . . 74
4.3 Outlook and bibliographic remarks . . . . . . . . . . . . . . . . . . . . . . 76
5 Expressibility under definable changes 79
5.1 Logically definable change operations . . . . . . . . . . . . . . . . . . . . . 82
vii
5.2 Maintaining Reachability under first-order definable insertions . . . . . . . 83
5.2.1 Undirected Reachability . . . . . . . . . . . . . . . . . . . . . . . . 84
5.2.2 Towards efficient dynamic programs . . . . . . . . . . . . . . . . . 87
5.2.3 Reachability in acyclic graphs . . . . . . . . . . . . . . . . . . . . . 92
5.3 Inexpressibility results for Reachability . . . . . . . . . . . . . . . . . . . . 93
5.3.1 Inexpressibility of Reachability with Restricted Auxiliary Relations 93
5.3.2 Inexpressibility of Reachability in the Quantifier-Free Fragment . . 96
5.4 Maintaining formal languages . . . . . . . . . . . . . . . . . . . . . . . . . 100
5.5 AC1 queries under parameter-free changes . . . . . . . . . . . . . . . . . . 108
5.6 Outlook and bibliographic remarks . . . . . . . . . . . . . . . . . . . . . . 110
6 Expressibility under size-restricted bulk changes 113
6.1 Changing sets of tuples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
6.2 Utilising second-order logics with restricted quantifiers . . . . . . . . . . . 116
6.2.1 The expressive power of restricted second-order quantification . . . 117
6.2.2 Reachability, regular languages and changes of polylogarithmic size 120
6.3 The linear algebra approach for bulk changes . . . . . . . . . . . . . . . . 126
6.3.1 Basic concepts from linear algebra . . . . . . . . . . . . . . . . . . 129
6.3.2 Distances and update formulas that use majority . . . . . . . . . . 129
6.3.3 Towards distances in DynFO . . . . . . . . . . . . . . . . . . . . . . 133
6.4 Outlook and bibliographic remarks . . . . . . . . . . . . . . . . . . . . . . 138
7 Experimental evaluation of the dynamic approach 141
7.1 Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143
7.2 Experiments and discussions . . . . . . . . . . . . . . . . . . . . . . . . . . 144
7.3 Outlook and bibliographic remarks . . . . . . . . . . . . . . . . . . . . . . 149
8 Conclusion 151
Bibliography 157
CHAPTER 1
Introduction
In our usual understanding, an algorithm works in three (potentially intertwined) stages:
it reads its input, computes the corresponding result, which it then outputs. This way it
can process a series of various unrelated inputs one after the other.
This model is often not suitable for today’s data management systems, which continu-
ously need to solve some computational problems on evolving data sets. Social networks
analyse the community structure and display information based on privacy settings and
user relationships, while the users frequently change their connections to other users.
Web search engines and other recommender systems face a changing set of items as
well as new information on their users. Routing information, for private and public
transport, network traffic and other applications, needs to be updated if links are faulty,
fail altogether, or are newly established.
In principle, all these challenges can of course be handled by discarding previously
computed data and applying a standard algorithm after every modification of the input.
This re-computation from scratch might however not be possible due to performance
reasons, especially if the input size is huge; often it is also not necessary: small changes
in a traffic network will probably only trigger very local adaptations of the routing
information, and whether two users are “friends” in a social network might only affect
them and their other friends. It is therefore plausible that it is sufficient to update a
previously computed result locally to incorporate small changes of the input, and that
such an update is computationally cheaper than re-computing the result from scratch.
A routine that performs such an update naturally has access to the previously computed
result, the former input and its modified part. Additionally, it may use further stored
auxiliary information if this facilitates the update. Of course, this information then needs
to be updated as well. If such an update routine can continuously provide the correct
answer to some query on a changing input, we say that it maintains this query.
1
Chapter 1 Introduction
There are (at least) two approaches on how such an update routine can be specified.
Following the dynamic algorithms approach, specialised data structures are stored as
auxiliary information and the goal is to design algorithms that need less time to update
a result than to re-compute it from scratch. See [Demetrescu et al. 2010] for an overview
on upper bounds in this field, and for example [Henzinger and Fredman 1998; Miltersen
1999; Abboud and Williams 2014] for lower bounds in different settings.
In this thesis we study the descriptive approach, which is mainly motivated from
database applications, as discussed further below. Within this approach, the update
of a query result and of the auxiliary information is declaratively defined by a set of
logical formulas. We are particularly interested in updates that are specified via first-
order formulas and variants thereof. Consequently, the input, the query result and the
auxiliary information are encoded by relational structures, that is, plain relations over
some underlying domain. The complexity measure we are interested in is, above all, the
precise logic we need to express updates that are able to maintain a query result.
The dynamic descriptive complexity framework Very similar formal frameworks
for studying the descriptive approach, in particular with updates that are expressible
in first-order logic, were proposed independently of each other by Dong, Su and Topor
[Dong and Topor 1992; Dong and Su 1993; Dong et al. 1995] and Patnaik and Immerman
[Patnaik and Immerman 1994; 1997] under the names first-order incremental evaluation
system (FOIES) and Dyn-FO, respectively.
The formalisation we use in this thesis is based on the framework of Patnaik and
Immerman. In this dynamic descriptive complexity framework, the goal of a dynamic
program is to maintain a fixed query under changes of the input. Such a dynamic program
consists of a set of first-order update formulas, which define the update of a query relation
that holds the result of the query on the input, as well as of additionally stored auxiliary
relations. The queries that can be maintained this way constitute the dynamic complexity
class DynFO.
Example 1.1. We give an example to strengthen our intuition about the descriptive
approach. The reachability query Reach that maps a directed graph G = (V,E) with
node set V and edge set E to its transitive closure relation is a fundamental query that
cannot be expressed by a first-order formula, see for example [Immerman 1999]. Here we
present the well-known result that it can be maintained using first-order update formulas
upon insertions of single edges into the graph.
Assume that in addition to the graph G a relation T is stored that equals the transitive
closure of G, that is, the set of all pairs (u, v) of nodes such that there is a directed path
from u to v in G. When an edge (a, b) is inserted into the graph G, forming the graph
G′ = (V,E ∪{(a, b)}), the updated transitive closure relation T ′ can be defined as follows.
Firstly, all paths that exist in G continue to exist in G′, and therefore T ⊆ T ′ must hold.
Secondly, every path in G′ from some node u to some node v that does not already exist
in G needs to use the edge (a, b). We can assume without loss of generality that such a
path uses this edge only once, and conclude that the sub-paths from u to a and from b
to v already exist in G.
2
So, a pair (u, v) is contained in T ′ if and only if it satisfies the first-order formula
T (u, v) ∨ (T (u, a) ∧ T (b, v)). This formula is therefore an update formula for the query
relation T with respect to changes that insert a single edge into the input graph. /
We give two justifications for the choice of first-order logic as the means of expressing an
update. As already mentioned above, the arguably strongest motivation originates from
database theory. Databases are prime examples of objects that are subject to ongoing
sequences of changes. The DynFO framework is particularly relevant for relational
databases, which can me modelled as relational structures. As queries on relational
databases are usually expressed in terms of the declarative query language SQL, relational
database management systems support the evaluation of SQL queries and are heavily
optimised to process them efficiently. The “core” of SQL corresponds to the relational
algebra, which itself corresponds to first-order logic, see [Abiteboul et al. 1995]. So,
a relational database management system can naturally implement first-order update
formulas to update a query, and it can rely on its already existing optimiser for computing
the updates quickly.
We continue Example 1.1 to illustrate this connection.
Example 1.1 (continuing from p. 2). Assume that the transitive closure relation T of
the directed graph G is stored in a database as a binary table T with attributes u and v.
The update formula T (u, v)∨ (T (u, a)∧T (b, v)) can be translated into the following SQL
query, which defines the updated relation T ′ after the insertion of the edge (a, b) into G.
SELECT *
FROM T
UNION
SELECT T1.u, T2.v
FROM T as T1, T as T2
WHERE T1.v = a AND T2.u = b;
/
We see from Example 1.1 that the dynamic approach enhances the expressive power
of core-SQL in the dynamic setting: one can maintain the transitive closure of a binary
relation if single tuples are inserted into the base relation. This is even possible if also
deletions of single tuples are allowed [Datta et al. 2018a]. In the static setting, the
relational algebra cannot express the transitive closure, as it cannot be expressed in
first-order logic, see [Immerman 1999]. This inability actually carries over to extensions
of the relational algebra that allow for aggregation, grouping and arithmetic, the most
important features that distinguish SQL from the relational algebra [Libkin 2003].
Note that the SQL:1999 standard has introduced features that enable SQL to express the
transitive closure of a relation and other recursively defined queries [Eisenberg and Melton
1999]. These features need customised optimisation techniques and implementation details
differ among different database systems [Przymus et al. 2010; Shalygina and Novikov 2017].
The declarative dynamic approach provides an alternative way to express reachability
and other recursively and non-recursively defined queries, using the core functionalities
of a database management system.
3
Chapter 1 Introduction
We now discuss another motivation for the use of first-order definable updates. The
algorithmic approach as mentioned above asks for algorithms that quickly update a
previously computed query result. The algorithms usually considered there are sequential.
With the widespread use of multi- and many-core processors, parallel algorithms become
more and more important. In complexity theory, circuits are a widely used model for
parallel computation, and first-order logic (equipped with a linear order as well as addition
and multiplication relations) naturally corresponds to the circuit complexity class uniform
AC0 [Barrington et al. 1990]. This class is defined via constant-depth circuits that may
use polynomially many “not”-, “or”- and “and”-gates, where “or”- and “and”-gates may
have arbitrary fan-in. So, a first-order formula can be evaluated in constant time by a set
of polynomially many processors working in parallel. Although this model of computation
can hardly be considered practicable, we can conclude that first-order update formulas
can be processed in a highly parallel fashion, and may provide insights on how one can
efficiently update a query result in a parallel setting.
So far we have, rather informally, introduced the class DynFO and we have motivated
the use of first-order formulas for expressing an update. However, we have not discussed
in detail which kind of changes the update formulas need to be able to process. Most
prior publications on the dynamic complexity framework focus on the restricted case of
changes that either insert a single tuple into an input relation or delete a single tuple
from such a relation. A primary goal of this thesis is to study query maintainability
under classes of more complex changes that cannot be characterised by insertions and
deletions of single (or constantly many1) tuples.
There are several kinds of complex changes one can consider. Note that arbitrary
changes are not suitable for our dynamic setting, as routines that update a query result
basically need to re-compute it from scratch when input relations can be replaced
arbitrarily. We give some examples for more restricted complex changes. First, one can
demand that the set of changed tuples satisfies some structural property, as for example
that deleted edges are in different connected components of a graph2. Second, one can
consider changes that are declaratively definable, for example as the result of a first-order
definable query on the input. The study of these changes has been mentioned as an open
problem by several authors [Patnaik and Immerman 1997; Etessami 1998; Grädel and
Siebertz 2012; Zeume 2015]. Third, one can impose quantitative restrictions and demand
that the number of changed tuples is bounded by some non-constant function in (for
example) the input size.
Before we formulate the research goals of this thesis in detail and present its contribu-
tions, we first review previous results in dynamic complexity.
Previous work in dynamic descriptive complexity We give an overview on pre-
vious work concerning the DynFO framework. More detailed lists of references on specific
1If a constant number c of tuples is affected by a change, one can usually number them as the first,
second, . . . , c-th affected tuple. If the query at hand can be maintained by first-order formulas under
single-tuple changes, these formulas can then be composed into first-order update formulas that
maintain the query also under changes of c tuples at a time.
2A similar condition is considered in [Dong and Pang 1997]; we give more details below.
4
aspects of dynamic complexity will also be given in later chapters.
The query that is most studied in the dynamic complexity framework is the reachability
query. It has been shown early that this query can be maintained by first-order update
formulas for undirected [Patnaik and Immerman 1997; Dong and Su 1998; Grädel
and Siebertz 2012] and for acyclic directed graphs [Dong and Su 1995] under single-
edge changes. Quantifier-free update formulas are sufficient to maintain reachability
for deterministic graphs [Hesse 2003b]. A series of results has been obtained for the
reachability query in arbitrary directed graphs. This query can be maintained using
uniform TC0 update circuits [Hesse 2003a] and non-uniform AC0[2] circuits [Datta et al.
2014] under single-edge changes. These circuit models are defined similarly as AC0
circuits but additionally allow for “majority”-gates as well as “parity”-gates, respectively.
Finally, Datta, Kulkarni, Mukherjee, Schwentick, and Zeume [2018a] have proven that
the reachability query is in DynFO under insertions and deletions of single edges.
Other queries that can be maintained with first-order update formulas under changes
of single tuples include the question whether two trees are isomorphic [Etessami 1998]
and whether a string belongs to a fixed regular or context-free language [Patnaik and
Immerman 1997; Hesse 2003b; Gelade et al. 2012].
Some work has been done to prove inexpressibility results for DynFO. Unfortunately,
so far there is no general result that some query is not in DynFO under insertions and
deletions of single tuples—apart from those that follow from the easy observation that all
queries that are dynamically maintainable by first-order update formulas can be evaluated
statically in polynomial time. Prior publications have however obtained inexpressibility
results under single-tuple changes for restricted settings. Dong and Su [1998] restrict the
arity of additionally stored auxiliary relations and show amongst other things that the
reachability query cannot be maintained using only unary auxiliary relations (apart from
the binary transitive closure relation). This result holds even when update formulas may
use grouping and aggregation features from SQL that go beyond the relational algebra
[Dong et al. 2003]. The tree isomorphism query cannot be maintained in a setting that
demands that initial auxiliary relations need to be defined in some logic for arbitrary
initial input structures [Grädel and Siebertz 2012]. If update formulas may not use
quantifiers, one cannot maintain the alternating reachability query or any non-regular
language [Gelade et al. 2012], and if additionally the initialisation is restricted or auxiliary
relations are at most binary, neither can the reachability query [Zeume and Schwentick
2015]. If the auxiliary relations need to functionally depend on the input structure, then
one cannot maintain whether two unordered sets are of equal size [Dong and Su 1997].
Other aspects that are studied concern the relationship of dynamic complexity classes
[Zeume and Schwentick 2017], the question whether the expressive power of update
formulas increases when auxiliary relations of higher arity are allowed [Dong and Su
1998; Dong and Zhang 2000; Zeume 2017], and problems that are complete for dynamic
complexity classes [Hesse and Immerman 2002; Weber and Schwentick 2007].
Some work has been done on changes that go beyond single-tuple changes. Frameworks
that incorporate more general changes are introduced in [Hesse and Immerman 2002;
Weber and Schwentick 2007], although in these papers the actual results only concern
changes that each modify the input relations by some constant number of tuples. In
5
Chapter 1 Introduction
[Dong et al. 1995], a class of queries including Reach is studied under insertions of
cartesian-closed sets of tuples. A set D of binary tuples is called cartesian-closed if for
each pair (a1, b1), (a2, b2) of tuples in D it also includes the tuples (a1, b2) and (a2, b1).
The reachability query can be maintained in DynFO under deletions of sets of edges
(nodes, strongly connected components) under the condition that for no pair e1, e2 of
(not necessarily distinct) deleted edges (nodes, strongly connected components) there is
a non-empty path from e1 to e2 in the original graph [Dong and Pang 1997]. Multiple
simultaneous edge contractions are considered in [Siebertz 2011].
Research questions for this thesis We think that extensions of the DynFO frame-
work that allow changes beyond single-tuple changes deserve a more thorough investigation.
Such extensions are particularly motivated by practical applications: database manage-
ment systems provide features to modify several rows of a table at once, power outages
lead to the sudden unavailability of multiple network servers, and in large social networks
many users edit their relationships at the same time. While in principle these changes
can be processed as a series of insertions and deletions of single tuples, it is plausible to
assume that it is more efficient to handle them directly.
We therefore aim to study the ability of dynamic programs and in particular first-order
update formulas to maintain queries under classes of complex changes. A primary goal is
to establish maintainability results.
Goal 1.1. Identify classes of complex changes such that DynFO maintainability results
can be proven for these changes.
It is very hard to show that a query cannot be maintained dynamically, as witnessed by
the lack of non-trivial general inexpressibility results for DynFO. Following the approach
of previous research, we also aim for corresponding results in restricted dynamic settings.
Goal 1.2. Prove that specific queries cannot be maintained by (restricted) first-order
dynamic programs under certain complex changes.
Many conceivable classes of complex changes include insertions and deletions of sin-
gle tuples as special cases, and maintainability under those complex changes implies
maintainability under single-tuple changes. It is therefore a reasonable first step to
establish that some query can be maintained dynamically under single-tuple changes
before considering more general changes. We therefore aim at providing also results of
this kind, in particular for queries that have not been studied before in our dynamic
setting.
Goal 1.3. Show DynFO maintainability results under single-edge changes for specific
queries.
One motivation for studying complex changes mentioned above is that it might be
more efficient to process complex changes directly than to handle them as a series of
single-tuple changes. Of course, such a claim should be checked. We therefore set the
goal of evaluating the direct treatment of complex changes in contrast to processing them
as a sequence of single-tuple changes.
6
Goal 1.4. Experimentally evaluate the performance of first-order dynamic programs for
single-tuple and complex changes.
Contributions We now summarise the contributions of this work and link them to
the research goals we just stated.
This thesis is the first comprehensive study of complex changes within the dynamic
complexity framework. In particular, we extend the DynFO framework towards first-order
definable changes and bulk changes of a set of tuples whose number is bounded by some
function in the size of the input, and study to what extent queries can be maintained by
first-order update formulas under these classes of changes.
We prove surprisingly strong DynFO maintainability results under both of the major
classes of complex changes we consider, as aimed for by Goal 1.1: first-order update
formulas can maintain many interesting queries under non-trivial changes that modify
a non-constant number of tuples. Most of the results we prove concern variants of the
reachability query and the membership problem for regular and context-free languages,
that is, queries for which maintainability under single-tuple changes is well understood.
More precisely, we show that reachability in undirected graphs can be maintained
under changes that insert a set of edges that is defined by a first-order query on the graph.
Reachability in directed acyclic graphs is in DynFO under a syntactically restricted set
of first-order definable insertions of edges. We further analyse the dynamic programs
for undirected reachability under different classes of first-order insertions. We identify
natural and practically important fragments, as for example insertions that are defined
by conjunctive queries, such that the resulting update formulas are “small”. The undi-
rected reachability query can also be maintained under insertions and deletions of a
polylogarithmic number of edges with respect to the number of nodes.
A similar result is shown for the membership problem for regular languages: these
problems can be maintained under changes of a polylogarithmic number of position of a
string. If a change can be described by a first-order formula with restricted quantifier
prefix, then also context-free languages can be maintained.
We also consider queries that have not been studied before in the DynFO framework:
all queries from the class uniform AC1, which is defined via circuits of logarithmic depth,
can be maintained under first-order definable changes that do not use parameters. This
weak class of changes does not include single-tuple changes.
In summary, we find that first-order update formulas are still very expressive under
complex changes. Still, we are able to prove that some queries cannot be maintained
with restricted dynamic programs if certain complex changes are allowed, as aimed for
by Goal 1.2. For example, we show that the reachability query cannot be maintained
using quantifier-free update formulas under changes that are defined by quantifier-free
first-order formulas. The corresponding result is not known for single-edge changes, so
our result supports the intuition that non-maintainability results should be easier to
prove for stronger classes of changes.
We further show that unary auxiliary relations are not sufficient to maintain Reach
with full first-order update formulas even if only very simple first-order definable insertions
7
Chapter 1 Introduction
are allowed, although this is possible under single-edge insertions. We observe that using
first-order update formulas we cannot hope for meaningful maintainability result under
bulk changes of more than a polylogarithmic number of tuples with respect to the number
of elements.
Although many questions remain open, these results provide a good first overview on
how reachability and formal languages can be maintained under complex changes. We
also study further queries that have not been considered so far in the dynamic setting, and
start by providing maintainability results under single-tuple changes. Towards Goal 1.3
we show that all queries definable in monadic second-order logic MSO, the extension of
first-order logic by set quantification, are in DynFO under single-tuple changes, as long
as the input structure has bounded treewidth. This result is also extended to certain
MSO-definable optimisation problems. It follows that one can for example maintain a
minimal vertex cover for a graph with bounded treewidth, or whether such a graph is
3-colourable. To prove these results we introduce the so-called Muddling technique. This
technique allows to obtain dynamic programs that maintain a query under arbitrary
sequences of single-tuple changes from dynamic programs that do so only for a sequence
of changes of bounded length. Variants of this technique are also used to prove results
for complex changes.
The previously mentioned result shows maintainability in DynFO for a large class of
queries at once. However, so far we have not identified any natural static complexity
class above AC0 such that all queries from this class can be maintained dynamically in
DynFO under single-tuple changes. Here we present an example query with low static
complexity which we so far cannot maintain dynamically, and thus forms a road-block
for proving a result in that fashion. Variants of this query are used to show that dynamic
programs with quantifier-free update formulas exhibit an arity hierarchy for Boolean
graph queries: the expressive power of these dynamic programs increases with the arity
of the auxiliary relations they may use.
The results we obtain in this thesis underscore that the DynFO framework forms a
powerful theoretical approach to dynamic query maintenance. We also give an empirical
evaluation of this approach, using the example of the reachability query for undirected
graphs under insertions of first-order definable sets of edges. Towards Goal 1.4 we compare
prototypical implementations of dynamic programs for complex and single-edge insertions
as well as re-computation from scratch. We conduct three experiments and identify
strengths and weaknesses of the approaches in the different scenarios. The competitive
performance of the dynamic programs provides further motivation to study the DynFO
framework, in particular with respect to complex changes.
Further related work We have already mentioned above related work concerning the
DynFO (and FOIES) framework directly. We now discuss work in closely related domains.
In databases, views are derived relations that are defined as the result of a query on
the base relations. Database systems may materialise views to allow faster access to
their content. The goal of incremental view maintenance is to update such materialised
views after changes of a base relation, see [Gupta and Mumick 1999] for an overview
8
of this topic. Incremental view maintenance usually follows the algorithmic approach
in the sense that the goal is to optimise the update time and that the update routines
may be specified using general-purpose programming languages. Often, no additional
auxiliary relations are used, or the only auxiliary relations are results of sub-queries of
the query that defines the view. Besides recursively defined views, it also studies views
that are definable in relational algebra; due to the correspondence with first-order logic,
see [Abiteboul et al. 1995], these views can trivially be updated using first-order formulas.
Within the research area of incremental view maintenance, Blakeley et al. [1989]
investigate whether a materialised view actually needs to be updated after a change.
They give sufficient and necessary conditions for whether a change of the base relations
is relevant for a view that is defined by a restricted relational algebra expression.
An algebraic approach on incremental view maintenance is studied in [Koch 2010],
where a query language is presented that subsumes the relational algebra and allows for
aggregates. It is shown that low-complexity update routines can be computed from view
definitions from that language. Reports on implementations of this algebraic approach
and experiments using changes of single tuples and sets of tuples are published as [Koch
et al. 2014; Nikolic et al. 2016].
Recent work studies the question whether one can update in constant time data
structures that can be used to enumerate with constant delay all result tuples of a query
after single-tuple changes to the database. This was confirmed for several classes of
queries, in particular for restricted classes of conjunctive queries, see e.g. [Berkholz et al.
2017; 2018a;b; Idris et al. 2017; 2018]. Further, one can enumerate the answer tuples of
an MSO-definable query on trees with constant delay, while the stored data structure can
be updated in logarithmic time [Amarilli et al. 2019]. It is shown that no such algorithm
with constant update time exists. The number of triangles in a database can be updated
in amortised time O(√n), where n is the size of the database, after single-tuple changes
[Kara et al. 2019]. A matching conditional lower bound is provided in the latter work.
Incremental techniques have also been studied in the context of semi-structured
databases. Algorithms that incrementally maintain whether an XML document satisfies
a Boolean XPath query, for different XPath fragments, are studied in [Björklund et al.
2010]. Dynamic algorithms that continuously validate an XML document against a
specification are presented and evaluated in [Balmin et al. 2004; Barbosa et al. 2004].
A complexity theoretic framework for dynamic computations is introduced by Miltersen
et al. [1994]. They define dynamic complexity classes incr-C based on static classes C.
For example, a problem is in the class incr-POLYLOGTIME if initial data structures for
an input can be computed in polynomial time, and a RAM machine can update those
data structures in polylogarithmic time after single-bit changes of the input. Miltersen
et al. [1994] further introduce a concept of reductions that is also appropriate for dynamic
complexity classes. They show for many problems that are complete for static complexity
classes under LOGSPACE reductions that they are also complete under these reductions.
The reductions used in the DynFO framework, see [Patnaik and Immerman 1997], use
similar ideas as those introduced in [Miltersen et al. 1994].
The IES(SQL) framework of Libkin and Wong [1997] is an extension of the FOIES
framework and thus closely related to our setting. It differs primarily in two aspects.
9
Chapter 1 Introduction
First, it uses an SQL-like update formalism, extending first-order logic with aggregation
and grouping. Second, by means of aggregation and arithmetic expressions it can generate
large number constants that can then be stored in the auxiliary database, although these
constants are not included in the input database. Consequently, the size of the auxiliary
database is not polynomially bounded in the size of the input database. In fact, the result
of [Libkin and Wong 1997; Dong et al. 1999] that the reachability query for arbitrary
graphs can be maintained in IES(SQL) uses auxiliary relations of potentially exponential
size, as representations for all paths of the input graph are stored. Libkin and Wong
[1997] show that the expressive power of IES(SQL) stays the same when only at most
binary auxiliary relations are allowed. This is in contrast to the DynFO and FOIES
frameworks, which both exhibit an arity hierarchy: for each k > 0 there are queries that
can be maintained using k-ary auxiliary relations, but not using (k − 1)-ary auxiliary
relations [Dong and Su 1998].
We also refer to [Hesse 2003b] and [Zeume 2015] for further discussions of work that is
related to the DynFO framework.
Outline We fix our notation, repeat necessary background and formally introduce and
discuss the DynFO framework in Chapter 2. In Chapter 3 we present the Muddling
technique, which will be utilised in the following chapters. Our results regarding single-
tuple changes are given in Chapter 4, first-order definable and size-bounded bulk changes
are defined and the corresponding results are proven in Chapter 5 and Chapter 6,
respectively. We report on the experimental evaluation in Chapter 7. We conclude in
Chapter 8.
Apart from the conclusion, the chapters start with an introduction that lists motivation,
goals and contributions of the corresponding chapter. They close with a short outlook
and with bibliographic remarks on where the presented results were published before,
and who contributed to them.
10
CHAPTER 2
The dynamic setting
In this chapter we introduce the dynamic complexity framework as we use it in this
thesis, together with several examples. Before we can do that, we repeat the necessary
background, mainly from finite model theory.
Our definition of DynFO is a variation of the definitions given by Patnaik and Immerman
[1997], and we discuss the differences and further variants of the framework later in this
chapter. We conclude the chapter with a short outlook and further bibliographic remarks.
2.1 Preliminaries
We now fix our notation and present notions and fundamental results that are used
throughout this thesis. More standard definitions will be repeated in later chapters.
Nevertheless, we assume familiarity with the basic concepts of finite model theory and
database theory and refer to standard texts for details [Abiteboul et al. 1995; Libkin
2004].
Basic notation We write N for the set of natural numbers, including 0. We denote by
[n] the set {1, . . . , n}, for some n ∈ N, and let [n]0 def= {0, . . . , n}. For some set A, the set
Ak
def= {a¯ = (a1, . . . , ak) | a1, . . . , ak ∈ A} contains all k-tuples over A.
The logarithm of n to base 2 is denoted as logn. The proofs in this thesis often assume
that the logarithm is only applied to powers of 2, so, that logn always is a natural
number. They can easily be adapted for the general case, then considering the number
dlogne or blognc, depending on the context. We write logd n for the polylogarithmic
function (logn)d.
11
Chapter 2 The dynamic setting
Structures, databases and queries We exclusively consider finite relational struc-
tures over relational schemas σ = {R1, . . . , R`, c1, . . . , cm}, where each Ri is a relation
symbol with a corresponding arity Ar(Ri) ∈ N, and each cj is a constant symbol. A
σ-structure D = (D,RD1 , . . . , RD` , cD1 , . . . , cDm) consists of a finite domain D, also called
the universe of D, relations RDi ⊆ DAr(Ri) and constants cDj ∈ D, for each i ∈ {1, . . . , `}
and each j ∈ {1, . . . ,m}. We usually omit the superscripts and write, for example, R1
instead of RD1 .
We use the term (relational) σ-database synonymously with the term σ-structure, and
mainly in contexts that are more motivated by database theory applications.
The active domain adom(D) of a structure D contains all elements used in some tuple
or as some constant of D. For a set D′ ⊆ D and a relation R, the restriction R[D′] of
R to D′ is the relation R ∩D′Ar(R). The structure D[D′] induced by D′ is the structure
obtained from D by restricting the domain and all relations to D′, where we assume that
D′ contains all constants of D.
A k-ary query q on σ-structures, for some schema σ, maps each σ-structure with some
domain D to a subset of Dk, and commutes with isomorphisms. For a 0-ary query we
use the terms Boolean query, (decision) problem and language synonymously.
In this thesis we often consider the Boolean query Parity, which is a query on
structures with a unary relation U and asks whether |U | is odd.
The most important class of structures we consider are graphs, that is, structures over
the schema with a single binary relation symbol E. The domain V of a graph G = (V,E)
is its set of nodes, and the edge set E is an arbitrary subset of V ×V . Formally, all graphs
in this thesis are directed. The undirected graph induced by G is the graph with the
same set of nodes and edges {(u, v), (v, u) | (u, v) ∈ E}. A path from u ∈ V to v ∈ V is a
sequence u = u0, u1, . . . , un = v of pairwise distinct nodes from V such that (ui, ui+1) ∈ E
for all 0 ≤ i < n. An undirected path in G is a path in its induced undirected graph. A
(strongly) connected component of G is a maximal connected subgraph H, i.e., for all
nodes u and v of H, there is a path from u to v. A weakly connected component of G is
a maximal subgraph whose induced undirected graph is connected.
If the formal representation does not matter, we usually speak directly of undirected
graphs and their connected components.
The reachability query Reach selects, given a (directed) graph G, all pairs (u, v) such
that there is a path from u to v in G. Similarly, the undirected reachability query UReach
selects (u, v) if there is an undirected path from u to v.
Logics Formulas from first-order logic FO over some schema σ are inductively defined
as follows. Atomic formulas are of the form R(t1, . . . , tk) or t1 = t2, where R ∈ σ is
a k-ary relation symbol and each ti is either a constant symbol from σ or a variable.
Formulas can be composed using the Boolean connectives ¬,∧,∨ as well as first-order
quantification ∃x and ∀x. We use abbreviations →,↔ as usual.
The quantifier rank of an FO formula is the maximal nesting depth of quantifiers in its
syntax tree. An FO formula ϕ is a Q1 · · ·Q`FO formula, where each Qi is either ∃ or ∀,
if ϕ is of the form Q1x1 · · ·Q`x` ϕ′(x1, . . . , x`) with quantifier-free ϕ′. Similarly, ∃∗FO is
12
2.1 Preliminaries
the subset of FO with only existential quantifiers in the quantifier prefix; other subsets of
FO are defined analogously.
Let ϕ be any FO formula over some schema σ. We write ϕ(x¯) to denote that the free
variables of ϕ, so, those variables that are not bound by any quantifier, are all contained
in x¯. We write (D, a¯) |= ϕ or simply D |= ϕ(a¯) if ϕ holds in D when the free variables in x¯
are interpreted by a¯, where a¯ is a tuple of elements from the domain of the σ-structure D
and of the same length as x¯.
Every FO formula ϕ(x¯) naturally defines a query whose result for D is {a¯ | D |= ϕ(a¯)}.
A query q is definable (or expressible) in FO if there is an FO formula ϕ that has the
same result as q for every structure over the corresponding schema.
Sometimes we allow first-order formulas to access built-in numerical relations. An
FO(≤,+,×) formula may use a binary relation symbol ≤ and ternary relation symbols +
and ×, which in any structure are interpreted as a linear order on the domain and
compatible addition and multiplication relations. If the structure does not already
provide a linear order, we require that the result of the formula is invariant under the
concrete choice of ≤. If a linear order on a domain D is available, we often identify D
with the set {0, . . . , |D| − 1}, and do not distinguish between domain elements and the
numbers they represent.
We also allow an FO(≤,+,×) formula to use the binary BIT relation, which contains
a tuple (i, j) if the j-th bit in the binary representation of the number i is 1. In the
presence of ≤, a first-order formula can express BIT given + and ×, and vice versa
[Immerman 1999].
Monadic second-order logic MSO extends FO by existential and universal quantification
over sets, using quantifiers of the form ∃X and ∀X. The notions defined above naturally
translate to MSO.
While Reach cannot be expressed in FO, the following example provides an MSO
formula for Reach.
Example 2.1. The following MSO formula ϕReach(s, t) expresses that there is a path
from s to t, and thus expresses Reach. A path from s to t exists in a graph G = (V,E)
if all subsets X of V that (1) include s and (2) include all nodes that have an incoming
edge from a node in X, also include t.
ϕReach(s, t)
def= ∀X
[(
X(s) ∧ ∀x∀y [(X(x) ∧ E(x, y))→ X(y)] )→ X(t)] /
In later chapters we consider types of tuples and structures, and how types compose.
The rank-k (FO, MSO) type of a tuple a¯ = (a1, . . . , a`) of length ` with respect to a
structure D is the set of all (FO, MSO) formulas ϕ(x¯) of quantifier rank at most k with `
free variables such that D |= ϕ(a¯). The rank-0 type of a tuple is also called its atomic
type. The rank-k type of a structure D is the rank-k type of the empty tuple with respect
to D.
We denote the set of rank-k FO types of tuples of length ` by FO[k, `]. As there are
only finitely many semantically non-equivalent first-order formulas of quantifier-rank k
with ` free variables, the set FO[k, `] is finite for all k and `, although of non-elementary
size.
13
Chapter 2 The dynamic setting
A fundamental result in finite model theory [Feferman 1957; Feferman and Vaught
1959], often called Feferman-Vaught Theorem, states that the rank-k first-order type of
the disjoint union of two structures is determined by the rank-k first-order types of these
two structures. There are multiple extensions of this result, in particular for MSO, and
we use it here in the following form.
Theorem 2.2 (Theorem of Feferman and Vaught [Feferman 1957; Feferman and Vaught
1959], see also [Makowsky 2004, Theorem 1.5]). Let D and D′ be two structures. Then
the rank-k FO (MSO) types of a tuple a¯ of D and a tuple a¯′ of D′ uniquely determine the
rank-k FO (MSO) type of the tuple (a¯, a¯′) in the disjoint union of D and D′.
First-order formulas can be used to define a mapping from σ-structures to σ′-structures,
where σ and σ′ are relational schemata. A (one-dimensional) first-order interpretation Υ
from σ to σ′ consists of a first-order formula ϕU (x) and first-order formulas ϕR(x1, . . . , xr)
for each r-ary relation symbol R ∈ σ′, each over schema σ.
The first-order interpretation Υ interprets, in a σ-structure D, the σ′-structure Υ(D)
with universe UΥ(D) def= {a | D |= ϕU (a)} and relations
RΥ(D) def= {(a1, . . . , ar) | D |= ϕR(a1, . . . , ar), a1, . . . , ar ∈ UΥ(D)}
for each R ∈ σ′.
A first-order interpretation from σ to σ′ not only interprets a σ′-structure in a σ-
structure, it also translates σ′-formulas to σ-formulas.
Lemma 2.3 (see [Blumensath et al. 2008, Proposition 3.2]). Let Υ be a first-order
interpretation from σ to σ′. For every FO (MSO) formula ϕ(x1, . . . , x`) over σ′ there is
an FO (MSO) formula ϕΥ(x1, . . . , x`) over σ such that D |= ϕΥ(a1, . . . , a`) ⇔ Υ(D) |=
ϕ(a1, . . . , a`) for all σ-structures D and all elements ai ∈ UΥ(D).
Analogously to one-dimensional FO interpretations one can define d-dimensional first-
order interpretations, for a natural number d ≥ 1, which use d-tuples of elements of D to
represent the elements of the universe UΥ(D). Lemma 2.3 holds accordingly.
Let q and q′ be Boolean queries on structures over the schemas σ and σ′, respectively.
A first-order interpretation Υ from σ to σ′ is a first-order reduction from q to q′ if D ∈ q
holds if and only if Υ(D) ∈ q′, for each σ-structure D.
Example 2.4. The query Parity can be reduced to s-t-Reach, the Boolean query that
asks whether (s, t) ∈ Reach(G), given a directed graph G and two nodes s and t of G,
via a first-order reduction that has access to a linear order ≤ on the domain.
Given a structure D with a linearly ordered domain D of size n, which we identify
with the set {0, . . . , n− 1}, and a set U ⊆ D, this reduction produces a graph G(D) with
node set {0, . . . , n} × {0, 1}. There are edges from a node (i, 0) to a node (i+ 1, 0) and
from (i, 1) to (i+ 1, 1) if i /∈ U , and else there are edges from (i, 0) to (i+ 1, 1) and from
(i, 1) to (i+ 1, 0), for all i ∈ D. See Figure 2.1 for an example. There is path from (0, 0)
to (n, 1) if and only if U has odd size, and the graph G(D) can be interpreted in D by a
14
2.1 Preliminaries
0 1 2 3 4
(a) A structure D with n = 5. An element
i ∈ U is represented by a black circle, an
element i′ /∈ U is represented by a white
circle.
(0, 0)
(5, 1)
(b) The corresponding graph G(D).
Figure 2.1: An illustration of the construction in Example 2.4.
two-dimensional FO(≤) interpretation1. It follows that there is an FO(≤) reduction from
Parity to s-t-Reach. /
Circuit complexity The complexity class AC0 contains all languages that are accepted
by families of circuits that consist of “and”- and “or”-gates with unbounded fan-in as
well as “not”-gates, and have polynomial size and constant depth. We often abuse
notation and denote by AC0 (and analogously for other complexity classes) also the class
of functions that can be computed by circuits of this form.
In this thesis we only consider uniform families of circuits, and more specifically, first-
order uniform families of circuits. A family (Cn)n∈N of circuits is first-order uniform if the
mapping n 7→ Cn can be defined by an FO(≤,+,×) interpretation, see [Immerman 1999,
Definition 5.16] for details. It is well-known that a language is in (first-order uniform)
AC0 if and only if it can be expressed in FO(≤,+,×) [Barrington et al. 1990].
Analogously to AC0 one can define the class AC1 via circuits of depth O(logn), and
more generally the class AC[f(n)] that allows for circuits of depth O(f(n)), for any
function f : N→ N. The class AC1 is a superset of, for example, LOGSPACE and NL.
The class IND[f(n)] contains queries that can be defined by inductive applications
of first-order formulas. A query q is in IND[f(n)] if it can be defined by a first-order
formula ψq that may use a relation R obtained by O(f(n))-fold iteration of an FO(≤,+,×)
formula ϕR, where n is the size of the domain. We refer to [Immerman 1999, Definition
4.16] for a formal definition, and only provide an example.
Example 2.5. We show that Reach is in IND[logn]. Consider the formula ϕR(x, y) =
(x = y) ∨ E(x, y) ∨ ∃z(R(x, z) ∧R(z, y)). When applying the formula to a graph and an
empty relation R it defines the relation R1 of paths of length 1, applying it to R
def= R1
defines the paths of length 2; in general applying the formula to R def= Ri defines the
paths of length 2i. Thus logn-fold application of ϕR defines the transitive closure relation
of a graph with n nodes. The formula ψReach is just R(x, y). /
For polynomially bounded and first-order constructible functions f : N→ N, the classes
AC[f(n)] and IND[f(n)] are equal [Immerman 1999, Theorem 5.22], and in particular
AC1 = IND[logn]. Here, a function f : N→ N is first-order constructible, if there is an
1The latter statement is true under the condition that D contains at least three elements. Structures
with less than three elements can be dealt with separately.
15
Chapter 2 The dynamic setting
FO(≤,+,×) formula ψf (x¯) such that D |= ψf (a¯) if and only if a¯ is a base-n representation
of f(n), for any ordered structure D with domain {0, . . . , n− 1}.
2.2 The dynamic complexity framework
In the section we present the dynamic complexity framework formally. Before we define
what it means for a query to be maintained under changes to the input, we have to
discuss the kind of changes we want to consider.
In this work, we study three classes of changes.
a) As already mentioned, single-tuple changes are most studied in the dynamic com-
plexity literature. A single-tuple change modifies one input relation by inserting or
deleting one tuple.
b) Bulk changes generalise insertions and deletions of single tuples to insertions and
deletions of sets of tuples.
c) A definable change of a k-ary relation R is, in its simplest form, given by a logical
formula with k free variables, possibly using additional parameters. The formula
defines a k-ary query over the input structure; the change replaces R by the result
of the query.
Formal definitions for the different classes of changes will be given below and in later
chapters, respectively. Before, we develop an abstract, generalised notion that is also
used for the definition of the framework.
A change operation δ(P¯ , p¯) has two tuples of variables: the relation parameter tuple P¯ ,
where each variable Pi has an associated arity Ar(Pi), and the element parameter tuple p¯.
A change α = (δ, A¯, a¯) over a domain D consists of a change operation δ, a tuple A¯ of
relations over D, and a tuple a¯ of elements of D; the tuples A¯ and a¯ are of the same
length as P¯ and p¯, respectively, and each relation Ai has arity Ar(Pi). We often use the
notation α = δ(A¯, a¯).
The structure that results from applying α to a structure D is denoted by α(D). For
sequences β = α1 · · ·αm of changes, β(D) is defined as αm(· · · (α1(D)) · · · ). The actual
definition of α(D), that is, the effect of applying α to a structure, will be formally defined
separately for different change operations δ.
A concrete instance of the change operations we just defined are single-tuples changes.
Let σ be a schema. For a k-ary relation symbol R ∈ σ, we define single-tuple insertions
insert p¯ into R and deletions delete p¯ from R, where p¯ has length k. The change
operations are usually more concisely noted as insR(p¯) and delR(p¯), respectively. Given
a σ-structure D and a change α = insR(a¯) over its domain, the structure α(D) consists
of the relations
Sα(D) def=
{
SD S 6= R
SD ∪ {a¯} S = R
for each S ∈ σ. Symmetrically, for a change α′ = delR(a¯), the structure α′(D) has the
16
2.2 The dynamic complexity framework
relations
Sα
′(D) def=
{
SD S 6= R
SD \ {a¯} S = R .
For a schema σ we usually write ∆σ for the set of all single-tuple change operations
for σ-structures. For graphs (and similar for other structures over a singleton schema),
we write ∆E instead of ∆{E}.
We now introduce dynamic programs and dynamic complexity classes, based on the
framework proposed by Patnaik and Immerman [1997], closely following the expositions
of Schwentick and Zeume [2016] and Schwentick et al. [2018].
The goal of a dynamic program is to maintain the answer to a fixed query q when the
underlying structure changes, where the changes are based on a fixed set ∆ of change
operations. We call the pair (q,∆) a dynamic query and denote the schema of q by σin.
To maintain a dynamic query, a dynamic program stores a set of auxiliary relations over
a schema σaux. We demand that σaux always contains a relation symbol Ans with the
same arity as q. The pair (σin, σaux) is called dynamic schema.
The update of an auxiliary relation S after a change based on an operation δ(P¯ , p¯)
is specified by an update rule on change δ(P¯ , p¯) update S as ϕSδ (P¯ , p¯; x¯), where the
length of x¯ equals the arity of S. The formula ϕSδ is called the update formula.
Definition 2.6 (Dynamic L-program). Let L be a logic. A dynamic L-program P for a
dynamic query (q,∆) over dynamic schema (σin, σaux) contains, for every δ(P¯ , p¯) ∈ ∆
and every S ∈ σaux, an update rule on change δ(P¯ , p¯) update S as ϕSδ (P¯ , p¯; x¯), where
ϕSδ is a formula from L over schema σin ∪ σaux.
Usually, the logic L is first-order logic FO or some other logic clear from the context.
We then do not mention L explicitly and just speak of dynamic programs.
We now define the semantics of a dynamic program. Let P be a dynamic program
for a dynamic query (q,∆) over dynamic schema (σin, σaux). A state S = (D, I, Aux)
of P consists of an input structure I and an auxiliary structure Aux over a common
domain D. The schemata of I and Aux are σin and σaux, respectively. We consider S as
one relational structure. For a change operation δ ∈ ∆ and a change α = δ(A¯, a¯) over
domain D, the application of α to S yields the state Pα(S) = (D,α(I), Aux′), where for
each S ∈ σaux the relation SAux′ in Aux′ is equal to {b¯ | S |= ϕSδ (A¯, a¯; b¯)}. For sequences
β = α1 · · ·αm of changes, Pβ(S) is the state Pαm(· · · (Pα1(S)) · · · ).
The dynamic program P maintains (q,∆), if for every sequence of changes over ∆ the
auxiliary relation Ans of P coincides with the result of q on the current input structure.
More formally, we demand that, for every non-empty domain D and every non-empty
sequence β of changes over ∆ and D, the relation Ans in Pβ(S0) equals q(β(I0)). Here,
S0 def= (D, I0, Aux0) and I0 and Aux0 are the empty input and auxiliary structures
over D, respectively. Because the relation Ans contains the query result, we also call it
the query relation of P.
Dynamic complexity classes consist of dynamic queries that can be maintained by
(restricted) dynamic programs. The most important class for this thesis is the class
DynFO which allows for dynamic programs with full first-order logic update formulas.
17
Chapter 2 The dynamic setting
Definition 2.7 (DynFO). The class DynFO is the class of dynamic queries that can be
maintained by dynamic FO-programs.
Instead of (q,∆) ∈ DynFO we also often say that (q,∆) can be maintained in DynFO,
or that q is in DynFO under ∆ changes, and similar for other dynamic complexity classes.
Example 2.8. As already noted in the introduction, the reachability query Reach can
be maintained dynamically under edge insertions. We can now formalise this insight
as statement (Reach, {insE}) ∈ DynFO. The dynamic program that maintains the
dynamic query (Reach, {insE}) stores and updates a single auxiliary relation Ans that
always contains the transitive closure of the edge relation E. Its update formula is given
by ϕAnsinsE (p1, p2;x, y)
def= Ans(x, y) ∨ (Ans(x, p1) ∧Ans(p2, y)). /
For some dynamic queries (q,∆) we are only interested in maintaining (q,∆) as
long as the input structure is from a certain class C of structures. Examples for these
restrictions are acyclic graphs or structures with bounded treewidth (which will be
studied in Section 4.1). We say that a dynamic program P maintains (q,∆) for C if P
maintains (q,∆) for every change sequence β = α1 · · ·αm over ∆ such that every structure
Ii def= α1 · · ·αi(I∅) is a structure from C. So, we demand that every prefix of β transforms
an empty structure into a structure from C. This definition just disallows any change
sequence that results in a structure outside of C. In some cases, a dynamic L-program
can also set up a guard: an L formula that detects, using the auxiliary relations, when a
change would lead to a structure outside of C.
We say that (q,∆) is in DynFO for C if there is a dynamic FO-program that maintains
(q,∆) for C.
Example 2.9 ([Patnaik and Immerman 1997]). We show that the reachability query
can be maintained under edge insertions and deletions for acyclic graphs, so, that
(Reach,∆E) is in DynFO for acyclic graphs. As in Example 2.8, the dynamic program
that maintains (Reach,∆E) for acyclic graphs only needs a single auxiliary relation Ans
that contains the query result. The update formula for insertions insE is already given
in Example 2.8.
The update formula ϕAnsdelE (p1, p2;x, y) for deletions delE(p¯) is constructed using the
following observations. Let G be an acyclic graph and let G′ be the graph that is obtained
from G by deleting an edge (a, b). There is a path from a node u to a node v in G′ if
and only if (1) such a path exists in G and (2a) no path from u to v in G uses the edge
(a, b) or (2b) there is a path ρ in G that uses an edge (c, c′) 6= (a, b) with the property
that a is reachable from c but not from c′.
We do not formally prove that claim here, but give an informal correctness argument.
Consider a graph G such that there is a path from u to v via (a, b) in G, no path ρ as
in Condition (2b) exists, and v is reachable from u in G′ via a path ρ′. As ρ′ does not
satisfy Condition (2b), there is in particular a path from v to a in G. As there also is a
path from a to v in G, this graph is not acyclic.
18
2.2 The dynamic complexity framework
We obtain the update formula as follows:
ϕAnsdelE (p1, p2;x, y)
def= Ans(x, y) ∧
((¬Ans(x, p1) ∨ ¬Ans(p2, y))
∨ ∃z∃z′(Ans(x, z) ∧ E(z, z′) ∧Ans(z′, y) ∧ (z 6= p1 ∨ z′ 6= p2)
∧Ans(z, p1) ∧ ¬Ans(z′, p1)
))
Observe that there is a first-order guard that checks whether an insertion insE(a, b)
leads to a cyclic graph: this is the case if and only if there already is a path from b to a
in the graph at hand, so, if (b, a) ∈ Ans holds. /
We already see from these examples that dynamic FO-programs are surprisingly
expressive: reachability in acyclic graphs is an NL-complete problem, and it is one of the
prime examples of a query that cannot be expressed in first-order logic statically.
To understand better the expressive power of dynamic programs, we study restricted
dynamic programs that only allow for syntactically restricted update formulas. As we
see in Example 2.8, even update formulas without quantifiers can be used to maintain
non-trivial queries. This insight leads to the definition of the class DynProp.
Definition 2.10 (DynProp). The class DynProp is the class of dynamic queries that can
be maintained by dynamic FO-programs whose update formulas are quantifier-free.
Example 2.11. The query Parity is not in AC0 and therefore not expressible even
in full FO [Ajtai 1983; Furst et al. 1984]. Dynamically, under single-tuple changes, the
query (Parity,∆U ) can be maintained easily. Each change transforms a relation with
odd parity into a relation with even parity, and vice versa, provided that the change
actually alters U . The update formulas for the only auxiliary relation Ans are as follows.
ϕAnsinsU (p)
def=
(
Ans ∧ U(p)) ∨ (¬Ans ∧ ¬U(p))
ϕAnsdelU (p)
def=
(
Ans ∧ ¬U(p)) ∨ (¬Ans ∧ U(p)) /
In the beginning of Chapter 4 we review in more detail previous result on which
dynamic queries can be maintained in DynFO and in DynProp, respectively.
Further restrictions of DynFO are studied in the literature [Hesse 2003b; Zeume and
Schwentick 2017]. In general, we denote by DynL the class of dynamic queries that can
be maintained by dynamic L-programs, for any logic L.
The class DynFO, as well as all other dynamic complexity classes we consider, is not
known to be closed under first-order reductions, which are often considered in static
complexity theory. For the dynamic complexity framework, Patnaik and Immerman
[1997] introduced bounded first-order reductions. Similar notions of reductions are defined
by other authors [Hesse and Immerman 2002; Grädel and Siebertz 2012; Datta et al.
2018a]. They all have in common that whenever a reduction maps an instance I of some
dynamic query to an instance I ′ of some other dynamic query, then every change of I
induces at most c changes in the instance I ′, for some global constant c.
More formally, for Boolean dynamic queries (q,∆) and (q′,∆′) we say that a first-order
reduction Υ is a bounded first-order reduction, if there is a c ∈ N such that for all input
19
Chapter 2 The dynamic setting
structures I for q and all changes α of I over ∆ there is a first-order definable sequence β
of changes that consists of at most c changes over ∆′ such that Υ(α(I)) = β(Υ(I)). We
refer to [Datta et al. 2018a] for a more general notion of reductions that also can be used
for non-Boolean dynamic queries.
Note that the reduction from Example 2.4 is a bounded first-order reduction, as each
change in D induces the insertion and deletion of two edges in G(D), respectively.
2.3 Variants of the setting
The definitions of the previous section are not without alternatives. In this section we
discuss some of the choices made for the definitions and point to articles that further
research the alternative settings. This in particular concerns the choice of the domain
and the initial state of a dynamic program. Already Zeume [2015] and Schwentick and
Zeume [2016] compare the different variants of dynamic complexity thoroughly, therefore
we here keep the discussion short and refer to the mentioned work for more details.
Domain One of the possibly unintuitive properties of the dynamic complexity frame-
work, as defined in the previous section, is that the domain of the database is fixed
throughout the dynamic process: changes cannot introduce new elements, and no element
can be removed from the domain. Note that although the domain is fixed, dynamic
programs need to work uniformly for all domains of any size.
The framework of first order incremental evaluation systems (FOIES) [Dong et al. 1995;
Dong and Su 1995] has its roots in database theory and allows the domain to grow and
shrink depending on the changes. Apart from this feature, it is basically equivalent to the
DynFO setting. In the FOIES framework, databases have an infinite background domain,
and the update formulas are evaluated on its active domain. Changes can add elements
to the active domain by tuple insertions, or remove them if all tuples that contain them
are deleted. Sometimes, for example in [Dong and Su 1997], elements may not be used
any more for auxiliary relations if they are removed from the active domain.
The setting of DynFO as we use it follows the formalisation by Patnaik and Immerman
[1997], which originates from (descriptive) complexity theory and finite model theory,
and therefore only considers fixed and finite domains2. There are several advantages in
using this arguably simpler framework. At first, many interesting phenomena of dynamic
computations already arise for fixed domains, dealing with a varying domain might
only impose technical difficulties that we are not primarily interested in. Secondly, the
connections between logics and circuit complexity allow us to re-use results obtained from
the latter research field. A disadvantage might be that the FOIES setting is more inclined
to practice. However, result on whether a dynamic query can be maintained usually carry
over from DynFO to FOIES. This is immediate for lower bounds: if a dynamic query
cannot be maintained in DynFO, it also cannot be maintained under additional changes
of the domain. Even for upper bounds this is true, at least for single-tuple changes:
2In [Patnaik and Immerman 1997], an input structure has a unary relation that holds its active domain,
which seems similar to the FOIES framework. It does not seem to be actually used.
20
2.3 Variants of the setting
for a large class of natural σ-queries q, (q,∆σ) ∈ DynFO implies that (q,∆σ) can also
be maintained in the FOIES setting (see [Datta et al. 2018a, Theorem 17] for a precise
statement).
We exclusively prove results for the DynFO framework in this thesis. However, most of
our results transfer to a FOIES-like extension of the DynFO framework that allows for
change operations add(x) and remove(x) that add an element x to the domain or remove
it, respectively. Whenever results do not transfer immediately, we discuss issues and
possible adaptations towards the end of the forthcoming chapters.
Initial input and auxiliary structures In our definition, a dynamic programs main-
tains a query starting from a state S0 = (D, I0, Aux0) with empty initial and auxiliary
relations. This is also the standard setting in [Gelade et al. 2012; Datta et al. 2018a] and
is called DynFO from scratch in [Zeume 2015]. Several different choices are made in the
literature. Our setting is equivalent to the Dyn-FO setting of Patnaik and Immerman
[1997], which is re-used by Etessami [1998] and Hesse [2003b], that considers empty initial
input structures and first-order definable initial auxiliary relations (the equivalence is
clear, see also [Gelade et al. 2012, Lemma 2.2]). The extension of this setting that allows
polynomial time computable initialisation of auxiliary relations is called Dyn-FO+ in
[Patnaik and Immerman 1997].
Alternative settings allow arbitrary initial input structures. In this case, an initialisation
of the auxiliary relations becomes necessary: if only initially empty auxiliary relations
were allows, all queries maintainable by dynamic FO-programs with arbitrary initial input
structure are already expressible in FO. In its strongest form, the auxiliary relations
can be initialised arbitrarily [Zeume 2015; 2017; Zeume and Schwentick 2015; 2017]. In
[Datta et al. 2018a], this variant is called non-uniform DynFO. A dynamic query (q,∆) is
in non-uniform DynFO, if for each input structure I with domain D there is an auxiliary
structure Aux such that (q,∆) can be maintained in DynFO starting from the initial
state (D, I, Aux).
A more fine-grained framework is introduced in [Weber and Schwentick 2007]. For
any complexity class C, the class Dyn(C,FO) contains the dynamic queries that can be
maintained in DynFO after a C-computable initialisation. The setting of [Grädel and
Siebertz 2012] looks similar at first sight. The authors consider classes Dyn(L,FO), for
any logic L, that contain dynamic queries maintainable in DynFO after an L-expressible
initialisation. As a striking difference to the variants mentioned before, these initialisations
are permutation-invariant in the sense of [Zeume and Schwentick 2015]. Let Init be
the mapping from input structures to auxiliary structures defined by the L-expressible
initialisation. For every input structure I and every permutation pi of the domain of I,
Init satisfies pi(Init(I)) = Init(pi(I)). A permutation-invariant initialisation cannot
define a linear order on the domain, amongst others. This is the main reason why there
are general lower bound results for the setting of [Grädel and Siebertz 2012]: for example
the query TreeIsomorphism is not in Dyn(L,FO) for any logic L, but it is known to be
in DynFO under single-edge changes [Etessami 1998].
In the FOIES framework, similar choices are made regarding initialisation, although
21
Chapter 2 The dynamic setting
less emphasis is put on this aspect. For example, [Dong and Su 1998] does not formally
specify the initialisation, but only mentions that arbitrary initialisation as well as empty
initialisation with empty initial input databases are possible choices. Other articles, for
example [Dong and Su 1995], opt for arbitrary initialisation. In [Dong et al. 1995], the
initial auxiliary relations are computed by Datalog programs.
How can we motivate the different settings? Much like fixing the domain, allowing
arbitrary initialisation lets us focus on the maintenance aspect of dynamic queries
(cf. [Zeume and Schwentick 2017]): it is a useful first step to prove that a dynamic
query can be maintained “at all”, without worrying about initialisation. Results that
certain dynamic queries cannot be maintained are strongest if they still hold with
arbitrary initialisation. On the other hand, maintainability results are stronger, the less
initialisation they require. Especially for practical applications it is important that the
initial auxiliary relations can actually be computed, preferably with low complexity.
In this thesis we follow a pragmatic approach. The class DynFO does not allow
initialisation, so results that a dynamic query is in DynFO hold for all standard variants,
with the exception of the setting of [Grädel and Siebertz 2012]. If a result requires
some form of initialisation, we explicitly mention this and say that a dynamic query is
in DynFO with suitable initialisation, and specify the precise initialisation we use. For
results that certain dynamic queries cannot be maintained in DynFO, we also specify
whether this holds even when certain classes of initialisations are allowed.
Linear order and arithmetic relations An important special case of initialisations
is whether certain built-in numeral relations are assumed to be present. A numerical
relation only depends on the domain and does not change in the dynamic process.
Examples for such relations are a linear order ≤ and compatible relations + and ×
expressing addition and multiplication. These three relations are particularly important,
as they allow to connect first-order logics with circuit complexity classes: FO(≤,+,×) is
equal to AC0 [Barrington et al. 1990].
Whether built-in relations are allowed differs among the various settings. The setting
of extended dynamic programs of [Gelade et al. 2012] allows in principle arbitrary built-in
relations. In [Hesse 2003b; Patnaik and Immerman 1997], all of ≤,+,× are assumed to
be present, whereas the conference version [Patnaik and Immerman 1994] only assumes ≤.
The update programs in [Pang et al. 2005] use ≤ and +.
In this thesis we are transparent and write DynFO(≤,+,×) when we allow update
formulas to use relations ≤, + and ×. In the realm of single-tuple changes, the difference
between DynFO and DynFO(≤,+,×) is rather small: even when ≤, + and × are not
present from the beginning, the restriction of these relations to the active domain3 can
be maintained [Patnaik and Immerman 1997; Etessami 1998]. As a result, for a large
class of natural queries, membership in DynFO(≤,+,×) implies membership in DynFO
under single-tuple changes [Datta et al. 2018a, Proposition 7], see also Proposition 3.2.
This is a major difference to the unordered setting of [Grädel and Siebertz 2012].
3More precisely, the restriction to the activated domain can be maintained. An element is called
activated, if it was part of the active domain at some point during the dynamic process.
22
2.4 Outlook and bibliographic remarks
The difference between DynFO and DynFO(≤,+,×) becomes more visible under com-
plex changes. A linear order on the active domain can be maintained under single-tuple
changes because elements enter the active domain “one at a time”: each change uses only
constantly many elements as parameters, and the parameter tuple orders them linearly.
New elements are appended in that order to the end of the existing linear order.
Under complex changes, a non-constant number of elements can enter the active domain
in one step, and in general FO cannot define a linear order on them. We will discuss
in Chapter 6 the differences between DynFO(≤,+,×) and DynFO for certain complex
changes.
2.4 Outlook and bibliographic remarks
In this chapter we formalised the dynamic complexity framework and discussed several
variants. The precise relationship between these variants often stays an open problem.
Thanks to [Datta et al. 2018a, Proposition 7 and Theorem 17], we have a good under-
standing on the relationship between DynFO, DynFO(≤,+,×) and FOIES for a large
natural class of queries under single-tuple changes. For some specific dynamic queries we
will discuss the differences between these frameworks towards the end of Chapter 6, but
a general result for complex changes in the spirit of [Datta et al. 2018a, Proposition 7
and Theorem 17] is still missing.
The DynFO framework and the setting of Grädel and Siebertz [2012] differ considerably,
which can be seen by means of the TreeIsomorphism query. Although Grädel and
Siebertz [2012] relate subclasses from their framework to subclasses of FOIES, no general
result is known that, for example, allows to transfer maintainability result for classes of
dynamic queries from DynFO to the framework of [Grädel and Siebertz 2012].
Bibliographic remarks
The setting of dynamic complexity as introduced in this chapter follows the exposition
of Schwentick and Zeume [2016], which is based on the setting of Dyn-FO by Patnaik
and Immerman [1994; 1997]. The class DynProp is introduced in [Hesse 2003b]. The
definition of change operations is a generalisation of the definition in [Schwentick et al.
2018]. Note that terms vary in the literature: what we call “change operation” and
“change” is called “operation symbol” and “operation” in [Weber and Schwentick 2007],
and “abstract modification” and “concrete modification” in [Zeume and Schwentick 2015],
respectively. In [Schwentick et al. 2018], “change operation” is used as a synonym for
“change”, and the term “replacement query” is used instead.
The presented examples are standard in the dynamic complexity literature. Example 2.8
appeared first in (a conference version of) [Dong et al. 1995], Example 2.9 is from [Patnaik
and Immerman 1997], which in turn is a simplification of a result of [Dong and Su 1995].
Example 2.11 appears amongst others in [Patnaik and Immerman 1997; Etessami 1998;
Weber and Schwentick 2007]. The examples from Section 2.1 are folklore.
23

CHAPTER 3
The Muddling technique: Bounding the number
of changes
Whenever a long-standing open problem is solved, for example when for some problem a
new algorithm is presented that beats all previously known algorithms in terms of worst-
case complexity, or when a complexity theoretic lower bound is established, researchers in
computer science not only care about the result, but are also interested in the algorithmic
techniques or proof techniques used. Can these techniques be applied to other problems,
similar ones or in different domains? Can they be strengthened?
This also applies for proofs showing that some dynamic query can be maintained in
DynFO. The constructed dynamic programs often maintain particular data structures,
which are encoded within the auxiliary database, or show that logical primitives can be
simulated. For example, Gelade et al. [2012] show how to maintain a certain query in
DynProp using a list structure, a technique which is re-used by Zeume and Schwentick
[2015] (and will be used later in Section 4.2). Datta et al. [2018a] show that for maintaining
queries with some particular property, one can assume that a linear order on the domain
as well as arithmetic relations +,× are available.
One goal in dynamic complexity research, and of this chapter in particular, is to expand
the “tool box” of techniques to maintain a query dynamically.
Goal 3.1. Identify high-level strategies of dynamic query maintenance.
The purpose of this chapter is to introduce a technique that bounds the number of
consecutive changes a dynamic program needs to support.
In Section 2.3 we discussed several details of the definitions of the dynamic complexity
framework. However, we did not question our understanding that a dynamic program
only maintains a dynamic query if it can update the query result under change sequences
25
Chapter 3 The Muddling technique: Bounding the number of changes
of arbitrary length. Of course, this requirement might be hard or even impossible to meet.
In practical scenarios of dynamic query maintenance it is conceivable that a query can
be maintained for some time but not indefinitely, as the quality of the updated answer
decreases after each update. For example, the update might yield only approximate
solutions after a change, or numerical errors in arithmetic computations might sum up.
In our dynamic complexity setting, intuition tells us that it should be easier to construct
a dynamic program that maintains a query for few change steps than to find a dynamic
program that maintains it “forever”.
In contrast to this intuition, we will prove in this chapter that we can actually transform
dynamic programs for maintenance under short change sequences into dynamic programs
for maintenance under change sequences of unbounded length. For this result we introduce
a restricted notion of query maintenance, that in particular only asks for maintaining a
query under change sequences of bounded length. When the specified number of changes
have occurred (and intuitively the quality of the auxiliary relations is too poor for them
to be useful any more), the query result and the auxiliary relations have to be recomputed
from scratch, using an algorithm of higher complexity than the updates. Afterwards, the
query result can again be maintained for some time, and so on. We will then show that
dynamic queries that can be maintained in (some instances of) this restricted sense can
also be maintained for arbitrary long change sequences.
Contributions We introduce the notion of (C, f)-maintainability, for a complexity
class C and a function f : N→ N, as the ability to maintain a dynamic query for f(n) many
changes, starting from an arbitrary initial input structure and with the help of auxiliary
relations initialised by a C-algorithm. Here, n is the size of the input structure’s domain.
We prove that in the context of single-tuple changes, (C, f)-maintainable dynamic queries
that meet a natural technical restriction are in DynFO, for some specific classes C and
functions f . Consequently, we obtain a new strategy for proving membership in DynFO,
which we call the Muddling technique: show that a dynamic query is (C, f)-maintainable
and by the results of this chapter conclude that it is also in DynFO.
This chapter presents joint work with Samir Datta, Anish Mukherjee, Thomas
Schwentick and Thomas Zeume. Detailed bibliographic references are given at the
end of the chapter.
Outline We formally introduce the notion of (C, f)-maintainability in Section 3.1,
together with a natural property of queries that we need to assume for our main
contribution. We also repeat a result from [Datta et al. 2018a] that allows dynamic
programs to use a linear order and arithmetic relations on the whole domain of the
input structure. The main result of this chapter, that allows to transform dynamic
programs maintaining a dynamic query for a bounded number of single-tuple changes
into dynamic programs that maintain the dynamic query for arbitrarily many changes,
is stated and proven in Section 3.2. We close with a short outlook and bibliographic
remarks in Section 3.3.
26
3.1 Notions used for the Muddling technique
3.1 Notions used for the Muddling technique
We formalise the notion of query maintenance we sketched in the introduction of this
chapter. We say that a dynamic query (q,∆) is (C, f)-maintainable, for some complexity
class C and some function f : N→ N, if there is a dynamic program P and a C-algorithm A
such that for each input structure I over a domain of size n, each linear order ≤ on
the domain, and each change sequence β over ∆ of length |β| ≤ f(n), the relation Ans
in Pβ(S) and q(β(I)) coincide, where S = (I,A(I,≤)). So, a dynamic program needs
to maintain (q,∆) for f(n) changes after a C-computable initialisation. We refer to
Subsection 4.1.2 for an example for this notion.
In this thesis we do not study (C, f)-maintainability in its own right, but use it as a
tool to prove that dynamic queries are actually maintainable in DynFO and its variants.
For the corresponding proof we need the technical assumption that the query under
consideration does not depend “considerably” on the size of the underlying domain. We
motivate this restriction with the example of the Boolean EvenDomain query that
asks whether the underlying domain of the input structure is of even size. Standard
arguments show that first-order logic cannot express this query, and this inability extends
to DynFO: without additional initialisation, no first-order update formula can give
the query result after the first change. On the other hand, EvenDomain is trivially
(AC0[2], f)-maintainable under all finite sets of change operations and for all functions f ,
as the underlying domain is immutable and the fixed query answer can easily be computed
by an AC0[2] initialisation1. So, (C, f)-maintainability alone cannot imply membership
in DynFO, for any interesting choice of C and f .
In the following, we restrict ourselves to queries that have some natural property. A
query q is called domain independent in [Datta et al. 2015], if the addition of isolated
elements to the domain does not change the query result, that is, if q(D) = q(D[adom(D)])
for all structures D. A weakly domain independent query q just satisfies q(D)[adom(D)] =
q(D[adom(D)]) for all structures D, so, the query result restricted to the original domain
stays the same when isolated elements are added. These two notions are equivalent for
Boolean queries.
Here we introduce a slight generalisation of this notion and call a query q almost domain
independent if there is a c ∈ N such that, for every structure D with domain D and every
setD′ ⊆ D\adom(D) with |D′| ≥ c it holds q(D)[(adom(D)∪D′)] = q(D[(adom(D)∪D′)]).
Informally this means that there is a constant c such that if a structure already has at
least c “non-active” elements, adding more “non-active” elements does not change the
query result with respect to the original elements. A query is weakly domain independent
if and only if it is almost domain independent with c = 0.
Example 3.1. We give some examples for the notions we just defined.
1. The Parity query is clearly domain independent.
2. The binary reachability query Reach is weakly domain independent: adding any
set V ′ of isolated nodes to a graph does not create or destroy paths in the original
1An AC0[2]-circuit is defined similarly to an AC0-circuit, but additionally may use “modulo 2”-gates
with unbounded fan-in that determine whether an odd number of its inputs is evaluated to 1.
27
Chapter 3 The Muddling technique: Bounding the number of changes
graph. Note that for each node v ∈ V the tuple (v, v) is part of the query result
Reach(G), so Reach(G) 6= Reach(G[adom(G)]) in general and therefore Reach
is not domain independent.
3. The FO-definable Boolean queryTwoIsolatedElements, which is true if and only
if exactly two elements are not in the active domain, is almost domain independent
with c = 3.
4. The Boolean query EvenDomain is not almost domain independent. /
We call a dynamic query (q,∆) (weakly, almost) domain independent, if q is (weakly,
almost) domain independent.
For weakly domain independent queries under single-tuple changes, maintainability
in DynFO and DynFO(≤,+,×) is equivalent [Datta et al. 2018a]. This is a remarkable
result, as FO is strictly weaker than FO(≤,+,×) in the static setting. One witnessing
query is of course EvenDomain, which is easily expressible in FO(≤,+,×) by the
formula ϕ def= ∃i∃j(i + i = j ∧ ∀j′ j′ ≤ j), but no equivalent FO formula exists. As
mentioned above, EvenDomain is not almost domain independent and separates DynFO
and DynFO(≤,+,×). However, there is also an almost domain independent query that
statically separates FO and FO(≤,+,×) [Otto 2000].
In this thesis we use a slightly more general formulation of the result by Datta et al.
[2018a].
Proposition 3.2 (Adaptation of [Datta et al. 2018a, Proposition 7]). Let q be an almost
domain independent query and let ∆ be a set of single-tuple change operations. If
(q,∆) ∈ DynFO(≤,+,×) then also (q,∆) ∈ DynFO.
The proof of this proposition is very similar to the proof of [Datta et al. 2018a,
Proposition 7], the necessary adaptation is discussed in [Datta et al. 2019].
3.2 From short to arbitrary long change sequences
We now present and prove the main result of this section, that for certain combinations
of complexity classes C and functions f : N→ N, all almost domain independent (C, f)-
maintainable dynamic queries are in DynFO. After the title of the first publication
using this result [Datta et al. 2017], we call it the Muddling Theorem, and similar call
the technique of employing this theorem to show membership in DynFO the Muddling
technique.
Theorem 3.3 (Muddling Theorem for single-tuple changes). Let f : N→ N be a first-
order constructible function with f ∈ O(n) and ∆ a set of single-tuple change operations.
If an almost domain independent dynamic query (q,∆) is (AC[f(n)], f(n))-maintainable
then also (q,∆) ∈ DynFO.
In this thesis we mainly use a special case of Theorem 3.3, which we state as the
following corollary.
Corollary 3.4. Let ∆ be a set of single-tuple change operations. If an almost domain
independent dynamic query (q,∆) is (AC1, logn)-maintainable then also (q,∆) ∈ DynFO.
28
3.2 From short to arbitrary long change sequences
Before we give a proof, we outline its main ideas, for ease of presentation based on
the statement of Corollary 3.4. Suppose we have a dynamic program P that maintains
some dynamic query (q,∆) for logn changes, starting from an arbitrary input instance
and auxiliary relations that are computed by an AC1 algorithm. How can we construct a
dynamic program P ′ that maintains this dynamic query for change sequences of arbitrary
length, and without initialisation (but from initially empty input structures)?
In a first step we can construct from P a dynamic program P∗ that does not use any
initialisation, but is only able to answer the query after the last of the logn changes
happened. Observe that the AC1 initialisation can be computed by applying a first-order
formula logn times, thanks to the equality AC1 = IND[logn]. Also, any update formula
of P is applied logn times while processing the change sequence of length logn. The
idea is now to distribute the work of the initialisation among the logn updates.
The program P∗ proceeds as follows. During the first half of the changes, it simulates
the initialisation by applying a corresponding first-order formula twice at each step.
During the second half of the changes, it simulates P to process both the changes of the
first and of the second half, two in each step. After logn changes, it simulated P for all
changes after it computed the initial auxiliary data, so it can return the correct query
answer.
The dynamic program P ′ now simulates multiple instances of P∗, similarly as it
was already done for [Datta et al. 2018a, Proposition 7] and [Schwentick et al. 2018,
Theorem 8.1]. We call such a simulation a thread. After each change, P ′ starts a new
thread. After logn changes occurred, and P ′ started logn more threads, the first thread
finishes its computation and terminates, and P ′ can output the thread’s query result. Like
this, P ′ can answer the query from the time the very first thread finishes its computation,
which is after the change number logn happened, and then arbitrarily long.
It remains to explain how P ′ gives the query result for the first logn changes. At this
point we use that the initial input structure is empty, and that the underlying query
is weakly domain independent. While processing the first logn single-tuple changes,
the active domain of the input structure is very small. In other words, it contains
many isolated elements, and these are not relevant for the query result. So, P ′ can
actually simulate P∗ on a much smaller input instance, for which the initialisation can
be computed in far less time than for the full instance, and hence P∗ can give the query
answer way earlier than after its logn-th change.
With these explanations in mind, we turn to the formal proof of Theorem 3.3.
Proof (of Theorem 3.3). Let f : N → N be first-order constructible with f ∈ O(n)
and assume that an AC[f(n)] algorithm A and a dynamic program P witness that
an almost domain independent dynamic query (q,∆) is (AC[f(n)], f(n))-maintainable.
Using Proposition 3.2, it is sufficient to construct a dynamic program P ′ that witnesses
(q,∆) ∈ DynFO(≤,+,×). For simplicity we restrict ourselves to graph queries and assume
that ∆ = ∆E . The extension for the general case is straightforward.
The overall idea is to use a simulation technique similar to the ones used in [Datta et al.
2018a, Proposition 7] and in [Schwentick et al. 2018, Theorem 8.1]. We first present the
computations performed by P ′ intuitively and later explain how they can be expressed
29
Chapter 3 The Muddling technique: Bounding the number of changes
in first-order logic. We consider each application of one change as a time step and refer
to the graph after time step t as Gt = (V,Et). After each time step t, P ′ starts a thread
that is in charge of answering the query at time point t+ f(n). Each thread works in
two phases, each lasting f(n)2 time steps. Roughly speaking, the first phase is in charge
of simulating A and in the second phase P is used to apply all changes that occur from
time step t+ 1 to time step t+ f(n). Using f(n) many threads, P ′ is able to answer the
query from time point f(n) onwards.
We now give more details on the two phases and describe afterwards how to deal
with time points earlier than f(n). For the first phase, we make use of the equality
AC[f(n)] = IND[f(n)] [Immerman 1999]. Let ψ be an inductive formula that is applied
df(n) times, for some d, to get the auxiliary relations A(G,≤) for a given graph G and
the given order ≤. The program P ′ simply applies ψ to Gt, 2d times during each time
step, and thus the result of A on (Gt,≤) is obtained after f(n)2 steps. The changes that
occur during these steps are not applied to Gt directly but rather stored in additional
relations. We also store the order in which the changes occur.2
During the second phase the f(n)2 already stored changes and the
f(n)
2 changes that
happen during the next f(n)2 steps are applied to the state after phase 1, in the order of
occurrence. To this end, it suffices for P ′ to apply two changes during each time step by
simulating two update steps of P . Since P can maintain q for f(n) changes, at the end of
phase 2, at time point t+ f(n), P ′ can give the correct query answer for q about Gt+f(n).
The following auxiliary relations are used by thread i:
• a binary relation Eˆi that contains the edges currently considered by the thread,
• relations Dinsi and Ddeli that cache edge insertions and deletions, respectively, in
the order they occur,
• a relation Rˆi for each auxiliary relation R of P with the same arity,
• and a relation Ci that is used as a counter: it contains exactly one tuple which is
interpreted as the counter value, according to its position in the lexicographic order
induced by ≤.
When thread i starts its first phase at time point t, it sets Eˆi to Et and the counter Ci
to 0; its other auxiliary relations are empty in the beginning. Whenever a change insE(a¯)
(or delE(a¯)) occurs, the thread updates its auxiliary relations as follows. The change is
not applied directly, so Eˆi is not changed. Instead, the thread needs to store the change
and remember the time step it occurred. To do so, the tuple (c¯, a¯) is inserted into Dinsi
(or Ddeli ), where c¯ ∈ Ci is the current counter value. Additionally, the counter Ci is
incremented by one and the relations Rˆi are replaced by the result of applying their
defining first-order formulas 2d times, as explained above.
When the counter value is at least f(n)2 , the thread is in its second phase and proceeds as
follows. When a change occurs, it is stored exactly as in the first phase. Additionally, the
two oldest stored changes δ1(a¯1) and δ2(a¯2) that are not already processed are applied, in
that order. These changes are identified by the two lexicographically smallest tuples c¯1, c¯2
2This seems to be a bit exaggerated here, as it would be sufficient to store the difference between the
actual and the considered edge relation. In later chapters we generalise this proof to “non-commutating”
change operations where the order needs to be preserved, so we already proceed this way here.
30
3.2 From short to arbitrary long change sequences
such that tuples (c¯1, a¯1) and (c¯2, a¯2) exist in Dinsi or Ddeli . To apply these changes, the
thread simulates P on the edge set Eˆi and auxiliary relations Rˆi, replaces the auxiliary
relations accordingly and adjusts Eˆi. Also, the tuples containing (c¯1, a¯1) and (c¯2, a¯2) are
deleted from Dinsi and Ddeli . Again, the counter Ci is incremented. If the counter value
is f(n), the thread’s query result is used as the query result of P ′, and the thread stops.
All steps are easily seen to be first-order expressible.
So far we have seen how P ′ can give the query answer from time step f(n) onwards.
For time steps earlier than f(n) the approach needs to be slightly adapted as the program
does not have enough time to simulate A. The idea is that for time steps t < f(n) the
active domain is small and, exploiting the almost domain independence of q, it suffices
to compute the query result with respect to this small domain extended by c isolated
elements, where c is the constant from almost domain independence. The result on this
restricted domain can afterwards be used to define the result for the whole domain.
Towards making this idea more precise, let n0, b be such that bn ≥ f(n) for all n ≥ n0.
We focus on explaining how P ′ handles structures with n ≥ n0, as small graphs with less
than n0 nodes can be dealt with separately.
The program P ′ starts a new thread at time t2 for the graph G t2 with at most
t
2
edges. Such a thread is responsible for providing the query result after t time steps, and
works in two phases that are similar to the phases described above. It computes relative
to a domain Dt of size min{2t + c, n}, where c is the constant from (almost) domain
independence. The size of Dt is large enough to account for possible new nodes used in
edge insertions in the following t2 change steps. The domain Dt is chosen as the first |Dt|
elements of the full domain (with respect to ≤). The program P ′ maintains a bijection pi
between the active domain DG of the current graph G and the first |DG| elements of the
domain to allow a translation between DG and Dt.
The first phase of the thread for t starts at time point t2 + 1 and applies ψ for 8bcd
times during each of the next time steps. This simulation of A is finished after at most
(2t+c)b
8bc ≤ t4 time steps, and therefore the auxiliary relations are properly initialised at
time point 3t4 .
In the second phase, starting at time step 3t4 + 1 and ending at time step t, the changes
that occurred in the first phase are applied, two at a time. The thread is then ready to
answer q at time point t. Since at time t at most 2t elements are used by edges, the almost
domain independence of q guarantees that the result computed by the thread relative
to Dt coincides with the Dt-restriction of the query result for pi(Gt). The query result
for Gt is obtained by translating the obtained result according to pi−1, and extending
it to the full domain. More precisely, a tuple t¯ is included in the query result, if it can
be generated from a tuple t¯′ of the restricted query result by replacing elements from
pi−1(Dt) \ adom(Gt) by elements from V \ adom(Gt) (under consideration of equality
constraints among these elements). Again, all steps are easily seen to be first-order
definable using the auxiliary relations from above.
The above presentation assumes a separate thread for each time point and each thread
uses its own relations. These threads can be combined into one dynamic program as
follows. Since at each time point at most f(n) threads are active, we can number them in
31
Chapter 3 The Muddling technique: Bounding the number of changes
a round robin fashion with numbers 1, . . . , f(n) that we can encode by tuples of constant
arity. The arity of all auxiliary relations is incremented accordingly and the additional
dimensions are used to indicate the number of the thread to which a tuple belongs.
3.3 Outlook and bibliographic remarks
The main contribution of this chapter is the introduction of the Muddling technique: in
order to show membership in DynFO for almost domain independent dynamic queries, it
is sufficient to show (AC[f(n)], f(n))-maintainability for some at most linear function f ,
so, that the dynamic query can be maintained for f(n) many changes after an AC[f(n)]
initialisation. This technique will be used in Section 4.1 to show that all MSO-definable
queries are in DynFO for graphs with bounded treewidth, and adapted to different classes
of complex changes in Section 5.5 and Section 6.3. The results of [Datta et al. 2018b],
which are partly presented in Section 6.3, rely on the insight that the dynamic program
from [Datta et al. 2018a], proving that Reach is in DynFO under single-edge changes,
can be conceptually simplified by utilising the Muddling technique.
We believe that the technique will find further applications, both for the class DynFO
but also in similar settings. The main result of Section 4.1, that MSO-definable queries
can be maintained in DynFO for graphs with bounded treewidth, is used in preliminary
work together with Samir Datta, Siddharth Iyer, Anish Mukherjee and Thomas Zeume
to show that approximate solutions to certain MSO-definable optimisation problems can
be maintained for planar graphs and extensions thereof. Other preliminary work together
with Samir Datta, Anish Mukherjee, Shreyas Srinivas and Thomas Zeume suggests that
the Muddling Theorem can be adapted for the class DynQF, introduced by Hesse [2003b],
which is defined via dynamic programs with quantifier-free update formulas that may
use functions. A variant of the Muddling Theorem is used in [Schmidt et al. 2020] to
obtain maintainability results in a dynamic setting of parameterised complexity.
Whenever one studies a particular setting within or extending the dynamic complexity
framework, it might be worthwhile to check whether a variation of the Muddling Theorem
holds in this setting. In particular, the Muddling Theorem can be adapted for the FOIES
framework that allows changes that modify the domain by one element, using the result
of Datta et al. [2018a] that also Proposition 3.2 holds in this setting.
We sketch the necessary adaptations towards the proof of Theorem 3.3. At first, the
threads need to simulate the dynamic program on a slightly larger domain to be prepared
for a growing domain. They can use a unary relation to differentiate between their own
domain and the domain of the simulation. Secondly, observe that a thread might need to
give the query result for more than one time step. If at time point i the input structure
has a domain of size n and at time point i+ 1 the domain is of size n+ 1, then the thread
started at step i gives the answer after f(n) further steps, while the thread from step
i+ 1 needs f(n+ 1) steps. As f ∈ O(n), the difference between these numbers is just
some constant d. One can therefore run d+ 1 copies of each thread such that the j-th
copy compresses the first j steps of the original thread into one step and by this achieves
the necessary speed-up.
32
3.3 Outlook and bibliographic remarks
The impact of the Muddling Theorem as well as of Proposition 3.2, originally from
[Datta et al. 2018a], stating that DynFO(≤,+,×) = DynFO when only considering almost
domain independent queries, suggests that we need more results of the form “all dynamic
queries in some dynamic complexity class C are already in a seemingly weaker class C′”.
As other examples of such results, Zeume and Schwentick [2017] show for DynFO and
some of its subclasses that restricting the syntax of update formulas, like disallowing
disjunction or negation, often does not change the expressive power of the corresponding
dynamic programs.
An important open problem with respect to high-level maintenance strategies is to
determine how dynamic programs can be composed. There is a dynamic program that
maintains Reach; if another dynamic program maintains some binary relation R, under
which conditions is there a dynamic program that maintains the transitive closure of R?
Bibliographic remarks
The Muddling Theorem was first published, in the form of Corollary 3.4, in [Datta
et al. 2017] as joint work with Samir Datta, Anish Mukherjee, Thomas Schwentick and
Thomas Zeume. It was developed from a strategy to prove that all AC1 queries are in
DynFO under change operations that are defined by parameter-free first-order formulas
[Schwentick et al. 2017a]; in this thesis, that result is included as Corollary 5.20 and
proved using the (appropriately adapted) Muddling technique. The idea of simulating
multiple instances of a dynamic program in parallel originates from [Datta et al. 2015;
2018a].
A more general version of Corollary 3.4 was published in [Datta et al. 2018b], in the
present form of Theorem 3.3 it is published in [Datta et al. 2019].
33

CHAPTER 4
Expressibility under single-tuple changes
If we asked the user of a database management system how he or she would like to be
able to modify a table of a database, a first, maybe baffled, answer would probably be:
“I need to add tuples. And later on I need to remove some of them.”
The insertion or deletion of a single tuple is the smallest possible change to a database,
but these change operations come, at least in principle, with universal expressibility:
having a database schema fixed, every database instance can be transformed into every
other instance by a series of insertions and deletions of single tuples.1
So, single-tuple changes are a natural starting point for studying maintenance of query
results under modifications of the input database, and consequently the vast majority of
research articles in the field of dynamic complexity concentrates on this model. Sometimes
the possible changes are even further restricted to only single-tuple insertions, or, less
frequently, to only single-tuple deletions.
We first summarise known results on the expressive power of DynFO and its subclasses
under single-tuple changes. Then we formulate the goals and the contributions of this
chapter, and present its outline.
The prototypical queries that cannot be expressed in first-order logic are Parity and
Reach. These queries are therefore natural candidates to study in a dynamic setting.
Maintenance of Parity under single-tuple changes is rather trivial (cf. Example 2.11),
as observed already by Patnaik and Immerman [1997]. For the reachability query it took
a longer time until maintainability was established.
One line of research studied maintainability of restrictions of the reachability query,
1Of course, a database system that only supported insertions and deletions of single tuples would neither
be efficient nor comfortable to use, and therefore a typical database management system supports
more succinct means of data manipulation. Some of them will be captured by the change operations
we consider in Chapter 5 and Chapter 6.
35
Chapter 4 Expressibility under single-tuple changes
with the result that reachability in directed acyclic graphs [Dong and Su 1995; Patnaik
and Immerman 1997] as well as undirected graphs [Patnaik and Immerman 1997; Dong
and Su 1998; Hesse 2003b; Grädel and Siebertz 2012] is in DynFO or even weaker dynamic
settings, under single-edge changes. Reachability in deterministic graphs can even be
maintained with quantifier-free dynamic programs [Hesse 2003b]. Orthogonally, it was
shown that the general reachability query for arbitrary directed graphs can be maintained
in extensions of DynFO [Hesse 2003a; Datta et al. 2014]. Finally, Datta et al. [2015;
2018a] proved that (Reach,∆E) is in DynFO.
Several other queries have been studied in the dynamic setting. One notable result
concerns the query TreeIsomorphism that asks whether two trees given as input are
isomorphic. Etessami [1998] showed that this query can be maintained in DynFO under
single-edge changes. Further results, often obtained by reductions to other queries or
variations of already known dynamic programs, can be found for example in [Patnaik
and Immerman 1997; Grädel and Siebertz 2012].
In addition to the NL-complete query Reach, some queries that are complete for
(presumably) larger complexity classes can be maintained in DynFO under single-edge
changes. One example is the query D2LReach on acyclic graphs [Weber and Schwentick
2007], which asks whether in an acyclic, labelled graph with edge labels from {(, ), [, ]} there
is a (not necessarily simple) path between two given nodes such that the concatenated
labels on the edges of this path spell a word from the Dyck language D2, that is, the set
of well-parenthesized strings with two types of parentheses. This query is complete for
the class LOGCFL, which lies between NL and AC1. Another example is a padded variant
of the P-complete query AlternatingReach [Patnaik and Immerman 1997].
Such results do not directly imply that all queries from complexity classes as NL,
LOGCFL or even P can be maintained dynamically under single-tuple changes, because
DynFO is not known to be closed under the usually used reductions. However, there are
some results that whole classes of queries can be maintained in DynFO under single-tuple
changes. Examples on strings include all regular languages [Patnaik and Immerman 1997],
which can already be maintained using only unary auxiliary relations [Hesse 2003b] or
with quantifier-free dynamic programs [Gelade et al. 2012], and all context-free languages
[Gelade et al. 2012].
A number of results were obtained for path queries on labelled graphs. These queries
ask, similar as the query D2LReach mentioned above, whether there is a path between
two nodes such that its edge labels form a word from a given target language. If the
target language is regular, we call such a query a regular path query. Regular path
queries can be maintained under insertions [Dong et al. 1995], and also under deletions
of labelled edges [Datta et al. 2018a]. Some extensions of regular path queries that can
be maintained in DynFO, at least for restricted classes of graphs, are identified by Zeume
[2015] and Muñoz et al. [2016].
The first goal of this chapter is to extend these maintainability results.
Goal 4.1. Show that specific (classes of) queries can be maintained dynamically under
single-tuple changes.
To fully understand the expressive power of DynFO we are interested in proving that
36
some dynamic query cannot be maintained in DynFO. Unfortunately, results of this
kind are rare. Of course, every query q that is in DynFO under single-tuple insertions
is contained in P, as from a dynamic FO-program P one can obtain a polynomial-time
algorithm that for any input instance I simulates P under the change sequence that inserts
all tuples from I. Therefore, no EXPTIME-hard query is in DynFO under single-tuple
changes.
Besides this trivial lower bound, no inexpressibility results for the full class of DynFO
are known. Grädel and Siebertz [2012] prove that in their unordered setting of DynFO,
where the initial input database is non-empty and the initial auxiliary relations are
defined in some logic, TreeIsomorphism cannot be maintained, and neither whether
two unary relations have the same size, but these results rely on the weakness of the
initialisation. This is underscored by the fact that (TreeIsomorphism,∆E) is in DynFO
as to our definition that only allows initially empty databases [Etessami 1998].
There are two further prominent restrictions of DynFO that allow for lower bound
proofs: restricting the arity of the auxiliary relations and restricting the update formulas
syntactically.
When only unary auxiliary relations are allowed, several queries cannot be maintained
in DynFO under single-tuple changes, including Reach [Dong and Su 1998; Dong et al.
2003] and some non-regular languages [Zeume 2015; Vortmeier 2013].
The syntactically restricted class DynProp, containing dynamic queries that can be
maintained by dynamic programs with quantifier-free update formulas, was introduced by
Hesse [2003b], who also shows that such programs are able to maintain the reachability
query on deterministic graphs under single-edge changes. We believe that (Reach,∆E)
is not in DynProp, but so far this is only known for the special case of at most binary
auxiliary relations, and in the setting with non-empty initial graphs and logically defined
initialisation [Zeume and Schwentick 2015]. The alternating reachability query cannot be
maintained, not even for layered graphs2 with a constant number of layers [Gelade et al.
2012], which is a first-order definable query.
On strings, the expressive power of DynProp is characterised completely: exactly the
regular languages can be maintained [Gelade et al. 2012]. While several proof techniques
and inexpressibility results for DynProp are already available [Gelade et al. 2012; Zeume
and Schwentick 2015; Zeume 2017], there is so far no characterisation of the dynamic
graph queries in DynProp.
Goal 4.2. Understand the expressive power of DynProp on graphs.
Contributions Towards Goal 4.1 we prove that the class of MSO-definable queries
can be maintained under single-edge changes on graphs of bounded treewidth, that is,
we give a dynamic version of Courcelle’s Theorem which states that these queries can
statically be evaluated in linear time. This result is obtained independently and exceeds
the result of Bouyer-Decitre et al. [2017], who show that MSO-definable queries can be
2This means that the vertex set V is partitioned into layers V1, . . . , Vd. Edges (u, v) ∈ E are only
between adjacent layers, so if u ∈ Vi for some i < d, then v ∈ Vi+1.
37
Chapter 4 Expressibility under single-tuple changes
maintained under the condition that an initially computed tree decomposition stays valid
throughout the dynamic process.
Our result depends on the Muddling technique as introduced in Chapter 3, as well as
on a constructive Feferman-Vaught-type composition theorem for relational structures
that have an intersection of logarithmic size. This composition theorem might also be
useful for other applications.
Later on, we recapitulate the proof technique of Zeume [2017] for proving inexpressibility
results for DynProp and encapsulate its essence in a technical lemma. This is then used,
together with a variation of the list technique for DynProp from Gelade et al. [2012], to
show that DynProp has a strict arity hierarchy for Boolean graph queries: for each k
there is a Boolean graph query that cannot be maintained under single-edge changes
in DynProp with k-ary auxiliary relations, but that can be maintained with (k + 1)-ary
auxiliary relations. This is a step towards Goal 4.2. As a corollary, we identify a AC0[2]
query that cannot be maintained in DynProp under single-edge changes.
The results presented in this chapter are joint work with Thomas Schwentick, Thomas
Zeume, Samir Datta and Anish Mukherjee. Detailed bibliographic remarks are given at
the end of this chapter.
Outline This chapter consists of two main parts. In Section 4.1 we prove the dynamic
version of Courcelle’s Theorem: every MSO-definable query can be maintained in DynFO
under single-edge changes on graphs of bounded treewidth. The arity hierarchy for
DynProp is shown in Section 4.2. We conclude with an outlook and bibliographic
comments in Section 4.3.
4.1 A dynamic version of Courcelle’s Theorem
In the introduction of this chapter we have seen that a wide range of queries can be
maintained in DynFO under single-tuple changes, including Reach and TreeIsomor-
phism. These dynamic queries serve as showcase examples for the expressive power of
DynFO and of the dynamic approach in general: in the static setting, these problems
are complete for classes that are much stronger than FO or AC0, namely NL for Reach
and NC1 or LOGSPACE for TreeIsomorphism, depending on the input representation
[Jenner et al. 1998]. In the dynamic setting, the complexity to update a result drops
to FO.
For these impressive maintainability results the corresponding dynamic programs are
constructed in rather complicated, single-purpose proofs. Ideally, one would have a kind
of compiler that automatically generates a dynamic program given a formal definition of a
query, without the need of problem-specific “manual” work. This goal calls for proofs that
whole classes of dynamic programs can be maintained in DynFO. In algorithmic research,
results of this form are also called meta-theorems, see [Grohe 2014] for some examples.
The result that all regular languages can be maintained in DynProp [Gelade et al. 2012]
can be seen as such a meta-theorem. The class of regular languages is exactly the class of
languages expressible in MSO, so, for every Boolean query on strings that is defined by
38
4.1 A dynamic version of Courcelle’s Theorem
an MSO sentence we can automatically deduce a quantifier-free dynamic program that
maintains this query under changes of single positions of a string. A similar result holds
for MSO-definable decision problems on trees, because all regular tree languages are in
DynFO [Gelade et al. 2012], and MSO on trees coincides with regular tree languages.
The “archetypal” [Grohe 2014, p. 16] example of a meta-theorem is Courcelle’s Theorem
[Courcelle 1990], stating that all MSO-definable decision problems for graphs of bounded
treewidth can be decided in linear time. The treewidth of a graph will be formally
introduced next, in Subsection 4.1.1, for now it suffices to think of graphs that are “close”
to being a tree. An algorithm for an MSO-definable problem P for graphs of bounded
treewidth basically decomposes the input graph G into a tree T whose nodes are labelled
with small subgraphs of G, called a tree decomposition, and then applies an appropriately
extended version of the algorithm for P for trees.
There is also a parallel version of Courcelle’s Theorem, proven by Elberfeld et al. [2010],
that states that all MSO-definable problems on graphs of bounded treewidth can be
solved with logarithmic space.
The main contribution of this section is a dynamic version of Courcelle’s Theorem: we
will prove that all MSO-definable queries can be maintained under single-tuple changes
in DynFO, for graphs of bounded treewidth. The generalisation from trees to graphs of
bounded treewidth is more complex in the dynamic setting than in the static setting, and
in particular this result is not an easy extension of the result by Gelade et al. [2012] that
MSO properties on trees can be maintained in DynFO under single-edge changes. This is
because a tree decomposition might change drastically under edge insertions, and it is
unclear how (1) one can maintain a tree decomposition, and (2) how auxiliary information
regarding an old tree decomposition can be used for a heavily changed new decomposition.
To deal with this problem, we show that a once computed tree decomposition can be
used for a certain period of time, even if it is not a valid tree decomposition any more for
the current input graph, and then apply the Muddling technique presented in Chapter 3.
The outline of this section is as follows. We start by defining the notion of treewidth
as well as tree decompositions in Subsection 4.1.1. Then we introduce the main ideas
for maintaining MSO-definable queries by means of the example problem 3Col in
Subsection 4.1.2. Afterwards, in Subsection 4.1.3, we give the proof for all MSO-definable
queries. This subsection is again divided into three parts. At first we give a Feferman-
Vaught-like composition theorem that allows us to infer the MSO type of a graph given
the MSO types of a set of its subgraphs that have a logarithmic number of nodes in
common. Afterwards we first show the result for MSO-definable decision problems, then
we adapt the result towards optimisation problems.
4.1.1 Treewidth and tree decompositions
Many problems that are NP-complete for arbitrary graphs, like 3Col and VertexCover,
are very easy when the input graphs are restricted to be trees. Moreover, it was observed
that the corresponding algorithms can be generalised for classes of graphs that are “close”
to being trees (cf. Arnborg et al. [1991]). The treewidth of a graph, introduced by
Robertson and Seymour [1986], is a parameter that formalises how “tree-like” a certain
39
Chapter 4 Expressibility under single-tuple changes
1 23
45
6
7 8
9
1011
12
13
(a) The graph G.
5
6
7
45
7
2
4
3
45
1 3
5
7 8
9
7 8
10
8
10
12
1011
7
13
(b) The tree decomposition (T,B).
Figure 4.1: A graph G and a tree decomposition (T,B) of G of width 2.
graph is. Intuitively, a graph G has low treewidth if one can decompose G into subgraphs
G1, . . . , Gm of small size and arrange these subgraphs as a tree T such that if a node of
G appears in two subgraphs Gi, Gj , then it also appears in every subgraph on the unique
path in T that connects Gi and Gj .
We make this more formal now. Let G = (V,E) be a graph. A tree decomposition
(T,B) of G consists of a (rooted, directed) tree T = (I, F, r), with (tree) nodes I, (tree)
edges F , a distinguished root node r ∈ I, and a function B : I → 2V such that
(1) the set {i ∈ I | v ∈ B(i)} is non-empty for each node v ∈ V ,
(2) there is an i ∈ I with {u, v} ⊆ B(i) for each edge (u, v) ∈ E, and
(3) the subgraph T [{i ∈ I | v ∈ B(i)}] is connected for each node v ∈ V .
We refer to the number of children of a node i of T as its degree, and to the set B(i)
as its bag. The width of a tree decomposition is defined as the maximal size of a bag
minus 1. The treewidth of a graph G is the minimal width among all tree decompositions
of G. An example of a tree decomposition is depicted in Figure 4.1.
It is in general NP-complete to determine whether some graph G has treewidth k, given
both G and k as input [Arnborg et al. 1987]. However, if k is a fixed constant, on input G
one can compute in linear time a tree decomposition of width k for G (or report that no
such decomposition exists) [Bodlaender 1996], and then answer any problem definable
in (certain extensions of) MSO in linear time as well [Courcelle 1990]. The latter result,
known as Courcelle’s Theorem, is a prime example of an algorithmic meta-theorem, and
the main purpose of this section is to prove its dynamic counterpart in the setting of
DynFO.
For our purposes we will need tree decompositions of a certain shape. We say that a
tree decomposition (T,B) of a graph G with n nodes is d-nice, for some d ∈ N, if
(1) T has depth at most d logn,
(2) the degree of the nodes is at most 2, and
40
4.1 A dynamic version of Courcelle’s Theorem
(3) all bags are distinct.
Often we do not make the constant d explicit and just speak of nice tree decompositions.
We will use that every graph of treewidth k has a nice tree decomposition of width
slightly more than k. This is formalised in the following lemma, which builds on [Elberfeld
et al. 2010, Lemma 3.1].
Lemma 4.1. For every k ∈ N there is a constant d ∈ N such that for every graph of
treewidth k, a d-nice tree decomposition of width 4k + 5 can be computed in logarithmic
space.
Proof. Let G be a graph of treewidth k. By [Elberfeld et al. 2010, Lemma 3.1] a tree
decomposition (T,B) of width 4k + 3 can be computed in logarithmic space, such that
each non-leaf node has degree 2 and the depth is at most d logn, for a constant d that
only depends on k. To obtain a tree decomposition with distinct bags, we compose
this algorithm with three further algorithms, each reading a tree decomposition (T,B)
and transforming it into a tree decomposition (T ′, B′) with a particular property. Since
each of the four algorithms requires only logarithmic space, the same holds for their
composition.
The first transformation algorithm removes unnecessary leaf nodes, that is, produces a
tree decomposition in which for each leaf node i and its parent p(i) it holds B(i) 6⊆ B(p(i)).
In particular, after this transformation, each bag of a leaf node i contains some graph
node, denoted u(i), that does not appear in any other bag. This transformation inspects
each node i separately in a bottom-up fashion, and removes it if (1) B(i) ⊆ B(p(i)) and
(2) i is a leaf or all descendants of i are to be removed as well. Clearly, logarithmic space
suffices for this.
The first transformation might introduce inner nodes of degree 1. The second transfor-
mation (inductively) removes such nodes i with parent node p(i) and child i′ and inserts
an edge between B(p(i)) and B(i′), whenever B(i) ⊆ B(p(i)) or B(i) ⊆ B(i′) holds. For
this transformation, only one linear chain of nodes in T has to be considered at any time
and therefore logarithmic space suffices again. Clearly, the connectivity property is not
affected by these contractions.
The third transformation adds to every bag of an inner node i the nodes u(i1) and
u(i2), guaranteed to exist by the first transformation, of the leftmost and rightmost leaf
nodes i1 and i2 of the subtree rooted at i, respectively. Here, we assume the children of
every node to be ordered by the representation of T as input to the algorithm. After this
transformation, each node of degree 2 has a different bag than its two children thanks
to the addition of u(i1) and u(i2). Each node of degree 1 has a different bag than its
child, since this was already the case before (and to both of them the same two nodes
might have been added). Altogether, all bags are pairwise distinct and the bag sizes have
increased by at most 2.
We emphasise that, whenever a leftmost graph node u(i1) is added to B(i), it is also
added to all bags of nodes on the path from i to i1 and therefore the connectivity property
is not corrupted. It is easy to see that the third transformation can also be carried out
in logarithmic space.
41
Chapter 4 Expressibility under single-tuple changes
r
i0
i1 i2
(a) A proper triangle.
r
i0
i1
(b) A unary triangle.
r
i0
(c) An open triangle.
Figure 4.2: Illustration of (a) a proper triangle (i0, i1, i2), (b) a unary triangle (i0, i1, i1),
and (c) an open triangle (i0, i0, i0). The blue shaded area is the part of the
tree contained in the triangle.
The three presented algorithms never increase the depth of a tree decomposition, so
the final result is a d-nice tree decomposition for G of width 4k + 5.
We only consider nice tree decompositions, and due to property (3) of these de-
compositions we can identify bags with nodes from I. So, formally, a nice width-k
tree decomposition can be encoded as a (2k + 2)-ary relation T that contains a tuple
(b11, . . . , bk+11 , b12, . . . , bk+12 ) if and only if there are nodes i1, i2 ∈ I with (i1, i2) ∈ F such
that B(i1) = {b11, . . . , bk+11 } and B(i2) = {b12, . . . , bk2}.
For two nodes i, i′ of I, we write i  i′ if i′ is in the subtree of T rooted at i and i ≺ i′
if, in addition, i′ 6= i. We will often consider subtrees of a tree decomposition that are
determined by up to three tree nodes. A triangle δ of T is a triple (i0, i1, i2) of nodes
from I such that i0  i1, i0  i2, and (1) i1 = i2 or (2) neither i1  i2 nor i2  i1. In
case of (2) we call the triangle proper, in case of (1) unary, unless i0 = i1 = i2 in which
we call it open (see Figure 4.2 for an illustration). The subtree T (δ) induced by a triangle
consists of all nodes j of T for which the following holds:
(i) i0  j,
(ii) if i0 ≺ i1 then i1 6≺ j, and
(iii) if i0 ≺ i2 then i2 6≺ j.
That is, for a proper or unary triangle, T (δ) contains all nodes of the subtree rooted at i0
which are not below i1 or i2. For an open triangle δ = (i0, i0, i0), T (δ) is just the subtree
rooted at i0.
Each triangle δ induces a subgraph G(δ) of G as follows: V (δ) is the union of all bags
of T (δ). By B(δ) we denote the set B(i0) ∪B(i1) ∪B(i2) of interface nodes of V (δ). All
other nodes in V (δ) are called inner nodes. The edge set of G(δ) consists of all edges of
G that involve at least one inner node of V (δ).
4.1.2 Showcase 3-colourability for graphs of bounded treewidth
Before we present our main result of this section, we introduce the main proof ideas by
means of a specific problem and show that the 3-colourability problem 3Col for graphs
of bounded treewidth can be maintained in DynFO under single-edge changes. Given an
undirected graph, 3Col asks whether its nodes can be coloured with three colours such
42
4.1 A dynamic version of Courcelle’s Theorem
that adjacent nodes have different colours. More precisely, we show the following result.
Theorem 4.2. For every k, (3Col,∆E) is in DynFO for graphs with treewidth at most k.
To prove this theorem, thanks to Corollary 3.4 and the fact that 3Col is almost
domain independent, it suffices to show that (3Col,∆E) is (AC1, logn)-maintainable
for graphs with treewidth at most k. In a nutshell, our approach can be summarised as
follows.
The AC1 initialisation computes a nice tree decomposition (T,B) of width at most
4k+5 and maximum bag size ` def= 4k+6, as well as information about the 3-colourability
of certain subgraphs of G. More precisely, it computes, for each triangle δ of T and each
3-colouring C of the interface nodes B(δ), whether there exists a colouring C ′ of the
inner nodes of G(δ) such that all edges involving at least one inner node are consistent
with C ∪ C ′.
To determine whether the input graph is 3-colourable for the next logn changes, the
dynamic program only maintains a set S of special bags: for each affected graph node v
that is an endpoint of a changed (i.e. deleted or inserted) edge, S contains one bag in
which v occurs. Also, if two bags are special, their least common ancestor is considered
special and is included in S. After logn changes there are at most 4 logn special bags.
With the help of the auxiliary information computed by the initialisation, a first-order
formula ϕ can test whether G is 3-colourable as follows. By existentially quantifying
8` variables and interpreting the valuation as a bit string of length 8` logn using BIT,
the formula can choose two bits of information for each of the at most 4` logn nodes
in special bags. For each such node, these two bits are interpreted as encoding of one
of three colours. The formula ϕ can check that this colouring of the special bags can
be extended to a colouring of G, because G is essentially a union of subgraphs induced
by triangles defined by special bags, and the auxiliary relations provide the information
whether a colouring of the interface nodes can be extended to the graph induced by any
triangle.
Before we give a detailed proof, we need some more notation. Let G = (V,E) be a
graph and (T,B) with T = (I, F, r) a nice tree decomposition with bags of size at most `.
A colouring of a set U of nodes is a mapping from U to {1, 2, 3}. An edge (u, v) is
properly coloured if u and v are mapped to different colours. For a triangle δ, we say that
a colouring C of the set B(δ) of interface nodes is consistent, if there exists a colouring C ′
of the inner vertices of G(δ) such that all edges of G(δ) are properly coloured by C ∪ C ′.
Recall that G(δ) only contains edges that involve at least one inner vertex.
We say that a tuple v¯(i) = (v1, . . . , v`) represents a tree node i ∈ I (or, the bag
B(i)) if B(i) = {v1, . . . , v`}. A tuple v¯(δ) = (v¯(i0), v¯(i1), v¯(i2)) represents the triangle
δ = (i0, i1, i2). If v¯(δ) = (v1, . . . , v3`) represents a triangle δ and c¯ is a tuple from
{1, 2, 3}3` such that cj = cj′ whenever vj = vj′ for j, j′ ∈ {1, . . . , 3`}, we write Cc¯,v¯ for
the colouring of B(δ) defined by Cc¯,v¯(vj) = cj , for every j ∈ {1, . . . , 3`}.
Proof (of Theorem 4.2). Let G = (V,E) be a graph of treewidth at most k. The AC1
initialisation first computes a d-nice tree decomposition (T,B) with bags of size at most
`
def= 4k + 6, for the constant d guaranteed to exist by Lemma 4.1, and the predecessor
43
Chapter 4 Expressibility under single-tuple changes
r
i0
i1 i2
i′1 i
′
2
Figure 4.3: Illustration of the inductive step in the computation of colourability informa-
tion for triangles in the proof of Theorem 4.2.
relation  of T . Also, it initialises the relations ≤ and BIT. Next, it computes the
following auxiliary relations in a bottom-up fashion with respect to T = (I, F, r). For
each tuple c¯ ∈ {1, 2, 3}3 the auxiliary relation Rc¯ contains all tuples v¯(δ) from V 3 that
represent some triangle δ such that Cc¯,v¯ is a consistent colouring of B(δ).
The auxiliary relations are computed inductively and bottom-up, that is, the auxiliary
information for a tuple representing a triangle (i0, i1, i2) is computed by using the
information for the triangles rooted at the two children of i0. It will be easy to see that
each inductive step can be defined by a first-order formula and, since T has depth d logn,
the induction reaches a fixpoint after d logn iterations. Therefore the initialisation is in
IND[logn] = AC1. We recall that triangles can be open, unary or proper, depending on
whether they are induced by a single bag, by two, or by three bags.
For the base case, a tuple v¯ representing an open triangle corresponding to a leaf of T
is in Rc¯ if and only if, for each j ∈ {1, . . . }, cj = c+j = c2+j , since the graph induced
by this triangle has no inner nodes and therefore contains no edges.
The inductive cases are straightforward. We only describe in detail the case of a proper
triangle δ = (i0, i1, i2) where i1 and i2 are in different subtrees of i0; the other cases are
similar. Let i′1 and i′2 be the two children of i0 such that i′1  i1 and i′2  i2. Figure 4.3
illustrates this situation. By the induction hypothesis, the auxiliary information for
all triangles rooted at i′1 and i′2 has already been computed. A tuple v¯
def= v¯(δ) is
in some Rc¯, if there are tuples d¯, e¯ ∈ {1, 2, 3}3 such that u¯ def= v¯((i′1, i1, i1)) ∈ Rd¯,
w¯
def= v¯((i′2, i2, i2)) ∈ Re¯ and it holds that
• Cc¯,v¯ and Cd¯,u¯ coincide on B(i0) ∩ B(i′1) and on B(i1),
• Cc¯,v¯ and Ce¯,w¯ coincide on B(i0) ∩ B(i′2) and on B(i2), and
• all edges over B(i0) ∪ B(i′1) ∪ B(i′2) ∪ B(i1) ∪ B(i2) of which at least one node is
not in B(i0) ∪ B(i1) ∪ B(i2) are properly coloured by Cc¯,v¯ ∪ Cd¯,u¯ ∪ Ce¯,w¯.
We next describe how a dynamic program can maintain 3-colourability for logn change
steps, starting from the above initial auxiliary relations, with the help of an additional
-ary relation S and another binary relation N . Whenever an edge (u, v) is inserted into
or deleted from E, we consider both u and v as affected. With each affected graph node v
44
4.1 A dynamic version of Courcelle’s Theorem
we associate a bag B(i) of the tree decomposition such that v ∈ B(i). We call such a
bag special, as well as all graph nodes it contains. Furthermore, if the tree node i is the
least common ancestor of two nodes i1, i2 such that B(i1) and B(i2) are special, then the
bag B(i) becomes special as well. We call a triangle δ = (i0, i1, i2) of T clean if no tree
node i in T (δ) is associated with a special bag B(i), apart from i0, i1, i2. The dynamic
program keeps track of all special bags using the relation S which contains all tuples v¯
that represent some special bag. Additionally, it maintains a bijection between the first
`|S| nodes of V with respect to the linear order ≤ and the graph nodes in S, encoded in
the relation N . The updates of the auxiliary relations S and N are clearly first-order
expressible.
It remains to describe how a first-order formula can check 3-colourability of G given the
relation S,N and Rc¯. Within logn changes at most 2 logn graph nodes can be affected,
resulting in at most 4 logn special bags altogether, since for each new special bag at
most one other bag can become special as the least common ancestor of special bags.
That means that the set Z of graph nodes occurring in some tuple of S contains at most
4` logn nodes.
A colouring of Z can be represented by 8` logn bits and can thus be “guessed” by a first-
order formula, by existentially quantifying over 8` variables x1, . . . , x4`, y1, . . . , y4`. More
precisely, the j-th bits of xr and yr together represent the colour of the special node at
position (r−1) logn+j with respect to the linear order represented by N . The first-order
formula can easily check that the colouring C of S represented by x1, . . . , x4`, y1, . . . , y4`
is consistent for edges between special nodes and that for each clean triangle of T induced
by special bags it can be extended to a consistent colouring of the inner nodes. The
latter information is available in the relations Rc¯.
4.1.3 MSO-definable queries on graphs of bounded treewidth
In this subsection we generalise the proof strategy from Theorem 4.2 and prove the
main result of this section, a dynamic version of Courcelle’s Theorem: all MSO-definable
properties can be maintained in DynFO for graphs with bounded treewidth. More
precisely, we start with the model checking problem MCϕ that asks whether a given
graph G satisfies some fixed MSO sentence ϕ, that is, whether G |= ϕ holds. So, we
show that MSO-definable decision problems can be maintained in DynFO, for graphs
with bounded treewidth.
Theorem 4.3. For every MSO sentence ϕ and every k, (MCϕ,∆E) is in DynFO for
graphs with treewidth at most k.
In addition to decision problems, we also prove that MSO-definable optimisation
problems can be maintained in DynFO for graphs with bounded treewidth. An MSO-
definable optimisation problem OPTϕ is induced by an MSO formula ϕ(X1, . . . , Xm)
with free set variables X1, . . . , Xm. Given a graph G with node set V , it asks for sets
A1, . . . , Am ⊆ V of minimal3 size ∑mi=1 |Ai| such that G |= ϕ(A1, . . . , Am). Examples
3We only deal with minimisation problems here. The adaptation to maximisation problems is straight-
forward.
45
Chapter 4 Expressibility under single-tuple changes
(with m = 1) for such problems are the vertex cover problem and the dominating set
problem.
We require from a dynamic program for such a problem that it maintains unary query
relations Ans1, . . . ,Ansm that store, at any time, an optimal solution for the current
graph.
Theorem 4.4. For every MSO formula ϕ(X1, . . . , Xm) and every k, (OPTϕ,∆E) is in
DynFO for graphs with treewidth at most k.
Although we, for simplicity, state and prove these theorems only for graphs, they can be
generalised for arbitrary input schemas. They can also be proven for certain extensions of
MSO logic. On particular extension of MSO is the logic guarded second-order logic (GSO),
which extends MSO by guarded second-order quantification. This logic syntactically
allows to quantify over non-unary relation variables, which, however, is semantically
restricted: a tuple t¯ = (a1, . . . , am) can only occur in a quantified relation, if all elements
from {a1, . . . , am} occur together in some tuple of the structure in which the formula is
evaluated. In the context of graphs, this just means that GSO allows quantification over
edge sets in addition to quantification over node sets. In this context, GSO is sometimes
also called MSO2; then, “regular” MSO is often denoted MSO1.
In general, GSO is more expressive than MSO. For example, the property that a
directed graph contains a Hamiltonian cycle can be expressed by a GSO formula that (1)
existentially quantifies a set C of edges and (2) checks that the edges in C constitute a
Hamiltonian cycle, by checking that (2a) every node has exactly on incoming and one
outgoing edge in C, and that (2b) the graph with edge set C is connected. The condition
(2a) is easily seen to be expressible, the latter condition (2b) can be expressed using the
MSO formula for graph reachability, see Example 2.1. On the other hand, Hamiltonicity
cannot be expressed in MSO [Courcelle and Engelfriet 2012, Proposition 5.13].
The situation is different when we only consider graphs of bounded treewidth. For
every k, GSO has the same expressive power as MSO on graphs with treewidth at most k
[Courcelle 1994, Theorem 2.2]. So, for every GSO formula ϕ and every k, there is an
MSO formula ψk that is equivalent to ϕ on graphs with treewidth k. Moreover, if
ϕ = ∃X1 · · · ∃Xm ϕ′, then ψk is of the form ∃X11 · · · ∃X`1 · · · ∃X1m · · · ∃X`mψ′, for some
natural number `. So, the next corollaries follow immediately.
Corollary 4.5. For every GSO sentence ϕ and every k, (MCϕ,∆E) is in DynFO for
graphs with treewidth at most k.
Corollary 4.6. For every GSO formula ϕ(X1, . . . , Xm) and every k, (OPTϕ,∆E) is in
DynFO for graphs with treewidth at most k.
In the remainder of this section, we work out the proofs of Theorem 4.3 and Theorem
4.4. To give an overview of the outline, we sketch the main proof ideas for Theorem 4.3.
Let ϕ be a fixed MSO formula of quantifier rank d and k a treewidth. We show
that (MCϕ,∆E) is (AC1, logn)-maintainable for graphs with treewidth at most k. The
construction of a dynamic program that maintains MCϕ is similar to the one in the
proof for 3Col (Theorem 4.2). At each point, the program needs to evaluate ϕ on a
46
4.1 A dynamic version of Courcelle’s Theorem
graph G′ = (V, (E \ E−) ∪ E+) with n nodes, where |E− ∪ E+| = O(logn), using a nice
tree decomposition (T,B) of width 4k + 5 for the initial graph G = (V,E) and auxiliary
information on the rank-d MSO type for each triangle of T .
The graph G′ can be viewed as having a center C ⊆ V of logarithmic size that contains
the nodes with edges in E− ∪ E+ and, additionally, for each of these nodes v all nodes
of one bag that contains v. Furthermore, there are node sets D1, . . . , D` that together
contain all nodes from V , such that the sets Di−C are pairwise disjoint and disconnected
in G′, and each set Di∩C has size O(1) (see already Figure 4.4 for an illustration). From
the type information for the triangles, the dynamic program can infer the rank-d MSO
type of each G′[Di]. The main task now is to determine the rank-d MSO type of G′ from
this information.
Elberfeld, Grohe and Tantau prove a composition theorem for a similar setting [Elberfeld
et al. 2016, Theorem 3.8]. In their setting, the center C is of constant size and there
might be arbitrarily many, even infinitely many, sets Di. They show that for every MSO
formula ϕ one can compute a first-order formula that, when evaluated on an expansion
of G[C] by information on the types of the G[Di], yields the same result as ϕ on G.
We show for our setting that one can construct an MSO formula ψ such that G′ |= ϕ
if and only if B |= ψ, where B is a structure that extends G′[C] by type information
about the G′[Di]. This construction is detailed next. It uses in particular a composition
theorem by Shelah (cf. Theorem 4.9).
Afterwards we explain how a dynamic program makes use of this formula ψ. The main
idea is that from ψ on can obtain a first-order formula that also determines whether
G′ satisfies ϕ by replacing the second-order quantification over C in ψ by first-order
quantification over V . This is possible, since sets of size O(logn) over C can be encoded
by constantly many elements from V . Altogether, we obtain a proof for Theorem 4.3.
In the last part of this subsection we extend this proof towards optimisation problems,
and prove Theorem 4.4.
A Feferman–Vaught-type composition theorem
We now present a Feferman–Vaught-type composition theorem for MSO, similar to the
composition theorem by Elberfeld et al. [2016]. We prove this theorem for arbitrary input
schemas, but first introduce the main concept using graphs.
We consider graphs G = (V,E) with a center C ⊆ V , such that for some setsD1, . . . , D`
and some w > 0 the following conditions hold.
• ⋃`i=1Di = V .
• For all i 6= j, Di ∩Dj ⊆ C.
• All edges in E have both end nodes in C or in some Di.
• For every i, |Di ∩ C| ≤ w.
• For each i there is some element vi ∈ Di ∩ C that is not contained in any Dj , for
j 6= i.
In this case, we say that C has connection width w in G. See Figure 4.4 for an illustration.
We refer to the sets D1, . . . , D` as petals and the nodes v1, . . . , v` as identifiers of
47
Chapter 4 Expressibility under single-tuple changes
C D1v1
D2
v2
D3
v3
D4 v4
D5
v5
D6
v6
Figure 4.4: Sketch of a graph with center C of connection width 2, highlighted in blue,
and petals D1, . . . , D6.
their respective petals. We emphasise that ` is bounded by |C|, but not assumed to be
bounded by a constant. Readers who have read the proof of Theorem 4.2 can roughly
think of C as the set of vertices from special bags.
Our goal is to show the following. If a graph G has a center C of connection width w,
for come constant w, then G[C] can be extended by the information about the MSO
types of its petals in a suitable way, resulting in a structure B with universe C, such that
MSO formulas over G have equivalent MSO formulas over B.
In the following, we work out the above plan in more detail. We fix some relational
schema σ and assume that it contains a unary relation symbol C.
The definition of the connection width of sets C easily carries over to σ-structures. In
particular, tuples need to be entirely in C or in some petal Di, and all constants of the
structure need to be included in C. For every i, we call the set Ii
def= Di ∩C the interface
of Di and the nodes of Di − C inner elements of Di.
Let A be a σ-structure, and C a center of connection width w with petals D1, . . . , D`.
For every i, let u¯i = (ui1, . . . , uiw) be a tuple of elements from the interface Ii of Di such
that ui1 is an identifier of its petal Di and every node from Ii occurs in u¯i. By (Ai, u¯i)
we denote the substructure of A induced by Di with ui1, . . . , uiw as constants but without
all tuples over C, i.e., (Ai, u¯i) only contains tuples with at least one inner element of Di.
Let d > 0. The rank d, width w MSO indicator structure of A relative to C and
tuples u¯i is the unique structure B which expands A[C] by the following relations:
• a w-ary relation J that contains all tuples u¯i, and
• for every rank-d MSO type τ over σ ∪ {c1, . . . , cw}, a unary relation Rτ containing
those identifier nodes ui1 for which the rank-d MSO type of (Ai, u¯i) is τ .
We note that different choices of the tuples u¯i result in different indicator structures
and we denote the set of all indicator structures of A relative to C by S(A, C, w, d).
We are now ready to formulate the desired composition theorem.
Theorem 4.7. For each d > 0, every MSO sentence ϕ with depth d, and each w, there
is a number d′ and a MSO sentence ψ such that for every σ-structure A, every center C
48
4.1 A dynamic version of Courcelle’s Theorem
of A with connection width w and every B ∈ S(A, C, w, d′) it holds A |= ϕ if and only if
B |= ψ.
The proof of Theorem 4.7 uses Shelah’s generalised sums [Shelah 1975]. We follow
the exposition from Blumensath et al. [2008]. In a nutshell, a generalised sum is a
composition of several disjoint component structures along an index structure. Shelah’s
composition theorem states that MSO sentences on a generalised sum can be translated
to MSO sentences on the index structure enriched by MSO type information on the
components.
We apply the composition theorem on the basis of the following ideas, illustrated in
Figure 4.5. From the center C and the petals Di we define an index structure I and
component structures Di, respectively. In the generalised sum, these disjoint structures
are again composed into a structure that is very similar to A. More precisely, A can
be defined in the generalised sum by a first-order interpretation, and thus, thanks to
Lemma 2.3, we can translate the formula ϕ for A into a formula ϕ′ on the generalised sum.
Shelah’s composition theorem then provides a translation of ϕ′ into an MSO formula ψ′ on
the structure I enriched with type information on the structures Di. This enriched index
structure is again very similar to an MSO indicator structure B: there is a first-order
interpretation that defines the enriched index structure in B. As a consequence, by
Lemma 2.3 again, the formula ψ′ can be translated into a formula ϕ on B.
Before we proceed to the proof of Theorem 4.7, we formally introduce the notion of a
generalised sum and state the corresponding result on translations of MSO formulas.
Definition 4.8 (Shelah [1975], formulation following Blumensath et al. [2008]). Let I =
(I, S1, . . . , Sr) be a structure and (Di)i∈I a sequence of structures Di = (Di, Ri1, . . . , Rit)
indexed by elements i of I. The generalised sum of (Di)i∈I is the structure∑
i∈I
Di def= (U,∼, R′1, . . . , R′t, S′1, . . . , S′r)
with universe U def= {〈i, a〉 | i ∈ I, a ∈ Di} and relations
• 〈i, a〉 ∼ 〈i′, a′〉 if and only if i = i′
• R′j def= {(〈i, a1〉, . . . , 〈i, a`〉) | i ∈ I, (a1, . . . , a`) ∈ Rij}
• S′j def= {(〈i1, a1〉, . . . , 〈i`, a`〉) | (i1, . . . , i`) ∈ Sj , ak ∈ Dik for all k ∈ {1, . . . , `}}
The structures I and Di in this definition are also referred to as index structure and
component structures, respectively.
Theorem 4.9 (Shelah [1975], formulation from [Blumensath et al. 2008, Theorem 3.16]).
From every MSO sentence ϕ, a finite sequence χ1, . . . , χs of MSO formulas and an MSO
formula ψ can be constructed such that∑
i∈I
Di |= ϕ ⇔ (I, Jχ1K, . . . , JχsK) |= ψ
for all index structures I and component structures Di, where JχK def= {i ∈ I | Di |= χ}.
49
Chapter 4 Expressibility under single-tuple changes
I, D1, . . . ,D|I| ∑v∈I Dv A
B (I, Jχ1K, . . . , JχsK)
ϕϕ′ψ′
χ1, . . . , χs
ψ
|= |=
|= |=
generalised sum FO interpretation
Υ
FO interpretation
Υ′
Lemma 2.3Theorem 4.9Lemma 2.3
Figure 4.5: Overview of the proof strategy of Theorem 4.7.
Intuitively, the formulas χi encode the MSO type information on the component
structures of the generalised sum.
With the necessary notions in place, we can now prove Theorem 4.7.
Proof (of Theorem 4.7). Suppose that A is a σ-structure with center C of connection
width w, tuples u¯i collecting the interface nodes for each petal Di as described above,
and let ϕ be an MSO formula over schema σ. The proof is in three steps, depicted in
Figure 4.5:
(A) We present an index structure I and component structures Di, and show that
there is an FO-interpretation Υ that interprets the structure A in the generalised
sum ∑i∈I Di. Thus there is an MSO formula ϕ′ such that A |= ϕ if and only if∑
i∈I Di |= ϕ′ by Lemma 2.3.
(B) From Theorem 4.9 we obtain formulas ψ′ and χ1, . . . , χs such that
∑
i∈I Di |= ϕ′ if
and only if (I, Jχ1K, . . . , JχsK) |= ψ′.
(C) We show that there is an FO-interpretation Υ′ that interprets (I, Jχ1K, . . . , JχsK) in
each MSO indicator structure B ∈ S(A, C, w, d′) with appropriate d′. Thus there is
an MSO formula ψ that satisfies B |= ψ if and only if (I, Jχ1K, . . . , JχsK) |= ψ′ by
Lemma 2.3.
Combining these three steps allows us to conclude
A |= ϕ (A)⇐⇒
∑
i∈I
Di |= ϕ′ (B)⇐⇒ (I, Jχ1K, . . . , JχsK) |= ψ′ (C)⇐⇒ B |= ψ
for any MSO indicator structure B ∈ S(A, C, w, d′).
It remains to prove (A) and (C), since (B) is a direct application of Theorem 4.9.
Towards proving (A) we construct structures I and Di which are closely related to
the substructure A[C] of A and the structures (Ai, u¯i), respectively. Let R1, . . . , Rq be
the relation symbols of σ and let σ′ = {S1, . . . , Sq}. The structure I has universe C, for
each j ∈ {1, . . . , q} the σ′-relation Sj , defined as the restriction Rj [C] of the respective
σ-relation Rj of A to C, and the relation J as described above. Each structure Dv for
v ∈ C is over schema σ ∪ {U1 . . . , Uw} and defined as follows. If v is an identifier ui1
50
4.1 A dynamic version of Courcelle’s Theorem
of a petal, then Dv = (Ai, {ui1}, . . . , {uiw}), that is, the restriction of A to the elements
from Di, without any tuples consisting only of elements from C ∩Di, and with additional
unary, singleton relations U1, . . . , Uw such that Uj includes only the j-th interface node uij .
If v is no identifier node then Dv is the structure with universe {v} and empty relations.
In the generalised sum S def= ∑v∈I Dv = (U,∼, R′1, . . . , R′q, U ′1, . . . U ′w, S′1, . . . , S′q, J ′),
the universe U consists of elements of the form 〈ui1, w〉, where w ∈ Di, and of the form
〈u, u〉 where u ∈ C is not an identifier of any petal. We emphasise that the formulas of
the interpretation that defines (a copy of) A in S can not access the components u and
v of an element 〈u, v〉 ∈ U . However, U can be partitioned into four kinds of elements,
each of which can easily be distinguished from the others in a first-order fashion:
(i) Elements of the form 〈u, u〉, for which u ∈ C is not an identifier of any petal. They
can be identified since they form an equivalence class of size 1 with respect to ∼;
(ii) elements of the form 〈ui1, ui1〉, for an identifier ui1 ∈ C. These are precisely the
elements in U ′1;
(iii) elements of the form 〈ui1, w〉, where w ∈ C, w 6= ui1 and w is in the petal Di. These
elements occur in some U ′j , for j > 1 (but not in U ′1);
(iv) elements of the form 〈ui1, w〉, where w 6∈ C is in the petal Di. They do not occur in
any U ′j and are not of type (i).
Elements of type (iv) are in one-to-one correspondence with the inner elements of petalsDi.
Elements from C might have several copies in U , but only one of the types (i) or (ii).
Thus, the formula that defines the universe for the first-order interpretation Υ of A in∑
i∈I Di simply drops all elements of type (iii). Tuples of A of a relation Rj that entirely
consist of nodes from C (that is, elements of type (i) or (ii) in S) are directly induced by
the corresponding relation S′j .
Tuples of a relation Rj in A that contain at least one inner node, that is, at least one
node of type (iv), are defined on the basis of the relation R′j of the generalised sum. Note
however that these relations R′j in S contain elements of type (iii) which are not in the
universe of the interpreted structure. In a corresponding tuple of the relation Rj , these
elements are substituted by their copy of type (i).
We make this more precise. Observe that it can be expressed in a first-order fashion
whether for a type (iii) element 〈v1, u1〉 and a type (i) element 〈v2, v2〉 it holds u1 = v2,
i.e., that, intuitively, 〈v2, v2〉 is the copy representing u1 in S that survives in the universe
of the interpretation. We claim that this condition holds, if and only if 〈v2, v2〉 occurs as
the i-th entry in some tuple of J ′ with first entry 〈v1, u1〉, where i is the unique number
such that 〈v1, u1〉 ∈ U ′i . From this claim, first-order expressibility follows instantly. The
“only if”-part of the claim is straightforward. For the “if”-part, it follows from the latter
condition that there is a tuple with first entry v1 and i-th entry v2 in J , by the definition
of J ′. Since 〈v1, u1〉 ∈ U ′i , there is also a tuple in J with v1 as first entry and u1 as i-th
entry. However, since J has at most one tuple with any given value as first entry, u1 = v2
follows, as claimed.
A tuple with some element 〈v, u〉 of type (iv) is now in a relation Rj of the interpretation,
if it can be transformed into a tuple of R′j by replacing some elements 〈w,w〉 of type (i)
with 〈v, w〉.
51
Chapter 4 Expressibility under single-tuple changes
It follows from the construction that Υ(S) is isomorphic to A and therefore A |= ϕ if
and only if Υ(S) |= ϕ. We obtain the formulas ϕ′, χ1, . . . , χs and ψ′ as explained above.
Let d′ be the maximal quantifier rank of any formula χj . This concludes step (A) of the
proof.
Towards proving (C), recall that we need to show that there is a first-order interpre-
tation Υ′ which interprets (I, Jχ1K, . . . , JχsK) in B, for any B ∈ S(A, C, w, d′). Let B be
such a structure. The universe of I is C, that is, the same as the universe of B. Thus
the formula of Υ′ that defines the universe is trivial.
For the definitions of the relations JχjK, the idea is as follows. If v is not an identifier ui1
of a petal, then v ∈ JχjK if and only if χj holds in the structure consisting of only one
element and with empty relations, which can be hard-coded in the defining formula.
Otherwise, if v = ui1 for some i, we need to determine whether χj holds in the structure
Dv = (Ai, {ui1}, . . . , {uiw}), which is a structure over schema σ ∪ {U1, . . . , Uw}. The
structure B contains information about the MSO types of the structures (Ai, u¯i), but
(Ai, u¯i) is a structure over schema σ ∪ {c1, . . . , cw}. Yet it is easy to see that for the
formula χ′j that is obtained from χj by replacing every atom Uk(x) by x = ck it holds
(Ai, u¯i) |= χ′j if and only if Dv |= χj . So, in this case v ∈ JχjK if and only if v ∈ RBτ
for a rank-d′ MSO type τ with χ′j ∈ τ . All these conditions can even be expressed by
quantifier-free first-order formulas for fixed formulas χj . As a result, by Lemma 2.3 we
obtain from Υ′ and ψ′ a formula ψ with B |= ψ ⇔ A |= ϕ.
Maintaining MSO-definable decision problems
We proceed to show that every MSO-definable decision problem can be maintained in
DynFO, and thus prove Theorem 4.3. As we want to apply Corollary 3.4, we need to
assure that these queries are almost domain independent. This is easy to see and given
by the following proposition.
Proposition 4.10 ([Datta et al. 2019, Proposition 3.2]). Every MSO-definable query is
almost domain independent.
For the proof of Theorem 4.3 it consequently suffices to show that (MCϕ,∆E) is
(AC1, logn)-maintainable for graphs G with treewidth at most k.
The idea for our dynamic program is similar to the idea for maintaining 3Col in
Theorem 4.2: during its initialisation the program constructs a tree decomposition and
computes the MSO type of appropriate rank for all triangles (instead of partial colourings
as in the proof of Theorem 4.2). During the change sequence, a set C of nodes is defined
that contains, for each graph node v affected by a change, all nodes of at least one
special bag containing v. The set C has connection width w for some constant w and the
dynamic program basically maintains an MSO indicator structure B for G relative to C.
As there are only logn many change steps, the size of C is bounded by O(logn).
By Theorem 4.7 there is an MSO formula ψ with the property that G |= ϕ if and only
if B |= ψ. Although the dynamic program maintains B, it cannot directly evaluate ψ, as
it is restricted to use first-order formulas. For this reason we first show that second-order
52
4.1 A dynamic version of Courcelle’s Theorem
quantification over sets of size O(logn) can be simulated in first-order logic, if a particular
relation is present. Afterwards we present the details of the dynamic program.
We call an MSO formula C-restricted, if all its quantified subformulas are of one of the
following forms.
• ∃x (C(x) ∧ ϕ) or ∀x (C(x)→ϕ),
• ∃X (∀x(X(x)→C(x)) ∧ ϕ) or ∀X (∀x(X(x)→C(x))→ϕ).
Let A be a structure with a unary relation C and a (k+1)-ary relation Sub, for some k.
We say that Sub encodes subsets of C if, for each subset C ′ ⊆ C, there is a k-tuple t¯ such
that, for every element c ∈ C it holds c ∈ C ′ if and only if (t¯, c) ∈ Sub. Clearly, such an
encoding of subsets only exists if |V |k ≥ 2|C| and thus if |C| ≤ k log |V |.
Proposition 4.11. For each C-restricted MSO sentence ψ over a schema σ (containing
C) and every k there is a first-order sentence χ over σ ∪ {S} where S is a (k + 1)-ary
relation symbol such that, for every σ-structure A and (k + 1)-ary relation Sub that
encodes subsets of C (of A), it holds A |= ψ if and only if (A,Sub) |= χ.
Proof. The proof is straightforward. Formulas ∃X (∀x(X(x)→C(x)) ∧ ϕ) are trans-
lated into formulas ∃x¯ ϕ′, where x¯ is a tuple of k variables and ϕ′ results from ϕ by
simply replacing every atomic formula X(y) by Sub(x¯, y). Universal set quantification is
translated analogously.
We now turn to the proof of Theorem 4.3.
Proof (of Theorem 4.3). Thanks to Corollary 3.4 and Proposition 4.10 it suffices to show
that (MCϕ,∆E) is (AC1, logn)-maintainable in DynFO for graphs with treewidth at
most k. Let d be the quantifier rank of ϕ and let d′ and ψ be the number and the MSO
sentence guaranteed to exist by Theorem 4.7.
Given a graph G = (V,E), the AC1 initialisation first ensures that relations ≤,+,×
and BIT are available. Then it computes a dtree-nice tree decomposition (T,B) with
T = (I, F, r) with bags of size at most ` def= 4k + 6, for the constant dtree guaranteed to
exist by Lemma 4.1, together with the predecessor order  on I.
With each node i ∈ I, we associate a tuple v¯(i) = (v1, . . . , vm, v1, . . . , v1) of length `,
where B(i) = {v1, . . . , vm} and v1 < · · · < vm. That is, if the bag size of i is `,
this tuple just contains all graph nodes of the bag in increasing order. If the bag
size is smaller, the smallest graph node is repeated. Similarly, with each triangle
δ = (i0, i1, i2) such that the subgraph G(δ) has at least one inner node, we associate a
tuple v¯(δ) = (v(δ), v¯(i0), v¯(i1), v¯(i2)), where v(δ) denotes the smallest inner node of G(δ)
with respect to ≤.
The dynamic program further uses auxiliary relations S,C,N , and Dτ , for each rank-d′
MSO type τ over the schema that consists of the binary relation symbol E and 3`+ 1
constant symbols c1, . . . , c3`+1. The intended meaning is that C is a center of G with
connection width 3` + 1 and that from these relations an MSO indicator structure B
relative to C can be defined in first-order.
The relation S stores tuples v¯(i) representing special bags, as in the proof of Theorem 4.2.
The relations Dτ provide MSO type information for all triangles. More precisely, for
53
Chapter 4 Expressibility under single-tuple changes
Figure 4.6: Tree of a tree decomposition. Tree nodes representing special bags are coloured
blue, the corresponding maximal clean triangles are indicated as coloured
areas. The union of all special bags are a center of the graph, with the graphs
induced by the maximal clean triangles as petals.
each triangle δ = (i0, i1, i2) for which the subgraph G(δ) has at least one inner node, Dτ
contains the tuple v¯(δ) if and only if the rank-d′ MSO type of (G(δ), v¯(δ)) is τ .
The set C always contains all graph nodes that occur in special bags (and thus in S).
Additionally, for each maximal clean triangle δ with at least one inner node, C contains
the inner node v(δ). The relation N defines a bijection between C and an initial segment
of ≤.
We observe that C is a center of G and that the petals induced by C correspond to
the maximal clean triangles, with respect to the special nodes stored in S, with at least
two inner nodes. The interface I(δ) of a petal corresponding to a maximal clean triangle
δ = (i0, i1, i2) contains the nodes from B(i0), B(i1), and B(i2) as well as the node v(δ),
so C has connection width w def= 3 + 1. Figure 4.6 gives an illustration.
Now, an indicator structure B ∈ S(G,C,w, d′) can be first-order defined as follows.
Clearly, maximal clean triangles can be easily first-order defined from the relation S.
For each maximal clean triangle δ = (i0, i1, i2) with at least two inner nodes, the
relation J contains the tuple v¯(δ), and the relation Rτ contains v(δ) if and only if is
v¯(δ) ∈ Dτ . We translate the MSO formula ψ to a C-restricted MSO formula χ′ such that
B |= ψ ⇔ (G,Aux) |= χ′, where Aux is the auxiliary database stored by the dynamic
program. This translation is basically as described by Lemma 2.3. The formula χ′ results
from ψ by
• C-restricting every quantified subformula, so, for example, replacing every quantified
subformula ∃X θ by ∃X (∀x(X(x)→C(x)) ∧ θ) and every quantified subformula
∀X θ by ∀X (∀x(X(x)→C(x))→ θ), and
• replacing every atom A(x¯) by θA(x¯), where θA is the first-order formula that defines
A in (G,Aux).
It clearly holds that (G,Aux) |= χ′ ⇔ B |= ψ, and by Theorem 4.7 also (G,Aux) |= χ′ ⇔
G |= ϕ.
54
4.1 A dynamic version of Courcelle’s Theorem
We now define a relation Sub that encodes subsets of C. We observe that C is of
size at most b logn for some b ∈ N. Thus a subset C ′ of C can be represented by a
tuple (a1, . . . , ab) of nodes, where an element c ∈ C is in C ′ if and only if c is the m-th
element of C with respect to the mapping defined by N , m = (` − 1) logn + j and
the j-th bit of a` is one. By Proposition 4.11 we finally obtain a first-order formula χ
such that (G,Aux,Sub) |= χ⇔ (G,Aux) |= χ′ ⇔ G |= ϕ. That means that a dynamic
program that maintains the auxiliary relations as intended can maintain the dynamic
query (MCϕ,∆E).
It thus remains to describe how the auxiliary relations can be initialised and updated.
The set C is initially the bag B(r) of the root of T plus one inner node v((r, r, r)), and S
contains the tuple v¯(r).
The relations Dτ are computed in dtree logn inductive steps, each of which can be
defined in first-order logic, and therefore this computation can be carried out in AC1,
thanks to IND[logn] = AC1. More precisely, the computation of the relations Dτ proceeds
inductively in a bottom-up fashion. It starts with triangles δ = (i0, i1, i2) for which T (δ)
has exactly one or two inner tree nodes (i.e., nodes different from i0, i1, i2). Since such
graphs G(δ) have at most 5` nodes, their type can be determined by a first-order formula.
Basically, all isomorphism types of such graphs and their respective MSO types can be
directly encoded into first-order formulas.
For larger triangles, several cases need to be distinguished. Here we explain the case of a
triangle δ = (i0, i1, i2), for which i0 has child nodes i′1 and i′2 such that i′1  i1 and i′2  i2
(cf., Figure 4.3). In this case, the type τ of (G(δ), v¯(δ)) can be determined from the types
τ1 of (G(δ1), v¯(δ1)) and τ2 of (G(δ2), v¯(δ2)), where δ1 = (i′1, i1, i1) and δ2 = (i′2, i2, i2),
and the type τ0 of the graph G0 that includes all edges of G(δ) that are not already in
G(δ1) or G(δ2). More precisely, τ0 is the type of (G0, v¯(i0), v¯(i1), v¯(i2), v¯(i′1), v¯(i′2)) and
G0 is the subgraph of G with node set V0 =
⋃{B(i0), B(i1), B(i2), B(i′1), B(i′2)} and all
edges from G[V0] that have at least one endpoint in B(i′1)∪B(i′2). These types are either
already computed or the graphs are of size at most 5` and their type can therefore be
determined by a first-order formula as before.
We make this more precise. We observe that (G(δ), v¯(δ)) can be composed from the
graphs (G0, v¯(i0), v¯(i1), v¯(i2), v¯(i′1), v¯(i′2)), (G(δ1), v¯(δ1)) and (G(δ2), v¯(δ2)) by first taking
the disjoint union of these graphs and afterwards fusing nodes according to the identities
induced by the additional constants. For both operations, the rank-d MSO type of the
resulting structure only depends on the rank-d MSO type(s) of the original structure(s),
see Theorem 2.2 and [Makowsky 2004, Proposition 3.6]. The type τ of (G(δ), v¯(δ)) is
therefore determined by a finite function f as τ = f(τ0, τ1, τ2), which can be directly
encoded into first-order formulas.
Finally, we describe how a dynamic program can maintain S,C, and N for logn many
changes. The relation Dτ is not adapted during the changes. Whenever an edge (u, v)
is inserted to or deleted from G, the nodes u and v are viewed as affected. For every
affected node u that is not yet in a bag stored in S, a special tree node is selected (in
some canonical way, e.g. always the smallest node with respect to ≤ is selected) such
that u ∈ B(j). Furthermore, if node i is the least common ancestor of j and another
special node it becomes special, as well. It is easy to see that when selecting j as a special
55
Chapter 4 Expressibility under single-tuple changes
node, at most one further node becomes special. The tuples v¯(j) and v¯(i) (if i exists) are
added to S, their elements are added to C, the identifier nodes in C for maximal clean
triangles are corrected, and N is updated accordingly.
Extension to optimisation problems
We build on the techniques introduced for the proof of Theorem 4.3 to show that also
MSO-definable optimisation problems can be maintained for graphs of bounded treewidth,
that is, to show Theorem 4.4. Again, it suffices to show (AC1, logn)-maintainability of
such a problem under single-edge changes. As before, a dynamic program that maintains a
dynamic optimisation problem (OPTϕ,∆E) defines a center C of the graph that includes
one special bag for each node affected by the logn changes. For each petal Di associated
with the center, the dynamic program not only stores whether the subgraph induced
by the petal has some MSO type τ , but it maintains for each such type τ a collection
(B1, . . . , Bm) of subsets of Di \ C such that (1) the subgraph induced by the petal and
expanded by the relations B1, . . . , Bm has type τ , and (2) this collection is minimal with
this property with respect to the sum ∑mi=1 |Bi|.
A first-order formula can then compute, given a target MSO type for each petal that
together imply that the expanded graph satisfies ϕ, the minimum achievable overall sum
of the sizes |Bi|, and give the corresponding answer for the choice of MSO types that
gives the optimal result.
Proof (of Theorem 4.4). We only prove the special case of m = 1, the extension to the
general case is straightforward. Let ϕ(X) be an MSO formula of quantifier rank d. The
proof of Theorem 4.3 shows how one can obtain a dynamic program that (AC1, logn)-
maintains under single-edge changes the model checking problem MCψ for ψ
def= ∃X ϕ.
We adapt this proof, and reuse its notation, in order to obtain such a dynamic program
for (OPTϕ,∆E), using almost the same auxiliary relations. Together with Corollary 3.4
and Proposition 4.10, the result follows.
In the following, we sketch the proof idea. We consider ϕ to be an MSO sentence over
the signature {E,X}. Let G+X = (V,E,X) be an arbitrary expansion of a graph G
with a center C of connection width w, for some constant w. By Theorem 4.7 there
is a number d′ and an MSO sentence ψ such that for every B+X ∈ S(G+X , C, w, d′)
it holds that G+X |= ϕ if and only if B+X |= ψ. So, the formula ψ uses the type
information on the petals provided by B+X as well as G+X [C] directly to check whether
the relation X represents a feasible solution of the problem OPTϕ. In the proof of
Theorem 4.3 we explained how to obtain a first-order formula χ from ψ such that
(G,Aux,Sub) |= χ⇔ B |= ψ, for the auxiliary database Aux maintained by the dynamic
program constructed in the proof of Theorem 4.3 and a relation Sub encoding subsets
of C.
Let B ∈ S(G,C,w, d′+1) be an MSO indicator structure for G. Our goal is to maintain
some relations that augment the type information provided by B such that a formula χ′
similar to χ can “guess” a relation X, check that it is a feasible solution, compute its
size and verify that no feasible solution of smaller size exists. Of course, a relation X of
56
4.1 A dynamic version of Courcelle’s Theorem
unrestricted size cannot be quantified in first-order logic, even in the presence of Sub,
but we will see that the restriction of X to C and the type information on the petals can
be quantified, which is sufficient for our purpose.
We now give the details of the construction. The structure B contains relations Rτ
such that ui1 ∈ Rτ if and only the rank-(d′ + 1) MSO type of (Ai, u¯i) over schema
σ = {E, c1, . . . , c3`+1} is τ , where the subgraph Ai over universe Di and the tuple u¯i
are defined as before in this subsection. We say that a rank-d′ type τ ′ over schema
σ+X
def= σ ∪ {X} can be realised in (Ai, u¯i) by a set Ai ⊆ Di, if τ ′ is the rank-d′ MSO
type of (Ai, u¯i, Ai). If (Ai, u¯i) has rank-(d′ + 1) MSO type τ , the existence of such a set
is equivalent to the statement ∃X ατ ′ ∈ τ . We note that τ ′ already determines whether
uij ∈ Ai shall hold, for each constant uij from the tuple u¯i.
The dynamic program maintains relations #Rτ ′ and Qτ ′ , for each rank-d′ MSO type
over σ+X . The relations #Rτ ′ give the minimal size of a set that realises the type τ ′.
So, if τ ′ can be realised in (Ai, u¯i) by some set A, and s is the minimal size of such a
set, then #Rτ ′ shall contain the tuple (ui1, vs), where vs is the (s+ 1)-th element4 with
respect to ≤. Furthermore, for the lexicographically minimal set A of this kind and size s,
Qτ ′ shall contain all tuples (ui1, a), where a ∈ A.
We construct a first-order formula χ′ from χ that is able to define an optimal solution X
forOPTϕ from (G,Aux,Sub) expanded by the relations #Rτ ′ andQτ ′ . First, this formula
quantifies for each rank-d′ MSO type τ ′ the set of identifiers ui1 such that X realises τ ′ in
(Ai, u¯i) and checks consistency: as for each node v ∈ C that appears in u¯i the type τ ′
already determines whether v ∈ X shall hold, the respective types need to agree for nodes
that appear in multiple petals. For each ui1 the assigned type also needs to be realisable
in the respective substructure (Ai, u¯i), which can be checked using the relations Rτ of B.
Using this information, χ′ can apply χ to check that the implied set X is a feasible
solution. With the help of #Rτ it can compute the size of X, as FO(≤,+,×) is able to
add up logarithmically many numbers [Vollmer 1999, Theorem 1.21] and C is only of
logarithmic size in |V |. Also χ′ checks that no other assignment of types τ ′ to identifier
nodes results in feasible solutions of smaller size. Finally, χ′ uses the relations Qτ ′ to
actually return an optimal solution X.
Building on the proof of Theorem 4.3, it remains to show that the additional auxiliary
relations #Rτ ′ and Qτ ′ can be initialised and maintained. Actually, we maintain similarly
defined relations #Dτ ′ and Fτ ′ , the relations #Rτ ′ and Qτ ′ are then first-order definable
by the dynamic program using these relations.
Let δ be a triangle such thatG(δ) has at least one inner node. Similar to the relationsDτ
used in the proof of Theorem 4.3, here a relation #Dτ ′ contains the tuple (v¯(δ), u) if
and only if (1) the rank-d′ MSO type τ ′ is realisable in (G(δ), v¯(δ)), and (2) u is the
(s+ 1)-th element with respect to ≤, where s is the minimal size of a set that realises
this type. Furthermore, for the lexicographically minimal set A of this kind and size s,
Fτ ′ contains all tuples (v¯(δ), a), where a ∈ A. It is clear that these relations suffice to
define the relations #Rτ ′ and Qτ ′ , given the other relations of the proof of Theorem 4.3.
4We ignore the case that the size could be as large as |V |, which can be handled by some additional
encoding.
57
Chapter 4 Expressibility under single-tuple changes
The proof of Theorem 4.3 can be extended to show that the initial versions of these
auxiliary relations can be computed in AC1. For the inductive step of this computation,
a type τ ′ realisable in a structure (G(δ), v¯(δ)) might be achievable by a finite number of
combinations of types of its substructures. Here, the overall size of the realising set for
X needs to be computed and the minimal solution needs to be picked. This is possible
by an FO(≤,+,×)-formula since the number of possible combinations is bounded by a
constant depending only on d and k.
The updates of the auxiliary relations are exactly as in the proof of Theorem 4.3. Since
Dτ needs no updates there, neither #Dτ ′ nor Fτ ′ do, here.
From the proof it is easy to see that a dynamic program can also maintain the size s
of an optimal solution, either implicitly as ∑mj=1 |Qj | for distinguished relations Qj , or
as {vs}. Additionally, it can easily be adapted for optimisation problems on weighted
graphs, where nodes and edges have polynomial weights in n.
4.2 An arity hierarchy for DynProp
Dynamic programs with quantifier-free first-order update formulas are surprisingly ex-
pressive, as we have already noted in the introduction of this chapter: the query Parity
is already in DynProp under changes of single bits, and many first-order expressible
queries can be maintained dynamically without the need of quantification. We start
this section with an example of such a query, partly because we want to construct a
non-trivial quantifier-free dynamic program explicitly and by this improve our intuition
on the mechanics of DynProp, and partly because we will need this particular result later
on.
Example 4.12. For fixed k ∈ N let in-deg-k be the unary query that, given a graph G,
returns the set of nodes with in-degree k. This query is easily definable in FO for
each k. We show here that in-deg-k can be maintained by a DynProp-program P under
single-edge changes of G.
The dynamic program we construct uses k-lists, a slight extension of the list technique
introduced in [Gelade et al. 2012]. The list technique was used in [Zeume and Schwentick
2015] to maintain emptiness of a unary relation U under insertions and deletions of
single elements with quantifier-free formulas. To this end a binary relation List which
encodes a linked list of the elements in U in the order of their insertion is maintained.
Additionally, two unary relations mark the first and the last element of the list. The key
insight is that a quantifier-free formula can figure out whether the relation U becomes
empty when an element a is deleted by checking whether a is both the first and the last
element of the list.
To maintain in-deg-k, the quantifier-free dynamic program P stores, for every
node v ∈ V , a list of all nodes u with (u, v) ∈ E, using a ternary relation List1.
More precisely, if u1, . . . , um are the in-neighbours of v then List1 contains the tu-
ples (v, uij , uij+1) where j1, . . . , jm is some permutation of {1, . . . ,m}. Additionally,
the program uses ternary relations List2, . . . ,Listk such that Listi describes paths of
58
4.2 An arity hierarchy for DynProp
length i in the linked list List1. For example, if (v, u1, u2), (v, u2, u3) and (v, u3, u4)
are tuples in List1, then (v, u1, u4) ∈ List3. The list List1 comes with 2k binary
relations First1, . . . ,Firstk,Last1, . . . ,Lastk that mark, for each v ∈ V , the first and
the last k elements of the list of in-neighbours of v, as well as with k + 2 unary relations
Is0, . . . , Isk, Is>k that count the number of in-neighbours for each v ∈ V up to k. We
call nodes u with (v, u) ∈ Firsti or (v, u) ∈ Lasti the i-first or the i-last element for v,
respectively.
Using these relations, the query can be answered easily: the result is the set of nodes v
with v ∈ Isk. We show how to maintain the auxiliary relations under insertions and
deletions of single edges, and assume for ease of presentation of the update formulas that
if a change insE(u, v) occurs then (u, v) /∈ E before the change, and a change delE(u, v)
only happens if (u, v) ∈ E before the change.
Insertions of edges When an edge (u, v) is inserted, then the node u needs to be
inserted into the list of v. This node u also becomes the last element of the list (encoded
by a tuple (v, u) ∈ Last1), and the i-last node u′ for v becomes the (i+ 1)-last one, for
each i < k. If only i elements are in the list for v before the change, for some i < k, then
u becomes the (i + 1)-first element for v. The number of elements in the list for v is
updated appropriately. The update formulas are as follows:
ϕListiinsE (u, v;x, y, z)
def= Listi(x, y, z) ∨
(
v = x ∧ Lasti(x, y) ∧ u = z
)
for i ∈ {1, . . . , k}
ϕLast1insE (u, v;x, y)
def=
(
v 6= x ∧ Last1(x, y)
) ∨ (v = x ∧ u = y)
ϕLastiinsE (u, v;x, y)
def=
(
v 6= x ∧ Lasti(x, y)
) ∨ (v = x ∧ Lasti−1(y)) for i ∈ {2, . . . , k}
ϕFirstiinsE (u, v;x, y)
def= Firsti(x, y) ∨
(
v = x ∧ u = y ∧ Isi−1(x)
)
for i ∈ {1, . . . , k}
ϕIs0insE (u, v;x)
def=
(
v 6= x ∧ Is0(x)
)
ϕIsiinsE (u, v;x)
def=
(
v 6= x ∧ Isi(x)
) ∨ (v = x ∧ Isi−1(x)) for i ∈ {1, . . . , k}
ϕ
Is>k
insE (u, v;x)
def= Is>k(x) ∨
(
v = x ∧ Isk(x)
)
Deletions of edges When an edge (u, v) is deleted, the hardest task for quantifier-free
update formulas is to determine whether, if the in-degree of v was at least k + 1 before
the change, the in-degree of v is now exactly k. To decide whether this is the case we use
that if an element u is the j-first and at the same time the j′-last element for v, then the
list for v contains exactly j + j′ − 1 elements. If u is removed from the list, j + j′ − 2
elements remain. So, using the relations Firstj and Lastj′ , the exact number m of
elements after the change can be determined, if m ≤ 2k − 2.
The relations Firsti (and, symmetrically, the relations Lasti) can be maintained
using the relations Listj : if the i′-first element u is removed from the list for v, u′
becomes the i-first element for i′ ≤ i ≤ k if (v, u, u′) ∈ Listi−i′+1. The update formulas
exploit these insights:
59
Chapter 4 Expressibility under single-tuple changes
ϕListidelE (u, v;x, y, z)
def= (v 6= x) ∧ Listi(x, y, z)
∨ (v = x ∧ u 6= y ∧ ∧
i′≤i
¬Listi′(x, y, u) ∧ Listi(x, y, z)
)
∨ (v = x ∧ ∨
j,j′
j+j′=i+1
Listj(x, y, u) ∧ Listj′(x, u, z)
)
for i ∈ {1, . . . , k}
ϕLastidelE (u, v;x, y)
def=
(
v 6= x ∧ Lasti(x, y)
)
∨
(
v = x ∧
∧
i′≤i
¬Lasti′(u) ∧ Lasti(y)
)
∨
(
v = x ∧
∨
i′≤i
(
Lasti′(u) ∧ Listi−i′+1(x, y, u)
))
for i ∈ {1, . . . , k}
ϕFirstidelE (u, v;x, y)
def=
(
v 6= x ∧ Firsti(x, y)
)
∨
(
v = x ∧
∧
i′≤i
¬Firsti′(u) ∧ Firsti(y)
)
∨
(
v = x ∧
∨
i′≤i
(
Firsti′(u) ∧ Listi−i′+1(x, u, y)
))
for i ∈ {1, . . . , k}
ϕIsidelE (u, v;x)
def=
(
v 6= x ∧ Isi(x)
)
∨ (v = x ∧ ∨
j,j′
j+j′−2=i
Firstj(x, u) ∧ Lastj′(x, u)
)
for i ∈ {0, . . . , k}
ϕ
Is>k
delE (u, v;x)
def=
(
v 6= x ∧ Is>k(x)
)
∨
(
v = x ∧ Is>k(x) ∧
∧
j,j′
j+j′−2=k
(¬Firstj(x, u) ∨ ¬Lastj′(x, u))) /
While DynProp includes many interesting dynamic queries, it still allows for general
inexpressibility results. With a result that combines both aspects, Gelade et al. [2012]
were able to characterise completely the expressive power of DynProp with respect to
Boolean queries on strings, that is, formal languages, under single-tuple changes: a
language L can be maintained in DynProp under changes of single positions precisely if
L is regular. It follows from the construction that quantifier-free dynamic programs only
need binary auxiliary relations to maintain a Boolean query on strings, simply because
every regular language can be maintained in binary DynProp under single-tuple changes.
In this section we show that a similar result for DynProp on graphs is not possible
and identify for every k ≥ 1 a Boolean graph query that can be maintained with k-ary
auxiliary relations in DynProp under single-tuple changes, but not with (k − 1)-ary
auxiliary relations. We say that DynProp has a strict arity hierarchy for this class of
dynamic queries.
Theorem 4.13. DynProp has a strict arity hierarchy with respect to Boolean graph
queries under single-tuple changes.
60
4.2 An arity hierarchy for DynProp
This theorem generalises multiple earlier results. Already Dong and Su [1998] prove
that DynFO has an arity hierarchy, and this result translates to DynProp, but the query
that separates k-ary DynProp from (k − 1)-ary DynProp is a k-ary query on structures
with (6k + 1)-ary input relations.5 Dong and Zhang [2000] reduce the arity of the input
relations to 3k + 1. An arity hierarchy for Boolean graph queries up to k = 3 was
observed by Zeume and Schwentick [2015], and we re-use this result for the proof of
Theorem 4.13. Zeume [2017] proves an arity hierarchy for DynProp if only insertions are
allowed, based on the query k-Clique, for different values of k. However, it is unknown
whether k-Clique can be maintained in DynProp if also deletions are allowed, for any
k ≥ 3.
The family of queries that witnesses the hierarchy for large arities consists of queries
that are evaluated over coloured graphs with schema σ def= {E,R}, that is, directed graphs
(V,E,R) with an additional unary relation R that encodes a set of (red-)coloured nodes.6
A node w of such a graph is said to be covered if there is a coloured node v ∈ R with
(v, w) ∈ E. The query OddCoveredNodes asks, given a coloured graph, whether the
number of covered nodes is odd.
We show that OddCoveredNodes cannot be maintained with quantifier-free up-
date formulas under single-tuple changes. A closer examination reveals the connection
between variants of this query and the arity structure of DynProp. Let k be a natural
number. The variant OddCoveredNodesdeg≤k of OddCoveredNodes asks whether
the number of covered nodes that additionally have in-degree at most k is odd. Note that
OddCoveredNodesdeg≤k is a query on general coloured graphs, not only on graphs
with bounded degree.
In the following subsections we show that (OddCoveredNodesdeg≤k,∆σ)7 witnesses
a strict arity hierarchy for DynProp, at least for arity at least 3. For this we first show in
Subsection 4.2.1 that OddCoveredNodesdeg≤k can actually be maintained with k-ary
auxiliary relations, for k ≥ 3. Then, in Subsection 4.2.2 we introduce the technique
that we will use in Subsection 4.2.3 to prove that OddCoveredNodesdeg≤k cannot be
maintained in (k − 1)-ary DynProp. In Subsection 4.2.4, we put the parts together.
We discuss in Subsection 4.2.5 how OddCoveredNodesdeg≤k can be maintained by
dynamic programs that are allowed to use quantifiers. This is in particular interesting
because OddCoveredNodes seems to be a minor generalisation of the Parity query,
and statically both queries can be expressed by first-order formulas that are allowed to
use a single parity quantifier. In the dynamic setting, OddCoveredNodes is provably
5In the FOIES setting of Dong and Su [1998] the query relation is not considered to be an auxiliary
relation and its arity is ignored for determining the maximal arity of a dynamic program. In our
setting of DynFO, it is a trivial statement that a k-ary query cannot be maintained in (k − 1)-ary
DynFO, because the k-ary query relation needs to be stored as an auxiliary relation. However, Zeume
[2015, Theorem 4.3.7] shows that a Boolean query that is very similar to the k-ary query used by
Dong and Su [1998], and in particular is also over a (6k+ 1)-ary input schema, separates k-ary DynFO
(DynProp) from (k − 1)-ary DynFO (DynProp).
6We note that the additional relation R is for convenience of exposition. All our results are also valid
for pure graphs: instead of using the relation R one could consider a node v coloured if it has a
self-loop (v, v) ∈ E.
7Remember that ∆σ denotes the set of all single-tuple changes over the schema σ = {E,R}.
61
Chapter 4 Expressibility under single-tuple changes
harder than Parity, and it remains open whether it can be maintained in DynFO.
4.2.1 Expressibility results for the arity hierarchy
We start by proving that OddCoveredNodesdeg≤k can be maintained in DynProp using
k-ary auxiliary relations. In Subsection 4.2.3 we show that this arity is optimal.
Proposition 4.14. For every k ≥ 3, (OddCoveredNodesdeg≤k,∆σ) is in k-ary
DynProp.
Proof. Let k ≥ 3 be some fixed natural number. We show how a DynProp-program P
can maintain (OddCoveredNodesdeg≤k,∆σ) using at most k-ary auxiliary relations.
The idea is as follows. Whenever a formerly uncoloured node v gets coloured, a certain
number c(v) of nodes become covered: v has edges to all these nodes, but no other
coloured node has. Because the number c(v) can be arbitrary, the program P necessarily
has to store for each uncoloured node v the parity of c(v) to update the query result.
But this is not sufficient. Suppose that another node v′ is coloured by a change and that,
as a result, a number c(v′) of nodes become covered, because they have an edge from v′
and so far no incoming edge from another coloured neighbour. Some of these nodes, say,
c(v, v′) many, also have an incoming edge from v. Of course these nodes do not become
covered any more when afterwards v is coloured, because they are already covered then.
So, whenever a node v′ gets coloured, the program P needs to update the (parity of
the) number c(v), based on the (parity of the) number c(v, v′). In turn, the (parity of
the) latter number needs to be updated whenever another node v′′ is coloured, using the
(parity of the) analogously defined number c(v, v′, v′′), and so on.
It seems that this reasoning does not lead to a construction idea for a dynamic program,
as information for more and more nodes needs to be stored, but observe that only those
covered nodes are relevant for the query that have in-degree at most k. So, a number
c(v1, . . . , vk) does not need to be updated when some other node vk+1 gets coloured,
because no relevant node has edges from all nodes v1, . . . , vk+1.
We now present the construction in more detail. A node w is called active if its
in-degree in-deg(w) is at most k. Let A = {a1, . . . , a`} be a set of coloured nodes and
let B = {b1, . . . , bm} be a set of uncoloured nodes, with ` + m ≤ k. By N •◦G (A,B) we
denote the set of active nodes w of the coloured graph G whose coloured (in-)neighbours
are exactly the nodes in A and that have (possibly amongst others) the nodes in B as
uncoloured (in-)neighbours. So, w ∈ N •◦G (A,B) if (1) in-deg(w) ≤ k, (2) (v, w) ∈ E for
all v ∈ A ∪ B, and (3) there is no edge (v′, w) ∈ E from a coloured node v′ ∈ R with
v′ /∈ A. We omit the subscript G and just write N •◦(A,B) if the graph G is clear from
the context. The dynamic program P maintains the parity of |N •◦G (A,B)| for all such
sets A,B.
Whenever a change α = insR(v) colours a node v of G, the update is as follows. We
distinguish the three cases (1) v ∈ A, (2) v ∈ B and (3) v /∈ A ∪B. In case (1), the set
N •◦α(G)(A,B) equals the set N •◦G (A \ {v}, B ∪ {v}), and the existing auxiliary information
can be copied. In case (2), actually N •◦α(G)(A,B) = ∅, as B contains a coloured node.
The parity of the cardinality 0 of ∅ is even. For case (3) we distinguish two further cases.
62
4.2 An arity hierarchy for DynProp
If |A∪B| = k, no active node w can have incoming edges from every node in A∪B ∪{v}
as w has in-degree at most k, so N •◦α(G)(A,B) = N •◦G (A,B) and the existing auxiliary
information is taken over. If |A∪B| < k, then N •◦α(G)(A,B) = N •◦G (A,B)\N •◦G (A,B∪{v})
and P can combine the existing auxiliary information.
When a change α = delR(v) uncolours a node v of G, the necessary updates are
symmetrical. The case v ∈ A is similar to case (2) above: N •◦α(G)(A,B) = ∅, because
A contains an uncoloured node. The case v ∈ B is handled similarly as case (1)
above, as we have N •◦α(G)(A,B) = N •◦G (A ∪ {v}, B \ {v}). The third case v /∈ A ∪ B is
treated analogously as case (3) above, but in the sub-case |A ∪ B| < k we have that
N •◦α(G)(A,B) = N •◦G (A,B) ∪N •◦G (A ∪ {v}, B).
Edge insertions and deletions are conceptionally easy to handle, as they change the
sets N •◦(A,B) by at most one element. Given all nodes of A and B and the endpoints
of the changed edge as parameters, quantifier-free formulas can easily determine whether
this is the case for specific sets A,B.
We now present P formally. For every ` ≤ k+ 1 the program maintains unary relations
N` and N•` with the indented meaning that for a node w it holds w ∈ N` if in-deg(w) = `
and w ∈ N•` if w has exactly ` coloured in-neighbours. These relations can be maintained
as presented in Example 4.12, requiring some additional, ternary auxiliary relations. We
also use a relation Active def= N1 ∪ · · · ∪Nk that contains all active nodes with at least
one edge.
For every `,m ≥ 0 with 1 ≤ `+m ≤ k the programs maintains (`+m)-ary auxiliary
relations P`,m with the intended meaning that a tuple (a1, . . . , a`, b1, . . . , bm) is contained
in P`,m if and only if
• the nodes a1, . . . , a`, b1, . . . , bm are pairwise distinct,
• ai ∈ R and bj /∈ R for i ∈ [`], j ∈ [m], and
• the set N •◦(A,B) has an odd number of elements, where A = {a1, . . . , a`} and
B = {b1, . . . , bm}.
The following formula θ`,m checks the first two conditions:
θ`,m(x1, . . . , x`, y1, . . . , ym)
def=
∧
i6=j∈[`]
xi 6= xj ∧
∧
i6=j∈[m]
yi 6= yj ∧
∧
i∈[`]
R(xi) ∧
∧
i∈[m]
¬R(yi)
Of course, P also maintains the Boolean query relation Ans.
We now describe the update formulas of P for the relations P`,m and Ans, assuming
that each change actually alters the input graph, so, for example, no changes insE(v, w)
occur such that the edge (v, w) already exists.
Let ϕ⊕ ψ def= (ϕ ∧ ¬ψ) ∨ (¬ϕ ∧ ψ) denote the Boolean exclusive-or connector.
Colouring a node v A change insR(v) increases the total number of active, covered
nodes by the number of active nodes that have so far no coloured in-neighbour, but an
edge from v. That is, this number is increased by |N •◦(∅, {v})|. The update formula for
Ans is therefore
ϕAnsinsR(v)
def= Ans⊕ P0,1(v).
63
Chapter 4 Expressibility under single-tuple changes
We only spell out the more interesting update formulas for the relations P`,m, for
different values of `,m. These formulas list the conditions for tuples a¯ = a1, . . . , a` and
b¯ = b1, . . . , bm that N •◦({a1, . . . , a`}, {b1, . . . , bm}) is of odd size after a change. The
other update formulas are simple variants.
ϕ
P`,m
insR (v;x1, . . . , x`, y1, . . . , ym)
def=∨
i∈[`]
(
v = xi ∧ P`−1,m+1(x1, . . . , xi−1, xi+1, . . . , x`, y¯, v)
)
∨
( ∧
i∈[`]
v 6= xi ∧
∧
i∈[m]
v 6= yi ∧
(
P`,m(x¯, y¯)⊕ P`,m+1(x¯, y¯, v)
))
for ` ≥ 1, `+m < k
ϕ
P`,m
insR (v;x1, . . . , x`, y1, . . . , ym)
def=∨
i∈[`]
(
v = xi ∧ P`−1,m+1(x1, . . . , xi−1, xi+1, . . . , x`, y¯, v)
)
∨
( ∧
i∈[`]
v 6= xi ∧
∧
i∈[m]
v 6= yi ∧ P`,m(x¯, y¯)
)
for ` ≥ 1, `+m = k
Uncolouring a node v The update formulas for a change delR(v) are analogous to
the update formulas for a change insR(v) as seen above. Again we only present a subset
of the update formulas, the others are again easy variants.
ϕAnsdelR(v)
def= Ans⊕ P1,0(v)
ϕ
P`,m
delR(v;x1, . . . , x`, y1, . . . , ym)
def=∨
i∈[m]
(
v = yi ∧ P`+1,m−1(x¯, v, y1, . . . , yi−1, yi+1, . . . , ym)
)
∨
( ∧
i∈[`]
v 6= xi ∧
∧
i∈[m]
v 6= yi ∧
(
P`,m(x¯, y¯)⊕ P`+1,m(x¯, v, y¯)
))
for m ≥ 1, `+m < k
ϕ
P`,m
delR(v;x1, . . . , x`, y1, . . . , ym)
def=∨
i∈[m]
(
v = yi ∧ P`+1,m−1(x¯, v, y1, . . . , yi−1, yi+1, . . . , ym)
)
∨
( ∧
i∈[`]
v 6= xi ∧
∧
i∈[m]
v 6= yi ∧ P`,m(x¯, y¯)
)
for m ≥ 1, `+m = k
Inserting an edge (v, w) When an edge (v, w) is inserted, the number of active,
covered nodes can change at most by one. At first, a covered node w might become
inactive. This happens when w had in-degree k before the insertion. Or, an active node w
becomes covered. This happens if v is coloured and w had no coloured in-neighbour and
64
4.2 An arity hierarchy for DynProp
in-degree at most k − 1 before the change. The update formula for Ans is accordingly
ϕAnsinsE (v, w)
def= Ans⊕
((
Nk(w) ∧
∨
i∈[k]
N•i (w)
) ∨ (R(v) ∧N•0 (w) ∧ ∨
i∈[k]
Ni−1(w)
))
.
The necessary updated for relations P`,m are conceptionally very similar. We list the
conditions that characterise whether the membership of w in N •◦(A,B) changes, for a
set A = {x1, . . . , x`} of coloured nodes and a set B = {y1, . . . , ym} of uncoloured nodes.
• Before the change, w ∈ N •◦(A,B) holds, but not afterwards. This is either because
w becomes inactive or because the new edge (v, w) connects w with another coloured
node v. This case is expressed by the formula
ψ1
def=
∧
i∈[`]
E(xi, w) ∧N•` (w) ∧
∧
i∈[m]
E(yi, w) ∧
(
Nk(w) ∨R(v)
)
.
• Before the change, w ∈ N •◦(A,B) does not hold, but it does afterwards. Then w
needs to be active and to have an incoming edge from all but one node from A∪B,
and v is that one node. Additionally, w has no other coloured in-neighbours. The
following formulas ψ2, ψ3 express these conditions for the cases v ∈ A and v ∈ B,
respectively.
ψ2
def=
∨
i∈[`]
(
v = xi ∧
∧
j∈[`]\{i}
E(xj , w) ∧
∧
j∈[m]
E(yj , w) ∧N•`−1(w) ∧
∨
j∈[k]
Nj−1(w)
)
ψ3
def=
∨
i∈[m]
(
v = yi ∧
∧
j∈[`]\{i}
E(yj , w) ∧
∧
j∈[`]
E(xj , w) ∧N•` (w) ∧
∨
j∈[k]
Nj−1(w)
)
The update formula for P`,m is then
ϕ
P`,m
insE (v, w;x1, . . . , x`, y1, . . . , ym)
def= θ`,m(x¯, y¯) ∧
(
P`,m(x¯, y¯)⊕ (ψ1 ∨ ψ2 ∨ ψ3)
)
.
Deleting an edge (v, w) The ideas to construct the update formulas for changes
delE(v, w) are symmetrical to the constructions for changes insE(v, w). When an edge
(v, w) is deleted, the node w becomes active if its in-degree before the change was k + 1.
It is (still) covered, and then is a new active and covered node, if it has coloured in-
neighbours other than v. This is the case if w has at least two coloured in-neighbours
before the change, or if it has at least one coloured in-neighbour and v is not coloured.
On the other hand, if v was the only coloured in-neighbour of an active node w, this
node is not covered any more. The update formula for the query relation Ans is therefore
ϕAnsdelE (v, w)
def= Ans⊕
((
Nk+1(w) ∧
∨
i∈[k+1]
N•i (w) ∧ (¬R(v) ∨ ¬N•1 (w))
)
∨ (R(v) ∧N•1 (w) ∧ ∨
i∈[k]
Ni(w)
))
Regarding the update of relations P`,m, we distinguish the same cases as for insertions
insE(v, w) for a set A = {x1, . . . , x`} of coloured nodes and a set B = {y1, . . . , ym} of
uncoloured nodes.
65
Chapter 4 Expressibility under single-tuple changes
• Before the change, w ∈ N •◦(A,B) holds, but not afterwards. That means the
active node w has incoming edges from all nodes from A ∪ B, has no coloured
in-neighbours apart from the nodes in A, and v ∈ A ∪B. This is expressed by the
formula
ψ′1
def=
∧
i∈[`]
E(xi, w) ∧N•` (w) ∧
∧
i∈[m]
E(yi, w) ∧
∨
i∈[k]
Ni(w)
∧ ( ∨
i∈[`]
xi = v ∨
∨
i∈[m]
yi = v
)
.
• Before the change, w ∈ N •◦(A,B) does not hold, but it does afterwards. Then w
already is and stays connected to all nodes from A ∪ B, and either have degree
k + 1 and become active and/or loose an additional coloured in-neighbour v. The
following formula ψ′2 lists the possible combinations.
ψ′2
def=
∧
i∈[`]
(v 6= xi ∧ E(xi, w)) ∧
∧
i∈[m]
(v 6= yi ∧ E(yi, w))
∧
((
Nk+1(w) ∧ ¬R(v) ∧N•j (w)
)
∨ (Nk+1(w) ∧R(v) ∧N•j+1(w))
∨ ( ∨
i∈[k]
Ni(w) ∧R(v) ∧N•j+1(w)
))
Finally, the update formula for P`,m is
ϕ
P`,m
delE (v, w;x1, . . . , x`, y1, . . . , ym)
def= θ`,m(x¯, y¯) ∧
(
P`,m(x¯, y¯)⊕ (ψ′1 ∨ ψ′2)
)
.
4.2.2 Proving inexpressibility results for DynProp
Before we prove the inexpressibility results for the claimed arity hierarchy, we introduce
the technique we use in this thesis to prove that some query cannot be maintained in
DynProp using only k-ary auxiliary relations, for some k. This technique is a reformulation
of the proof technique of Zeume [2017], which combines techniques from Gelade et al.
[2012] and Zeume and Schwentick [2015] with insights regarding upper and lower bounds
for Ramsey numbers.
We state a sufficient condition under which a query q cannot be maintained in k-ary
DynProp under single-tuple changes. In Chapter 5, this condition will be generalised for
certain more complex change operations.
For a Boolean query q, the condition basically requires that for each collection B
of subsets of size k + 1 of a set {1, . . . , n} (for an arbitrary n larger than some n0),
there is a structure D and a change sequence β(x1, . . . , xk+1) such that (1) the elements
1, . . . , n cannot be distinguished by quantifier-free formulas, and (2) β(i1, . . . , ik+1)(D) is
a positive instance for q exactly if {i1, . . . , ik+1} ∈ B.
The actual statement of the condition is more general and allows non-Boolean queries,
tuples p¯1, . . . , p¯n instead of elements 1, . . . , n, and additional parameters for the change
sequence β.
66
4.2 An arity hierarchy for DynProp
The statement of the condition is also quite technical. Before we actually formulate the
general statement in Lemma 4.18, we introduce the proof technique of Zeume [2017] with
a rather simple example and re-prove that AlternatingReach cannot be maintained
in DynProp, even when an arbitrary initialisation is allowed. This result appears already
in [Gelade et al. 2012], although with a different construction. AlternatingReach is a
query on alternating graphs G = (V,X,U,E), that is, graphs (V,E) such that the node
set V = X unionmulti U is partitioned into a set X of existential and a set U of universal nodes.
The set AltReach(s) of nodes reachable from a node s ∈ V is the smallest set that
contains (1) the node s, (2) an existential node u ∈ X if there is a node v ∈ AltReach(s)
with (u, v) ∈ E, and (3) a universal node v ∈ U if v ∈ AltReach(s) holds for all nodes
v with (u, v) ∈ E. The query result AlternatingReach(G) contains all tuples (s, t)
such that t ∈ AltReach(s). We re-prove the following result, using the proof technique
of Zeume [2017, Theorem 5.3].
Proposition 4.15 (cf. [Gelade et al. 2012, Proposition 6.2]). Let k be any natural number.
(AlternatingReach,∆E) is not in k-ary DynProp, even with arbitrary initialisation.
In the following, we denote the set of all subsets of size k of a set A by
(A
k
)
. The
proof technique of Zeume [2017] uses the following combinatorial result which combines
Ramsey upper and lower bounds.
Lemma 4.16 ([Zeume 2017, Lemma 5.2]). Let k ∈ N be arbitrary and σ a k-ary schema.
Then there is a monotone and unbounded function f : N 7→ N and an n0 ∈ N such that
for every domain A larger than n0 the following conditions are satisfied:
(S1) For every σ-structure S over A and every linear order ≺ on A there is a subset A′
of A of size |A′| ≥ f(|A|) such that all ≺-ordered k-tuples over A′ have the same
quantifier-free type in S.
(S2) The set
( A
k+1
)
can be partitioned into two subsets B and C such that for every set
A′ ⊆ A of size |A′| ≥ f(|A|) there are B,C ∈ ( A′k+1) with B ∈ B and C ∈ C.
Intuitively speaking, this lemma says that for every large enough domain A there
is a 2-colouring of the (k + 1)-element subsets of the domain such that (1) in every
structure S over A one can find a “large” sub-domain A′ such that all (appropriately
ordered) k-tuples over A′ cannot be distinguished in S by quantifier-free formulas, but
(2) for each colour there is a (k + 1)-element subset of that colour that only contains
elements from A′. So, while k-tuples over A′ cannot be distinguished using the whole
structure, subsets of A′ of size k + 1 can be distinguished using a pre-defined 2-colouring.
We remark that the unboundedness of f was not explicitly stated in [Zeume 2017]
but it readily follows from its proof in [Zeume 2017], where f is chosen in Ω(log(k−1)(n)).
Monotonicity can be easily achieved.
Additionally, the proof technique utilises the Substructure Lemma as introduced by
Gelade et al. [2012] and further developed by Zeume and Schwentick [2015]. This
lemma formalises the insight that for DynProp programs, whether a tuple c¯ is included
in an auxiliary relation after a (single-tuple) change of a tuple d¯ only depends on the
elements contained in these two tuples. Suppose, we have two isomorphic states S and
67
Chapter 4 Expressibility under single-tuple changes
S ′ of a DynProp program, so, input and auxiliary structures are isomorphic via some
isomorphism pi. A consequence of this insight is that after applying some changes δ(d¯)
and δ(pi(d¯)), respectively, the updated states are still isomorphic vie pi.
We make this more formal now. Let pi be an isomorphism from a structure D to a
structure D′. Two changes α = δ(a¯) on D and α′ = δ′(b¯) on D′ are said to be pi-respecting
if δ = δ′ and b¯ = pi(a¯). Two sequences β = α1 · · ·αm and β′ = α′1 · · ·α′m of changes are
pi-respecting if αi and α′i are, for every i ≤ m.
Lemma 4.17 (Substructure Lemma, [Zeume and Schwentick 2015]). Let P be a DynProp-
program and let S and S ′ be states of P with domains S and S′. Further let A ⊆ S and
A′ ⊆ S′ such that S[A] and S ′[A′] are isomorphic via pi. Then Pβ(S)[A] and Pβ′(S ′)[A′]
are isomorphic via pi for all pi-respecting sequences β, β′ of single-tuple changes on A
and A′.
Having the prerequisites in place, we now prove Proposition 4.15, using a construction
very similar to the proof of [Zeume 2015, Proposition 4.1.19].
Proof (of Proposition 4.15). Assume, towards a contradiction, that there is a DynProp
program P with k-ary auxiliary relations over schema σaux that maintains Alternating-
Reach. Let σ = σaux ∪ {c1, c2}, where c1, c2 are additional constant symbols, and let f
and n0 be as guaranteed by Lemma 4.16 applied to k and σ. Let n be a number with
n ≥ n0, let A = {1, . . . , n, s, t} and let ≺ be the order 1 ≺ · · · ≺ n ≺ s ≺ t on A. Let
B, C be the partition of ( Ak+1) guaranteed to exist by Property (S2) of Lemma 4.16.
We consider an alternating graph G = (V,X,U,E) with node set V = A∪( Ak+1)∪{s, t}.
The set U of universal nodes consists of the elements from
( A
k+1
)
, the remaining nodes
constitute the set X of existential nodes. Let S = (D, Aux) be the state of P that is
obtained by inserting the edges
• (S, i) for each set S ∈ ( Ak+1) and each i ∈ S and
• (t, B) for each B ∈ B
in some arbitrary order, starting from an empty input structure and an arbitrarily
initialised auxiliary structure. See Figure 4.7 for a sketch.
Let Aux′ be the σ-expansion of Aux where the constant symbols c1, c2 are interpreted
by s, t, respectively. By Condition (S1) of Lemma 4.16 there is a subset A′ ⊆ A of size
|A′| ≥ f(|A|) such that all ≺-ordered k-tuples over A′ have the same type in Aux′. It
must hold s, t /∈ A′, as tuples involving constants cannot have the same type as tuples
without constants. By Condition (S2), there are B,C ∈ ( A′k+1) with B ∈ B and C ∈ C.
Let B = {i1, . . . , ik+1} and C = {i′1, . . . , i′k+1}, with ij ≺ ij+1 and i′j ≺ i′j+1 for all j ≤ k,
and let β and β′ be the change sequences that insert the edges (i1, s), . . . , (ik+1, s) and
(i′1, s), . . . , (i′k+1, s) in that order, respectively.
Observe that the tuple (s, t) is in the query result for β(D), but not in the query
result for β′(D). However, P gives the same answer in both cases by Lemma 4.17,
as the substructures induced by (i1, . . . , ik+1, s, t) and by (i′1, . . . , i′k+1, s, t) are clearly
isomorphic and the change sequences β and β′ respect this isomorphism. So, contradicting
the assumption, P does not maintain AlternatingReach.
68
4.2 An arity hierarchy for DynProp
t
s
(
A
k+1
) B
CB = {i1, i2, i3, . . . , ik+1}
C = {i′1, i′2, i′3, . . . , i′k+1}
A i1 i2 i3 ik+1· · · · · ·i′1 i′2 i′3 i′k+1
Figure 4.7: A sketch of the graph from the proof of Proposition 4.15. Existential nodes
are filled gray, universal nodes are white. The edges inserted by the change
sequence β are dashed.
We now state and prove a lemma that serves as an abstraction for proofs like the proof
of Proposition 4.15. It encapsulates the applications of Lemma 4.16 and Lemma 4.17,
which is the same for every proof of this kind, from the problem specific construction of
the input structure D and the change sequences β, β′ as in the proof of Proposition 4.15
and sketched in Figure 4.7.
In the following, we denote the set {1, . . . , n} by [n] and write (D, a¯) ≡0 (D, b¯) if a¯ and
b¯ have the same length and agree on their quantifier-free type in D.
Lemma 4.18. Let q be an m-ary σ-query. Then (q,∆σ) is not in k-ary DynProp, even
with arbitrary initialisation, if there are natural numbers `, r and, for each n0 ∈ N, a
natural number n ≥ n0, such that for all subsets B ⊆
( [n]
k+1
)
there exist
• a σ-structure D, a set P = {p11, . . . , p`1, . . . , p1n, . . . , p`n} of distinct elements, and
tuples a¯ = a1, . . . , am and u¯ = u1, . . . , ur, such that
– P and {a1, . . . , am, u1, . . . , ur} are disjoint and contained in the domain of D,
– (D, a¯, u¯, p¯i1 , . . . , p¯ik+1) ≡0 (D, a¯, u¯, p¯j1 , . . . , p¯jk+1) for all strictly increasing se-
quences i1, . . . , ik+1 and j1, . . . , jk+1 over [n], where p¯i = (p1i , . . . , p`i), and
• a sequence β(x11, . . . , x`1, . . . , x1k+1, . . . , x`k+1, y1, . . . , yr) = α1(x¯i1 , y¯) · · ·αs(x¯is , y¯) of
∆σ-changes, where no change αj uses variables xtij and x
t′
ij′
with ij 6= ij′ as
parameters,
such that for all strictly increasing sequences i1, . . . , ik+1 over [n] it holds that
a¯ ∈ q(β(p¯i1 , . . . , p¯ik+1 , u¯)(D)) ⇐⇒ {i1, . . . , ik+1} ∈ B.
It should be noted that, although D can contain further elements, the parameters in
the considered change sequences are only instantiated by elements from P ∪ {u1, . . . , ur}.
69
Chapter 4 Expressibility under single-tuple changes
Proof (of Lemma 4.18). Let (q,∆σ) be a dynamic query where q is an m-ary σ-query.
Suppose q satisfies the antecedent of the lemma, and let ` and r be numbers such that
the conditions can be satisfied. Towards a contradiction, let us assume that there is a
DynProp program P over a k-ary auxiliary schema σaux that maintains (q,∆σ).
Let σ be the schema {Rw | R ∈ σaux, w ∈ {0, 1, . . . , `}Ar(R)}∪{s1, . . . , sm}∪{t1, . . . , tr},
where the si and tj are additional constant symbols and the arity of each relation
symbol Rw equals the arity of R. Let f and n0 be as guaranteed to exist by Lemma 4.16
applied to k and σ, and let n with n ≥ n0 be a number such that the conditions of the
lemma can be satisfied.
Let A def= [n]∪{a1, . . . , am, u1, . . . , ur} and let B, C ⊆
( A
k+1
)
be the partition guaranteed
to exist by property (S2) of Lemma 4.16.
Let D, P, a¯, u¯ and β be as in the condition of the statement of the lemma. Let Aux
be the auxiliary structure over schema σaux that is obtained by P when the tuples of D
are inserted in some arbitrary order to an initially empty structure, after an arbitrary
initialisation. Our goal is to identify sequences p¯1, . . . , p¯k+1 and p¯′1, . . . , p¯′k+1 of tuples
over P such that Lemma 4.16 and Lemma 4.17 imply that P gives the same answer when
β(p¯1, . . . , p¯k+1, u¯) and β(p¯′1, . . . , p¯′k+1, u¯) are applied starting from the state (D, Aux),
but the actual query results differ on the tuple a¯. As a consequence, P cannot maintain
(q,∆σ).
To be able to apply Lemma 4.16, we now construct a σ-structure S over domain
A that encodes the restriction of Aux to domain D def= P ∪ {a1, . . . , am, u1, . . . , ur},
intuitively by encoding the tuples p¯1, . . . , p¯n by numbers 1, . . . , n. We encode in the
relation names which element of a tuple is actually represented by the number. Let
R ∈ σaux and let (d1, . . . , dk′) ∈ RAux be a tuple with elements from D. We let
flat(d1, . . . , dk′) = (d′, . . . , d′k′) be the tuple with d′i = di if di ∈ D \ P and d′i = j if
di = pej for some e ≤ `. We set flat(d1, . . . , dk′) ∈ RSw if and only if for w def= w1 · · ·wk′ it
holds wi = 0 if di ∈ D \ P and wi = e if di = pej for some j ≤ n. The additional constant
symbols s1, . . . , sm, t1, . . . , tr of S are interpreted by a1, . . . , am, u1, . . . , ur.
Let A be ordered in the natural way by the linear order ≺ such that all elements from
[n] precede all other elements. Let A′ be the subset of A as guaranteed by property (S1)
of Lemma 4.16. Since all ordered tuples of A′ have the same quantifier-free type in S,
they cannot involve constants and thus A′ ⊆ [n]. Now property (S2) of Lemma 4.16
guarantees that there exist sets B,C ∈ ( A′k+1) with B ∈ B and C ∈ C. Let i1 < · · · < ik+1
and j1 < · · · < jk+1 be such that B = {i1, . . . , ik+1} and C = {i′1, . . . , i′k+1}. Let β1 and
β2 be the change sequences β(p¯i1 , . . . , p¯ik+1 , u¯) and β(p¯i′1 , . . . , p¯i′k+1 , u¯), respectively. It
follows from the construction that a¯ ∈ q(β1(D)), but a¯ /∈ q(β2(D)).
To reach the desired contradiction, we now prove that P gives the same query an-
swer for both change sequences, starting from the state (D, Aux). For this, using
Lemma 4.17 it is sufficient to observe that the substructures of (D, Aux) induced by
v¯1
def= (a¯, u¯, p¯i1 , . . . , p¯ik+1) and by v¯2
def= (a¯, u¯, p¯i′1 , . . . , p¯i′k+1) are isomorphic via the canon-
ical isomorphism pi that maps the i-th component of v¯1 to the i-th component of v¯2, for
every i. The change sequences clearly respect this isomorphism.
The substructures of D induced by v¯1 and v¯2 are isomorphic, as (D, v¯1) ≡0 (D, v¯2)
70
4.2 An arity hierarchy for DynProp
holds by the choice of the objects. It remains to show that also (Aux, v¯1) ≡0 (Aux, v¯2)
holds. Suppose, towards a contradiction, that this is not the case. As the {=}-type
of the tuples is clearly the same, that means there is a relation RAux and a tuple w¯1
over elements of v¯1 such that w¯1 ∈ RAux ⇔ pi(w¯1) /∈ RAux. Because R is at most
k-ary, w¯1 contains at most k elements a1, . . . , ak. Observe that if ai = pej for some i, j, e,
then pi(ai) = pej′ for some j′. It follows that there is a w ∈ {0, . . . , `}Ar(R) such that
flat(w¯1) ∈ RSw ⇔ flat(pi(w¯1)) /∈ RSw, and consequently there are ≺-ordered k-tuples
over [n] that have different quantifier-free types in S, contradicting Property (S1) of
Lemma 4.16. This concludes the proof.
4.2.3 Inexpressibility results for the arity hierarchy
In this subsection we prove that k-ary auxiliary relations are not sufficient to maintain
OddCoveredNodesdeg≤k+1.
Proposition 4.19. For every k ≥ 0, (OddCoveredNodesdeg≤k+1,∆σ) is not in k-ary
DynProp, even with arbitrary initialisation.
In the following, for a graph G = (V,E) and some set X ⊆ V of nodes we write N→(X)
for the set {v | ∃u ∈ X : E(u, v)} of out-neighbours of nodes in X. For singleton sets
X = {x} we just write N→(x) instead of N→({x}).
Proof. Let k ≥ 0 be a fixed natural number. We apply Lemma 4.18 to show that
(OddCoveredNodesdeg≤k+1,∆σ) is not in k-ary DynProp.
The basic proof idea is simple. Given a collection B ⊆ ( [n]k+1), we construct a graph
G = (V,E) with distinguished nodes P = {p1, . . . , pn} ⊆ V such that (1) each node has
in-degree at most k + 1 and (2) for each B ∈ ( [n]k+1) the set N→({pi | i ∈ B}) is of odd
size if and only if B ∈ B. Then applying a change sequence α which colours all nodes pi
with i ∈ B to G results in a positive instance of OddCoveredNodesdeg≤k+1 if and only
if B ∈ B. An invocation of Lemma 4.18 yields the intended lower bound.
It remains to construct the graph G. Let S be the set of all non-empty subsets of [n] of
size at most k+1. We choose the node set V of G as the union of P and S. Only nodes in
P will be coloured, and only nodes from S will be covered. A first attempt to realise the
idea mentioned above might be to consider an edge set Ek+1
def= {(pi, B) | B ∈ B, i ∈ B}:
then, having fixed some set B ∈ B, the node B becomes covered whenever the nodes pi
with i ∈ B are coloured. However, also some other nodes B′ 6= B will be covered, namely
if B′ ∩ B 6= ∅, and the number of these nodes influences the query result. We ensure
that the set of nodes B′ 6= B that are covered by {pi | i ∈ B} is of even size, so that
the parity of |N→({pi | i ∈ B})| is determined by whether B ∈ B holds. This will be
achieved by introducing edges to nodes
([n]
i
) ∈ S for i ≤ k such that for every subset P ′
of P of size at most k the number of nodes from S that have an incoming edge from all
nodes from P ′ is even. It follows that whenever k + 1 nodes pi1 , . . . , pik+1 are marked,
the number of covered nodes is odd precisely if there is one node in S that has an edge
from all nodes pi1 , . . . , pik+1 , which is the case exactly if {i1, . . . , ik+1} ∈ B.
71
Chapter 4 Expressibility under single-tuple changes
B
p1
p2
p3
p4
{1}
{2}
{3}
{4}
{1, 2}
{1, 3}
{1, 4}
{2, 3}
{2, 4}
{3, 4}
{1, 2, 3}
{1, 2, 4}
{1, 3, 4}
{2, 3, 4} ([n]
3
)
([n]
2
)
([n]
1
)
Figure 4.8: Example for the construction in the proof of Proposition 4.19, with k = 2
and n = 4.
We now make this precise. We have m = 0 and choose ` = 1, r = 0 in the statement of
Lemma 4.18. Let n be arbitrary and let P = {p1, . . . , pn}. For a set X ⊆ [n] we write
PX for the set {pi | i ∈ X}.
The structure D we construct consists of a coloured graph G = (V,E,R) with nodes
V
def= P ∪ S, where S def= ([n]1 ) ∪ · · · ∪ ( [n]k+1), and initially empty set R def= ∅ of coloured
nodes. The edge set E = E1 ∪ · · · ∪ Ek+1 is constructed iteratively in k + 1 steps. We
first define the set Ek+1 and define the set Ej based on the set E>j
def= ⋃k+1j′=j+1Ej′ .
The set Ek+1 consists of all edges (pi, B) such that B ∈ B and i ∈ B. For the
construction of the set Ej with j ∈ {1, . . . , k} we assume that all sets Ej′ with j′ > j
have already been constructed. Let X ∈ ([n]j ) be a set and let m be the number of nodes
Y ∈ S for which there are already edges (pi, Y ) ∈ E>j for all nodes pi in PX . If m is
odd, then there is so far an odd number of nodes from S that have an incoming edge
from all pi ∈ PX . As we want this number to be even, we let Ej contain edges (pi, X)
for all i ∈ X. If m is even, no edges are added to Ej . See Figure 4.8 for an example of
this construction. Note that for each X ∈ ([n]i ), for i ∈ {1, . . . , k + 1}, the degree of X
in G is at most i, and therefore also at most k + 1.
We now show that for a set B ∈ ( [n]k+1) the cardinality of N→(PB) is indeed odd if and
only if B ∈ B. This follows by an inclusion-exclusion argument. For a set X ⊆ [n] the
set N→(PX) contains all nodes with an incoming edge from a node in PX . It is therefore
equal to the union ⋃i∈X N→(pi). When we sum up the cardinalities of these sets N→(pi),
any node in N→(PX) with edges to both pi and pj , for numbers i, j ∈ X, is accounted for
twice. Continuing this argument, the cardinality of N→(X) can be computed as follows.
72
4.2 An arity hierarchy for DynProp
∣∣N→(PX)∣∣ = ∑
i∈X
∣∣N→(pi)∣∣− ∑
i,j∈X
i<j
∣∣N→(pi) ∩N→(pj)∣∣+ · · ·+ (−1)|X|−1∣∣ ⋂
i∈X
N→(pi)
∣∣
By construction of G, the set ⋂i∈Y N→(pi) is of even size, for all sets Y ⊆ [n] of size
at most k. Consequently, for each X ∈ ( [n]k+1) the parity of ∣∣N→(PX)∣∣ is determined by
the parity of
∣∣ ⋂
i∈X N→(pi)
∣∣, the last term in the above equation. Only the node X can
possibly have incoming edges from all nodes pi in PX , and these edges exist if and only
if X ∈ B.
Let α(x1), . . . , α(xk+1) be the change sequence insR(x1), . . . , insR(xk+1) that colours
the nodes x1, . . . , xk+1. Let B ∈
( [n]
k+1
)
be of the form {i1, . . . , ik+1} with i1 < · · · < ik+1.
The change sequence αB
def= α(pi1) · · ·α(pik+1) results in a graph where the set of coloured
nodes is exactly PB. As all nodes in N→(PB) have degree at most k + 1 and the set
N→(PB) is of odd size exactly if B ∈ B, we have that αB(D) is a positive instance of
OddCoveredNodesdeg≤k+1 if and only if B ∈ B.
It follows from Lemma 4.18 that (OddCoveredNodesdeg≤k+1,∆σ) cannot be main-
tained in k-ary DynProp.
Immediately from Proposition 4.19 we get the following general inexpressibility result.
Corollary 4.20. (OddCoveredNodes,∆σ) is not in DynProp.
4.2.4 Obtaining the arity hierarchy
We can now combine the results of the previous subsections and prove the main theorem
of this section.
Proof (of Theorem 4.13). For every k ≥ 1 we give a Boolean graph query that can be
maintained in k-ary DynProp but not in (k − 1)-ary DynProp under single-tuple changes.
For k ≥ 3, we choose the dynamic query (OddCoveredNodesdeg≤k,∆σ) which can
be maintained in k-ary DynProp but not in (k− 1)-ary DynProp, by Proposition 4.14 and
Proposition 4.19.
For k = 2, already Zeume and Schwentick [2015] prove that the query s-t-TwoPath
which asks whether there exists a path of length 2 between some distinguished vertices s
and t separates unary DynProp from binary DynProp [Zeume and Schwentick 2015,
Proposition 4.10].
For k = 1, we consider the Boolean graph query ParityDegreeDiv3 that asks
whether the number of nodes whose degree is divisible by 3 is odd. This query can easily
be maintained in DynProp using only unary auxiliary relations. In a nutshell, a dynamic
program can maintain for each node v the degree of v modulo 3. So, it maintains three
unary relations M0,M1,M2 with the intention that v ∈Mi if the degree of v is congruent
to i modulo 3. These relations can easily be updated under edge insertions and deletions.
Similar as for Parity, a bit P that gives the parity of |M0| can easily be maintained.
73
Chapter 4 Expressibility under single-tuple changes
On the other hand, ParityDegreeDiv3 cannot be maintained in DynProp using
nullary auxiliary relations. Suppose, towards a contradiction, that it can be maintained
by some dynamic program P that only uses nullary auxiliary relations, and consider
an input instance that contains five node V = {u1, u2, v1, v2, v3} as well as edges E =
{(u1, v1), (u1, v2), (u2, v1)}. No matter the auxiliary structure, P needs to give the same
answer after the changes α1
def= insE(u1, v3) and α2
def= insE(u2, v3), as it cannot
distinguish these tuples using quantifier-free first-order formulas. But α1 leads to a
yes-instance for ParityDegreeDiv3, and α2 does not. So, P does not maintain
ParityDegreeDiv3.
4.2.5 Allowing quantification
We have already mentioned that DynFO also has a strict arity hierarchy, witnessed by a
family of queries over input schemas of growing arity [Dong and Su 1998; Zeume 2015]. It
is an open question whether DynFO has an arity hierarchy with respect to graph queries.
In the following we see that the queries OddCoveredNodesdeg≤k do not witness an
arity hierarchy for dynamic programs that are allowed to use quantification: we prove
that (OddCoveredNodesdeg≤k,∆σ) is in binary DynFO for any k. We actually prove
that a linear order on the active domain, which can be maintained easily in DynFO
[Etessami 1998], is the only binary auxiliary relation we need.
Proposition 4.21. In the presence of a linear order, (OddCoveredNodesdeg≤k,∆σ)
is in unary DynFO, for every k ∈ N.
An intuitive reason why quantifier-free dynamic programs need auxiliary relations of
growing arity to maintain OddCoveredNodesdeg≤k is that for checking whether some
change, for instance the colouring of a node v, is “relevant” for some node w, it needs to
have access to all of w’s “important” neighbours. Without quantification, the only way
to do this is to explicitly list them as elements of the tuple for which the update formula
decides whether to include it in the auxiliary relation.
With quantification and a linear order, sets of neighbours can be defined more easily,
if the total number of neighbours is bounded by a constant. Let us fix a node w
with at most k (in-)neighbours, for some constant k. Thanks to the linear order, the
neighbours can be distinguished as first, second, . . . , k-th neighbour of w, and any subset
of these nodes is uniquely determined and can be defined in FO by the node w and a set
I ⊆ {1, . . . , k} that indexes the neighbours. With this idea, the proof of Proposition 4.14
can be adjusted appropriately for Proposition 4.21.
Proof sketch (of Proposition 4.21). Let k ∈ N be some constant. Again, we call a node
active if its in-degree is at most k. We sketch a dynamic program that uses a linear order
on the nodes and otherwise at most unary auxiliary relations.
Let I be a non-empty subset of {1, . . . , k}, and let w be an active node with at least
max(I) in-neighbours. The set N←I (w) of I-indexed in-neighbours of w includes a node v
if and only if (v, w) is an edge in the input graph and v is the i-th in-neighbour of w
with respect to the linear order, for some i ∈ I. The following notation is similar as in
74
4.2 An arity hierarchy for DynProp
N←{1,2,3}(w3)
N •◦(N←{1,2,3}(w3))
v1 v2 v3 v4 v5 v6 v7
w1 w2 w3 w4 w5
Figure 4.9: An illustration of the notation used in the proof of Proposition 4.21. The
set N •◦G (N←{1,2,3}(w3)) does not include w1, as there is no edge (v5, w1), and
it does not include w5, as there is an edge (v7, w5) for a coloured node
v7 6∈ N←{1,2,3}(w3).
the proof of Proposition 4.14. For a graph G and an arbitrary set C of (coloured and
uncoloured) nodes, we denote the set of active nodes that have an incoming edge from
every node in C and no coloured in-neighbour that is not in C by N •◦G (C). An example
for these notions is depicted in Figure 4.9.
For every I ⊆ {1, . . . , k} with I 6= ∅ we introduce an auxiliary relation PI with the
following intended meaning. An active node w with at least max(I) neighbours is in PI
if and only if (1) w has no coloured in-neighbours that are not contained in N←I (w), and
(2) the set N •◦G (N←I (w)) has odd size. Note that (1) implies that w ∈ N •◦G (N←I (w)).
An auxiliary relation PI basically replaces the relations P`,m with ` + m = |I| from
the proof of Proposition 4.14, and the updates are mostly analogous.
We explain how the query relation Ans and the relations PI are updated when a
modification to the input graph occurs. When a node v is coloured, the query relation
is only changed if v becomes the only coloured neighbour of an odd number of active
nodes. This is the case if and only if there actually is an active and previously uncovered
node w that v has an edge to and if w ∈ PI for the set I def= {i}, where i is the number
such that v is the i-th in-neighbour of w with respect to the linear order.
The update of a relation PI after the colouring of a node v is as follows. Let G be
the graph before the change is applied, and G′ the changed graph. Let w be any active
node. If v is an I-indexed in-neighbour of w, no change regarding w ∈ PI is necessary.
Otherwise, some nodes in N •◦G (N←I (w)) might now have a coloured neighbour v that
is not contained in N←I (w), and therefore are not contained in N •◦G′ (N←I (w)). Let w′
be such a node, that is, a node with an edge from v and every node in N←I (w), and
let I ′ be such that N←I′ (w′) = N←I (w) ∪ {v}. The parity of the number of nodes in
N •◦G (N←I (w)) \ N •◦G′ (N←I (w)) is odd if and only if w′ ∈ PI′ . This can be used to update
PI .
We do not present the updates for the remaining changes as they can be easily
constructed along the same lines.
It is easy to maintain a linear order on the non-isolated nodes of an input graph
75
Chapter 4 Expressibility under single-tuple changes
[Etessami 1998], which is all that is needed for the proof of Proposition 4.21. So,
OddCoveredNodesdeg≤k can also be maintained in DynFO without a predefined linear
order, at the expense of binary auxiliary relations.
Corollary 4.22. For every k ∈ N, (OddCoveredNodesdeg≤k,∆σ) is in binary DynFO.
Unfortunately we cannot generalise the proof technique from Proposition 4.21 for
OddCoveredNodesdeg≤k to OddCoveredNodes, but only to a similarly defined
variant OddCoveredNodesdeg≤logn that asks for the parity of the number of covered
nodes with in-degree at most logn. Here, n is the number of nodes of the graph.
Proposition 4.23. (OddCoveredNodesdeg≤logn,∆σ) is in binary DynFO, in the pres-
ence of a linear order and BIT.
Proof sketch. With the help of the linear order we identify the node set V of size n of
the input graph with the numbers {0, . . . , n− 1}, and use BIT to access the bit encoding
of these numbers. Any node v ∈ V then naturally encodes a set I(v) ⊆ {1, . . . , logn}:
i ∈ {1, . . . , logn} is contained in I(v) if and only if the i-th bit in the bit encoding of v
is 1.
The proof of Proposition 4.21 constructs a dynamic program that maintains unary rela-
tions PI with I ⊆ {1, . . . , k}, and w ∈ PI holds if w ∈ N •◦G (N←I (w)) and if |N •◦G (N←I (w))|
is odd. We replace these relations by a single binary relation P , with the intended meaning
that (v, w) ∈ P if w ∈ N •◦G (N←I(v)(w)) and if |N •◦G (N←I(v)(w))| is odd.
A dynamic program that maintains OddCoveredNodesdeg≤logn can then be con-
structed along the same lines as in the proof of Proposition 4.21.
In addition to a linear order, Etessami [1998] also shows how corresponding relations
addition and multiplication can be maintained for the active domain of a structure. As
BIT is first-order definable in the presence of addition and multiplication, and vice versa
[Immerman 1999], both a linear order and BIT on the active domain can be maintained,
still using only binary auxiliary relations. So, the variant ofOddCoveredNodesdeg≤logn
that considers n to be the number of non-isolated nodes, instead of the number of all
nodes, can be maintained in binary DynFO without assuming built-in relations. It is an
open problem whether OddCoveredNodes can be maintained in DynFO.
4.3 Outlook and bibliographic remarks
We briefly summarise the contributions of this chapter and state some open questions.
In Section 4.1 we have seen that a large class of queries, namely all MSO-definable
queries on graphs of bounded treewidth, can be maintained in DynFO under single-tuple
changes. Although this shows that a large subset of LOGSPACE-queries [Elberfeld et al.
2010] is in DynFO under single-tuple changes, it is still unknown whether all (some
variant of) domain independent queries from a complexity class C with AC0 ( C can be
maintained.
The AC0[2] query OddCoveredNodes introduced in Section 4.2 seems to be a good
candidate for research in this direction. We showed that OddCoveredNodesdeg≤k for
76
4.3 Outlook and bibliographic remarks
each k and evenOddCoveredNodesdeg≤logn can be maintained in DynFO with auxiliary
relations of small arities, but for the general query OddCoveredNodes, maintainability
is still unknown. This query demonstrates that static complexity and dynamic complexity
may behave very differently. Statically, Parity and OddCoveredNodes are very
similar, they ask for the parity of the size of a set that is definable by a quantifier-free
first-order formula and a formula with one existential quantifier, respectively. Both are
natural problems for the complexity class AC0[2] and can be solved by AC0[2] circuits
with a single “modulo 2” gate. Dynamically, the difference in complexity is striking.
Parity can be maintained easily in DynProp, while we do not even know membership in
DynFO for OddCoveredNodes.
If OddCoveredNodes cannot be maintained in DynFO under single-tuple changes,
and if we can actually prove this, then we can conclude that DynFO does not capture any
static complexity class C that includes AC0[2], which does not leave much open. On the
other hand, if we can actually maintain OddCoveredNodes, hopefully we can extract
from the dynamic program a strategy to dynamically maintain some greater class of AC0[2]
queries. Note also the relationship of this question to Chapter 5: OddCoveredNodes
under single-tuple changes can be interpreted as Parity under certain first-order definable
change operations.
We state some more open problems. For maintaining MSO-definable queries, the Mud-
dling technique helps to provide a tree decomposition, although a slightly outdated one
and with non-optimal width. Is it possible to maintain an up-to-date tree decomposition
for the input graph at hand? Note that the dynamic programs from Section 4.1 can
trivially be extended to maintain a tree decomposition of width O(logn): during the
logn changes that need to be processed, for each insertion of an edge (u, v) the dynamic
program selects bags Bu and Bv that contain u and v, respectively, adds u to all bags
on the path between Bu and the lowest common ancestor of Bu and Bv in the tree
decomposition, and analogously for v and Bv. Can we maintain for every graph with
treewidth k a tree decomposition of width ck, for some constant c? Even of width k?
Can we do this if k is restricted to 2?
A related question is whether one can maintain whether the input graph has treewidth
at most k, for a fixed k. Note that this is possible under the assumption that the input
graph at all times has treewidth at most k′, for some fixed constant k′ > k. This is
because for every k the class of graphs of treewidth at most k is characterised by a finite
list of forbidden minors8 [Robertson and Seymour 1990], one can express in MSO whether
a graph contains a fixed minor, and we can maintain this MSO property for graphs of
treewidth at most k′ by Theorem 4.3. However, we cannot actually construct a dynamic
program for arbitrary values of k, because the result of Robertson and Seymour [1990] is
non-constructive.
For the previous approach and also for the results in Section 4.1 we assume that the
treewidth of the input graph never exceeds a certain bound. Another open question
is whether this assumption can be weakened: can a dynamic program maintain MSO
8A graph G is a minor of a graph H if G can be constructed from H by deleting nodes and edges and
contracting edges.
77
Chapter 4 Expressibility under single-tuple changes
properties in the sense that it gives the correct answer whenever the input graph’s
treewidth is below a certain threshold, and does not give an answer otherwise? In
particular, can the dynamic program “recover” when the treewidth of the input graph
becomes “small” again? Note that the dynamic program from Theorem 4.3 achieves
this goal with a “delay” of logn changes: it can maintain an MSO-definable property for
logn changes after the input graph’s treewidth exceeds the bound, and it can answer the
query again when the treewidth has been below the bound for the last logn changes.
We have seen that MSO-definable queries can be maintained for graphs of bounded
treewidth. Can the Muddling technique be used to maintain interesting queries for
other classes of graphs, for example planar graphs? As already mentioned in Section 3.3,
preliminary results, obtained together with Samir Datta, Siddharth Iyer, Anish Mukherjee
and Thomas Zeume, suggest that an extension of Theorem 4.3 yields dynamic programs
that can maintain approximate solutions to certain MSO-definable optimisation problems
on planar graphs. The problems considered there are NP-hard, even when restricted
to planar graphs, so they probably cannot be maintained exactly in DynFO. Other
problems for planar graphs, like isomorphism [Datta et al. 2009], are in LOGSPACE; as
LOGSPACE ⊆ AC1, it is possible that the concept of (AC1, logn)-maintainability can be
applied.
The just mentioned graph isomorphism query is interesting also for other classes of
graphs. It can be maintained in DynFO under single-edge changes for trees [Etessami
1998]. Can this result be extended to graph classes of treewidth k ≥ 2?
The main result of the second part of this chapter is Theorem 4.13, an arity hierarchy
for DynProp under Boolean graph queries. This result underscores that Goal 4.2 might
be very hard to achieve in its full generality. In particular, we require stronger proof
techniques to show that certain dynamic graph queries cannot be maintained in DynProp.
Lemma 4.18 provides a relatively easy way of utilising the proof technique from Zeume
[2017], and it will in particular be used in Chapter 5 to show that Reach is not in
DynProp under certain first-order definable change operations (see Theorem 5.11), but so
far we cannot show the corresponding result for single-edge changes. The current frontier
is ternary DynProp (c.f. [Zeume and Schwentick 2015]): can (Reach,∆E) be maintained
in DynProp with at most ternary auxiliary relations?
Bibliographic remarks
The results of Section 4.1 were published first as [Datta et al. 2017]. They are joint
work with Samir Datta, Anish Mukherjee, Thomas Schwentick and Thomas Zeume. The
presentation in this thesis, in particular the proof of Theorem 4.7, follows the full version
[Datta et al. 2019]. The arity hierarchy for DynProp as presented in Section 4.2 is joint
work with Thomas Zeume and accepted for publication [Vortmeier and Zeume 2020].
The proof strategy for DynProp lower bound results was introduced by Zeume [2017],
its reformulation as Lemma 4.18 is published in a slightly weaker form in [Schwentick
et al. 2018] as joint work with Thomas Schwentick and Thomas Zeume. The result of
Proposition 4.15 is originally published in [Gelade et al. 2012, Proposition 6.2], and its
proof in this thesis is an adaptation of the proof of [Zeume 2015, Proposition 4.1.19].
78
CHAPTER 5
Expressibility under definable changes
The SQL standard allows for several ways to alter the contents of a database table beyond
single-tuple changes. We give an example for these capabilities.
Example 5.1. Suppose our database models a street network. One table streets with
attributes start and to represents the existing road segments that go from one crossing
to another one. Another table usable_streets with the same attributes represents
those road segments that can actually be used by vehicles. There is a third table
open_crossings with one attribute name that lists crossings that can be passed at the
moment.
Suppose now that some construction works at the crossing university are finished,
so all street segments that go from this crossing to another open crossing are now usable.
The corresponding change of the table usable_streets can be implemented by the
following SQL statement.
INSERT INTO usable_streets (start , to)
SELECT start , to
FROM streets , open_crossings
WHERE start = ’university ’ AND to = name;
On the other hand, the following statement removes all tuples from usable_streets
that mention a crossing that is not listed in open_crossings.
DELETE FROM usable_streets
WHERE start NOT IN (SELECT name FROM open_crossings)
OR to NOT IN (SELECT name FROM open_crossings );
/
So, SQL offers a declarative formalism to change table contents, allowing to insert
or to delete a set of tuples that is specified as the result of some query on the current
79
Chapter 5 Expressibility under definable changes
database. In this chapter we study the question which queries can be maintained in
DynFO and its subclasses under such declaratively defined changes. For this we allow
changes that replace each input relation by the result of a first-order query on the input
structure, possibly using elements as parameters. Again employing the equivalence
between first-order logic and relational algebra, this is a natural formalisation of the
update mechanism of SQL1, and several authors posed the study of such changes as an
open question [Patnaik and Immerman 1997; Etessami 1998; Grädel and Siebertz 2012;
Zeume 2015].
Consequently, the main goal of this chapter is to lift dynamic programs for single-tuple
changes to dynamic programs that support changes defined by first-order queries.
Goal 5.1. Show maintainability in DynFO under (classes of) first-order definable changes
for queries that are in DynFO under single-tuple changes.
Intuitively, it should be harder for a dynamic program to maintain a query under
definable changes than under single-tuple changes, as single-tuple changes are a special
case of first-order definable changes. On the one hand, this makes it more difficult to
show membership in DynFO, but on the other hand, non-maintainability proofs should
become easier, which leads to the formulation of another goal for this chapter.
Goal 5.2. Show for specific queries that they cannot be maintained dynamically in
DynFO under certain first-order definable changes.
Lower bound proofs of this kind come in (at least) two flavours. Firstly, they can
provide a barrier for maintainability, when they state that a query cannot be maintained
under first-order definable changes, although it can be maintained under some smaller set
of changes, for example single-tuple changes. Secondly, they can be a first step towards a
proof that the query in question cannot even be maintained under single-tuple changes.
Contributions The main contributions of this chapter concern maintainability of the
reachability query under first-order definable changes. We show that reachability in
undirected graphs can still be maintained in DynFO under first-order defined insertions,
that is, definable changes that are guaranteed to preserve all existing edges, while still
allowing single-edge deletions. For acyclic graphs, we show that reachability can be
maintained in DynFO under insertions that are defined by quantifier-free first-order
formulas, and single-edge deletions.
These results rely on the so-called bounded bridge property: we show that in each graph
obtained by an insertion as specified above, each reachable pair of nodes is connected by
a path that only uses a constant number of newly inserted edges, where this constant only
depends on the first-order formula defining the change. Updating the transitive closure
of a graph then only requires to concatenate a constant number of already existing paths
along newly inserted edges, which is easily doable in first-order logic. The size of the
1Of course, SQL sub-queries that go beyond relational algebra cannot be expressed in our setting.
See also [Ameloot et al. 2013] for a theoretical investigation of the expressive power of SQL update
primitives.
80
resulting dynamic programs are consequently proportional to the constant given by the
bounded bridge property, which can be huge. We therefore identify natural classes of
first-order definable insertions that yield small dynamic programs maintaining undirected
reachability. This investigation is the theoretical basis of the experimental evaluation of
the dynamic approach in Chapter 7.
So far, we have no results that reachability can be maintained for general directed
graphs under definable insertions, or for some non-trivial graph class under definable
deletions. We however provide barriers for this kind of results. Firstly, we show that
very easy first-order definable insertions for general directed graphs as well as for acyclic
directed graphs do not have the bounded bridge property, so the technique we introduce
for the maintainability results cannot be used in these settings. Building on this insight,
we show that arbitrary unary auxiliary relations and the transitive closure relation are
not sufficient to maintain Reach in these cases. Also, these relations do not suffice to
maintain reachability for directed graphs, even acyclic ones, under definable deletions.
We still do not know whether Reach is in DynProp under single-edge changes. In this
chapter we prove that this is not the case if we allow insertions or deletions definable by
quantifier-free first-order formulas.
Apart from the reachability query, we also study maintenance of formal languages under
definable changes. One result states that all regular languages can still be maintained
in DynProp under quantifier-free changes. Additionally, all context-free languages are
in DynFO under changes that are defined by first-order formulas without quantifier
alternation. We briefly discuss why changes that are defined via first-order formulas with
quantifier alternation seem to be harder to handle.
Intuition (as well as the formulation of Goal 5.1) suggests that all queries that are in
DynFO under some class of definable changes need to be in DynFO under single-tuple
changes. We show that this is not necessarily the case and study parameter-free changes,
that is, changes that are defined by first-order formulas that do not use parameters. In
SQL terminology, we can think of sub-queries that do not mention constants, and refer
to the second expression of Example 5.1 for an example for such a change. This class
does not subsume single-tuple changes and therefore maintainability result do not have
direct implications for maintainability under single-tuple changes. The last result of this
chapter states that all AC1 queries can be maintained in DynFO under parameter-free
first-order changes.
The results presented in this chapter are obtained jointly with Thomas Schwentick and
Thomas Zeume. More detailed bibliographic remarks are given at the end of the chapter.
Outline We formally introduce definable changes in Section 5.1. The next two sections
are devoted to the reachability query: Section 5.2 gives the positive results, Section 5.3
the negative results regarding maintainability of Reach under various classes of definable
insertions and deletions. We continue with the investigation of formal languages under
changes with restricted quantification in Section 5.4. In Section 5.5, we study AC1 queries
under parameter-free changes. We give an outlook and further bibliographic comments
in Section 5.6.
81
Chapter 5 Expressibility under definable changes
5.1 Logically definable change operations
In this section we introduce first-order definable change operations. Intuitively, these
operations replace some of the input relations by the result of first-order queries on the
input structure. Before me make this approach more formal, we illustrate this idea by a
few examples.
Example 5.2.
(a) Actually, a single-tuple insertion is just a special case of a first-order definable
change operation. It only affects one relation of the input structure. Consider the
change operation insE(p1, p2) that allows to insert edges into the edge relation E
of a graph. The formula µ(p1, p2;x, y)
def= E(x, y) ∨ (x = p1 ∧ y = p2) with free
variables x, y and parameters p1, p2 defines a relation that includes all previous edges
and (if that edge was not already present before) one additional new edge (p1, p2).
An edge insertion insE(a, b) can be expressed by instantiating the parameters of µ
with a and b and replacing E with the result of µ(x, y).
(b) Suppose that a user wants to connect some node of a directed graph to every other
node of this graph. This can be achieved by a change operation that replaces the
edge relation with the relation defined by the formula µ(p;x, y) def= E(x, y)∨ (x = p)
with parameter p.
(c) The following example allows to change two relations by one operation. Here,
we consider directed graphs with two additional unary relations C1, C2, which we
interpret as a colouring of the nodes. Let G = (V,E,C1, C2) be such a coloured
graph and suppose a user wants to swap the colours C1, C2 for all nodes that have an
edge to a node u ∈ V . This can be achieved by simultaneously replacing C1 with the
relation defined by µC1(p;x)
def=
(¬E(x, p)∧C1(x))∨(E(x, p)∧C2(x)) and replacing
C2 with the relation defined by µC2(p;x)
def=
(¬E(x, p)∧C2(x))∨ (E(x, p)∧C1(x)),
where for both formulas the parameter p is instantiated by the same node u. We
emphasise that these two formulas specify one change operation.
(d) Finally, we consider an example of a parameter-free insertion. It states that, in
a graph, all nodes u and v that are connected by a path of length 2 should be
connected directly. The new edge relation is defined by the formula µ(x, y) def=
E(x, y) ∨ ∃z(E(x, z) ∧ E(z, y)). /
We now formally define first-order definable change operations. A replacement rule ρR
for a relation symbol R is of the form replace R by µR(p¯; x¯). Here, µR(p¯; x¯) is a
first-order formula, the length of the tuple x¯ equals the arity of R and p¯ is a tuple of
element parameters. A replacement query ρ(p¯) is a set of replacement rules for distinct
relation symbols with the same parameter tuple p¯. In the case of replacement queries ρ
that consist of a single replacement rule, we usually do not distinguish between ρ and its
single replacement formula µR.
We define the semantics of replacement queries. Let D be a structure and α = ρ(a¯) a
change over the domain of D. The result α(D) of the application of α to D is defined in a
straightforward way: each relation RD in D, for which there is a replacement rule ρR in ρ,
82
5.2 Maintaining Reachability under first-order definable insertions
is replaced by the relation resulting from evaluating µR, that is, by {b¯ | D |= µR(a¯; b¯)}.
Other relations that are not explicitly replaced by ρ remain unchanged.
Some of our investigations will focus on (syntactically) restricted replacement queries
that either only remove or only insert tuples to relations. For an insertion rule ρR, the
replacement formula µR(p¯; x¯) has the form R(x¯) ∨ ϕR. Similarly, deletion rules have
replacement formulas µR(p¯; x¯) of the form R(x¯) ∧ ¬ϕR.
Other syntactic restrictions to be studied in this chapter are parameter-free replacement
queries (that allow no parameters in change formulas) and quantifier-free replacement
queries (that allow only quantifier-free change formulas). A special case of quantifier-
free changes are the single-tuple changes. As already sketched in Example 5.2, an
insertion insR(p¯) can be expressed by the insertion query replace R by µR(p¯; x¯), where
µR(p¯; x¯)
def= R(x¯)∨ (p¯ = x¯). Analogously, a deletion delR(p¯) is expressed by the deletion
query replace R by µR(p¯; x¯), where µR(p¯; x¯)
def= R(x¯) ∧ ¬(p¯ = x¯).
5.2 Maintaining Reachability under first-order definable
insertions
The reachability query is arguably the most studied query in the field of dynamic
complexity, and it was shown very early that in the setting of single-tuple changes it can
be maintained in DynFO on undirected as well as directed acyclic graphs [Patnaik and
Immerman 1997; Dong and Su 1995; 1998]. Here, we extend these results to first-order
definable insertions, while still allowing deletions of single edges.
These results rely on the so-called bounded bridge property. In a nutshell, a change
operation has the bounded bridge property, if whenever a graph resulting from such an
operation has a path from some node u to some node v, it also has such a path that uses
only a bounded number of newly inserted edges.
The remainder of this section consists of three parts. In the first part we consider
the undirected reachability query UReach and show that this query can be maintained
under first-order definable insertions (and single-edge deletions).
The dynamic programs we construct are unfortunately rather impractical, as their
size is non-elementary in the size of the first-order insertions, due to the fact that the
constant given by the bounded bridge property, the bridge bound, might be huge in
general. Therefore we identify in the second part subclasses of first-order definable
insertions that result in smaller programs: insertions that are defined by (unions of)
conjunctive queries, with and without negation. We show that these change operations
have small bridge bounds. These results motivate our experiments with a prototypical
implementation of the resulting dynamic programs in Chapter 7.
We do not know whether also the directed reachability query Reach can be maintained
under general first-order insertions. In the third part of this section we present the solution
for a special case: a dynamic program for Reach under quantifier-free insertions on
directed acyclic graphs.
All the programs we construct in this section basically use the same auxiliary relations
as in the case of single-tuple changes. In Section 5.3 we show that extending the dynamic
83
Chapter 5 Expressibility under definable changes
program for single-tuple insertions (see Example 2.8) with arbitrary unary auxiliary
relations is not sufficient to maintain Reach under first-order insertions in cyclic graphs.
There we also present barriers for maintainability under first-order deletions and lower
bounds for quantifier-free dynamic programs.
5.2.1 Undirected Reachability
In this subsection, we show that the undirected reachability query can be maintained
in DynFO under single-edge changes and any finite set of first-order definable insertions.
The respective dynamic programs rely on the fact that for every undirected path between
two nodes u, v in the graph after the insertion there is an undirected path between u and
v that only uses a bounded number of newly inserted edges.
We formalise this idea with the following notion. Let G′ be a graph that is obtained
from a graph G by an insertion of edges. For two nodes u, v of G′, the undirected bridge
distance ubdG,G′(u, v) is the minimal number d such that there is an undirected path
from u to v in G′ that uses at most d edges that are not in G. We will refer to the
new edges in such paths as bridges. Since the two graphs will always be clear from the
context, we usually simply write ubd(u, v) instead of ubdG,G′(u, v). We say that an
insertion query ρ has the undirected bounded bridge property if there is a constant c such
that, for every graph G′ resulting from a graph G by applying ρ, and all nodes u, v of G,
ubdG,G′(u, v) ≤ c. The smallest such c is called the undirected bridge bound of ρ.
The undirected bridge bound of a single-edge insertion is of course 1, but we now prove
that each first-order insertion also has a constant undirected bridge bound.
Proposition 5.3. Every first-order definable insertion query has the undirected bounded
bridge property.
Proof. We show that each first-order definable insertion operation ρ with underlying
first-order formula µ(p¯; x¯) of quantifier rank k has the undirected bounded bridge property.
Let ` be the length of p¯ and c′ the number of FO[k, 1]-types of directed2 graphs. We will
bound the number of bridges on undirected paths created by ρ by c def= `+ c′. For this
we use Theorem 2.2, which implies that if a change, defined by a first-order formula of
quantifier rank k, inserts edges (a, b) and (c, d) for nodes a, b, c, d from different weakly
connected components such that a and c as well as b and d have the same rank-k type,
respectively, then the change also inserts the edges (a, d) and (c, b).
Let G be a graph and let α = ρ(a¯) for some tuple a¯ of nodes of G. Let u, v be two
nodes that are connected by some undirected path pi of the form u = w0, w1, . . . , wr = v
in α(G). Let q be the number of bridges of pi, that is, edges that are not in G. Our
goal is to show that there exists such an undirected path with at most c bridges. For
q ≤ c, there is nothing to prove, so we assume q > c. It suffices to show that there is
a path from u to v with fewer than q bridges. Let (u1, v1), . . . , (uq, vq) be the bridges
in pi. If for some i, the nodes ui and vi are in the same weakly connected component
of G (before the application of α), we can replace the bridge (ui, vi) by a path of “old”
2Remember that we formally consider undirected paths in directed graphs.
84
5.2 Maintaining Reachability under first-order definable insertions
edges resulting in an overall path with q − 1 bridges. Similarly, if ui and uj are in the
same weakly connected component of G, for some i < j, we can shortcut pi by a path
from ui to uj inside G. Therefore, we can assume that, for every i, the nodes ui and vi
are in different weakly connected components of G, and likewise ui and uj for i < j.
We show that in this case there are i, j with i < j such that µ defines an edge between
ui and vj , and therefore a path with fewer bridges can be constructed by shortcutting
the path pi with the edge (ui, vj). Indeed, by the choice of m there must be two nodes ui
and uj , with i < j, in distinct weakly connected components of G that do not contain
any element from a¯, such that ui and uj have the same FO[k, 1]-type in their respective
connected components. By Theorem 2.2 it follows that (ui, vj , a¯) and (uj , vj , a¯) have the
same FO[k, `+ 2]-types and therefore, since µ defines an edge between uj and vj , it also
defines an edge between ui and vj . Clearly, the path u, . . . , ui, vj , . . . , v has fewer bridges
than pi.
With the help of Proposition 5.3 we can show the following result.
Theorem 5.4. Let ∆ be a finite set of first-order insertion queries. Then (UReach,∆∪
∆E) can be maintained in DynFO.
For the proof, we use the approach for maintainingUReach under single-edge insertions
and deletions from [Dong and Su 1998, Theorem 4.3] and maintain a directed spanning
forest and its transitive closure relation. The undirected bounded bridge property allows
the update of the spanning forest and its transitive closure in a first-order definable way.
Proof. The dynamic program presented in [Dong and Su 1998, Theorem 4.3] maintains
the undirected transitive closure of graphs under single-edge changes with the help
of auxiliary relations H and TCH . The binary relation H is a directed forest, whose
undirected version is a spanning forest of the undirected version of the input graph G, and
TCH is its transitive closure.3 Observe that two nodes u and v are in the same weakly
connected component if and only if {(w, u), (w, v)} ⊆ TCH holds, for some node w.
We show how to maintain the relations H and TCH for a single first-order insertion
query ρ. Since ∆ is finite, the update program for ∆ is just the union of the update
programs for each ρ ∈ ∆. For the moment we assume a predefined linear order ≤ on
the domain to be present; later we show how this assumption can be dropped. Let G
be a graph and α = ρ(a¯) an insertion, H a directed spanning forest of G and TCH
its transitive closure. We show how to define in first-order logic the updated auxiliary
relations H ′ and TC ′H for the modified graph G′ = α(G).
We first describe a strategy to define H ′ and then argue that it can be implemented by
a first-order formula. We call the smallest node of a weakly connected component C ′ of
G′ with respect to ≤ the queen u0 of C ′. For each weakly connected component C of G
that is a subgraph of C ′, we define its queen level as the (unique) number ubd(u0, u), for
3In [Dong and Su 1998], undirected graphs are considered but it is easy to see that UReach on directed
graphs can be basically maintained in the same way. Furthermore, the relation H is used slightly
differently, but the adjustments are straightforward.
85
Chapter 5 Expressibility under definable changes
nodes u ∈ C. Thanks to Proposition 5.3 the queen level of each component is bounded
by a constant c.
The spanning tree of a weakly connected component C ′ is constructed by inserting
bridges into the spanning forest of the subgraph of G that is induced by C ′. More
precisely, a directed edge (u, v) is inserted into the spanning forest if the weakly connected
components of u and v in G have queen levels ` and `+ 1 with respect to their queen u0
in G′, for some ` < c, and (u, v) is the lexicographically smallest pair with respect to ≤
for which α inserts (u, v) or (v, u). The lexicographically minimal bridges can be defined
by a first-order formula because the undirected bridge distance between any two nodes in
any component of α(G) is at most c and because there are formulas θh(x, y) expressing
that ubd(x, y) ≤ h, for each number h. Such a formula θh basically quantifies h bridges
and checks, using the stored result of UReach for G, that these bridges connect x and y.
Observe that an edge (u, v) might be inserted into the spanning tree such that v is not
the root of its spanning tree in H. To obtain a directed spanning forest H ′, the direction
of some edges in the directed spanning tree of the component of v needs to be flipped
accordingly, as described in [Dong and Su 1998, Lemma 4.2].
The construction of H ′ ensures that along each directed path in H ′ at most c
new edges are inserted, therefore it is straightforward to extend the update formula
of [Dong and Su 1998, Lemma 4.2] for TCH′ . The update formulas for the deletion of
edges are the same as in the single-edge modification case.
So far we have seen how the auxiliary relations can be updated in first-order logic
using ≤. It remains to show how the assumption of a predefined linear order can be
eliminated. For a change sequence β, we denote by Aβ the set of parameters used
in β. When applying β to an initially empty graph, a linear order on Aβ can be easily
constructed as in the case of single-tuple changes [Etessami 1998]. The crucial observation
is that for each node b ∈ V , either all the remaining nodes in V \ Aβ have an edge to
b or none of them. More precisely, one can show by induction on |β| that for all nodes
a ∈ V and b, b′ ∈ V \Aβ it holds (a, b) ∈ E ⇔ (a, b′) ∈ E (and likewise (b, a) ∈ E if and
only if (b′, a) ∈ E). The proof uses a similar idea as the proof of Proposition 5.3: one
can show that the insertion query cannot distinguish b from b′, therefore an edge (a, b) is
inserted if and only if an edge (a, b′) is inserted.
The dynamic program for maintaining UReach without a predefined linear order
maintains the relations H and TCH as described above, yet restricted to the induced
(and ordered) subgraph Gβ of G on Aβ. Whether two nodes are in the same weakly
connected component of Gβ can thus be inferred from H and TCH . Whether two nodes
are in the same weakly connected component of G can be easily determined as follows.
• Two nodes a, a′ ∈ Aβ are in the same weakly connected component of G if and
only if they are in the same weakly connected component of Gβ or there is a
node d ∈ V \ Aβ such that there are edges (b, d) (or (d, b)) and (b′, d) (or (d, b′))
connecting d with some node b in the weakly connected component of a in Gβ and
some node b′ in the weakly connected component of a′ in Gβ.
• Two nodes a ∈ Aβ and b ∈ V \Aβ are connected by a path if and only if there is
an edge between b and some node in the weakly connected component of a in Gβ.
86
5.2 Maintaining Reachability under first-order definable insertions
• Finally, two nodes a, a′ ∈ V \Aβ are connected if and only they are connected by
an edge or they are both connected by an edge to some node b ∈ Aβ.
5.2.2 Towards efficient dynamic programs
The size of the dynamic programs from the proof of Theorem 5.4 depends on the
undirected bridge bounds of the insertion queries in ∆. Unfortunately, in the worst case
the bridge bound grows non-elementarily in the quantifier rank, that is, its growth cannot
be bounded by a fixed-height tower of exponential functions. We therefore turn next to
insertion queries with more desirable bridge bounds (and consequently reasonably small
dynamic programs). More precisely, we consider insertion queries definable by variants of
conjunctive queries. We will see that for all these variants the bridge bounds are bounded
by the size s of the insertion queries. Therefore, the size of the dynamic update programs
grows at most linearly in s.
Conjunctive queries correspond to select-project-join queries in the relational algebra
and to select-from-where queries with conjunctions of atoms in where-clauses in SQL
[Abiteboul et al. 1995], and constitute one of the most investigated query languages for
relational databases.
More formally, the class UCQ contains all unions of conjunctive queries (short: UCQs),
that is, queries expressible by formulas of the form ϕ(w¯) = ∨`i=1 ∃z¯ψi, where each ψi is a
conjunction of atomic formulas. A conjunctive query is a UCQ for which ` = 1 holds.
The class of conjunctive queries is denoted by CQ. The extensions of CQ and UCQ by
allowing negated atoms are denoted by CQ¬ and UCQ¬, respectively. In the following, a
UCQ-definable insertion query is a replacement query ρ(p¯) with a formula of the form
E(x, y) ∨ ϕ(p¯, x, y), where ϕ is a union of conjunctive queries. We will refer to ϕ(p¯, x, y)
as insertion formula. Analogously, we define CQ-, CQ¬- and UCQ¬-definable insertion
queries.
We will see that the undirected bridge bound of CQ-definable insertions is at most two
and, more generally, for UCQ-definable insertions based on the union of ` conjunctive
queries it is at most 2`. There is no constant undirected bridge bound for CQ¬-definable
insertions, but it is still linear in the size of the query.
We first state the result for UCQ-definable insertions.
Proposition 5.5.
(a) For each UCQ-definable insertion query ρ that is a union of ` conjunctive queries,
the undirected bridge bound of ρ is at most 2`.
(b) The bound of Statement (a) is tight.
Before we prove the statements, we introduce some further notation. In the following,
we associate a (hyper-)graph H with a conjunctive query ψ as it is common in the
literature on CQs [Gottlob et al. 2001]: the nodes of Hψ are the variables of ψ and each
atom of ψ becomes a (hyper-)edge in Hψ. Note that Hψ does not need to be connected.
Each maximal weakly connected component induces a conjunctive query θ = ∃y¯ θ′, where
θ′ is the conjunction of all atoms of the component and y¯ contains only the quantified
87
Chapter 5 Expressibility under definable changes
variables of the component. We call these formulas θ the patterns of ψ and denote for
each variable x the unique pattern in which x occurs by θx.
Proof (of Proposition 5.5). (a) Let us assume, towards a contradiction, that there is a
UCQ-definable insertion query ρ with insertion formula ϕ(a¯;x, y), a graph G, a change
α = ρ(a¯) and two nodes u, v such that ubdG,α(G)(u, v) ≥ 2`+ 1, where ` is the number
of conjunctive queries that constitute ϕ = ∨`i=1 ∃z¯ ψi. That is, there is an undirected
path pi in α(G) from u to v with at least 2` + 1 bridges, but no such path with
fewer bridges. By the pigeonhole principle, at least three (not necessarily consecutive)
bridges (u1, v1), (u2, v2), (u3, v3) from pi are inserted by the same CQ ψi. In particular,
G |= ∃z¯ ψi(a¯, u1, v1) and G |= ∃z¯ ψi(a¯, u3, v3). Without loss of generality, we assume that
u1 is the first of these six nodes that occurs on the path from u to v and that (u3, v3) is
the last of the three edges on this path. We however do not specify whether u3 is visited
before or after v3 in the undirected path pi. We show that α also inserts the edge (u1, v3)
that can be used to shortcut pi, resulting in an undirected path from u to v with fewer
bridges, which is the desired contradiction.
We examine the patterns of ψi. Observe that the patterns θxi and θ
y
i are different:
if x and y were in the same connected component of ψi, then every assignment η that
satisfies ψi needs to map x and y to nodes that are connected by an undirected path, and
thus ψi only inserts edges inside weakly connected components of a graph, contradicting
the assumption that ψi inserts at least three bridges. Now, let η1 and η3 be assignments
witnessing that ψi inserts the edges (u1, v1) and (u3, v3), respectively. So, ηj maps (x, y)
to (uj , vj), p¯ to a¯ and the quantified variables z¯ to some nodes b¯j , and all patterns of
ψi are satisfied under both η1 and η3. Let η′ be the assignment that agrees with η1 on
the variables of the pattern θxi and with η3 on the remaining variables. In particular, η′
agrees with η3 on the variables of the pattern θyi . Observe that η′ satisfies every pattern
of ψi and therefore proves that G |= ∃z¯ ψi(a¯, u1, v3) and that α inserts the edge (u1, v3),
as desired.
(b) We now show that the bound of Statement (a) is actually tight. For the construction
we use, for natural numbers q, conjunctive queries χq(z)
def= ∃z1, . . . , zq−1E(z, z1) ∧
E(z1, z2) ∧ · · · ∧ E(zq−1, z) expressing that z lies on a (not necessarily simple) cycle of
length q.
We warm up with the case ` = 1 and show that there is a CQ-definable insertion with
bridge bound 2. For this, we consider the insertion formula ψ1(x, y) = χ2(x) ∧ χ3(y) and
the graph G that consists of three disjoint cycles, two of length 2 and one of length 3.
Let u, v be two nodes from the two different cycles of length 2. It is easy to see that
ψ1 introduces an edge from every 2-cycle node to every 3-cycle node and thus produces
an undirected path with 2 bridges from u and v, but there is no such path with fewer
bridges.
Now we generalise this approach for arbitrary ` > 1. Let Q = {q1, . . . , q2`} be a set of
2` prime number q1 < · · · < q2` such that q2` < 2q1. Such a set Q exists by the Prime
Number Theorem (see for example Selberg [1949]). For every i ≤ `, let ψi(x, y) be the
CQ χq2i−1(x)∧χq2i(y). That is, an edge (u, v) is inserted due to ψi whenever u is part of
a (not necessarily simple) cycle of length q2i−1 and v is part of a (not necessarily simple)
88
5.2 Maintaining Reachability under first-order definable insertions
Cq1
Cq2
Cq3
Cq1
Cq4
Cq5
Cq3
Cq6
Cq7
Cq5
Cq8
Cq7
ψ1
ψ1
ψ2
ψ2
ψ3
ψ3
ψ4
ψ4
Figure 5.1: Illustration of the graph from the proof of Proposition 5.5(b) for ` = 4. Each
Cqi depicts a cycle of length qi. Bridges are highlighted in blue and annotated
by the conjunct used for their insertion.
cycle of length q2i. Furthermore, for every prime q, let Cq be a cycle graph with q nodes.
We now show that the insertion formula ϕ(x, y) = ∨`i=1 ∃z¯ψi(x, y) constitutes an
insertion query with undirected bridge bound 2`. To this end, for numbers m1,m2, let
Dm1,m2 be a graph consisting of a cycle of length m1 and a cycle of length m2, which
have one node in common. We consider the graph G that is a disjoint union of
• `+ 2 cycles Cq1 , Cq2 , Cq4 . . . , Cq2`−2 , Cq2`−1 , Cq2` , and
• `− 1 graphs Dq1,q3 , Dq3,q5 , . . . , Dq2`−3,q2`−1 .
Let u be some node from Cq1 and v some node from Cq2`−1 . It is easy to see that
the application of ϕ yields an undirected path from u to v through the subgraphs
Cq2 , Dq1,q3 , Cq4 , Dq3,q5 , Cq6 , . . . , Dq2`−3,q2`−1 , Cq2` , in that order (see Figure 5.1). This
path contains 2` bridges.
There is no undirected path with fewer bridges, because edges are only inserted
between consecutive subgraphs in the above sequence. This is because G |= χq(u) holds
only for a node u that is part of a simple cycle of length q, for some q ∈ Q, as the length
of all non-simple cycles in G is larger than any prime number qi ∈ Q. However, for each
i ∈ {1, . . . , `}, simple cycles of length q2i−1 and q2i, respectively, only occur in consecutive
subgraphs in the above sequence.
We now proceed to the undirected bridge bounds for UCQ¬-definable insertions. For
the statement we need to extend the notion of a pattern slightly. We view a conjunctive
query ψ with negation as a formula of the form ∃z¯ ψ1 ∧ ψ2, where ψ1 is a conjunction of
positive atoms and ψ2 a conjunction of negated atoms. We recall that each variable of a
CQ with negation needs to occur in some positive atom. The positive patterns of ψ are
the patterns of the CQ ∃z¯ ψ1, that is, the weakly connected components of ψ when only
the positive atoms are considered.
Proposition 5.6.
(a) For each UCQ¬-defined insertion query ρ that is a union of ` conjunctive queries
with negations with at most k positive patterns each, the undirected bridge bound of
ρ is at most 2(k − 1)`.
(b) For each n ∈ N there is a CQ¬-defined insertion query ρ with undirected bridge
bound ≥ n.
Proof.
89
Chapter 5 Expressibility under definable changes
(a) We only need to extend the argument of Proposition 5.5(a) slightly. Towards a
contradiction, we now assume that there is a UCQ¬-definable insertion query ρ with
insertion formula ϕ(p¯;x, y) = ∨`i=1 ∃z¯ ψi(p¯;x, y) with at most k positive patterns in each
CQ ψi, a graph G, a change α = ρ(a¯) and two nodes u, v such that ubdG,α(G)(u, v) ≥
2(k − 1)`+ 1, as witnessed by an undirected path pi in α(G) from u to v with at least
2(k − 1)`+ 1 bridges, and the absence of a path with fewer bridges.
By the pigeonhole principle, there is a CQ¬ ψi that inserts at least 2k − 1 (not
necessarily consecutive) bridges (u1, v1), . . . , (u2k−1, v2k−1) of pi. Let u1 be the first of
these nodes in pi and let η be an assignment that witnesses G |= ∃z¯ ψi(a¯, u1, v1). In
particular, all positive patterns of ψi are satisfied under η. To satisfy the at most k − 2
positive patterns of ψi which do not contain x or y, the variables of z¯ that appear in
these patterns are mapped into at most k− 2 weakly connected components C1, . . . , Ck−2
of G.
If some weakly connected component of G would contain more than two of the nodes
v3, . . . , v2k−1, pi could be shortcut yielding a path from u to v with fewer bridges. Likewise,
none of them can be from the components of u1 or v1. Therefore, at least one of the nodes
v3, . . . , v2k−1 must be from a weakly component component other than C1, . . . , Ck−2. Let
vj be such a node and let η′ be an assignment that witnesses G |= ∃z¯ ψi(a¯, uj , vj). Since
η′ maps the variables of θyi , the positive pattern of ψi that contains y, into a component
of G that is not in the range of η, the valuation η′′ that coincides with η′ on the variables
of θyi and with η everywhere else witnesses G |= ψi(a¯, u1, vj). Note that we can be sure
that for all negated atoms of the form ¬E(z1, z2), where z1 is from θyi and z2 from some
other positive pattern of ψi, there is no edge from η′′(z1) to η′′(z2), since these nodes are
in different weakly connected components of G (and likewise for atoms ¬E(z2, z1)). It
follows that there is a shortcut edge from u1 to vj and consequently an undirected path
from u to v with fewer bridges than pi, the desired contradiction.
(b) Let n ∈ N be some natural number. Our approach is to construct a CQ¬-definable
insertion query ρ as well as a graph G consisting of connected components D0, . . . , Dn,
each Di with a distinguished node vi, such that the application of ρ inserts exactly the
edges (v0, v1), . . . (vn−1, vn). Then the undirected bridge distance between v0 and vn will
be n.
The nodes vi will be distinguished by having an edge and a path of length 2 to
some other node. The remaining challenge is to construct the insertion query and
the components of G such that no edge between components Di and Dj is inserted if
|i− j| > 1.
Let Q = {q(i, j) | 0 ≤ i < j ≤ n, j − i > 1} be a set of distinct prime numbers
q(0, 2) < q(0, 3) < · · · < q(n− 2, n) such that q(n− 2, n) < 2q(0, 2). Such a set Q exists
by the Prime Number Theorem (see Selberg [1949]). For each i ∈ {0, . . . , n}, let Qi be
the set Qi
def= {q(i, j) | q(i, j) ∈ Q, i < j ≤ n} ∪ {q(j, i) | q(j, i) ∈ Q, 0 ≤ j < i}.
For a finite set N of numbers, let DN be a graph with one distinguished node v
(so, v has an edge and a path of length 2 to some node v′), and one directed cycle of
length q, for each q ∈ N , all of them containing the node v and otherwise being disjoint.
Finally, the graph G is the disjoint union of graphs DQ0 , . . . , DQn , where in each DQi
the distinguished node is denoted by vi (see Figure 5.2).
90
5.2 Maintaining Reachability under first-order definable insertions
C7
C11
v0
C13
v1
C7
v2
C11
C13
v3
Figure 5.2: Illustration of the graph from the proof of Proposition 5.6(b) for n = 3 with
Q = {7, 11, 13}, q(0, 2) = 7, q(0, 3) = 11, and q(1, 3) = 13. Each Ci depicts a
cycle of length i and bridges are highlighted in blue. As an example, the edge
(v0, v2) is not inserted because all cycles of length 7 contain either v0 or v2.
Now we construct a CQ¬-definable insertion query that inserts exactly the edges
(v0, v1), . . . , (vn−1, vn) (and the edges in reverse direction) into G. To this end, it inserts
an edge (u, v) if (1) u and v have each have an edge and a path of length 2 to some other
node, and (2) for each q ∈ Q there is some (not necessarily simple) cycle of length q in G
which avoids u and v. For each k, let
χk(x, y, x1, . . . , xk)
def=
k−1∧
i=1
E(xi, xi+1) ∧ E(xk, x1) ∧
k∧
i=1
(¬E(x, xi) ∧ ¬E(y, xi)).
That is, χk expresses that the variables x1, . . . , xk are mapped to a (not necessarily
simple) cycle of length k and that x and y do not have an edge to any node of that cycle,
so in particular they are not part of that cycle. Finally, the insertion query ρ is induced
by the CQ¬ formula
ϕ(x, y) def= ∃z1, z2, z3, z4 E(x, z1) ∧ E(z1, z2) ∧ E(x, z2) ∧ E(y, z3) ∧ E(z3, z4) ∧ E(y, z4)
∧
∧
q∈Q
∃x1, . . . , xq χq(x, y, x1, . . . , xq).
We claim that the undirected bridge distance between v0 and vn in G is n with respect
to the insertion ρ. It is at most n since the edges (v0, v), . . . (vn−1, vn) are inserted by ρ.
On the other hand, no edge between components Di and Dj is inserted if |i− j| > 1. To
see this, we observe first that ϕ only allows to insert edges between distinguished nodes
vi and vj due to the required edge and path of length 2 to some other node. Towards a
contradiction, let us assume that ρ inserts an edge between nodes vi and vj with j− i > 1.
Let q = q(i, j). Simple cycles of length q only occur in Di and Dj and can thus not be
used to satisfy ∃x1, . . . , xq χq(x, y, x1, . . . , xq). All non-simple cycles in G have length
larger than q by choice of Q and construction of G, and can therefore also not be used to
satisfy ∃x1, . . . , xq χq(x, y, x1, . . . , xq). So, ∃x1, . . . , xq χq(x, y, x1, . . . , xq) is not satisfied,
the desired contradiction.
With the bridge bounds of Proposition 5.5(a) and Proposition 5.6(a) the dynamic
program constructed in the proof of Theorem 5.4 becomes actually usable. Indeed,
Statement (a) of Proposition 5.5 is the basis for our implementation and experiments
that are described in Chapter 7.
91
Chapter 5 Expressibility under definable changes
5.2.3 Reachability in acyclic graphs
Now we turn to the other restriction for which DynFO maintainability under complex
insertions (and single-edge deletions) is preserved: directed reachability on directed,
acyclic graphs. However, we are only able to show this result for quantifier-free insertions.
In [Patnaik and Immerman 1997, Theorem 4.2], edge insertions are only allowed if they
do not add cycles. Of course, given the transitive closure of the current edge relation
it can be easily checked by a first-order formula (a guard) whether a new edge closes a
cycle. We will see that this is also possible for the complex insertions we consider.
As in Subsection 5.2.1, we begin by stating a bounded bridge property. However, we
have to modify it a bit to cope with the fact that insertion queries can introduce cycles
into acyclic graphs. The directed bridge distance bd is defined as the undirected bridge
distance ubd, but with respect to directed paths. Let G′ be a graph that is obtained
from a graph G by inserting edges. For two nodes u, v of G′, bdG,G′(u, v) is the minimal
number d such that there is a directed path from u to v in G′ that uses d edges that are
not in G.
We say that an insertion query ρ has the bounded bridge property on directed acyclic
graphs if there is a constant c such that, for every graph G′ resulting from a directed
acyclic graph G by applying ρ, G′ either contains a directed cycle with at most c bridges,
or G′ is acyclic and bdG,G′(u, v) ≤ c for all nodes u, v of G′. We call the smallest such c
the directed bridge bound of ρ on acyclic graphs.
Proposition 5.7. Every quantifier-free insertion query has the bounded bridge property
on directed acyclic graphs.
Proof. We show that each quantifier-free insertion operation ρ with underlying formula
µ(p¯; x¯) has the bounded bridge property on acyclic graphs. Let ` be the length of the
parameter tuple of ρ and c′ the number of FO[0, `+ 1]-types of graphs. We will bound
the number of bridges on paths created by ρ by c def= c′ + 1.
Let G be a directed, acyclic graph, let α = ρ(a¯) be a change, and let u, v be nodes
of G. As in the proof of Proposition 5.3, we show that each path pi from u to v with
q > c′ bridges can be transformed into a path with fewer bridges, unless a cycle with at
most c bridges is introduced.
To this end, let (u1, v1), . . . , (uq, vq) be the bridges in pi. Since q is larger than the
number c′ of FO[0, ` + 1]-types, there are i, j with 1 ≤ i < j ≤ c′ + 1 such that (vi, a¯)
and (vj , a¯) have the same FO[0, ` + 1]-types. We distinguish three cases. In case (1),
the edge (vi, ui) is in G and thus α introduces a cycle of length 2. In case (2), the edge
(vj , ui) is in G and, together with the sub-path from ui to vj , constitutes a cycle with at
most c bridges. In case (3), ui is neither connected to vi nor to vj by an edge. Therefore
(ui, vi, a¯) and (ui, vj , a¯) have the same FO[0, `+ 2]-types by Theorem 2.2, and α inserts
an edge (ui, vj) as well, the desired shortcut.
This property allows extending the technique for maintaining the transitive closure
relation of acyclic graphs under single-tuple changes (see Example 2.9 together with
Example 2.8) to quantifier-free insertions. As in the single-edge case, no further auxiliary
92
5.3 Inexpressibility results for Reachability
relations besides the transitive closure relation are needed. In Section 5.3 we show that
the transitive closure relation does not suffice for maintaining Reach for acyclic graphs
under insertions definable with existential quantifiers.
Theorem 5.8. Let ∆ be any finite set of quantifier-free insertion queries. Then
(Reach,∆ ∪ ∆E) can be maintained in DynFO for directed acyclic graphs. Further-
more, for each quantifier-free insertion, there is a first-order guard which checks whether
the insertion destroys the acyclicity of the graph.
Proof. In [Patnaik and Immerman 1997, Theorem 4.2], a dynamic program is given
that maintains the transitive closure of acyclic graphs under single-edge changes, using
only the transitive closure as auxiliary relation (see Examples 2.9 and 2.8). Thanks to
Proposition 5.7, this program can be easily extended. Let ρ be a quantifier-free insertion
query with replacement formula µE(x, y), and let c be the directed bridge bound of ρ on
acyclic graphs. To update the transitive closure relation Q of the input graph, it suffices
to existentially quantify up to c newly inserted edges and check that they connect x and
y using already existing paths. The corresponding update formula is thus given by the
formula
ϕQρ (x, y)
def= Q(x, y) ∨
∨
1≤c′≤c
∃u1∃v1 · · · ∃uc′∃vc′
(
Q(x, u1) ∧Q(v1, u2) ∧ · · · ∧Q(vc′ , y)
∧ µE(u1, v1) ∧ · · · ∧ µE(uc′ , vc′)
)
.
Analogously, a first-order guard formula can be constructed that checks whether a
cycle with at most c newly inserted edges is created.
5.3 Inexpressibility results for Reachability
Until now we have seen that (different variants of) the reachability query can still be
maintained under some classes of definable insertions and single-tuple deletions. However,
we have not seen so far a result that the reachability query can be maintained under
definable insertions and definable deletions, even for restricted graph classes. On the
other hand we have no proof that Reach cannot be maintained in DynFO under all
FO-definable changes.
We provide preliminary results from two directions. First, in Subsection 5.3.1, we prove
that Reach cannot be maintained in DynFO under complex changes with arity-restricted
auxiliary relations. Then, in Subsection 5.3.2, we show that reachability cannot be
maintained in DynProp under changes defined by quantifier-free formulas.
5.3.1 Inexpressibility of Reachability with Restricted Auxiliary
Relations
We show that, in the presence of complex changes, the transitive closure relation as the
only binary auxiliary relation does not suffice to maintain reachability. For this result to
93
Chapter 5 Expressibility under definable changes
hold, it suffices to allow single-tuple insertions and one complex change query, which can
be chosen either as an insertion or a deletion query. This change query can be restricted
in several ways and in most cases the result even holds for acyclic graphs. These results
should be contrasted with Theorem 5.8. By ∃∗FO we denote the existential fragment of
first-order logic, that is, the set of first-order formulas in prenex normal form with only
existential quantification.
Theorem 5.9.
(a) There is an insertion query ρ1 such that (Reach, {insE , ρ1}) cannot be maintained
in DynFO, if all auxiliary relations besides the query relation are unary. The
insertion query ρ1 can be chosen as quantifier-free and parameter-free.
The result even holds on acyclic graphs, in which case ρ1 can be chosen as ∃∗FO-
definable and parameter-free.
(b) There is a deletion query ρ2 such that (Reach, {insE , ρ2}) cannot be maintained in
DynFO, if all auxiliary relations besides the query relation are unary. The deletion
query ρ2 can be chosen as quantifier-free and parameter-free.
The result also holds on acyclic graphs, even with a quantifier-free ρ2.
Theorem 5.9(a) even holds in the case where the nodes of G are linearly ordered, but
the proof becomes considerably more involved. It can be found in [Schwentick et al.
2017b, Theorem 10].4
The proof of Theorem 5.9 makes use of suitable static lower bounds. This approach
has often been used before and was made precise by Zeume [2015].
We say that a k-ary query q is expressed by a formula ϕ(x¯) with help relations of
schema τ , if for every structure D there is a τ -structure H over the same domain such
that for every k-tuple a¯ it holds: a¯ ∈ q(D) if and only if (D, H) |= ϕ(a¯). This notion
should not be confused with definability of the query q in existential second-order logic.
In the latter case, the relations can be chosen depending on a¯, but here the relations
need to “work” for all tuples a¯.
The proof of Theorem 5.9 relies on the fact that unary help relations do not suffice
to express the transitive closure for path graphs, i.e., graphs consisting of just one
path, and for unions of paths even if a certain special binary relation is present. Before
stating this formally, we define the latter class more precisely. For n,m ∈ N, let
Gn,m = (Vn,m, En,m, pos) where (Vn,m, En,m) is the disjoint union of n paths of m nodes
each, that is, Vn,m = {vi,j | i ∈ {1, . . . , n}, j ∈ {1, . . . ,m}}, and En,m = {(vi,j , vi,j+1) |
i ∈ {1, . . . , n}, j ∈ {1, . . . ,m − 1}. The binary relation pos contains tuples (vi,j , vi′,j′)
with j < j′, that is, vi,j is at a smaller position than vi′,j′ in its path. We denote the
class of all structures of the form Gn,m, for arbitrary n,m ∈ N, by Gup.
Lemma 5.10. The query Reach cannot be expressed by a first-order formula with unary
help relations on (a) path graphs and (b) structures from Gup.
Statement (a) follows from an easy locality-based argument and was proved in [Zeume
2015, Lemma 4.3.2]. Statement (b) follows from a standard Ehrenfeucht-Fraïssé argument.
4We strongly conjecture that Theorem 5.9(b) holds in the case with a linear order as well.
94
5.3 Inexpressibility results for Reachability
Proof (of Theorem 5.9). In order to prove (a), let us assume, towards a contradiction, that
for every quantifier-free and parameter-free insertion query ρ1, (Reach, {insE , ρ1}) can
be maintained in DynFO on directed graphs with k unary auxiliary relations B1, . . . , Bk.
Our goal is to show that then the transitive closure of path graphs could be expressed
by a first-order formula with unary help relations B1, . . . , Bk and C0, C1, C2.
We refer to Figure 5.3 for an illustration of the following. Let G be some arbitrary
path graph. Let C0, C1, C2 be unary relations as indicated in Figure 5.3: the relation Ci
contains all nodes whose position in the path is i modulo 3. Let G1 be derived from G
by reversing all edges from C1-nodes to C2-nodes and by adding loops to all C1-nodes
and C2-nodes. Let G2 be the result of applying the insertion query ρ1 = µE(x, y; )
def=
E(x, y)∨ (E(x, x)∧E(y, y)∧E(y, x)) to G1. Intuitively, ρ1 adds all edges (x, y) for which
there is an edge (y, x) and both x and y have self-loops.
By our assumption, Reach(G2) can be defined by a first-order formula ψ2 using
Reach(G1) and unary auxiliary relations B1, . . . , Bk. We next show why this implies
that the transitive closure of path graphs G can be expressed by a first-order formula
with B1, . . . , Bk and C0, C1, C2 as unary help relations.
First, there is a simple formula ϕ1 which defines the edge relation E1 of G1 in terms of
the edge relation E of G and C0, C1, C2. Since all directed paths in G1 have length at
most 2, Reach(G1) can be defined by a first-order formula ψ1 on G1.
By combining ϕ1, ψ1, ρ1, and ψ2 in a suitable way it is straightforward to construct a
first-order formula θ(x, y) which defines Reach(G2) on (G,C0, C1, C2, B1, . . . , Bk). It is
also straightforward to define Reach(G) from Reach(G2) in a first-order fashion, when
C0, C1, C2 and Reach(G2) are given (basically ignore all pairs (u, v), for which there is
an edge (v, u) in G).
Altogether, we can conclude that the transitive closure query can be defined on G with
the help of unary help relations C0, C1, C2, B1, . . . , Bk, the desired contradiction.
We observe that both graphs G1 and G2 in the above proof are not acyclic. Yet
a slight modification of the construction yields acyclic graphs G′1 and G′2. However,
it uses existential quantifiers in the definition of the change operation. The graphs
are also depicted in Figure 5.3. The graph G′2 is obtained by applying the operation
ρ′1 = µE(x, y; )
def= E(x, y)∨ (∃z(E(x, z)∧E(y, z))∧∃z′E(z′, x)) to G′1. The proof is now
analogous to (a) except that G′1 has to be first-order interpreted into the path graph
G, as it uses a slightly larger domain. The rest of the argument for acyclic graphs is
analogous.
For (b), a similar approach is used, this time starting from a structure G′′ = Gn,m
from Gup. The construction is illustrated in Figure 5.4. It only involves acyclic graphs.
Let G′′1 be a graph derived from G′′ by adding n(m− 1) nodes {wi,j | i ∈ {1, . . . , n}, j ∈
{1, . . . ,m− 1}} that are used to connect the paths: for each i, i′ ∈ {1, . . . , n} and each
j ∈ {1, . . . ,m− 1}, the edges (vi,j , wi′,j) and (wi′,j , vi,j+1) are added. Notice that in G′′1
there is a path from a node vi,j to a node vi′,j′ with (i, j) 6= (i′, j′) if and only if j < j′.
Additionally, we add a node u and edges (wi,j , u) for all i ∈ {1, . . . , n}, j ∈ {1, . . . ,m−1}.
This node will be used as the parameter for the deletion. Let G′′2 be the result of applying
the deletion query ρ2(p) = µE(p;x, y; )
def= E(x, y) ∧ ¬E(x, p) ∧ ¬E(y, p) ∧ y 6= p with
parameter u, which deletes all edges that are incident to a node that has an edge to u,
95
Chapter 5 Expressibility under definable changes
G:
v0
C0
v1
C1
v2
C2
v3
C0
v4
C1
v5
C2
v6
C0
v7
C1
v8
C2
v9
C0
v10
C1
v11
C2
G1:
v0 v1 v2 v3 v4 v5 v6 v7 v8 v9 v10 v11
G2:
v0 v1 v2 v3 v4 v5 v6 v7 v8 v9 v10 v11
G′1:
v0 v1
v′1
v2 v3 v4
v′4
v5 v6 v7
v′7
v8 v9 v10
v′10
v11
G′2:
v0 v1
v′1
v2 v3 v4
v′4
v5 v6 v7
v′7
v8 v9 v10
v′10
v11
Figure 5.3: The graphs from the proof of Theorem 5.9(a).
as well as all edges to u.
The argument is now analogous to the previous proof. To this end we observe that
the graph G′′1 can be first-order interpreted into G′′ using pos5. Also, Reach(G′′1) can
be expressed by a first-order formula with the help of pos. Hence the combination of
the interpretation formulas and an assumed formula for updating the transitive closure
yields a formula expressing the transitive closure on Gup. This proves the stated result
for acyclic graphs.
In order to show that the result even holds for quantifier-free and parameter-free
deletion queries on general graphs, the nodes wi,j can be equipped with self-loops, and
the construction hence does not need the parameter node u any more.
5.3.2 Inexpressibility of Reachability in the Quantifier-Free Fragment
Although we believe that Reach cannot be maintained in DynProp using only quantifier-
free update formulas even under single-edge changes, we have so far no proof for this
conjecture. It is only known that binary auxiliary relations are not sufficient in this
setting [Zeume and Schwentick 2015]. There the authors also show that certain restricted
classes of initialisation procedures cannot be used to maintain Reach whenever arbitrary
graphs are allowed as initial databases.
5Because of this step, G′′1 contains n(m− 1) nodes of the form wi,j . All other steps would also work
using m− 1 nodes of the form wj .
96
5.3 Inexpressibility results for Reachability
G′′:
v1,1
v2,1
v3,1
v1,2
v2,2
v3,2
v1,3
v2,3
v3,3
v1,4
v2,4
v3,4
v1,5
v2,5
v3,5
v1,6
v2,6
v3,6
G′′1 :
v1,1
v2,1
v1,2
v2,2
v1,3
v2,3
v1,4
v2,4
v1,5
v2,5
v1,6
v2,6
v3,1 v3,2 v3,3 v3,4 v3,5 v3,6
w1,1 w1,2 w1,3 w1,4 w1,5
w2,1 w2,2 w2,3 w2,4 w2,5
w3,1 w3,2 w3,4 w3,5w3,3
u
G′′2 :
v1,1
v2,1
v1,2
v2,2
v1,3
v2,3
v1,4
v2,4
v1,5
v2,5
v1,6
v2,6
v3,1 v3,2 v3,3 v3,4 v3,5 v3,6
w1,1 w1,2 w1,3 w1,4 w1,5
w2,1
w3,1
w2,2
w3,2
w2,3
w3,3
w2,4
w3,4
w2,5
w3,5
u
Figure 5.4: The graphs from the proof of Theorem 5.9(b) (for n = 3 and m = 6)
Here we provide a general unmaintainability result for Reach with quantifier-free
update formulas in the presence of definable changes. More precisely, we give simple
quantifier-free change queries (a deletion or two insertions), such that Reach and even
UReach cannot be maintained in DynProp under these changes.
Theorem 5.11.
(a) There is a quantifier-free deletion query ρ such that (Reach, {insE , ρ}) cannot be
maintained in DynProp, even with arbitrary initialisation.
(b) There are quantifier-free insertion queries ρ1, ρ2 such that (Reach, {insE , ρ1, ρ2})
cannot be maintained in DynProp, even with arbitrary initialisation.
The results also hold for the reachability query Reach over acyclic graphs and for the
undirected reachability query UReach.
97
Chapter 5 Expressibility under definable changes
For the proof of Theorem 5.11 we lift the proof technique for DynProp inexpressibility
results formalised by Lemma 4.18 from single-tuple changes to definable changes. The
only difference in the statements of Lemma 4.18 and the following Lemma 5.12 is the
class of change operations we consider.
Lemma 5.12. Let q be an m-ary σin-query and let ∆ be a set of quantifier-free replace-
ment queries that includes all single-tuple insertions insR for R ∈ σin. Then (q,∆) is not
in k-ary DynProp, even with arbitrary initialisation, if there are natural numbers `, r and,
for each n0 ∈ N, a natural number n ≥ n0, such that for all subsets B ⊆
( [n]
k+1
)
there exist
• a σin-structure D, a set P = {p11, . . . , p`1, . . . , p1n, . . . , p`n} of distinct elements, and
tuples a¯ = a1, . . . , am and u¯ = u1, . . . , ur, such that
– P and {a1, . . . , am, u1, . . . , ur} are disjoint and contained in the domain of D,
– (D, a¯, u¯, p¯i1 , . . . , p¯ik+1) ≡0 (D, a¯, u¯, p¯j1 , . . . , p¯jk+1) for all strictly increasing se-
quences i1, . . . , ik+1 and j1, . . . , jk+1 over [n], where p¯i = (p1i , . . . , p`i), and
• a sequence β(x11, . . . , x`1, . . . , x1k+1, . . . , x`k+1, y1, . . . , yr) = α1(x¯i1 , y¯) · · ·αs(x¯is , y¯) of
∆-changes, where no change αj uses variables xtij and x
t′
ij′
with ij 6= ij′ as parame-
ters,
such that for all strictly increasing sequences i1, . . . , ik+1 over [n] it holds that
a¯ ∈ q(β(p¯i1 , . . . , p¯ik+1 , u¯)(D)) ⇐⇒ {i1, . . . , ik+1} ∈ B.
Proof idea. The proof of Lemma 4.18 uses only once that the change operations are
limited to single-tuple changes: when it applies Lemma 4.17, the Substructure Lemma
from [Zeume and Schwentick 2015]. The proof of the Substructure Lemma for single-tuple
changes as presented in [Zeume and Schwentick 2015] immediately carries over to change
operations definable by quantifier-free first-order formulas. Then Lemma 5.12 is a simple
corollary of Lemma 4.18.
Now we can prove Theorem 5.11.
Proof (of Theorem 5.11).
(a) We apply Lemma 5.12 and prove that for some quantifier-free deletion query ρ the
dynamic query (Reach, {insE , ρ}) is not in k-ary DynProp with arbitrary initialisation,
for any k.
The idea for the construction of D and β, given some collection B ⊆ ( [n]k+1), is to
start from a graph consisting of paths of length 2 of the form a1, B, a2, for every B ∈ B.
The sequence β induced by a sequence i1, . . . , ik+1 deletes all edges (Y, a2) for which
some ij is not in Y . The only path from a1 to a2 that can possibly remain is the path
a1, {i1, . . . , ik+1}, a2, which is present from the start if and only if {i1, . . . , ik+1} ∈ B, just
as required by Lemma 5.12.
We make this more precise now. Let ρ(p) be the quantifier-free deletion defined as
ρ(p) = µ(p;x, y) def= E(x, y) ∧ ¬E(p, x). So, ρ(p) deletes an edge (x, y) if there is an edge
(p, x); this is because an edge (x, y) is not deleted only if no edge (p, x) exists.
98
5.3 Inexpressibility results for Reachability
a1
a2
. . . B. . .B
{p1, . . . , pn} p i, i
/∈ B
pj,
j ∈ B
(a) The graph for the proof of Theorem 5.11(a).
a1
a2
u1 u2 u3
. . . B. . .B
{p1, . . . , pn} p i, i
/∈ B
pj,
j ∈ B
(b) The graph for the proof of Theorem 5.11(b).
Figure 5.5: The graphs from the proof of Theorem 5.11. Edges used for controlling the
changes are highlighted in blue.
Towards a contradiction, let us assume that Reach can be maintained in k-ary
DynProp under changes from {insE , ρ} for some k. We choose m = 2, ` = 1 and r = 0 in
the statement of Lemma 5.12. Let n ≥ k + 1 and B ⊆ ( [n]k+1) be arbitrary.
The structure D that we construct (see Figure 5.5(a) for an illustration) is a graph
with node set {p1, . . . , pn} ∪ {a1, a2} ∪ B. The graph has the following edges:
• For each B ∈ B there are edges (a1, B) and (B, a2).
• For each i ∈ [n] and for each B ∈ B there is an edge (pi, B) if i 6∈ B.
Since there are no edges between vertices from {a1, a2, p1, . . . , pn}, the requirements
of Lemma 5.12 on D are fulfilled.
Let β(x1, . . . , xk+1) be the change sequence ρ(x1) · · · ρ(xk+1). For a strictly increasing
sequence i1, . . . , ik+1 over [n], let Y = {i1, . . . , ik+1} and let βY be the change sequence
β(pi1 , . . . , pik+1). This change sequence removes all edges (X, a2) with ij 6∈ X, for some
j ≤ k + 1. Therefore, the only edge of the form (X, a2) that might remain after applying
βY is (Y, a2) which only exists if Y ∈ B. Since there is a path from a1 to a2 in βY (D) if
and only if such an edge remains, we conclude that a¯ ∈ Reach(βY (D)) if and only if
Y = {i1, . . . , ik+1} ∈ B. By Lemma 5.12 we conclude that (Reach, {insE , ρ}) cannot be
maintained in k-ary DynProp, the desired contradiction.
(b) The proof is very similar to the proof of part (a). Now, in the structure D that we
construct, the possible paths from a node a1 to a node a2 are of the form a1, B, u1, a2, for
some other node u1 and some B ∈ B. However, initially there are no edges of the form
(B, u1). The first k + 1 changes of the to-be-constructed change sequence β, induced
by a sequence i1, . . . , ik+1, will insert edges (Y, u2) for some particular node u2 if some
ij is not in Y . Therefore, after these changes there is at most one node Y that is not
connected to u2, namely Y = {i1, . . . , ik+1}, if this node exists. A last change inserts an
edge (B, u1) for all B that are not connected to u2. Such a node exists, and consequently
99
Chapter 5 Expressibility under definable changes
there will again be a path from a1 to a2, if and only if {i1, . . . , ik+1} ∈ B. Lemma 5.12
then says that no k-ary DynProp program can maintain the query result.
We choose the insertion queries ρ1 and ρ2 as follows.
• ρ1(p, p′) = µ1(p, p′;x, y) def= E(x, y) ∨
(
y = p ∧ E(p′, x)) and
• ρ2(p, p′, p′′) = µ2(p, p′, p′′;x, y) def= E(x, y) ∨
(
y = p ∧ E(p′, x) ∧ ¬E(x, p′′)).
The query ρ1 inserts edges (x, p) for all x with an edge (p′, x), whereas ρ2 inserts edges
(x, p) for all x with an edge (p′, x) but without an edge (x, p′′).
Towards a contradiction, let us assume that (Reach, {insE , ρ1, ρ2}) can be maintained
in k-ary DynProp, for some k. Again, we aim to use Lemma 5.12 and we let m = 2, ` = 1
and r = 3 in the statement of the lemma. Let n ≥ k + 1 and B ⊆ ( [n]k+1) be arbitrary.
The database D is a graph with node set {p1, . . . , pn} ∪ {a1, a2, u1, u2, u3} ∪ B and
the following edges:
• For each B ∈ B there are edges (a1, B) and (u3, B).
• For each i ∈ [n] and for each B ∈ B there is an edge (pi, B) if i 6∈ B.
• There is an edge (u1, a2).
It can be easily verified that the conditions imposed by Lemma 5.12 are satisfied.
Let β(x1, . . . , xk+1) be the change sequence ρ1(u2, x1) · · · ρ1(u2, xk+1)ρ2(u1, u3, u2).
For any set Y = {i1, . . . , ik+1} with 1 ≤ i1 < · · · < ik+1 ≤ n we define βY def=
β(pi1 , . . . pik+1). After the first k + 1 changes of βY , a node B ∈ B is connected to
u2 unless B = Y . Therefore, the last change ρ2(u1, u3, u2) inserts at most one edge,
(Y, u1), but only if Y ∈ B. Thus, in βY (D) there is a path from a1 to a2 if and only if
Y = {i1, . . . , ik+1} ∈ B.
An application of Lemma 5.12 again yields the desired contradiction.
An inspection of the proofs shows that the used graphs are acyclic and that both
constructions immediately work for undirected reachability.
We note that Theorem 5.11(b) can also by proved using only one definable insertion ρ
instead of two insertions ρ1, ρ2. This insertion ρ could use two parameters, in addition
to the parameters of ρ1 and ρ2, behave as ρ1 if the parameter values are equal and else
behave as ρ2.
5.4 Maintaining formal languages
Similar to the reachability query for different graph classes, the membership problem
for formal languages under changes of single positions is widely studied in the dynamic
complexity literature. Already Patnaik and Immerman [1997] observed that in this setting
all regular languages can be maintained in DynFO using the BIT predicate; Hesse [2003b]
showed that DynFO with unary auxiliary relations suffices. Later, Gelade et al. [2012]
proved that the class of regular languages is exactly the class of languages maintainable
in DynProp. All context-free languages can be maintained in DynFO [Gelade et al. 2012],
some of them and even some non-context-free languages even with only unary auxiliary
relations (cf. [Gelade et al. 2012, Proposition 4.2] and [Dong and Su 1997, Example 5]).
100
5.4 Maintaining formal languages
This is not possible for all context-free languages [Zeume 2015; Vortmeier 2013].
Numerous researchers studied the question which classes of path queries to graphs can
be maintained in DynFO and its subclasses [Weber and Schwentick 2007; Datta et al.
2018a; Zeume 2015; Muñoz et al. 2016; Bouyer and Jugé 2017].
In this section, we show that the membership problem for regular and context-free
languages can still be maintained under certain kinds of definable changes. We first
introduce some notation.
A word over an alphabet Σ is encoded by an ordered structure, that is, a structure D
with a built-in linear order ≤ on its domain that is not modified by any change. We
use this order to identify the domain with the set {1, . . . , n}. Intuitively, the positions
of the word correspond to the elements of the structure’s domain. For every σ ∈ Σ the
structure has a unary relation Rσ that encodes all positions of the input word that carry
the letter σ.
We demand that at any point of time an element of the domain is in at most one
relation Rσ. If an element i is in no relation Rσ, then we consider the position i to be
labelled with the empty word . A structure of the presented form with domain {1, . . . , n}
therefore encodes a word w = w1 · · ·wm of length m ≤ n. As an example, the structure
with domain {1, 2, 3, 4, 5} and Ra = {2, 4} and Rb = {1} represents the word baa.
We assume that structures have constants min and max that represent the smallest
and the largest element, 1 and n, respectively. This assumption is not necessary for our
results but facilitates the proofs: these constants can of course be quantified in FO; for
Theorem 5.13, the only result of this section concerning DynProp, the assumption can be
avoided by using additional auxiliary relations for prefixes and suffixes, just as in [Gelade
et al. 2012].
In the following, we will not distinguish between a structure and the word it represents.
We consider the problem of maintaining (the membership problem for) formal languages
under first-order definable change operations. We assume that only replacement queries
are used whose application results in structures where each position is in at most one
set Rσ. For a given formal language L we denote the membership query for L as
Member(L).
Under definable changes we cannot hope for strong maintenance result for DynFO
without initialisation. Even the simple regular language L((aa)∗) of words with an even
number of as cannot be maintained under a change operation that inserts all positions
into the relation Ra: a dynamic program would need to determine the parity of the
domain, which is not possible in first-order logic. In this section we consequently allow
non-empty initialisations of the auxiliary relations and state their complexity along with
the results.
As an alternative to initialisations one could restrict the replacement queries in a
way that only a constant number of elements may enter the active domain by every
change. For example, one could demand that only elements that are parameters to the
change might change their label from  to some symbol σ ∈ Σ. However, we are more
interested in maintaining queries under stronger change operations than in optimising
the complexity of the initialisation, so we do not consider this alternative approach.
We prove that regular and context-free languages can be maintained dynamically for
101
Chapter 5 Expressibility under definable changes
large classes of change operations: all regular languages can be maintained in DynProp
under quantifier-free change operations and all context-free languages can be maintained
in DynFO under ∃∗FO-definable (and, dually, ∀∗FO-definable) change operations. So
far we cannot deal in general with definable changes that utilise quantifier alternation.
Towards the end of this section we discuss the challenges using an example.
For quantifier-free change operations, the results are obtained by generalisations of
the techniques of [Gelade et al. 2012]. The most important observation is that large
parts of an input word change rather uniformly under the changes we consider: the
label of a position before the change essentially implies the label after the change, so
the transformation of the input can be described by a relabelling function f : Σ → Σ,
where for an alphabet Σ we denote by Σ the set Σ ∪ {}.
Theorem 5.13. Let L be a regular language and ∆ a finite set of quantifier-free re-
placement queries. Then (Member(L),∆) can be maintained in DynProp with NC1
initialisation.
Proof. Let L be a regular language over alphabet Σ and let A = (Q,Σ, δ, s, F ) a cor-
responding deterministic finite automaton with set Q of states, transition function δ,
initial state s, and set F of accepting states. In [Gelade et al. 2012, Proposition 3.3],
the main auxiliary relations are of the form Sq,r(i, j), where q, r are states of A and
i, j are positions of the string under consideration. The intended meaning of Sq,r is
that (i, j) ∈ Sq,r if and only if δ∗(q, wi+1 · · ·wj−1) = r, where δ∗ is the extension of δ to
strings6. Notice that wi and wj are not relevant for determining whether (i, j) ∈ Sq,r.
We show how to extend this approach to replacement queries of the form ρ(p) with one
parameter p. The general case of more parameters works analogously, but is notationally
more involved. A replacement query with one parameter consists of one quantifier free
formula µσ(p;x) for each symbol σ ∈ Σ. Each formula µσ(p;x) determines whether
position x carries σ after the change. Whether this is the case only depends on (1)
the current symbol at position x, (2) the current symbol at position p, and (3) on the
relative order of x and p. Thus, the impact of a change can be described as follows:
some relabelling function f← is applied at all positions x < p, some relabelling occurs at
position p, and some relabelling function f→ is applied at all positions x > p.
We generalise the approach of [Gelade et al. 2012] and maintain binary auxiliary
relations Sfq,r for each relabelling function f and each pair q, r ∈ Q of states. The
intended meaning is that (i, j) ∈ Sfq,r if and only if δ∗(q, f(wi+1 · · ·wj−1)) = r, where f
is extended to strings in the straightforward way. Clearly, Sq,r = Sidq,r. When a change
ρ(i) occurs, an update formula ϕfq,r(p;x, y) has to select the relabelling functions f τ← and
f τ→ as well as the symbol στ that characterise the effect of the change, based on the
symbol τ at position i. When some relabelling f ′ is applied to a subword, whether a
relabelling f yields a subword w with δ∗(q, w) = r is given by the auxiliary relation Sf◦f ′q,r
6The relations Sq,r are named Rq,r in [Gelade et al. 2012], but we want to avoid confusion with
the relations Rσ. Since [Gelade et al. 2012] does not use constants min and max, it uses further
auxiliary relations of the form Ir and Fq that contain all positions i with δ∗(s, w1 · · ·wi−1) = r, and
δ∗(q, wi+1 · · ·wn) ∈ F , respectively.
102
5.4 Maintaining formal languages
for the relabelling function f ◦ f ′. So, the quantifier-free update formula ϕfq,r(p;x, y) can
be chosen as
ϕfq,r(p;x, y)
def= x < y ∧
∨
τ∈Σ
(
Rτ (p) ∧
(
(p ≤ x ∧Rf◦fτ→q,r (x, y)) ∨ (y ≤ p ∧Rf◦f
τ←
q,r (x, y))∨
(
x < p < y ∧
∨
q′,r′∈Q
δ(q′,στ )=r′
(Rf◦f
τ←
q,q′ (x, p) ∧Rf◦f
τ→
r′,r (p, y))
)))
.
The (Boolean) query relation Ans can be updated by the formula
ϕ(p) def=
∨
q,r∈Q,σ,σ′∈Σ
δ(s,σ)=q,δ(r,σ′)∈F
Rσ(min) ∧ ϕidq,r(p; min,max) ∧Rσ′(max).
The initialisation of the relations Sfq,r is straightforward. If f() = σ for a relabelling
function f , then a pair (i, j) with i < j is in Sfq,r if and only if δ∗(q, σj−i−1) = r, where
σm is the word of length m that only contains the letter σ. If f() = , then (i, j) is in
Sfq,r if and only if q = r. This initialisation can be computed in NC1, as the membership
problem for regular languages is in NC1, see [Ladner and Fischer 1980].
We next turn to context-free languages. Using the ideas of the proof of Theorem 5.13
we can extend the result of [Gelade et al. 2012] that context-free languages can be
maintained in DynFO under changes of single positions [Gelade et al. 2012, Theorem
4.1] to quantifier-free changes. In a second step we extend this result to ∃∗FO-definable
change operations, and dually, ∀∗FO-definable change operations.
Proposition 5.14. Let L be a context-free language and ∆ a finite set of quantifier-
free replacement queries. Then (Member(L),∆) can be maintained in DynFO with
P initialisation.
Proof. We adapt the proof of [Gelade et al. 2012, Theorem 4.1], using the same ideas as
in the proof of Theorem 5.13. Let L be a context-free language over alphabet Σ and G a
corresponding context-free grammar in Chomsky normal form, that is, only with rules
of the form X → Y Z and X → σ, for non-terminals X,Y, Z and symbols σ ∈ Σ. We
extend G, slightly violating the Chomsky normal form, with a new non-terminal E and a
new rule E → , as well as with new rules X → XE,X → EX, for every non-terminal X
of G. Clearly, this extension does not change the language generated by G, but simplifies
for example the update formulas for deletions, see [Gelade et al. 2012] for details.
The proof from [Gelade et al. 2012] uses (a slight variation of the) auxiliary rela-
tions SX,Y (i1, j1, i2, j2), for each pair X,Y of non-terminals of the grammar G, with
the intention that (i1, j1, i2, j2) ∈ SX,Y if and only if i1 ≤ j1 < i2 ≤ j2 and X ⇒∗
wi1 · · ·wj1Y wi2 · · ·wj2 holds. The latter statement means that from X one can derive the
sentential form wi1 · · ·wj1Y wi2 · · ·wj2 using rules of G. In particular, if (i1, j1, i2, j2) ∈
SX,Y and Y ⇒∗ wj1+1 · · ·wi2−1, then X ⇒∗ wi1 · · ·wj2 . If (min, i, i+ 1,max) ∈ SX,E for
some position i, then X generates the input word w.
103
Chapter 5 Expressibility under definable changes
X
U1 U2
U
Y
Z
i1 u1 i u2 j1 i2 u3 j2
f1 f2
f← στ f→ f→
wi1 τ · · · wj1 wi2 · · · wj2
Figure 5.6: Conditions for the update of Sf1,f2X,Y in the proof of Proposition 5.14 (see also
[Gelade et al. 2012, Fig. 2]). The bottom contains the input word string
before the change. The layer above depicts the effect of the change, and the
next layer shows the relabelling assumed by the auxiliary relation. On top,
we see the partial derivation tree checked by the update formula.
We adapt this approach as in the proof of Theorem 5.13 and use auxiliary relations of
the form Sf1,f2X,Y for relabelling functions f1, f2. The intention is that (i1, j1, i2, j2) ∈ Sf1,f2X,Y
if and only if i1 ≤ j1 < i2 ≤ j2 and X ⇒∗ f1(wi1 · · ·wj1)Y f2(wi2 · · ·wj2).
We show next how to maintain these relations under quantifier-free replacement queries
ρ(p) with one parameter p. As in the proof of Theorem 5.13, the generalisation for any
number of parameters is straightforward, but tedious.
Let α = ρ(i) be a change to the input word w. Similarly as in the proof of Theorem 5.13,
from ρ and the symbol τ at position i one can derive relabelling functions f←, f→ and a
symbol στ that characterise the effect of α on the positions smaller and larger than i,
and on the position i itself, respectively.
We explain how an update formula can determine whether a tuple (i1, j1, i2, j2) shall be
included in Sf1,f2X,Y after a change ρ(i) with i1 < i < j1. The case i2 < i < j2 is symmetric,
all other cases are also very similar. We also omit several border sub-cases.
The idea behind the update formula is illustrated in Figure 5.6. It essentially “guesses”
parts of a derivation tree that witnesses that X can produce the sentential form
f1(α(w)i1 · · ·α(w)j1))Y f2(α(w)i2 · · ·α(w)j2)) and checks consistency of that guess us-
ing the old auxiliary relations. More formally, it iterates over all rules U → U1U2 of G
and all non-terminals Z with a rule Z → f1(στ ) and checks whether there are positions
u1, u2, u3 with i1 ≤ u1 < i < u2 < j1 and i2 ≤ u3 ≤ j2 such that
• (i1, u1 − 1, u3 + 1, j2) ∈ Sf1◦f←,f2◦f→X,U ,
• (u1, i− 1, i+ 1, u2) ∈ Sf1◦f←,f1◦f→U1,Z , and
• (u2 + 1, j1, i2, u3) ∈ Sf1◦f→,f2◦f→U2,Y hold.
104
5.4 Maintaining formal languages
The initialisation of the relations Sf1,f2X,Y is straightforward. If f1() = σ1 or f2() = σ2
holds for relabelling functions f1, f2 and σ1, σ2 ∈ Σ, then a tuple (i1, j1, i2, j2) is in
Sf1,f2X,Y if and only if X ⇒∗ σj1−i1+11 Y σj2−i2+12 . This can clearly be checked in polynomial
time.
We now lift this result to ∃∗FO-definable replacement queries and essentially prove that
every ∃∗FO-definable replacement query for words can be simulated by a quantifier-free
replacement query. Intuitively this means that an ∃∗FO-definable replacement query
splits the input word into constantly many subwords, and each subword changes with
respect to some relabelling function.
Theorem 5.15. Let L be a context-free language and ∆ a finite set of ∃∗FO-definable
replacement queries. Then (Member(L),∆) can be maintained in DynFO with P initial-
isation.
Proof. We show that for each ∃∗FO-definable change operation ρ(p¯) there exists a
quantifier-free change operation ρ′(p¯, q¯), using (many) additional parameters q¯, such that
for each string w and each change α = ρ(a¯) there is a change α′ = ρ′(a¯, c¯) such that
α(w) = α′(w). Moreover, we show that the tuple c¯ can be determined in FO. As by
Proposition 5.14 we can maintain Member(L) under quantifier-free replacement queries,
the result follows: an update formula for ρ only has to determine the tuple c¯ and then
simulate the corresponding update formula for ρ′.
We construct the change operation ρ′ from ρ. Let µσ(p¯;x) be the replacement formula
of ρ for some relation Rσ, that is, the ∃∗FO-formula that expresses whether after a
change ρ(a¯) the element at position x carries the symbol σ. Without loss of generality,
µσ(p¯;x) =
∨m
i=1 θi(p¯;x) where each θi(p¯;x) is of the form ∃y1 . . . , yk ψi(y¯, p¯;x) with
quantifier-free ψi, and ψi describes a full atomic type over p¯, y¯, x with respect to the
linear order and the relations Rσ. So, ψi is a conjunction of literals such that for any
pair z, z′ of variables from p¯, y¯, x either the literal z ≤ z′ or ¬(z ≤ z′) and either Rσ(z)
or ¬Rσ(z) is present in ψi.
Towards the construction of a replacement formula µ′σ for ρ′, we claim that for every i,
every string w and every tuple a¯ of position there exists a tuple c¯ = (c1, . . . , ck) of positions
in w such that for every position b it holds w |= θi(a¯, b) if and only if w |= ψi(a¯, c¯; b).
Let y1 < · · · y` < x < y`+1 < · · · < yk be the order among the quantified variables
y¯ and x that is enforced by ψi. If w |= θi(a¯, b) holds, let positions (c1, . . . , c`) be the
lexicographically minimal and (ck, . . . , c`+1) the lexicographically maximal positions such
that w |= ψi(a¯, c¯, b) holds. Intuitively, this assignment for the quantified variables y¯ picks
the positions that are as far away as possible from the position b. This tuple satisfies the
condition of the claim above, that is, for every position b′, the statements w |= θi(a¯, b′)
and w |= ψi(a¯, c¯; b′) are equivalent. We call c¯ the canonical tuple for θi with respect to w
and a¯.
As the other direction is trivial, we only argue the direction w |= θi(a¯, b′) ⇒ w |=
ψi(a¯, c¯; b′). We assume that w |= θi(a¯, b′) holds for an arbitrary position b′ and let c¯′
be a tuple of positions such that w |= ψi(a¯, c¯′; b′) holds. We need to show that also
w |= ψi(a¯, c¯; b′) holds. Because w |= ψi(a¯, c¯; b) holds, we only have to show that all
105
Chapter 5 Expressibility under definable changes
statements cj ≤ b′ and b′ ≤ cj hold exactly if c′j ≤ b′ and b′ ≤ c′j hold, respectively. We
show that cj ≤ b′ holds if and only if c′j ≤ b′, the other case is symmetric.
From the order enforced by ψi we see that c′j ≤ b′ holds if and only if j ≤ `. If j ≤ `,
then cj ≤ c′j by the choice of c¯ and therefore also cj ≤ b′ holds. If j > `, then cj ≥ c′j
and cj ≤ b′ does not hold, either. This completes the proof of the claim.
Let µ′σ(p¯, q¯1, . . . , q¯m;x) be the quantifier-free replacement formula
∨m
i=1 ψ
′
i(q¯i, p¯;x),
where ψ′i results from ψi by renaming of the variables y¯ to q¯i. It is clear from the observa-
tions above that, for each word w and each tuple a¯ of positions, w |= µ′σ(a¯, c¯1, . . . , c¯m; b)
if and only if w |= µσ(a¯; b), when the tuples ci are chosen as the canonical tuples for θi
with respect to w and a¯. Note that these tuples can easily be determined in FO, given w
and a¯.
The change ρ′(p¯, (q¯σ)σ∈Σ) is the composition of all replacement formulas µ′σ(p¯, q¯σ;x).
Because every ∀∗FO-formula is just the negation of an ∃∗FO-formula, the next corollary
follows immediately.
Corollary 5.16. Let L be a context-free language and ∆ a finite set of ∀∗FO-definable
replacement queries. Then (Member(L),∆) can be maintained in DynFO with P initial-
isation.
Proof. This can be proven along the same lines as Theorem 5.15, as every ∀∗FO replace-
ment formula can be written in the form ¬∨mi=1 θi with θi as in the previous proof.
Wrapping up the results we have seen so far, we can maintain context-free languages in
DynFO under changes that are definable by FO formulas without quantifier alternation.
So far we do not know whether regular or context-free languages can be maintained under
definable changes that use quantifier alternation. We exemplify the challenges imposed by
these changes with the parameter-free replacement query ρab→aa that intuitively replaces
every subword ab in the input word with aa. It is defined by the replacement formulas
µRa(x)
def= Ra(x) ∨
(
Rb(x) ∧ Ra(x− 1)
)
and µRb(x)
def= Rb(x) ∧ ¬Ra(x− 1), where − 1
denotes the ∃∀FO-definable predecessor function.
When we consider the effect of this change operation to input words of the form
abbabb· · · abb we see that a reduction to quantifier-free replacement queries is not
possible any more: the change cannot be characterised by dividing the input string into
constantly many parts and applying some relabelling function to each part independently.
This change operation can indeed raise the complexity of maintaining a regular language
in DynFO, at least to a small extent. We show this with the example of the regular
language Laa
def= L((b∗ab∗ab∗)∗) over alphabet Σ = {a, b} that contains all strings with
an even number of a’s. Analogously to Example 2.11 one can show that Member(Laa)
can be maintained in DynFO under change operations ∆Σ that allow to set or erase the
label of a single position, using only nullary auxiliary relations, that is, auxiliary bits.
This is not the case any more when the change operation ρab→aa is available.
Proposition 5.17. The dynamic query (Member(Laa),∆Σ∪{ρab→aa}) cannot be main-
tained in DynFO using only nullary auxiliary relations and arbitrary initialisation.
106
5.4 Maintaining formal languages
Proof sketch. To obtain a contradiction, we assume that there is a dynamic program P
with m auxiliary bits that maintains this dynamic query. Let k be the maximal quantifier
rank of any update formula of P.
We consider strings of the form (ab)n1(abb)n2 · · · (abm+1)nm+1 with ni ∈ N. For I ⊆
{1, . . . ,m + 1}, let w(I) be the string of this form such that ni = 2k if i ∈ I and
ni = 2k + 1 otherwise. There are 2m different valuations of m auxiliary bits and 2m+1
different strings w(I), so there are different index sets I1, I2 ⊆ {1, . . . ,m+ 1} such that
the dynamic program stores the same auxiliary structure Aux for both strings w(I1) and
w(I2) after an arbitrary initialisation and after applying the canonical change sequences
that yield w(I1) and w(I2), respectively.
We argue that P cannot maintain the query result when ρab→aa is applied repeatedly.
This is because the update formulas cannot distinguish the two input strings, even after
an arbitrary number of applications of ρab→aa, and therefore P assigns the same auxiliary
relations after each change step. However, after some number ` of applications of ρab→aa,
exactly one of the strings ρ`ab→aa(w(I1)) and ρ`ab→aa(w(I2)) is in Laa. Because P gives
the same query result for both of them, it cannot correctly maintain Member(Laa).
For the first part, observe that for each number i of applications of ρab→aa the strings
ρiab→aa(w(I1)) and ρiab→aa(w(I2)) only differ in the number of substrings of the form
abj , and both numbers n1j and n2j are at least 2k. By a standard Ehrenfeucht-Fraïsse
argument, these numbers cannot be distinguished by first-order formulas of quantifier
rank at most k.
We argue the second part. By applying ρab→aa, substrings of the form b∗ get smaller
and eventually disappear. When by an application of ρab→aa an even (odd) number of
positions change their label from b to a and an even number of b∗-substrings disappears,
then also the next application of ρab→aa changes the label of an even (odd) number of
positions from b to a. If an odd number of b∗-substrings disappears, the parity of the
number of changed positions is different between the two applications of the change.
Let `′ be the smallest index on which I1 and I2 differ, that is `′ ∈ I1 ⇔ `′ /∈ I2. Without
loss of generality we assume that `′ ∈ I1 and `′ /∈ I2. The string w(I1) has an even
number of substrings ab`′ (that are not succeeded by another b), and w(I2) has an odd
number of these substrings. That means that with the `′-th application of ρab→aa an even
number of b∗-substrings disappears in the first string, and an odd number of b∗-substrings
disappears in the second string. Assuming that P gave the correct query answers up to
the `′-th application of ρab→aa for both input strings, for `
def= ` + 1 the parity of the
number of a’s is different among ρ`ab→aa(w(I1)) and ρ`ab→aa(w(I2)), as claimed.
Note that (Member(Laa),∆Σ ∪ {ρab→aa}) can be maintained in DynFO(≤,+,×) with
one unary auxiliary relation (in addition to the ternary arithmetic relations). For a
word with n positions, it suffices to store the parity of the number of occurrences of
substrings abb· · · b with exactly ` b’s, for every ` ≤ n. We omit the details. Whether
(Member(Laa),∆Σ ∪ {ρab→aa}) can be maintained in DynFO with only unary auxiliary
relations is open, as well as whether this dynamic query can be maintained under all
change operations definable by quantifier-free formulas that may use the predecessor and
successor functions.
107
Chapter 5 Expressibility under definable changes
5.5 AC1 queries under parameter-free changes
In the last sections we extended known maintainability results for DynFO under single-
tuple changes to more expressive change operations, characterised by fragments of
first-order logic. We extended not only the results but the known dynamic programs
for the single-tuple cases: basically, the techniques we used for the single-tuple case still
work for the complex changes we consider.
Remarkably, in this section we prove a maintainability result under a fragment of FO-
definable changes that we do not know to hold under single-tuple changes. Of course this
means that single-tuple changes cannot be expressed directly by these change operations.
We now study replacement queries without parameters, or equivalently, changes
expressible as relational algebra queries that do not use constant values. One might
suspect that parameter-free replacement queries are not very powerful, especially when
they are applied to an initially empty input structure. However, when the input structure
comes with a built-in linear order, one can actually construct every finite graph with
relatively simple replacement queries (and similarly for other kinds of structures). For
instance, one can cycle through all pairs of nodes in lexicographic order. If (u, v) is
the current maximal edge, operation keep can move to (u, v + 1), that is, insert this
edge into E, while leaving (u, v) in E, and drop can move to (u, v + 1) while deleting
(u, v) from E. For this reason we include a linear order into the input structure and
yield stronger change operations, and therefore stronger results. Note that in practical
database scenarios a linear order on the domain is often available.
It turns out that in the setting of ordered structures under parameter-free changes a
large class of queries can be maintained in DynFO: all queries that can be expressed in
(uniform) AC1 and thus, in particular, all queries that can be answered in logarithmic
space. This result exploits the fact that for a fixed set of replacement queries without
parameters there is only a constant number of possible changes to a structure, in each
step.
We prove this result in two steps. First we show that the dynamic queries of interest
are (AC1, logn)-maintainable as defined in Section 3.1. Then we adapt the Muddling
technique for single-tuple changes (Theorem 3.3) to prove an analogue of Corollary 3.4
for parameter-free replacement queries.
We start with the proof that AC1 queries are (AC1, logn)-maintainable under parameter-
free replacement queries.
Theorem 5.18. Let q be an AC1 query over ordered structures and ∆ a finite set of
parameter-free first-order definable replacement queries. Then (q,∆) is (AC1, logn)-
maintainable.
Proof. Let q be an r-ary AC1 query over ordered structures and let ∆ be a finite
set of parameter-free first-order definable replacement queries. We construct an AC1
algorithm A and a dynamic program P such that P maintains (q,∆) for logn many
changes starting from auxiliary relations initialised by A. We use a technique that is
inspired from the squirrel technique of [Zeume and Schwentick 2017] and equip P with
the necessary information to answer q after all change sequences of length logn.
108
5.5 AC1 queries under parameter-free changes
For a fixed set ∆ of parameter-free change operations there is only a polynomial number
of change sequences of length at most logn based on these operations. Therefore the
algorithm A can compute the effect of applying each of these change sequences to the
input structure, evaluate q on the resulting structure and store the query result in the
auxiliary relations. The program P then only needs to record the change sequence that
is actually applied and return the corresponding query result.
We make this more formal now but, for simplicity, only consider the case of ordered
graphs (σin = {E,≤}) and assume that ∆ = {ρ0, ρ1} contains only two change operations.
Let G = (V,E,≤) be an ordered graph with n vertices. The initialisation A first
computes the predicate BIT that is compatible to the linear order ≤. With this predicate,
bit sequences of length logn can be represented by one node of G. We use nodes of the
graph to encode change sequences as follows: A sequence β = α1 · · ·αm with m ≤ logn
is encoded by the node vβ whose bit string representation has 1 at position i ≤ m if and
only if αi = ρ1, and has 0 at all other positions.
The algorithm A initialises an (r + 2)-ary auxiliary relation R as follows. For each
m ≤ logn and each v ∈ V such that v = vβ for a change sequence β of length m, A
computes the query result Ansv = q(β(G)). Note that A can compute the graph β(G)
because the iterated evaluation of logn first-order formulas is clearly in AC1. For every
tuple q¯ ∈ Ansv, A inserts the tuple (v,m, q¯) in R.
The dynamic program P only stores the number m of changes that are already applied
and the node v that encodes these changes. The updates for these auxiliary relations are
clear. The program then only needs to output the relation {q¯ | (v,m, q¯) ∈ R}.
We now show the analogue of Corollary 3.4 for parameter-free replacement queries.
Contrary to Corollary 3.4 we here do not assume that the query to be maintained is
almost domain independent: the following theorem is valid for all AC1 queries. However,
this comes with the price that we need an initialisation of the auxiliary relations.
Theorem 5.19 (Muddling for parameter-free definable changes). Let q be an AC1
query over ordered structures and ∆ a finite set of parameter-free first-order definable
replacement queries. If (q,∆) is (AC1, logn)-maintainable, then (q,∆) is in DynFO with
AC1 initialisation.
Proof sketch. Our goal is to construct a dynamic program P that maintains q after an
AC1 initialisation. The construction of this program follows the construction in the proof
of Theorem 3.3 that implies membership in DynFO(≤,+,×) for (AC1, logn)-maintainable
dynamic queries (q′,∆′) where q′ is almost domain independent and ∆′ is a set of
single-tuple changes.
In the proof of Theorem 3.3, the constructed program uses threads that work slightly
differently depending on whether they are supposed to give the query result for a time
point t ≥ logn or for an earlier time point. Observe that only for time points before logn
the assumptions that q′ is almost domain independent and that ∆′ is a set of single-tuple
changes are actually used. This means that the same construction as in the proof of
Theorem 3.3 gives a dynamic FO(≤,+,×)-program P without initialisation that can give
the query result for (q,∆) from time point logn onwards.
109
Chapter 5 Expressibility under definable changes
It only remains to show that the query result can also be obtained for the first logn
time steps. By Theorem 5.18 there is an AC1 initialisation A′ and a dynamic program P ′
that maintains (q,∆) for logn many steps from auxiliary relations initialised by A′.
So, the initialisation algorithm A′′ that first initialises the relations + and × and then
executes A′ together with the dynamic program P ′′ that simulates in parallel P and
P ′ and outputs the result of P ′ until time point logn and then outputs the result of P
witness that (q,∆) is in DynFO with AC1 initialisation.
The main theorem of this section is now an immediate corollary of the preceding
theorems.
Corollary 5.20. Let q be an AC1 query over ordered structures and ∆ a finite set of
parameter-free first-order definable replacement queries. Then (q,∆) is in DynFO with
AC1 initialisation.
5.6 Outlook and bibliographic remarks
In this chapter we studied three classes of queries and three classes of first-order definable
changes. We saw that undirected reachability as well as reachability in acyclic directed
graphs can be maintained under (restricted) first-order insertions, extended the dynamic
programs for regular and context-free languages to changes defined by quantifier-restricted
first-order formulas, and obtained membership in DynFO for all AC1 queries under
parameter-free first-order changes.
We comment on the remark in Section 2.3 that the results in this thesis also hold
in the variant of the FOIES framework of Dong et al. [1995] obtained by extending
the DynFO framework with change operations add(x) and remove(x) that change the
underlying domain by one element. These operations do not fit in the parameter-free
setting of Section 5.5, but we can allow similar change operations add and remove that
add some new element to the domain as the largest element of ≤ or remove that element,
respectively. The other results transfer immediately.
We give some pointers for future research. It is still unclear whether the general Reach
query can be maintained under non-trivial first-order definable insertions, we could only
show that unary auxiliary relations are not sufficient, see Theorem 5.9(a). We have also
no results so far regarding the undirected reachability query under first-order deletions.
One problem for proving membership in DynFO is that one deletion could render the
auxiliary information basically useless in the following sense. Suppose the input graph
contains a node v that has an edge to every other node, and the spanning tree stored as
auxiliary information consists precisely of these edges. A first-order deletion can remove
all of v’s edges at once, and the update formulas need to define a spanning tree for the
remaining graph from scratch. It therefore seems that a dynamic program that maintains
UReach under such first-order deletions needs to store a spanning tree with small, ideally
constant, node degrees, as long as this is possible. Unfortunately, the dynamic program
for first-order insertions from Theorem 5.4 constructs spanning trees with high degree.
Possibly this observation can be used to show that UReach cannot be maintained under
110
5.6 Outlook and bibliographic remarks
both first-order insertions and deletions, at least if only the auxiliary relations from the
proof of Theorem 5.4 are allowed.
The positive result for reachability rely on the bounded bridge property: it is only
necessary to concatenate a constant number of old paths to connect a reachable pair of
nodes in the changed graph.7 We relied on a similar property for maintaining formal
languages in Section 5.4. The quantifier-restricted change operations studied there split
the input string into a constant number of fragments, and in each fragment the positions
are relabelled according to the same function. First-order formulas with ∃∀ quantifier
prefix can express the successor function on strings, and changes defined using this
function do not have the property stated above, as already discussed towards the end of
Section 5.4. There we have already sketched that we can maintain some regular language
in DynFO under some ∃∀FO-definable change operation. The approach used there is
tailored towards the specific language and the specific changes, generalisations in any
direction are left open. Can this language be maintained under all changes that only use
the successor function and no quantifier? Are there regular languages that are not in
unary DynFO under ∃∀FO-definable changes?
The state of art regarding lower bound techniques and results is still unsatisfactory.
We were able to prove unmaintainability of Reach in DynProp under quantifier-free
first-order insertions and deletions, but the proof crucially relies on the fact that the
update formulas do not have access to the endpoints of the changed edges. It is therefore
unclear how this result could be generalised to show that Reach is not in DynProp under
single-edge changes. Further research is necessary to answer this question.
In this chapter we focussed on reachability and on formal language recognition. Of
course there are a lot more interesting queries for which we know maintainability in
DynFO under single-tuple changes. For all of them we can now ask the question: “Can
we do this under first-order changes?” As we already remarked several times, for example
when we compared the queries Parity and OddCoveredNodes in Section 4.3, results of
this form may also have consequences for query maintenance under single-tuple changes.
Bibliographic remarks
The results of this chapter are obtained jointly with Thomas Schwentick and Thomas
Zeume and are published as [Schwentick et al. 2017a] and [Schwentick et al. 2018]. The
detailed distribution is as follows. The results from Section 5.2 appear in both versions,
except of the findings of Subsection 5.2.2, which are only contained in [Schwentick
et al. 2018]. Theorem 5.9(a) and Theorem 5.11(a) are again published in both versions,
Theorem 5.9(b) and Theorem 5.11(b) are only in [Schwentick et al. 2018]. The presentation
of the proof of Theorem 5.11 follows the presentation of [Schwentick et al. 2018]. The
7Actually, the bounded bridge property is the only necessary assumption for the dynamic programs
to work. These dynamic programs are therefore also able to maintain reachability under other, not
necessarily first-order definable, changes with the bounded bridge property. For example, we could
also allow changes that insert plain sets of edges with the structural restriction that the associated
bridge bound is bounded by some constant. See Chapter 6 for further results concerning changes that
insert or delete sets of edges.
111
Chapter 5 Expressibility under definable changes
results concerning formal languages from Section 5.4 only appear in [Schwentick et al.
2017a]. The result of Corollary 5.20 is published in both [Schwentick et al. 2017a] and
[Schwentick et al. 2018], although with a different proof.
112
CHAPTER 6
Expressibility under size-restricted bulk changes
A database table can be modified in ways that cannot be captured by a fixed set of
first-order definable change operations. For example, SQL’s INSERT statement can add a
list of tuples without a pre-determined size bound. Specific database systems support
non-standard instructions, like for example PostgreSQL’s COPY command1 which similarly
can insert a long list of tuples. Additionally, if for performance reasons it is not possible to
update the auxiliary data after every change but this can only be done periodically, then
many changes may accumulate between the updates, even if only single-tuple changes
are allowed.
These aspects lead to the requirement that dynamic programs need to be able to update
a query result when a set of tuples is changed that does not adhere to any structural
limitations. In this chapter we study these bulk changes that may insert and delete sets
of tuples. Of course we cannot allow these changes to replace the input structure with an
arbitrary different one, as a dynamic program then essentially needs to re-compute the
query result from scratch. Therefore we restrict these changes quantitatively and demand
that the size of the changed sets of tuples is bounded by a function in the domain size.
For example, we study changes that allow to insert up to logn edges into a graph with n
nodes.
The goals of this chapter are basically analogous to the goals of Chapter 5. Immediately
the question comes up which maintainability results for single-tuple changes survive when
bulk changes are allowed, and how large the change may be.
Goal 6.1. Show maintainability in DynFO under bulk changes of non-constant size for
queries that can be maintained under single-tuple changes.
1see https://www.postgresql.org/docs/current/sql-copy.html
113
Chapter 6 Expressibility under size-restricted bulk changes
On the other hand, as more complex changes make query maintenance more difficult,
we want to prove that certain queries cannot be maintained under changes that exceed
some size bound.
Goal 6.2. Show for specific queries that they cannot be maintained dynamically under
bulk changes of a certain size.
Contributions As in the previous chapter we mostly consider the reachability query
for directed and undirected graphs and the membership problem for formal languages.
We first show a barrier for these and other queries: we can only hope to maintain them
under changes of polylogarithmic size with respect to the size of the domain, at least if we
only allow FO(≤,+,×) update formulas. Consequently, we then try to prove matching
maintainability results. We succeed for reachability in undirected graphs as well as
for the membership problems of regular languages on words and trees. These results
rely on an investigation of the expressive power of a restricted version of MSO logic
where second-order quantification is restricted to sets of small size, which might be of
independent interest.
For Reach on general directed graphs we only show maintainability under insertions
of polylogarithmic size. It is known that Reach can be maintained under simultaneous
insertions and deletions of lognlog logn edges [Datta et al. 2018b]. We extend this result, which
uses an appropriate extension of the Muddling technique from Chapter 3, to more general
change operations. Then we proceed to study the Distance query which asks for the
length of the shortest path between two graph nodes. We show that this query can be
maintained under changes of polylogarithmic size by FO+Maj(≤,+,×) update formulas,
that is, FO(≤,+,×) formulas that additionally are allowed to use majority quantifiers.
It is still unknown whether Distance is in DynFO under single-tuple changes. We
prove as an intermediate result that DynFO(≤,+,×) can maintain auxiliary relations
under changes of size lognlog logn such that the query result can be extracted from these
relations by an FO+Maj(≤,+,×) formula.
These results are joint work with Thomas Zeume, Samir Datta and Anish Mukherjee.
Again, more detailed bibliographic remarks are given at the end of the chapter.
Outline We start by extending the dynamic complexity framework to bulk changes in
Section 6.1, together with a discussion on principal limits. In Section 6.2 we introduce a
restriction of MSO that still expresses Reach and the membership problem for regular
languages on words and trees. We show that FO(≤,+,×) can simulate this restricted
logic on substructures of polylogarithmic size in the size of the domain, which we use
to show that reachability in undirected graphs as well as regular (tree) languages can
be maintained under changes of polylogarithmic size. Afterwards, in Section 6.3, we
formulate a variant of the Muddling theorem for bulk changes and provide the results we
mentioned above regarding Reach and Distance. These results build on the techniques
from [Datta et al. 2018a] and we introduce the necessary background from linear algebra
before. We conclude in Section 6.4 by giving an outlook and further bibliographic
remarks.
114
6.1 Changing sets of tuples
6.1 Changing sets of tuples
In this section, we extend our framework to change operations that insert or delete sets
of tuples, and discuss the principal limits of query maintenance under these changes.
A single-tuple change α as defined in Chapter 2 consists of a change operation δ that
specifies which relation is modified and whether the tuple is inserted or deleted, and the
actual tuple that is inserted or deleted. The update formulas access that tuple via explicit
first-order parameters. We generalise this approach to size-restricted bulk changes that
provide a set of changed tuples via an explicit second-order parameter, and introduce the
change operations insert f(n) tuples P into R and delete f(n) tuples P from R, for
a relations symbol P and a function f : N→ N. We usually denote these operations as
insR[f(n)] and delR[f(n)], respectively. Given a structure D over some domain D and an
input schema that contains a k-ary relation symbol R, a bulk insertion α = (insR[f(n)], A)
for D provides a k-ary set A of size |A| ≤ f(|D|). The result α(D) is defined according to
intuition: the relation Rα(D) is chosen as RD ∪A, all other relations remain unchanged.
Bulk deletions are defined analogously.
For any set ∆ of single-tuple change operations, we write ∆[f(n)] for the set {δ[f(n)] |
δ ∈ ∆} of bulk change operations that lift the operations from ∆ to changes of size f(n).
If (q,∆[f(n)]) ∈ DynFO, we also say that q can be maintained in DynFO under f(n)
changes from ∆. Note that in all our subsequent maintainability results stating that
some query can be maintained under f(n) changes, we can replace f(n) by O(f(n)), as
each change of size O(f(n)) can be replaced by O(1) changes of size f(n).
In the remainder of this section we discuss which kind of maintainability results we
can aim for. For reasons that we discuss later, we mainly consider FO(≤,+,×) update
formulas that use built-in arithmetic relations. Much of our reasoning is based on the
expressive power of FO(≤,+,×) on small structures. For example, although FO(≤,+,×)
cannot express Parity [Ajtai 1983; Furst et al. 1984], it can determine whether a
sufficiently small relation has odd size.
Theorem 6.1 ([Denenberg et al. 1986; Fagin et al. 1985]).
(a) For every d ∈ N there is a FO(≤,+,×) formula ϕ(A) that determines whether a
set A with |A| ≤ logd n has odd size, where n is the size of the domain.
(b) Such a formula does not exist for sets A that may be as large as f(n), for any
function f(n) ∈ logω(1) n.
Part (a) of this result enables us to extend Example 2.11 and show that the query
Parity can be maintained in DynFO(≤,+,×) under polylogarithmically many insertions
and deletions of elements into and from the unary relation U .
Proposition 6.2. The dynamic query (Parity,∆U [logd n]) is in DynFO(≤,+,×), for
every d ∈ N.
Proof sketch. Suppose a bulk change inserts or deletes a set A of elements to or from U .
Without loss of generality, we assume that A ∩ U = ∅ in case of an insertion, and A ⊆ U
in case of a deletion. The FO(≤,+,×) update formula determines the parity of the
115
Chapter 6 Expressibility under size-restricted bulk changes
number of elements in A, which is possible thanks to Theorem 6.1(a), and updates the
query answer accordingly.
Part (b) of Theorem 6.1 provides a barrier for maintaining Parity under bulk changes.
Proposition 6.3. Let f(n) ∈ logω(1) n be an arbitrary function. The dynamic query
(Parity,∆U [f(n)]) is not in DynFO(≤,+,×).
Proof sketch. Let f(n) ∈ logω(1) n be fixed. The assumption (Parity,∆U [f(n)]) ∈
DynFO(≤,+,×) leads to a contradiction with Theorem 6.1(b). Let ϕ be the FO(≤,+,×)
formula for the change operation insU [f(n)] that updates the query result. When the first
change insU [f(n)](A) is applied starting from an empty input structure, the auxiliary
relations are empty and the formula ϕ needs to determine directly whether A is of odd
size. It is therefore easy to obtain from ϕ a formula that expresses whether a set of size
at most f(n) has odd size, contradicting Theorem 6.1(b).
In light of Proposition 6.3 we see that we cannot hope for meaningful results that a
query can be maintained in DynFO(≤,+,×) under more than polylogarithmically many
changes. This in particular concerns formal language and graph reachability maintenance,
as the membership problem for the regular language of strings with an odd number of a
symbols is equivalent to Parity, and the latter problem can also be reduced easily to
graph reachability, see Example 2.4.
The previous propositions are stated in terms of DynFO(≤,+,×), not DynFO. For single-
tuple changes, the difference is neglectable for most interesting queries, see Proposition 3.2,
but this is not true any more for bulk changes. Using standard arguments one can show
that FO without built-in linear order and arithmetic can only determine whether a set
has odd size if the size of this set is bounded by some fixed constant. The same way
we concluded Proposition 6.3 from Theorem 6.1(b) it follows that DynFO without any
built-in relations cannot maintain Parity under bulk changes of non-constant size, and
consequently neither regular languages nor reachability. Therefore, all results in this
chapter will be stated with respect to DynFO(≤,+,×) and its generalisations.
If the size of bulk changes is given as a function in the size of the active domain instead
of the whole domain and we make sure that a linear order and arithmetic is always
available on the active domain, then some results of this chapter transfer to DynFO
without built-in arithmetic. This will be discussed in more detail in Section 6.4.
6.2 Utilising second-order logics with restricted quantifiers
An important insight we use in Section 4.1 is that MSO formulas can be evaluated via
FO(≤,+,×) formulas on substructures of logarithmic size, as second-order quantification
over sets of size logn can be simulated by first-order quantification over sets of size n.
This is possible because each subset of a set with logn elements can be represented by
one of n nodes.
In this chapter’s setting, a change might insert many edges into a graph. If we, for
example, want to maintain the transitive closure of the graph, we necessarily need to be
116
6.2 Utilising second-order logics with restricted quantifiers
able to express the transitive closure of the graph that only consists of the just inserted
edges. If logn edges are inserted into a graph with n nodes, the same reasoning as
above gives that an FO(≤,+,×) update formula can express the transitive closure of the
inserted edges, as Reach is MSO-expressible, see Example 2.1. Of course, in a second
step, this information needs to be combined with the maintained transitive closure of the
graph before the change.
But what happens if polylogarithmically many edges are inserted? We cannot simulate
MSO on the set of nodes that is affected by such a change, simply because we cannot
represent all of its subsets by constant-sized tuples over all nodes, as there are super-
polynomially many of these subsets. We circumvent this problem and show that we can
represent all subsets of affected nodes up to a certain size, and even relations of arbitrary
arity over this set, as long as they contain only few tuples. It follows that we can simulate
a weak form of second-order quantification that is only able to quantify relations of
small size. We demonstrate that this restricted form of second-order logic is still very
expressive, and deduce that FO(≤,+,×) update formulas can express reachability and
regular languages on words and trees on the substructures that are affected by a change
of polylogarithmic size. Then we use this ability to show that the corresponding query
results can be updated for the whole structure.
6.2.1 The expressive power of restricted second-order quantification
In this section we are particularly interested in second-order logics with size-restricted
quantification. In model theory, the restriction of MSO that allows only quantification
over finite sets is called weak MSO. Over finite structures, in contrast to arbitrary
structures, this restriction of course does not change the expressive power of MSO. Here
we introduce a more restricted variant of MSO that only allows quantification over sets
of bounded size with respect to the underlying domain.
Let f : N → N be a fixed function. The syntax of f(n)-MSO over a given schema σ
is defined exactly as for MSO. We do not give a formal definition of its semantics,
but rather explain the difference to regular MSO. Intuitively, an f(n)-MSO formula
∃X ϕ(X) is satisfied in a σ-structure D with universe D, if there is a set D′ ⊆ D with
|D′| ≤ f(|D|) such that D |= ϕ(D′); a formula ∀X ϕ(X) is satisfied if for all sets D′ ⊆ D
with |D′| ≤ f(|D|) it holds that D |= ϕ(D′).
In the following we list results highlighting that MSO is still very expressive when
quantified sets are bounded by i
√
n, for arbitrary i. We first recall some definitions.
Remember that words over an alphabet Σ are encoded as ordered structures with unary
relations Rσ for every σ ∈ Σ, see also Section 5.4. To define languages of trees instead
of words, we consider ranked alphabets Σ, that is, every symbol σ ∈ Σ comes with an
associated natural number, called its rank. A tree language over a ranked alphabet Σ
is a set of rooted, Σ-labelled, ranked and ordered trees that we represent by relations
FirstChild and NextSibling with the obvious meaning, with the property that the
number of children of a node equals the rank of its symbol σ ∈ Σ. A tree language
is regular if it is accepted by a deterministic finite (bottom-up) tree automaton (see
c.f. [Comon et al. 2007]).
117
Chapter 6 Expressibility under size-restricted bulk changes
Theorem 6.4. The following queries can be expressed in i
√
n-MSO, for every i ≥ 1.
(a) Reachability in directed graphs,
(b) the word problem for any regular language,
(c) the word problem for any regular tree language.
Proof. Let i ≥ 1 be fixed. We prove the parts separately, but with similar proof ideas.
(a) We use an approach that is well-known in the context of small-depth circuits com-
puting small distance connectivity, c.f. [Chen et al. 2016]. From the MSO formula for
Reach from Example 2.1 we can easily construct an i
√
n-MSO formula that expresses
reachability via paths of length at most i
√
n in a graph G of size n. Such a formula
ϕ i√n(s, t) basically states that there is a subset V ′ of nodes (of size at most i
√
n+ 1) such
that t is reachable from s in G[V ′]. As we cannot quantify over sets of size i
√
n+ 1, the
formula2 asserts that there is an edge from s to a node v such that t is reachable from v
via a path of length at most i
√
n− 1.
ϕ i√n(s, t)
def= s = t ∨ ∃V ′ ∃v ∈ V ′ ∀X ⊆ V ′(
E(s, v) ∧
(
X(u) ∧ ∀x ∈ X ∀y ∈ V ′ (E(x, y)→ X(y)))→ X(t))
With a Savich-like construction, this formula can be lifted to paths of greater length.
A path of length i
√
n2 can be decomposed into i
√
n paths of length i
√
n. So, the i
√
n-
MSO formula ϕ i√
n2
(s, t) that we obtain from ϕ i√n(s, t) by replacing atoms E(x, y) with
ϕ i√n(x, y) expresses reachability along paths of length
i
√
n2. Repeating this step i times
results in an i
√
n-MSO formula for Reach.
(b) Let L be some regular language over an alphabet Σ and let A = (Q,Σ, δ, q0, F ) be
a deterministic finite automaton (DFA, with state set Q, transition function δ, initial
state q0 and set F of accepting states) that decides L. We give an i
√
n-MSO formula for
Member(L), the word problem for L.
The usual approach to expressMember(L) in MSO is to construct a formula ψp,q(j, k)
for any pair p, q of states of A such that ψp,q(j, k) is satisfied on a string w = w1 · · ·wn if
A goes from state p to state q while reading the word wj · · ·wk, so, if δ∗(p, wj · · ·wk) = q.
This formula existentially quantifies for all states q ∈ Q the set of positions from w such
that A assumes the state q after reading this position. The formula then checks whether
for all pairs of neighbouring positions these choices are consistent with the transition
function δ.
Under i
√
n-MSO semantics, an almost identical formula ψp,qi√n(j, k) can check whether
δ∗(p, wj+1 · · ·wk) = q holds, as long as wj+1 · · ·wk is a substring of length at most i
√
n.
Similar to part (a) of this proof, we extend this formula to substrings of larger size.
The formula ψp,qi√
n2
(j, k) that extends ψp,qi√n(j, k) to substrings of length
i
√
n2 existentially
quantifies a set P of positions with the intention that these positions divide the substring
of length i
√
n2 into i
√
n substrings of length i
√
n. Additionally, it uses existential quantifiers
2For the sake of notational clarity we use abbreviations “⊆” and “∈” that are easily expressible in
i
√
n-MSO.
118
6.2 Utilising second-order logics with restricted quantifiers
to assign a state from Q to every position from P . For all pairs j′ < k′ of positions from
P such that no ` ∈ P with j′ < ` < k′ exists, the formula checks that ψp1,p2i√n (j′, k′) is
satisfied for the states p1, p2 that are assigned to the positions j′ and k′.
We obtain formulas ψp,qn (j, k) by repeating this construction. The formula ϕL that
expresses Member(L) only needs to iterate over all pairs p ∈ Q and f ∈ F of states and
check that for one such pair δ(q0, w1) = p and ψp,fn (1, n) hold.
(c) We follow a similar approach as for part (b) of the proof and construct formulas
for larger and larger parts of the input tree. To define subtrees of a tree easily, we
use a concept similar to the concept of triangles as defined in Subsection 4.1.1. Let
T = (V,E, r) be a rooted tree, t ∈ V and B ⊆ V . The tree T (t, B) is the subtree of
T rooted at t, but all inner nodes of this subtree that are in B become leaves. More
formally, let T (t) = (V ′, E′) be the subtree of T rooted at t, and let B′ def= B ∩ V ′. The
tree T (t, B) results from T (t) by removing all children of B′ and their subtrees.
Let L be a regular tree language over a ranked alphabet Σ with maximal rank k and
let A = (Q,Σ, δ, F ) be a corresponding deterministic finite tree automaton. Suppose we
are given a set B of tree nodes and a partition Bq1 , . . . , Bq` of this set that associates a
state qi ∈ Q to every node from B. Analogously to part (b), there is an i
√
n-MSO formula
ψqi√n(t, Bq1 , . . . , Bq`) that expresses for trees T (t, B) of size O( i
√
n) that A assigns the
state q to t in T (t, B) if the initial states on the leaf nodes from B are as given by the
partition. This formula existentially quantifies for each state from Q a set of nodes3 and
verifies that A actually assigns these states to the corresponding nodes.
We now present a formula ψqi√
n2
(t, Bq1 , . . . , Bq`) for subtrees T (t, B) of size O( i
√
n2).
This formula existentially quantifies a set B′ of nodes as well as a partition into ` sets
B′q1 , . . . , B
′
q`
with the intention that for each t′ ∈ B′ ∪ {t} the subtrees T (t′, B ∪B′) have
size at most k i
√
n. It then checks that for all t′ ∈ B′ ∪ {t} the formula ψqi√n(t′, Bq1 ∪
B′q1 , . . . , Bq` ∪B′q`) is satisfied.
We iterate this step and obtain a formula ψqn(t) that expresses whether A assigns the
state q to t in T (t), and the final formula for the word problem of L only needs to check
whether ψfn(r) holds for any accepting state f ∈ F .
We will actually not directly apply parts (b) and (c) of Theorem 6.4 later. We
state and prove them here because they show that i
√
n-MSO and MSO have the same
expressive power on words and ranked trees, respectively, for each i ∈ N, which might be
of independent interest. This is not true any more if the quantifiers are further restricted.
Proposition 6.5. Let f(n) be a function from no(1). There is no f(n)-MSO formula ϕ
that expresses Member(Laa), the word problem for the regular language of strings with
an even number of a’s.
Proof. Let f(n) ∈ no(1) be fixed. We assume that there is an f(n)-MSO formula ϕ
for Member(Laa) and derive a contradiction to Håstad’s lower bound on the size of
constant-depth circuits for Parity [Håstad 1986].
3We allow here and in the following to quantify sets of size O( i√n), which is actually realised by
quantifying O(1) sets of size i√n.
119
Chapter 6 Expressibility under size-restricted bulk changes
Let d be the depth of the syntax tree of ϕ. Given some n, from the syntax tree
of ϕ we construct in the natural way a depth-d Boolean circuit Cn for input strings of
length n as follows. Every existential quantifier is replaced by an “or” gate, with fan-in n
for a first-order quantifier and with fan-in upper-bounded by nf(n) for an f(n)-MSO
second-order quantifier. Analogously, universal quantifiers are replaced by “and” gates
with fan-in n or nf(n), respectively. The obtained circuit is a tree of depth d and of degree
at most nf(n), so the number of gates is upper-bounded by (nf(n))d+1 = 2logn(d+1)f(n)
which for f(n) ∈ no(1) is not in 2Ω(n
1
d−1 ), the lower bound for depth-d circuits that
compute a parity function on n input bits [Håstad 1986].
The same way we defined the restriction f(n)-MSO of MSO we can restrict full second-
order logic that allows to quantify over relations instead of plain sets. Let rSO, for r ∈ N,
be the subset of second-order logic that allows quantification over r-ary relations. The
logic f(n)-rSO is the logic that restricts rSO to quantification over r-ary relations with
at most f(n) many tuples, where n is the size of the underlying domain.
Also the restriction of SO to quantification over relations of size i√n is very expressive,
especially if we allow formulas to use a built-in linear order.
Theorem 6.6. Let q be a query that is computable in NL. There is an r ∈ N such that
q is expressible in i
√
n-rSO(≤), for every i ∈ N.
Proof. Let q and i be fixed. As Reach is NL-complete under first-order reductions
[Immerman 1987], there is an r-dimensional first-order reduction Ψ from q to Reach,
for some r ∈ N. Note that Ψ uses a built-in linear order on the domain. Thanks to
Theorem 6.4 there is an i
√
n-MSO formula ϕ that expresses Reach. Combining ϕ with
Ψ yields an i
√
n-rSO(≤) formula that expresses q.
6.2.2 Reachability, regular languages and changes of polylogarithmic
size
With the help of the results of the previous subsection we now construct dynamic programs
that maintain UReach and regular (tree) languages under changes of polylogarithmic
size. In light of Proposition 6.3, these results are optimal with respect to the size of the
change: no DynFO(≤,+,×) program can maintain these queries under changes of larger
size.
Our approach is as follows. We show that FO(≤,+,×) formulas can simulate i√n-SO
formulas on substructures of polylogarithmic size, for some i ∈ N. For the different
dynamic queries we then prove that the query result and all other necessary auxiliary
relations can be updated using i
√
n-SO expressible relations on the set of elements that
are affected by the change.
The following lemma is an adaptation of Proposition 4.11 in combination with the
explanations in the proof of Theorem 4.3 on how to encode subsets of a set of size logn.
Lemma 6.7. Let d and r be arbitrary natural numbers, and let i def= 2d. Let ψ(x¯)
be an i
√
n-rSO(≤) formula over some relational schema σ. There is an FO(≤,+,×)
120
6.2 Utilising second-order logics with restricted quantifiers
formula ϕ(x¯) such that for every σ-structure D with domain D, every subset D′ ⊆ D of
size |D′| ≤ logd |D| and every tuple a¯ of elements from D′ it holds that D[D′] |= ψ(a¯) if
and only if (D, D′) |= ϕ(a¯).
Proof. Let d, i, r and ψ be as in the statement of the lemma. The idea to construct
ϕ is as follows. Given a structure D with universe D of size n, the formula ψ is only
applied to substructures of D of size logd n. So, it only quantifies relations with at most
i
√
logd n =
√
logn many tuples over a set of logd n many elements. We will see that
relations of this form and size can be encoded by O(logn) many bits, which in turn can
be represented by constantly many elements from D. An FO(≤,+,×) formula ϕ can
quantify these elements and decode the represented relation.
Let D′ ⊆ D be a set of size at most logd n. We explain step by step how relations of
small size over D′ can be encoded. We will use the well-known fact that FO(≤,+,×)
formulas can identify a set D′ of polylogarithmic size with the first |D′| elements of the
domain D with respect to ≤, see for example [Durand et al. 2006, Corollary 1]. So, we
can represent any element from D′ by a number between 0 and |D′| − 1, and, using the
FO(≤,+,×)-expressible BIT predicate, by a bit string of length log |D′| = d log logn, for
a set D′ of size logd n. An arbitrary r-tuple over elements from D′ can consequently be
represented by a bit string of length dr log logn, and a set of at most i
√
logd n =
√
logn
of these tuples by a bit string of length dr
√
logn log logn. The latter number is smaller
than dr logn, for all n larger than some number n0, which in turn is the length of a bit
string that can be represented by rd quantified elements in the presence of BIT. Small
structures with less than n0 elements can be dealt with separately.
We construct the FO(≤,+,×) formula ϕ from ψ by first replacing every first-order
existential quantification ∃xψ′(x) by ∃x(D′(x)∧ψ′(x)), and every universal quantification
∀x ψ′′(x) by ∀x (D′(x)→ ψ′′(x)). Then, we replace every existential second-order quan-
tifier ∃X by first-order quantifiers ∃x1 · · · ∃xrd, and symmetrically universal quantifiers
∀X by ∀x1 · · · ∀xdr. Every atom X(y¯) is replaced by an FO(≤,+,×) subformula ϕX(y¯)
that checks whether the bit string of length dr log logn associated with the interpretation
of y¯ is contained in the bit string represented by x1, . . . , xdr. We make this more precise.
Let (a1, . . . , ar) ∈ D′r be the interpretation of y¯, and s1, . . . , sr the bit strings of length
d log logn associated with these elements as explained above. The formula ϕX checks
whether there is a j ≤ √logn such that for each r′ ≤ r, each d′ ≤ d and each p ≤ log logn
the bit at position (d′− 1) log logn+p of sr′ equals the bit at position (j− 1) log logn+p
in the bit string represented by x(d′−1)r+r′ .
Lemma 6.7 in conjunction with Theorem 6.6 enables us to express in FO(≤,+,×) any
NL-computable query on structures of polylogarithmic size.
Corollary 6.8. Let k and d be arbitrary natural numbers, and let q be a k-ary, NL-
computable query on σ-structures. There is an FO(≤,+,×) formula ϕ over schema
σ ∪ {D′} such that for any σ-structure D with n elements, any subset D′ of its domain
of size at most logd n and any k-tuple a¯ ∈ D′k it holds that a¯ ∈ q(D[D′]) if and only if
(D, D′) |= ϕ(a¯).
121
Chapter 6 Expressibility under size-restricted bulk changes
In the following, we simulate different NL computations on the part of the input
structure that is directly affected by a change of polylogarithmic size. As a first example,
we show that reachability in directed graphs can be maintained under insertions of
polylogarithmically many edges.
Theorem 6.9. The dynamic query (Reach, {insE [logd n]}) is in DynFO(≤,+,×), for
every d ∈ N.
Proof. Let d ∈ N be fixed. We extend the dynamic program from Example 2.8 to
insertions of logd n many edges. So, the dynamic program we construct only maintains
the query relation Ans, the transitive closure of the input graph. Whenever a set E+ of
edges is inserted into a graph G = (V,E), the dynamic program updates Ans with the
help of the transitive closure relation of some graph H that is defined on the set Vaff of
affected nodes, that is, the set of incident nodes of the edges in E+. The edge set EH of
H contains the newly inserted edges E+, and additionally edges (u, v) for all pairs (u, v)
of nodes from Vaff that are connected by a path in G. The transitive closure of H equals
the transitive closure relation of G′ def= (V,E ∪ E+) restricted to Vaff: it accounts for all
paths from a node u ∈ Vaff to another node v ∈ Vaff that may use both newly inserted
edges as well as edges that are already present in G. The transitive closure relation of G′
can then easily be expressed as follows. Every path ρ in G′ consists of three consecutive
subpaths ρ1ρ2ρ3 = ρ, where ρ1 and ρ3 are defined as the maximal subpaths of ρ that
do not rely on edges from E+. These subpaths already exist in G and are represented
in Ans. The subgraph ρ2 by definition starts and ends at nodes from Vaff, so its existence
is given by the transitive closure relation of H.
We now give a more formal construction. Let G = (V,E) be a graph with |V | = n and
Ans the transitive closure of E, and let E+ be a set of at most logd n edges. Let H be
the graph with node set VH
def= {u, v | (u, v) ∈ E+} and edge set EH def= E+ ∪ {(u, v) ∈
V 2H | (u, v) ∈ Ans}. The graph H is clearly FO-definable from G,E+ and Ans, and by
Corollary 6.8 its transitive closure relation TCH can be expressed in FO(≤,+,×), as VH is
of polylogarithmic size with respect to n. The transitive closure of G′ def= (V,E ∪E+) can
then be defined by the formula ϕ(s, t) def= Ans(s, t)∨∃x1∃x2
(
Ans(s, x1)∧TCH(x1, x2)∧
Ans(x2, t)
)
.
So far we do not know how to maintain directed reachability under polylogarithmi-
cally many deletions in DynFO(≤,+,×). There are preliminary results in this direction,
showing that Reach can be maintained under fewer, but non-constantly many dele-
tions in DynFO(≤,+,×), or under polylogarithmically many deletions in extensions of
DynFO(≤,+,×), which we will discuss in more detail in the following Section 6.3.
When it comes to undirected reachability, we can actually extend the dynamic program
from Dong and Su [1998] for UReach to edge insertions and deletions of polylogarithmic
size.
Theorem 6.10. The dynamic query (UReach,∆E [logd n]) is in DynFO(≤,+,×), for
every d ∈ N.
122
6.2 Utilising second-order logics with restricted quantifiers
Proof. The dynamic program from [Dong and Su 1998] that maintains undirected reach-
ability in DynFO under single-edge changes uses, in addition to the transitive closure
relation of the input graph, two binary auxiliary relations that represent a directed
spanning forest of the input graph and its transitive closure, respectively. We show that
these relations can still be maintained in DynFO(≤,+,×) under changes of logd n many
edges, for some fixed d.
For edge insertions, the construction idea is very similar to the proof of Theorem 6.9.
We define a graph H, where nodes correspond to connected components of the input
graph that include an affected node, and edges indicate that some inserted edge connects
the respective connected components. As this graph is of polylogarithmic size, thanks
to Corollary 6.8 we can express a spanning forest for H and its transitive closure in
FO(≤,+,×), which is sufficient to update the respective relations for the whole input
graph.
In the case of edge deletions, the update formulas need to replace deleted spanning tree
edges, whenever this is possible. Our approach is very similar to the case of edge insertions.
The spanning tree decomposes into polylogarithmically many connected components
when edges are deleted. These components can be merged again if non-tree edges exist
that connect them, and these edges become tree edges of the spanning forest. For a
correspondingly defined graph of polylogarithmic size we can again define a spanning
forest and its transitive closure, and from this information select the new tree edges.
We explain both cases in more detail. Let G = (V,E) be the undirected input graph of
size n with transitive closure Ans, and let S and TCS be a directed spanning forest for
G and its transitive closure, respectively. Suppose that a set E+ of size at most logd n is
inserted. We define a graph H as follows. It contains a node v ∈ V if (1) v is affected,
that is, if E+ contains an edge of v, and (2) v is the smallest affected node in its connected
component of G with respect to ≤. It contains an edge (u, v) if (u′, v′) ∈ E+ for some
nodes u′, v′ with (u, u′) ∈ Ans and (v, v′) ∈ Ans, so, if the connected components of u
and v are connected by an inserted edge. The graph H is easily seen to be FO-definable
using Ans. Because H is of polylogarithmic size with respect to n and a spanning forest
of a graph can be computed in NL (for example the breadth-first spanning forest with the
minimal nodes of each component, with respect to ≤, as roots), we can define a spanning
tree SH and its transitive closure TCSH in FO(≤,+,×), thanks to Corollary 6.8.
The update formulas define updated auxiliary relations for the graph G′ = (V,E ∪E+)
as follows. Intuitively, an edge (u, v) ∈ SH means that the connected components of u
and v in G shall be connected in G′ directly by a new tree edge. There might be several
edges in E+ that may serve this purpose, and we need to choose one of them. So, an edge
(u′, v′) ∈ E+ becomes part the updated spanning forest if there is an edge (u, v) ∈ SH such
that u′ and u as well as v′ and v are in the same connected component of G, respectively,
and (u′, v′) is the lexicographically minimal edge with these properties. This is clearly
FO(≤,+,×)-expressible using the old auxiliary relations. The old tree edges from S are
taken over to the updated version, although some directions need to be inverted, see
[Dong and Su 1998] and Theorem 5.4 for how this can be expressed in first-order logic.
The update formulas for TCS from [Dong and Su 1998] can easily be extended to our
more general case, using TCSH . We note that Ans is first-order expressible from TCS .
123
Chapter 6 Expressibility under size-restricted bulk changes
In conclusion, all auxiliary relations can be updated in FO(≤,+,×).
Now suppose that a set E− of at most logd n edges is deleted. Let S′ be the spanning
forest that results from S after all tree edges from E− are removed, and let TCS′ be
its transitive closure, which is easily FO-expressible from TCS . Similarly as above we
define a graph H, with nodes being the minimal affected nodes in a weakly connected
component of S′, which are connected by an edge if the respective weakly connected
components of S′ are connected by some edge from E \ E−. The same way as above,
FO(≤,+,×) formulas can define a spanning forest and its transitive closure for H and
then use this information to define a spanning forest and its transitive closure for the
changed graph G′ = (V,E \ E−).
We have shown in Theorem 6.4 that regular languages on words and trees can be
expressed in i
√
n-MSO, for any i ∈ N. Now we use this result to show that these languages
can be maintained under changes of the labels of polylogarithmically many positions,
very similarly to the proof of Theorem 6.9. We refer to Section 5.4 and to [Gelade et al.
2012] for technical details of the setting.
Theorem 6.11. The following dynamic queries are in DynFO(≤,+,×), for every d ∈ N:
(a) (Member(L),∆Σ[logd n]), for any regular language L over some alphabet Σ,
(b) (Member(L),∆Σ[logd n]), for any regular tree language L over some ranked alpha-
bet Σ.
Proof.
(a) Let L be a regular language over an alphabet Σ, and let A = (Q,Σ, δ, q0, F ) be a
corresponding DFA. To maintain Member(L) under changes of single positions, Gelade
et al. [2012] use auxiliary relations of the form Sp,q(i, j), where p, q ∈ Q are states of A
and (i, j) ∈ Sp,q holds if and only if δ∗(p, wi+1 · · ·wj−1) = q. Here, w = w1 · · ·wn is the
input word and δ∗ is the extension of the transition function δ from symbols σ ∈ Σ to
words from Σ∗.
We show that these auxiliary relations can be maintained in DynFO(≤,+,×) under
changes of polylogarithmic size. In a nutshell, we show that using the auxiliary relations,
the behaviour of A on a string with polylogarithmically many changed positions can
be characterised by another DFA that only reads a string of polylogarithmic length.
Whether this automaton accepts can be determined in FO(≤,+,×) by Corollary 6.8; we
show that is enough to update all auxiliary relations.
We elaborate further on this approach. Suppose the labels of the set P = {i1, . . . , im}
of at most logd n positions are changed, resulting in an input word w′ = w′1 · · ·w′n.
We show how the auxiliary relations can be updated for the substring w′i1 · · ·w′im that
contains all changed positions. More precisely, we construct for every pair p, q of states
of A an FO(≤,+,×) formula ϕp,q that checks whether δ∗(p, w′i1 · · ·w′im) = q. From these
formulas we easily obtain another FO(≤,+,×) formula that updates the query relation:
as the prefix of w′ up to position i1 − 1 and the suffix from position im + 1 onwards are
unchanged and therefore the old auxiliary relations are still valid, it suffices to check
that for some pair p, q of states δ∗(q0, w′1 · · ·w′i1−1) = p and δ∗(q, w′im+1 · · ·w′n) ∈ F hold,
124
6.2 Utilising second-order logics with restricted quantifiers
which is easy given the auxiliary relations, and that ϕp,q is satisfied. The other auxiliary
relations can be updated analogously, and we do not spell out further details.
It remains to explain how the formulas ϕp,q are constructed. Using the existing
auxiliary relations, we can determine the behaviour of A on the substring w′ij · · ·w′ij+1−1
of w′i1 · · ·w′im that for each j < m starts with a changed position ij and ends just before
the next changed position, and that only consists of the symbol at position im for j = m.
More formally, an FO formula can associate with every j < m a function γj : Q→ Q such
that γj(p′) = q′ if and only if δ∗(p′, w′ij · · ·w′ij+1−1) = q′ holds, for any p′, q′ ∈ Q, so, if A
ends in state q′ if it starts in state p′, reads the new symbol at position ij , and then reads
the unchanged subword w′ij+1 · · ·w′ij+1−1 = wij+1 · · ·wij+1−1. The formula can associate
with the last position im the function γm(p′)
def= δ(p′, w′im).
It now holds that δ∗(p, w′i1 · · ·w′im) = q if and only if (γm ◦ · · · ◦ γ1)(p) = q. The set
Lp,q of sequences γ′1 · · · γ′` such that (γ′` ◦ · · · ◦ γ′1)(p) = q is clearly a regular language
over the alphabet Γ of all functions from Q→ Q. The formula ϕp,q thus only needs to
FO-define the word γ1 · · · γm and check that it is included in Lp,q, which is possible in
FO(≤,+,×) by Corollary 6.8.
(b) The proof is very similar to part (a). Let L be a regular tree language over a ranked
alphabet Σ. We assume that the symbols in Σ have rank 0 or 2, so, that L is a regular
language of binary trees. Let A = (Q,Σ, δ, F ) be a finite deterministic bottom-up tree
automaton that decides L. We denote the labelled rooted input tree by T .
To maintain Member(L) when the label of a single node changes, Gelade et al. [2012]
mainly use binary auxiliary relations that are again of the form Sp,q(u, v), for states
p, q ∈ Q. Intuitively, (u, v) ∈ Sp,q means that A assigns the state p to the node u, if the
state for node v is forced to be q — no matter which state A actually assigns to v based
on its subtree and the transition function δ.
We show that these auxiliary relations can be updated after changes of the labels
of polylogarithmically many nodes. For this we assume the existence of the descendant
relation on the tree nodes, which can be made available as an auxiliary relation. Our
strategy is analogous to part (a): we FO-define a labelled binary tree with polylogarith-
mically many nodes such that from the information whether this tree is member of some
regular tree language we can infer whether the changed input tree is in L, and how the
other auxiliary relations need to be updated. Thanks to Corollary 6.8, this approach can
be implemented by FO(≤,+,×) update formulas.
We give some details. Let P be the set of at most logd n nodes whose labels are
modified by a change, resulting in an input tree T ′. Without loss of generality, we assume
that for two nodes v, v′ ∈ P also their lowest common ancestor is in P , and that if for
a node v ∈ P a descendant from its left or right subtree is contained in P , also some
descendant from its other subtree is in P . So, with the help of the descendant relation
on T , we can define in FO a binary tree TP with node set P and edges from a node to its
“nearest descendants” in T .
We assign labels to the nodes of TP according to the behaviour of A on the subtree of
T ′ that is rooted in the respective node. If v ∈ P is a leaf of TP , then its label is just
the state q ∈ Q that A assigns to this node in T ′. As the only difference in the subtree
125
Chapter 6 Expressibility under size-restricted bulk changes
rooted at v between T and T ′ is the label of v itself, this state can easily be expressed
using the old auxiliary relations and the transition function of A. If v ∈ P is an inner
node of TP , its label is a function γ : Q×Q→ Q. Intuitively, this labels says that if A
assigns in T ′ some states p, q to the nodes u1, u2 that are the children of v in TP , then it
assigns the state γ(p, q) to v in T ′. This label can be expressed in first-order logic from
the old auxiliary relations as follows. Let v1, v2 be the children of v in T such that u1 is
a descendant of v1 and u2 is a descendant of v2. If the state p is assigned to u1 in T ′,
then the state p′ is assigned to v1 that satisfies (u1, v1) ∈ Sp,p′ . Symmetrically one can
determine the state q′ that is assigned to v2 in T ′. The state that is assigned to v can
then be read from the transition function of A.
Another bottom-up tree automaton can “evaluate” TP in the natural way. By
Corollary 6.8, an FO(≤,+,×) update formula can determine the state that this automaton
assigns to every inner node, and in particular the state q′ ∈ Q that it assigns to the root
of Tp. From this state, the update formula can determine the state q that A assigns to
the root r of T ′, and therefore can check whether T ′ ∈ L. The auxiliary relations of the
form Sp,q can be updated similarly.
6.3 The linear algebra approach for bulk changes
The results of the previous section are obtained by lifting the known dynamic programs
from [Dong and Su 1998] and [Gelade et al. 2012] towards changes of polylogarithmic size,
using basically the same auxiliary relations. The important idea that makes these results
possible is that restricted second-order logics can express the respective queries, and
even all NL-computable queries, and that these logics can be simulated in FO(≤,+,×)
on substructures of polylogarithmic size.
This way we obtained maintainability results for undirected reachability and the
word problem for regular languages and regular tree languages that are optimal for
DynFO(≤,+,×) with respect to the size of the change. For the directed reachability
query Reach, we could only give a dynamic program for insertions of polylogarithmic size.
This dynamic program, as constructed in the proof of Theorem 6.9, does not even support
single-edge deletions. Naturally, we wonder whether Reach can also be maintained in
DynFO(≤,+,×) under insertions and deletions of polylogarithmically many edges.
The approach of Section 6.2 does not seem to be applicable to the dynamic program
from [Datta et al. 2018a] that maintains Reach under single-edge changes. In [Datta
et al. 2018a], the problem to determine whether there is a directed path from some
node s to some node t in a graph G is reduced to the question whether a certain matrix
has full rank. In [Datta et al. 2018b], a different approach is used to show that Reach
can be maintained in DynFO(≤,+,×) under insertions and deletions of non-constant
size, namely of size lognlog logn , where n is the number of nodes. The authors show that
using the (appropriately extended) Muddling technique, the approach for maintaining
Reach under single-edge changes from [Datta et al. 2018a] can be reformulated as a
reduction to maintaining the inverse of a matrix. They then show that the inverse of
a matrix of dimension lognlog logn × lognlog logn over an appropriate field can be determined
126
6.3 The linear algebra approach for bulk changes
in FO(≤,+,×). The Sherman-Morrison-Woodbury formula (cf. [Henderson and Searle
1981], see Equation 6.1) states that the inverse of an n× n matrix can be updated after
changes of k entries on the basis of the inverse of a k×k matrix. It then can be concluded
that (Reach,∆E [ lognlog logn ]) is (AC
2, log2 n)-maintainable.
We propose the following variant of the Muddling Theorem 3.3 for bulk changes.
Theorem 6.12 (Muddling Theorem for bulk changes). Let f, g ∈ logO(1) n be first-order
constructible functions. Let q be a NL-computable, almost domain independent query, and
∆ a set of single-tuple change operations. If (q,∆[g(n)]) is (AC[f(n)], f(n))-maintainable,
then (q,∆[g(n)]) ∈ DynFO(≤,+,×).
Proof sketch. The proof follows the proof of Theorem 3.3. The only difference is how
the query result is given during the first f(n) change steps. Up to step f(n), the active
domain of the input structure contains O(f(n)g(n)) elements, which is bounded by logd n
for some d ∈ N that only depends on f, g and ∆. So, by Corollary 6.8 an FO(≤,+,×)
formula can express the query result.
Theorem 6.12 together with the insight that (Reach,∆E [ lognlog logn ]) is (AC
2, log2 n)-
maintainable proves the main result of [Datta et al. 2018b].
Theorem 6.13 ([Datta et al. 2018b]). The dynamic query (Reach,∆E [ lognlog logn ]) is in
DynFO(≤,+,×).
Actually, the result of [Datta et al. 2018b] is stronger, as it allows not only insertions
and deletions of lognlog logn edges, but insertions and deletions that affect
logn
log logn nodes.
This result can in fact be extended to bulk changes operations that can modify more
than lognlog logn edges. Let us for example consider a change operation clear-node(p) that
deletes all incident edges of a given node.
Theorem 6.14. The dynamic query (Reach, {insE ,delE ,clear-node}[ lognlog logn ]) is in
DynFO(≤,+,×).
Proof. In [Datta et al. 2018b] it is shown that Reach is (AC2, log2 n)-maintainable under
edge changes that affect O( lognlog logn) nodes. So, let A be an AC2-algorithm with depth
` log2 n for some constant ` and P a dynamic program that maintains Reach for log2 n
of these changes starting from auxiliary relations initialised by A.
We show that from A and P we can construct an AC2-algorithm A′ and a dynamic
program P ′ that witness that Reach is still (AC2, log2 n)-maintainable when additionally
clear-node changes for lognlog logn nodes are allowed. It then follows from Theorem 6.12
that Reach can be maintained in DynFO(≤,+,×) under these changes.
Let G = (V,E) be the input graph, with |V | = n. The main idea is to maintain the
reachability information in a slightly modified graph G′ where each clear-node change
can be simulated by a constant number of edge insertions and deletions. The graph G′ is
defined as follows: for every node v of G, G′ contains the nodes vin1 , . . . , vinlog2 n that will
receive the incoming edges of v, and the nodes vout1 , . . . , voutlog2 n that will hold the outgoing
edges from v. At each time, for every node v only one pair vini and vouti is actually used,
127
Chapter 6 Expressibility under size-restricted bulk changes
in the sense that they are connected by an edge to v. We call these nodes the active
gateways for v. Initially, the nodes vin1 and vout1 are the active gateways of v, and the
edges (vin1 , v) and (v, vout1 ) are present in G′, for every v ∈ V . Every edge (u,w) in G is
represented by the edge (uout1 , win1 ) in G′.
The initialisation A′ just computes G′ and applies A to this graph. The graph G′
has n′ def= n log2 n many nodes, so the depth of A′ is ` log2 n′ = ` log2(n log2 n) which is
bounded by 2` log2 n for sufficiently large n. So, A′ is indeed an AC2-algorithm.
It remains to explain P ′. For edge insertions and deletions, P ′ just simulates P in
the obvious way for the corresponding changes on G′, using the unique active gateways
of the nodes that are affected by the change. For clear-node changes, we observe
that every change clear-node(v) for a node v in G can be simulated by a constant
number of edge deletions and insertions in G′: if vini and vouti are the active gateways
of v, it suffices to delete the edges between v, vini and vouti and to insert edges (vini+1, v)
and (v, vouti+1). Because at most log2 n changes clear-node(v) can occur for each node v
during a change sequence of length log2 n, enough “fresh” gateways are available. So,
whenever clear-node changes for lognlog logn nodes occurs, P ′ only needs to simulate P for
the corresponding O( lognlog logn) edge insertions and deletions. In all cases, P ′ also applies
the changes to G′.
It is clear from the construction that at all times there is a path from some node u
to some node v in G if and only if such a path also exists in G′. Because P maintains
Reach in G′ for change sequences of length log2 n, so does P ′ in G.
In the same way, also change operations that activate and deactivate nodes, as used by
Kähler and Wilke [2003], can be supported.
In the remainder of this section we apply the ideas from the proof of Theorem 6.13 to
the Distance query, which asks, given a directed graph G, for the length of the shortest
path between two nodes s and t. More formally, the query result Distance(G) is a
ternary relation with (s, t, `) ∈ Distance(G) if and only if (s, t) ∈ Reach(G) and the
shortest path from s to t consists of ` edges.4
It is not even known whether Distance can be maintained in DynFO under single-edge
changes. Hesse [2003a] shows that (Reach,∆E) is in DynFO+Maj(≤,+,×), the extension
of DynFO(≤,+,×) that is defined via FO+Maj(≤,+,×) update formulas. These formulas
may use majority quantifiers Maj(x)ϕ(x), and such a formula is satisfied in a structure D
with domain D, if D |= ϕ(a) holds for more than half of the elements a ∈ D. It is
well-known that FO+Maj(≤,+,×) corresponds to the circuit complexity class uniform
TC0 [Barrington et al. 1990], which is defined analogously to AC0, but additionally allows
for majority gates.
The dynamic program from [Hesse 2003a] actually witnesses that (Distance,∆E) ∈
DynFO+Maj(≤,+,×). In the following subsections we generalise this result in two
directions. First we show that Distance can still be maintained in DynFO+Maj(≤,+,×)
under edge changes of polylogarithmic size. Then we show, towards a DynFO maintenance
4We suppose that the node set V of G is linearly ordered, so we can identify nodes with numbers from
{0, . . . , |V | − 1}.
128
6.3 The linear algebra approach for bulk changes
result for the distance query, that DynFO(≤,+,×) can maintain auxiliary relations under
edge changes of size lognlog logn such that a FO+Maj(≤,+,×) formula can extract the query
result for the distance query. We start be introducing the necessary notions from linear
algebra.
6.3.1 Basic concepts from linear algebra
In the following subsections we use matrix operations and standard results from linear
algebra. An n× n matrix M of natural numbers up to n is formally represented by a
ternary relation M with (i, j,m) ∈M if the entry in the i-th row and the j-column of M
is m. We denote this entry by [M ]ij . Analogously, matrices of dimension nk × nk with
polynomial entries up to nk, for some fixed k ∈ N, are represented by 3k-ary relations.
Numbers m up to size 2nk − 1 can be represented by a k-ary relation that includes a
tuple a¯ if the a¯-th bit in the binary representation of m is 1; here we view a¯ as a number
between 1 and nk. So, we can represent matrices with entries of exponential size in n.
Matrices with entries from sets other than N can be represented in a similar fashion.
Let Z denote the integers. By Z[[x]] we denote the ring of formal power series over Z,
that is, the ring with elements ∑∞i cixi, with ci ∈ Z for each i, and the natural addition
and multiplication. An element ∑∞i cixi ∈ Z[[x]] is normalised if c0 = 1. All normalised
elements of Z[[x]] have an inverse, see [Godsil 1993, Lemma 3.3.1]. The ring Z[[X]] can be
embedded into the field of fractions Z((x)) def= { g(x)h(x) | g(x), h(x) ∈ Z[[x]] and h(x) 6= 0}.
We denote the subring of Z[[x]] consisting of all finite polynomials by Z[x], and the field
of fractions of elements from Z[x] by Z(x).
A matrix A ∈ Z[[x]]n×n is invertible over Z[[x]] if there is a matrix B ∈ Z[[x]]n×n with
AB = I, where I is the identity matrix with 1 on its diagonal and 0 anywhere else. Given
an invertible matrix A, the inverse A−1 of A can be computed by [A−1]ij = (−1)i+j detAjidetA ,
where the matrix Cji results from a matrix C by removing the j-th row and the i-th
column, and detC denotes the determinant of C. Therefore, a matrix A ∈ Z[[x]]n×n is
invertible over Z((x)) if its determinant is non-zero, and it is invertible over Z[[x]] if the
constant term of det(A) is 1. If A is of the form I + xC for some matrix C ∈ Z[[x]]n×n
then A is invertible over Z[[x]] as det(I + xC) is normalised.
For a polynomial g(x) ∈ Z[x] we abbreviate its degree by deg g(x) and write ‖g(x)‖
for the value of its largest coefficient. The degree of a representation g(x)h(x of an element
of Z(x) is the maximum of the degrees of g(x) and h(x), and similarly for the largest
coefficient. Degree and maximal coefficient are defined similarly for matrices over Z[x]
and Z(x).
6.3.2 Distances and update formulas that use majority
The goal of this subsection is to extend a result from [Hesse 2003a] to changes of
polylogarithmically many edges, leading to the following theorem.
Theorem 6.15. The dynamic query (Distance,∆E [logd n]) is in DynFO+Maj(≤,+,×),
for every d ∈ N.
129
Chapter 6 Expressibility under size-restricted bulk changes
We follow the approach of Hesse [2003a] and use formal power series for counting the
number of paths of each length. Fix a graph G with adjacency matrix Ad(G) ∈ Zn×n
and a formal variable x. Then D def= ∑∞i=0(xAd(G))i is a matrix of formal power series
from Z[[x]] such that if [D]st =
∑∞
i=0 cix
i then ci is the number of paths from s to t of
length i. In particular, the distance between s and t is the smallest i such that ci is
non-zero. Note that if such an i exists, then i < n.
The matrix D is invertible over Z[[x]] and can be written as (I−xAd(G))−1 (cf. [Godsil
1993, Example 3.6.1]). The maintenance of distances thus reduces to maintaining for a
matrix A ∈ Z[[x]] the smallest i < n such that the i-th coefficient of [A−1]st is non-zero,
for each pair s, t ≤ n. In particular, Theorem 6.15 is a consequence of the following
theorem.
Theorem 6.16. Suppose A ∈ Z[[x]]n×n stays invertible over Z[[x]]. For all s, t ∈ [n] one
can maintain the smallest i < n such that the i-th coefficient of the st-entry of A−1 is
non-zero in DynFO+Maj(≤,+,×) under changes of logd n entries, for each fixed d ∈ N.
We gradually explain the idea. When the matrix A is changed to A+ ∆A, where ∆A
has logd n non-zero entries, then one can decompose the change matrix ∆A into a matrix
product UBV where the matrices U,B, V have dimension n× logd n, logd n× logd n and
logd n× n, respectively, by the following lemma.
Lemma 6.17 ([Datta et al. 2018b, Lemma 7]). Fix a ring R. Suppose M ∈ Rn×n with
non-zero rows ri1 , . . . , rik and columns cj1 , . . . , cjk . Then M = UBV with U ∈ Rn×k, B ∈
Rk×k, and V ∈ Rk×n where
1. B is obtained from M by removing all-zero rows and columns.
2. U =
u¯1...
u¯n
 where u¯i =
{
0¯T if i /∈ {i1, . . . , ik}
e¯Tm if i = im
3. V =
(
v¯1, . . . , vn
)
where v¯j =
{
0¯ if j /∈ {j1, . . . , jk}
e¯m if j = jm
Here, e¯m denotes the m-th unit vector.
The following equation, called here the Sherman-Morrison-Woodbury identity, describes
how the inverse of a matrix can be updated after a change that is characterised by a
matrix product UBV [Henderson and Searle 1981, Equation 23]:
(A+ UBV )−1 = A−1 −A−1U(I +BV A−1U)−1BV A−1 (6.1)
Note that if the involved matrices are from Z[[x]] then the inverse in general is from
the field Z((x)) of fractions of formal power series. For the proof of Theorem 6.16 the
resulting inverses are in Z[[x]] by assumption.
In the remainder of this subsection we explain how the update described by Equation 6.1
can be “implemented” in FO+Maj(≤,+,×).
Of course, computing with inherently infinite formal power series is not possible in
DynFO+Maj(≤,+,×). However, as stated in Theorem 6.16, in the end we are only
130
6.3 The linear algebra approach for bulk changes
interested in the first i < n coefficients of a power series. We therefore show that it
suffices to truncate all occurring power series at the n-th term and use FO+Maj(≤,+,×)’s
ability to define iterated sums and products of polynomials [Hesse et al. 2002].
Formally, we have to show that no precision for the first i < n coefficients is lost when
computing with truncated power series. This motivates the following definition. A formal
power series g(x) = ∑i cixi ∈ Z[[x]] is an m-approximation of a formal power series
h(x) = ∑i dixi ∈ Z[[x]], denoted by g(x) ≈m h(x), if ci = di for all i ≤ m. This notion
naturally extends to matrices over Z[[x]]: a matrix A ∈ Z[[x]]`×k is an m-approximation
of a matrix B ∈ Z[[x]]`×k if each entry of A is an m-approximation of the corresponding
entry of B. The notion of m-approximation is preserved under all arithmetic operations
that will be relevant.
Lemma 6.18. Fix an m ∈ N.
(1) Suppose g(x), g′(x), h(x), h′(x) ∈ Z[[x]] with g(x) ≈m g′(x) and h(x) ≈m h′(x).
Then
(i) g(x) + h(x) ≈m g′(x) + h′(x),
(ii) g(x)h(x) ≈m g′(x)h′(x), and
(iii) 1g(x) ≈m 1g′(x) whenever g(x) and g′(x) are normalised.
(2) Suppose A,A′, B,B′ ∈ Z[[x]]n×n with A ≈m A′ and B ≈m B′. Then
(i) A+B ≈m A′ +B′,
(ii) AB ≈m A′B′,
(iii) If A is invertible over Z[[x]] then so is A′, and A−1 ≈m A′−1.
Remember that a formal power series is normalised if its constant term is 1.
Proof. The first two parts of (1) are straightforward. For the last part, first observe
that 1g(x) and
1
g′(x) actually exist, as g(x) and h(x) are normalised. Suppose that
1
g(x) =
∑
i dix
i and 1g′(x) =
∑
i d
′
ix
i. We write g(x) and g′(x) as g(x) = ∑mi=0 cixi + r(x)
and g′(x) = ∑mi=0 cixi + r′(x) where the formal power series r(x) and r′(x) are divisible
by xm+1 (and therefore the coefficient of xm+1 is the first that is allowed to be non-zero).
Because g(x) 1g(x) = 1, we have that c0d0 = 1 and
∑j
i=0 cidj−i = 0 for all j ∈ [m].
Similarly, c0d′0 = 1 and
∑j
i=0 cid
′
j−i = 0 hold. Solving both systems of equations yields
di = d′i for i ∈ [m]0.
The first two parts of (2) follow immediately from (1). For the third part, recall that A
is invertible over Z[[x]] if and only if det(A) is non-zero and normalised. This translates
to the matrix A′, because A ≈m A′. Furthermore
[A−1]st = (−1)s+tdet(Ats)det(A) ≈m (−1)
s+tdet(A′ts)
det(A′) = [A
′−1]st
An approximation of the inverse of a matrix A ∈ Z[[x]]n×n can be updated using
Equation 6.1.
131
Chapter 6 Expressibility under size-restricted bulk changes
Proposition 6.19. Suppose A ∈ Z[[x]]n×n is invertible over Z[[x]], and C ∈ Z[[x]]n×n
is an m-approximation of A−1. If A+ ∆A is invertible over Z[[x]] and ∆A = UBV with
U ∈ Z[[x]]n×k, B ∈ Z[[x]]k×k and V ∈ Z[[x]]k×n for some k ≤ n, then
(A+ ∆A)−1 ≈m C − CU(I +BV CU)−1BV C
Proof. This follows immediately from Equation 6.1 and Lemma 6.18.
The application of Equation 6.1 for Theorem 6.16 involves inverting logd n × logd n
matrices, which reduces to computing the determinant of such matrices. We show that
this is possible in FO+Maj(≤,+,×) for matrices of polynomials.
Lemma 6.20. Fix a domain of size n and let k = logd n, for some fixed d ∈ N. The
determinant of a matrix A ∈ Z[x]k×k, with entries of polynomial degree in n, can be
defined in FO+Maj(≤,+,×).
Proof. We show that the value can be computed in uniform TC0, which is as powerful as
FO+Maj(≤,+,×) [Barrington et al. 1990].
Computing the determinant of an k× k matrix is equivalent to computing the iterated
matrix product of k matrices [Cook 1985]. The reduction produces k matrices of dimension
at most (k + 1)× (k + 1) and can be computed in uniform TC0, as can be seen implicitly
in [Mahajan and Vinay 1999, p. 482]. Thus, the lemma statement follows from the fact
that iterated products of matrices A1, . . . , Ak ∈ Z[x]k×k with k = logd n can be computed
in uniform TC0, which can be proven in the spirit of [Agrawal and Vinay 2008, p. 69].
For the sake of completeness we outline the proof. We first explain how
√
logn such
matrices can be multiplied. Each entry in such a product is the sum of (logd n)
√
logn
many products of
√
logn polynomials. Such products can be computed in uniform TC0
due to [Hesse et al. 2002, Corollary 6.5]. The sum can be computed in TC0 as it is over
at most polynomially many terms:
(logd n)
√
logn = 2log((logd n)
√
logn) = 2d
√
logn log logn ∈ 2O(
√
logn
√
logn) = nO(1)
The idea for computing the product of A1, . . . , Ak with k = logd n is to partition the
sequence into fragments of length
√
logn each. The product Bi of the matrices of the
i-th fragment can be computed by the procedure from above, and there are logd− 12 n
such fragments. This procedure can now be repeated recursively, that is, the sequence
Bi is partitioned into fragments of length
√
logn, and so on. After 2d repetitions, the
final product is obtained.
Finally we turn to the proof of Theorem 6.16.
Proof (of Theorem 6.16). The dynamic program maintains an n-approximation C ∈
Z[x]n×n of A−1 that truncates A−1 at degree n. When A is updated to A+ ∆A then:
1. ∆A is decomposed into suitable U ∈ Z[x]n×logd n, B ∈ Z[x]logd n×logd n, and V ∈
Z[x]logd n×n;
2. C is updated via C ′ def= C − CU(I +BV CU)−1BV C;
132
6.3 The linear algebra approach for bulk changes
3. All entries of C ′ are truncated at degree n.
The steps can be defined in FO+Maj(≤,+,×) due to Lemma 6.17, Lemma 6.20,
and the fact that iterated addition and multiplication of polynomials can be defined
in FO+Maj(≤,+,×), see [Hesse et al. 2002]. The maintained matrix C is indeed an
n-approximation of A−1 due to Proposition 6.19.
6.3.3 Towards distances in DynFO
It is still open whether Distance can be maintained in DynFO under single-edge changes,
let alone under changes of non-constant size. In this subsection we combine ideas
from the proofs of Theorem 6.13 and Theorem 6.15 to show that, even under changes
of non-constant size, DynFO(≤,+,×) can maintain auxiliary relations from which a
FO+Maj(≤,+,×) formula can extract the result for the Distance query.
Theorem 6.21. The query result for Distance can be defined by a FO+Maj(≤,+,×)
query from auxiliary relations that can be maintained in DynFO(≤,+,×) under changes
of lognlog logn edges, where n is the number of nodes of the graph.
The general proof approach is again via maintenance of matrix inverses, just as for
the proof of Theorem 6.15. We establish the following theorem, which is a variation
of Theorem 6.16. In particular, the described auxiliary relations can be maintained for
log2 n many change steps after an AC2-computable initialisation, so Theorem 6.21 follows
by the reduction explained in the previous subsection and Theorem 6.12.
Theorem 6.22. Suppose A ∈ Z[[x]]n×n stays of the form I−xA′ for some A′ ∈ {0, 1}n×n
and is thus invertible over Z[[x]]. After an AC2-computable initialisation to an arbitrary
initial matrix of the stated form, one can maintain in DynFO(≤,+,×) auxiliary relations
under changes of lognlog logn entries, from which for all s, t ∈ [n] the smallest i < n such that
the i-th coefficient of the st-entry of A−1 is non-zero can be defined in FO+Maj(≤,+,×).
The proof strategy is the same as for Theorem 6.16. However, here it does not suffice
to just truncate formal power series and maintain approximated polynomials, as the
involved polynomials have polynomial degree and coefficients with exponential values
in n. Consequently, the update formulas in the proof of Theorem 6.16 need to multiply
O(n)-bit numbers and express the sum of polynomially many of these numbers, which
both are possible in FO+Maj(≤,+,×) but not in FO(≤,+,×).
Therefore our goal is to maintain implicit representations of n-approximations C(x) of
the entries from A−1 from which the smallest non-zero term of each of the entries can be
extracted. The idea is to store and update the evaluation C(a) for several numbers a ∈ N.
If the entries of C(x) have small degree, then the smallest non-zero term can be extracted
from this via Cauchy interpolation, which is possible in FO+Maj(≤,+,×) [Hesse et al.
2002]. However, when only storing C(x) implicitly via C(a) it is not possible to truncate
the polynomials after each step, as in the proof of Theorem 6.16. Furthermore, the
update formula for C(x) provided by Proposition 6.19 does not ensure that polynomials
keep a small degree if they are not truncated.
133
Chapter 6 Expressibility under size-restricted bulk changes
For this reason we proceed as follows. We first introduce a representation where each
entry of C(x) is represented by a fraction g(x)h(x) such that g(x) and h(x) are polynomials
of degree O(nc) for some c. The program then stores g(a) and h(a) for several numbers
a ∈ N. Actually the numbers g(a) and h(a) might be very large, indeed exponential in n,
and therefore we will store all numbers in Chinese remainder representation.
Next we prepare by proving several intermediate propositions and lemmas that will
ensure the correctness of this course of action. Afterwards we prove Theorem 6.22 by
presenting the dynamic program in detail.
We start by introducing a representation of the matrix C(x) in terms of fractions.
A quotient m-approximation of a formal power series f(x) is a fraction g(x)h(x) ∈ Z(x)
with normalised h(x) such that g(x)h(x) ≈m f(x) when g(x)h(x) is treated as a formal power
series. Quotient m-approximations for matrices are defined analogously. Using this
representation and a specific way to apply Equation 6.1, see Algorithm 1, we bound the
degree and the size of the coefficients in an updated m-approximation.
Proposition 6.23. Suppose A ∈ Z[[x]]n×n is invertible over Z[[x]], and C ∈ Z(x)n×n is
a quotient m-approximation of A−1. If A+ ∆A is invertible over Z[[x]] and ∆A can be
written as UBV with U ∈ Z[x]n×k, B ∈ Z[x]k×k, and V ∈ Z[x]k×n for some k ≤ n, then
C ′ def= C − CU(I +BV CU)−1BV C
is an m-approximation of (A+ ∆A)−1. Furthermore, if C ′ is computed with Algorithm 1,
B is of the form xB∗ for some B∗ ∈ Z[x]k×k that takes only values from {−1, 0, 1}, and
U, V take only values from {0, 1}, then
(1) degC ′ ∈ O(k3 degC) and ‖C ′‖ ∈ (‖C‖k degC)O(k3), and
(2) if the denominators in C are normalised then the denominators of C ′ are normalised
as well.
Proof. That C ′ is an m-approximation of (A+ ∆A)−1 follows from Proposition 6.19 and
the assumption that C is a quotient m-approximation. If all denominators of C are
normalised and B is of the form xB∗, an inspection of Algorithm 1 yields that also all
denominators in C ′ are normalised.
It remains to show the bounds on degC ′ and ‖C ′‖. Note that for polynomials
h1, . . . , hk ∈ Z[x] one has deg(
∑
i hi) = maxi deg(hi) and deg(
∏
i hi) ∈ O(
∑
i deg(hi))
when∑i hi and ∏i hi are computed in the naive way. Hence in Algorithm 1, E has degree
O(degC) and f(x) has degree O(k2 degC), and therefore det(f(x)E) and (I+BV CU)−1
have degree O(k3 degC). It follows that degC ′ ∈ O(k3 degC).
The estimation of ‖C ′‖ is similar, using the facts that ‖∑i hi‖ ∈ O(∑i ‖hi‖) and
‖∏i hi‖ ∈ O(∏i deg(hi)∏i ‖hi‖). We observe that ‖E‖ ∈ O(‖C‖) and ‖f(x)‖ ∈
O(‖C‖k2k2 degC) ⊆ (‖C‖ degC)O(k2), which is still true for ‖f(x)E‖, and therefore
‖ det(f(x)E)‖ ∈ (‖C‖k degC)O(k3). It follows that the same bound holds for ‖C ′‖.
Our dynamic program will maintain an implicit representation of the matrix C from
the previous theorem. For extracting the smallest non-zero terms it suffices to look at the
134
6.3 The linear algebra approach for bulk changes
Algorithm 1 Updating a quotient m-approximation
Input: Matrices C ∈ Z(x)n×n, U ∈ Z[x]n×k, B ∈ Z[x]k×k, and V ∈ Z[x]k×n
Output: Matrix C ′ def= C − CU(I +BV CU)−1BV C
1: Compute E = I +BV CU ∈ Z(x)k×k
2: Compute the product f(x) of all denominators of E.
3: for all i, j ≤ k do
4: Compute the ij-th entry of the inverse E∗ of f(x)E as (−1)i+j det(f(x)Eji)det(f(x)E) .
5: Compute the inverse of E as f(x)E∗.
6: Compute C ′.
numerators of C, as long as the denominators are normalised, as given by the following
lemma.
Lemma 6.24. Suppose g(x)h(x) ∈ Z(x) for some g(x) =
∑
i cix
i and normalised h(x), and
that g(x)h(x) =
∑
i dix
i. Then i is the smallest number such that ci 6= 0 if and only if it is
the smallest number such that di 6= 0.
Proof. Suppose h(x) = ∑i eixi with e0 = 1. Then c0 = e0d0 and hence c0 = 0 if and only
if d0 = 0. Further, ci =
∑i
j=0 ejdi−j . Hence, if c` = d` = 0 for all ` < i then ci = 0 if and
only if di = 0.
Instead of working with quotient approximations directly, the dynamic program will
maintain in DynFO(≤,+,×) evaluations of numerators and denominators under several
numbers from a ∈ N. By interpolating the polynomials we can extract the smallest
non-zero term from this representation in FO+Maj(≤,+,×). That is even true if the
evaluated values are stored in a Chinese remainder representation, that is, the evaluation
is done modulo several primes.
Lemma 6.25. For all numbers d, d′, e ∈ N with d′ > d there are numbers e′, b ∈ N such
that the following is true. Fix a domain of size n. Suppose g(x) = ∑i cixi ∈ Z[x] with
deg g(x) ≤ nd and ‖g(x)‖ ≤ 2ne. Let S ⊆ [nd′ ]0 ⊆ N with |S| ≥ nd + 1, and P a set of
ne
′ primes (among the first nb numbers). The smallest i such that ci 6= 0 can be defined
in FO+Maj(≤,+,×) from a relation that stores the value g(a) (mod p) for each a ∈ S
and each p ∈ P .
Proof. The value g(a) with a ≤ nd′ is bounded by 2ne′ for some e′ that only depends
on d, d′ and e. As the product of the primes in P exceeds this number, g(a) is uniquely
determined by the Chinese remainder representation given by the values g(a) (mod p),
and can be decoded in FO+Maj(≤,+,×) [Hesse et al. 2002, Theorem 4.1]. The statement
follows from the fact that g(x) of degree at most nd is uniquely determined by the
values g(a) for nd + 1 pairwise distinct data points a, and Cauchy interpolation is in
FO+Maj(≤,+,×) [Hesse et al. 2002, Corollary 6.5].
We can finally proceed to the proof of Theorem 6.22.
135
Chapter 6 Expressibility under size-restricted bulk changes
Proof sketch (of Theorem 6.22). Suppose that the matrix C ∈ Z(x) is a normalised
quotient n-approximation of A−1. Then by Lemma 6.24, the smallest i < n such that
the i-th coefficient of the st-entry of A−1 is non-zero is equal to the smallest such i for
the numerator of the st-entry of C. This i can be extracted from the relations stated in
Lemma 6.25 by an FO+Maj(≤,+,×) formula.
Our goal is therefore to maintain relations that store the values from Lemma 6.25 for
each entry of C. We will apply a variant of Theorem 6.12 and exhibit a dynamic program
that maintains such relations for k def= lognlog logn changes of size k each, starting from initial
auxiliary relations that represent a normalised quotient n-approximation C ∈ Z(x) with
numerators of degree at most n and denominator 1. More details on this part, and in
particular why such a dynamic program implies that the relations can be maintained
under change sequences of arbitrary length, are given towards the end of this sketch.
We show that the auxiliary relations can be maintained for k = lognlog logn change steps,
when initial auxiliary data as described above is given. For a domain of size n, let S def= [nλ]
and let P be the set of the first nµ primes, for numbers λ and µ to be determined later.
The dynamic program implicitly maintains a quotient n-approximation C ∈ Z(x) of A−1
as follows. For each a ∈ S, each p ∈ P and each entry (s, t) of C, it maintains g(a)
(mod p) and h(a) (mod p) if g(x)h(x) is the st-th entry of C. Whenever h(a) = 0 (mod p)
then p is declared invalid for a; whenever h(a) = 0 for some denominator h(x) and a ∈ S
then a is declared invalid. The set of primes valid for a value a is denoted by Pa.
The initial amount of values in S and in P is chosen such that after k changes, sufficiently
many valid values remain in S and in each Pa in order to apply Lemma 6.25 for extracting
the smallest non-zero coefficients of denominators from this implicit representation.
Suppose C is a quotient n-approximation of A−1 implicitly stored by the dynamic
program. Then we denote by Ca (mod p) the evaluation of C at position a modulo
prime p for valid a and p. By Ca we denote the tuple (Ca (mod p1), . . . , Ca (mod pη))
where p1, . . . , pη are the primes still valid for a. By C we denote the tuple Ca1 , . . . , Caκ
where a1, . . . , aκ are valid numbers.
Formally the program uses a relation D that stores a tuple (a¯, p¯, s, t, v¯), where m¯
denotes the base-n encoding of a number m, if and only if (i) a is valid, (ii) p is valid
for a, and (iii) v is the value of the denominator of the st-th entry modulo p. Similarly,
a relation N stores the numerators. These relations encode Ca (mod p) as well as the
sets S and Pa. In the following we abstain from using this formal perspective for the sake
of clarity. The descriptions to follow can be easily translated to this formal framework.
When a change ∆A occurs, by the restrictions of Theorem 6.22 regarding the form
of A, this change matrix has only entries from {−x, 0, x}. The dynamic program updates
the sets S and Pa as well as the tuple C according to Algorithm 2.
The steps mainly consist of multiplying numbers that are polynomial in n (and thus
can be encoded by O(logn) bits) and addition of k ∈ O(logn) of these numbers, which
can be done in FO(≤,+,×), see for example [Hesse et al. 2002, Theorem 5.1]. The loops
from Lines 2–4 translate to parallel evaluation in FO(≤,+,×).
Let us analyse the necessary amount of values in S and P . By Proposition 6.23(1), the
degrees of denominators h(x) of C grow by a factor k3 after each change. Thus, after k
136
6.3 The linear algebra approach for bulk changes
Algorithm 2 Updating the auxiliary data for Theorem 6.22
Input: A change ∆A with entries from {−x, 0, x}
1: Decompose ∆A into UBV with U ∈ Zn×k, B ∈ Z[x]k×k, and V ∈ Zk×n
according to Lemma 6.17.
2: for each a ∈ S do
3: for each prime p ∈ Pa do
4: for all (s, t) ∈ [k]2 do
5: Compute gst(a) (mod p) and hst(a) (mod p)
where gst(x)hst(x) is the st-th entry of (I +BV CU)
−1, following Algorithm 1.
6: If hst(a) = 0 (mod p) then remove p from Pa
7: If Pa = ∅ then remove a from S
8: If Pa 6= ∅ then compute C′a (mod p)
according to C − CU(I +BV CU)−1BV C and following Algorithm 1.
change steps starting from fractions of polynomials with maximal degree n, the degree
of the denominators is bounded by O(nk3k) ⊆ O(nr) for some r ∈ N. Therefore each
denominator h evaluates to 0 for at most O(nr) many a ∈ S. All in all there are at most
O(nr+2) many a ∈ S such that h(a) = 0 for some denominator h of C after one update
step, and at most O(nr+3) such a in the course of k change steps.
By Proposition 6.23(2), the values of coefficients of C ′ are bounded by (‖C‖k degC)O(k3)
after one change step. Thus, after k changes, the values h(a) for a ∈ S are bounded
by 2O(ns(r)) for some s ∈ N→ N. Thus, if h(a) 6= 0 then h(a) ≡ 0 (mod p) for at most
O(ns(r)) many primes p and for a denominator h(x) of C. All in all, at most O(ns(r)+2)
many primes p are removed from Pa in Line 6 in one step. If h(a) 6= 0 after each of k
many changes, then at most O(ns(r)+3) many primes p have been removed from Pa. If
h(a) = 0 after some change, then h(a) ≡ 0 (mod p) for all primes p and therefore a is
removed from S in Line 7.
Thus, there exists numbers λ, µ such that if one starts with S = [nλ]0 and nµ primes P ,
then after k = lognlog logn many changes there are still at least nr numbers a in S, each of
them with a set Pa of size at least ne
′ , for the number e′ that is guaranteed to exist by
Lemma 6.25 applied to d def= r, d′ def= λ, e def= s(r). In particular one can define distances
with an FO+Maj(≤,+,×) formula according to Lemma 6.25.
We remark on how the described dynamic program can be used to maintain the
auxiliary relations for an arbitrary number of changes, starting from relations that are
initialised in AC2. Observe that for an arbitrary input matrix A of the stated form the
computation of a normalised quotient n-approximation C of A−1 and its evaluation on
multiple points modulo many primes can be done in AC2. The dynamic program from
above thus can maintain the auxiliary relations for the first k change steps.
A variant of Theorem 6.12 yields that this can also be done arbitrarily long. In the proof
of Theorem 6.12 (or Theorem 3.3, respectively), the initialisation algorithm is only applied
to the input structure. Inspecting the proof one can see that the initialisation also has
access to the auxiliary relations that are maintained for its input. So, given the auxiliary
137
Chapter 6 Expressibility under size-restricted bulk changes
relations that are maintained up to a time point, an FO+Maj(≤,+,×) initialisation
can interpolate the polynomials g(x), h(x) for each entry g(x)h(x) of C, compute the first
n + 1 coefficients d0, . . . , dn of g(x)h(x) , and therefore obtain a new entry for the quotient
n-approximation with numerator ∑ni=0 dixi, which has degree n, and denominator 1.
Additionally, the initialisation can compute C(a) (mod p) for all a ∈ [nλ]0 and all nµ
primes p ∈ P . As FO+Maj(≤,+,×) = uniform TC0 ⊆ NC1 ⊆ AC[ lognlog logn ] [Chandra et al.
1984, Theorem 4.3] and the necessary relations can be maintained for k = lognlog logn change
steps, a generalisation of the proof of Theorem 6.12, that applies the initialisation also to
the auxiliary database, shows the claim.
6.4 Outlook and bibliographic remarks
In this chapter we introduced changes that insert and delete sets of tuples with bounded
size, and studied which queries can still be maintained under these changes. We observed
that UReach as well as all regular languages of strings and trees can be maintained in
DynFO(≤,+,×) under changes of polylogarithmic size, as well as Reach under insertions.
These results are optimal with respect to the size of the change. Also, we extended the
result from [Datta et al. 2018b] that Reach can be maintained under insertions and
deletions of size lognlog logn to more general change operations, and showed that Distance
can be maintained in DynFO+Maj(≤,+,×) under changes of polylogarithmic size. For
insertions and deletions of lognlog logn edges, we saw that Distance can be maintained in
DynFO(≤,+,×) with an additional FO+Maj(≤,+,×) query to extract the query result.
Several open questions remain. Of course our result for Distance implies that Reach
can be maintained under insertions and deletions of polylogarithmically many edges
in DynFO+Maj(≤,+,×). Is this also true for DynFO(≤,+,×)? Possible steps towards
such a result is to study Reach under logd n insertions and single-edge deletions, or to
consider restricted classes of input graphs. For example, can Reach be maintained under
logd n insertions and deletions on graphs of bounded treewidth or on planar graphs, for
every d ∈ N?
Regarding the Distance query, membership in DynFO is open even for single-edge
changes. See also [Mukherjee 2019] for multiple preliminary results in that direction.
It is open whether the result of Theorem 4.3 on maintenance of MSO-definable queries
for graphs of bounded treewidth can be extended to changes of non-constant size. Probably
such a result cannot be obtained by combining the techniques from Section 4.1 and
Section 6.2 directly, because this would (1) require auxiliary data of size nlogd n, and (2)
involve to translate arbitrary MSO sentences to i√n-MSO sentences.
While we showed by Theorem 6.4(b) and (c) that i
√
n-MSO is equally expressive as
MSO on strings and tree for any i ∈ N, we strongly believe that on general structures
MSO is more expressive than i√n-MSO if i > 1, and even i√n-rSO for any r ∈ N, as
otherwise well-known hypotheses fail. We recall the NP-complete problem 3-SAT, which
asks whether a propositional formula, given in conjunctive normal form with at most
three literals per clause, is satisfiable. It is easy to show that 3-SAT can be expressed by
138
6.4 Outlook and bibliographic remarks
an MSO formula. We show that if there is an i√n-rSO formula for 3-SAT, for any r and
any i > 1, then the exponential time hypothesis is false. This hypothesis, formulated by
Impagliazzo and Paturi [2001], states that 3-SAT cannot be solved in time 2o(n).
Proposition 6.26. If 3-SAT ∈ i√n-rSO for some i, r ∈ N with i > 1, then the exponen-
tial time hypothesis fails.
Proof sketch. We argue similarly as for Proposition 6.5. Suppose there is an i
√
n-rSO
formula ϕ that expresses 3-SAT, where i > 1 and r is arbitrary. For each n we can
construct from ϕ a Boolean circuit Cn for inputs of 3-SAT encoded by strings of length n.
The construction is analogous to the proof of Proposition 6.5, with the extension that
i
√
n-rSO quantifiers are replaced by gates whose fan-in is upper-bounded by nr i
√
n. The
size of the resulting circuit Cn is bounded by 2O(logn
i√n) which is in 2o(n) for i > 1.
It follows that 3-SAT can be solved in time 2o(n), contradicting the exponential time
hypothesis.
It is also open whether formal languages beyond regular languages can be maintained
under bulk changes. As MSO expresses exactly the regular languages, the techniques of
Section 6.2 do not seem to be applicable.
When we use FO(≤,+,×) update formulas, we cannot hope for meaningful maintain-
ability results for changes of size beyond polylogarithmic, see Proposition 6.3. This is
not true any more if we allow update formulas from stronger logics. Of course, Parity
can trivially be maintained under any changes in DynFO+Maj(≤,+,×), but for Reach
and Distance update mechanisms that cannot express all of NL are interesting. In
[Datta et al. 2018b] it is shown that via NC1-computable updates, both queries can be
maintained under changes that affect O(2
√
logn/ log∗ n) nodes. Here, log∗ n denotes the
smallest number i such that i-fold application of log yields a number smaller than 1.
We already mentioned in Section 6.1 that FO update formulas, without built-in
arithmetic, cannot be used in general to maintain queries under changes of non-constant
size. This is basically the case because, unlike for single-tuple changes, first-order update
formulas cannot extend a linear order on the active domain when a non-constant number
of elements enter the active domain at once. If the change would contain more information
than just the changed tuples, as for example a linear order and the BIT predicate on
the domain of the change (that is, all elements that are affected by the change), then
all results for DynFO(≤,+,×) of this chapter can be translated to DynFO. However, the
functions f(n) that give the maximal size of a change need to be functions in the size of
the active domain, not of the whole domain. In this way, the results also transfer to the
appropriately extended FOIES framework.
Bibliographic remarks
The framework for changes of non-constant size is introduced in [Datta et al. 2018b], which
is joint work with Samir Datta, Anish Mukherjee and Thomas Zeume. The results from
Section 6.3, apart from Theorem 6.14, are published there, together with the discussions
on stronger update formalisms and on first-order updates without built-in arithmetic
139
Chapter 6 Expressibility under size-restricted bulk changes
from the end of Section 6.4. The use of [Datta et al. 2018b, Theorem 4] in that paper
is flawed, here we give a slightly restricted variant as Theorem 6.12 that nevertheless
is sufficient to replace [Datta et al. 2018b, Theorem 4]. Theorem 6.14 was originally
developed for [Schmidt et al. 2020] jointly with Jonas Schmidt, Thomas Schwentick,
Thomas Zeume and Ioannis Kokkinis, but appears there only in a weaker version.
The results from Section 6.2 are obtained jointly with Thomas Zeume, they are not
published so far.
140
CHAPTER 7
Experimental evaluation of the dynamic approach
In Chapter 1 we motivated the study of the dynamic approach, amongst others, by
its relative efficiency with respect to static algorithms. We expressed the hope that
applications would run faster if they would update a previous result instead of recomputing
it from scratch. The choice of first-order logic as update language is justified by its
relationship to SQL and the widespread use of relational database management systems
for which dynamic programs can straightforwardly be implemented. The bold message
here is: DynFO makes database systems run faster!
Of course, dynamic complexity is a theoretical framework and its main purpose is
to understand the fundamentals of dynamic query maintenance, not to directly yield
efficient algorithms for real-life applications. Still, as dynamic programs can be transferred
comparatively easily into runnable code, we should test how well they actually work in
practice.
Goal 7.1. Evaluate the performance of dynamic programs in practical scenarios.
Actual implementations and evaluations of the dynamic approach of query evaluation
are rare. Dong et al. [1999] present SQL code for maintaining reachability for directed
acyclic, undirected, and general directed graphs under single-edge changes1 and analyse
the number of joins needed to process a single-edge change. Empirical results for shortest
distances in weighted undirected graphs under deletions, reported by Pang et al. [2005],
indicate that the dynamic complexity approach can outperform recomputation from
scratch significantly. The compilation of a class of non-recursive queries into incremental
programs has been studied and implemented in [Koch 2010; Koch et al. 2014] as well as
1The code for reachability in general directed graphs cannot be translated to a DynFO-program, as it
uses exponentially many constants not present in the input graph. As a result the update formulas
may need to be evaluated for exponentially many tuples per change step.
141
Chapter 7 Experimental evaluation of the dynamic approach
[Nikolic et al. 2016]. Evaluations of different strategies to maintain joins of base relations
are given by Vista [1998].
Contributions In this chapter we empirically compare the evaluation of the undirected
reachability query UReach under first-order defined insertions of edges using
(1) dynamic programs for complex changes, following the approach of Theorem 5.4;
(2) dynamic programs that process complex changes by treating them as a sequence of
single-edge changes;
(3) evaluation from scratch using SQL’s recursive queries; and
(4) evaluation from scratch using standard imperative algorithms, implemented in
Python.
We do not include graph database systems into our comparison. In a preliminary test
using Neo4j2 and its standard Cypher interface, query evaluation did not terminate in
reasonable time even on very small instances. For general recursively defined queries a
similar behaviour was observed in a recent study [Bagan et al. 2017]. We note that plug-
ins3 are available that enable Neo4j to evaluate UReach efficiently. The corresponding
approach of query evaluation is, except of the chosen programming language, covered by
method (4) in our comparison.
We perform three experiments to evaluate the four methods. For the first two experi-
ments we use families of random graphs that consist of graphs with a varying number of
disjoint subgraphs of varying edge density. One of these experiments involves a change
that connects many formerly disjoint connected components, the second experiment
in turn uses a change that inserts several edges between a small number of connected
components. The change operations have small bridge bounds, following the guiding
principle of Subsection 5.2.2 that such changes lead to small and thus efficient dynamic
programs. For the third experiment, both changes are evaluated on a very large graph
with nearly two million nodes, obtained from the DBLP database.
Our results give strong arguments for the use of the dynamic approach for query
evaluation. For complex changes, our implementations of dynamic programs are consider-
ably faster than recomputation from scratch within SQL in almost all of the considered
scenarios, and in some scenarios even faster than the Python-based recomputation from
scratch. Furthermore, the dynamic program for complex changes performs in general
better than the dynamic program for single-edge changes which is invoked once for every
changed edge. Especially the former program still performs well in the third experiment
on the large DBLP-based graph.
All implemented variants can also deal with single-edge deletions, yet we only evaluate
them for first-order definable insertions, since Pang et al. [2005] already evaluated the
(single-tuple) deletion of edges.
Outline We give more details on our implementation in Section 7.1. In Section 7.2 we
present our experiments and their results. We close with a short outlook in Section 7.3.
2see https://neo4j.com/
3e.g. https://neo4j-contrib.github.io/neo4j-graph-algorithms/
142
7.1 Implementation
7.1 Implementation
In this section we describe our implementation in more detail. We implemented change
operations and the evaluation methods (1)-(3) we presented in the introduction of this
chapter as PL/pgSQL functions for a PostgreSQL database system. PL/pgSQL4 is a
procedural language that enables the definition of functions and allows the use of control
structures in addition to SQL queries. From the latter, we only use if-then-else blocks.
In our implementation, a change function writes the set of all inserted edges to a table
delta. The PL/pgSQL function DYN-complex, which implements the dynamic program
that processes complex changes, accesses this table and updates the auxiliary relations
accordingly. It is invoked before the changes are actually applied to the database. For
processing a complex change as a sequence of edge insertions, the dynamic program
for single-edge insertions from [Dong and Su 1998] is used. The PL/pgSQL function
DYN-single implementing this program is invoked by a trigger when the changes of
delta are actually applied to the database, once for every edge. The PL/pgSQL function
STATIC-SQL that recomputes UReach from scratch is invoked after the changes are
applied and uses Common Table Expressions, more specifically the WITH RECURSIVE
construct5, an SQL construct that allows evaluating recursively defined queries like
reachability, although it is not expressible in relational algebra. This function also has
access to delta, which is used, e.g., to avoid recomputation when no edge between two
distinct connected components was inserted. The algorithm STATIC-Python computes
the connected components of an undirected input graph and writes the result to the
database.
In order to store the query result for UReach in a compact way, the functions use a
binary table connected_components that contains for each node a unique representative
of its connected component. In the implementation, nodes are represented by integers
and the representative of a connected component is its minimal node with respect to the
natural linear order. The actual transitive closure relation of the graph is easily definable
in SQL or first-order logic using this table. The reason to use connected_components is
its succinctness: it is of linear size with respect to the number of nodes, whereas the full
transitive closure might be of quadratic size.
For [Dong and Su 1998, Theorem 4.3] and Theorem 5.4, some effort is made to
maintain a linear order on the nodes, which is then used by the corresponding dynamic
programs. As each database comes with a linear order—in our case, the natural order on
the integers—it is not necessary to maintain such an order explicitly. Apart from this
difference and the rather straightforward maintenance of connected_components, the
implementations of the dynamic programs follow the proofs of the corresponding results
closely.
Our implementations can actually maintain UReach under first-order definable inser-
tions on node coloured graphs, that is, graphs with additional unary relations C1, . . . , C`
representing ` colours. For our experiments we hence use coloured graphs.
4https://www.postgresql.org/docs/current/static/plpgsql-overview.html
5https://www.postgresql.org/docs/current/static/queries-with.html
143
Chapter 7 Experimental evaluation of the dynamic approach
7.2 Experiments and discussions
We conducted three experiments comparing the performance of the four approaches for
evaluating UReach under complex insertions. The first two experiments explore how the
approaches differ depending on the number of connected components that are joined by
bridges. The third experiment examines how they scale to large graphs. The experiments
use insertions with small bridge bounds, following the insights from Proposition 5.5 and
Proposition 5.6.
All experiments were conducted on a machine with 28 CPU cores (56 threads) with
2.6 GHz base frequency, of which we use only one, and 52 GB main memory, running
Ubuntu 16.04, with a local default installation of PostgreSQL 11devel. STATIC-Python
uses Python 3.5.2 and version 1.11 of the NetworkX package6. If not stated otherwise,
running times are obtained from six runs per graph instance. The first run is discarded,
as well as the fastest and the slowest run. The reported time is the average of the
three remaining runs (cf. [Bagan et al. 2017]). Individual timings include the time to
update connected_components and, for the dynamic programs, a spanning forest and
its transitive closure. Not included is the time needed to compute delta, to apply the
changes to the database, and, for STATIC-Python, to build the input graph object.
First experiment In the first experiment, we tested the evaluation methods for
a change that connects many different connected components of the input graph.
The considered change operation, expressed by the formula ρ1(v) = µE(v;x, y) =
E(x, y) ∨ ((x = v) ∧ C1(y)), inserts edges from some specified node v to all C1 coloured
nodes in a coloured graph. This insertion query is CQ-definable, and its bridge bound
is 2. We tested this change on graphs of different sizes and for a varying num-
ber of edges. For each n ∈ {10000, 20000, 30000, 40000, 50000, 75000, 100000} and
p ∈ {0, 0.05, 0.1, 0.15, 0.2, 0.25, 0.3} we constructed five graphs with n nodes, whose
colours were chosen uniformly at random from ten colours. Each graph was the disjoint
union of an appropriate number of graphs with 50 nodes each. For each of these graphs,
the probability of an (undirected) edge to be present was p. For each n we randomly
chose a node vn, applied the change ρ1(vn) to each graph of size n, and measured the
time it took each evaluation method to (re-)compute the table connected_components.
Table 7.1 and Figure 7.1 give details on the graphs and the results of the experiment,
where the stated values for every pair n, p average over the five instances. Because of the
large running time, STATIC-SQL was, for p 6= 0, only tested for instances with at most
30000 nodes. Values marked ∗ were obtained with a reduced testing schedule: due to
large running times, only two runs on two instances were performed for every pair n, p,
and the average over the four runs is reported.
We observe that in this setting, where many connected components can be joined by
one change, DYN-complex runs around two orders of magnitude faster than DYN-single.
This is as expected, since the latter program has to update the auxiliary information for
many nodes multiple times. Apart from the case p = 0 of initially empty graphs, direct
6see https://networkx.github.io/
144
7.2 Experiments and discussions
Table 7.1: Results of the first experiment (insertion of a star into graphs with n nodes,
consisting of random subgraphs of size 50 and edge probability p).
p n |E| cc’s ∆|E| ∆cc’s DYN-c DYN-s STAT-SQL STAT-Py DYN-sDYN-c STAT-SQLDYN-c
STAT-Py
DYN-c
0 10000 0 10000 943 -943 1.74 27.43 1.2 0.33 15.79 0.69 0.19
0 20000 0 20000 2048 -2048 9.75 118.9 4.86 0.55 12.19 0.5 0.06
0 30000 0 30000 3118 -3118 19.51 234.48 11.3 0.78 12.02 0.58 0.04
0 40000 0 40000 3970 -3970 39.06 1426.98* 18.36 1.05 36.53* 0.47 0.03
0 50000 0 50000 5038 -5038 33.19 1404.24* 31.55 1.22 42.31* 0.95 0.04
0 75000 0 75000 7490 -7490 47.47 2473.57* 69.59 1.91 52.11* 1.47 0.04
0 100000 0 100000 10120 -10120 145.93 5588.25* 130.78 2.42 38.29* 0.9 0.02
0.05 10000 12259.6 1124 998.8 -308.2 0.91 93.42 113.06 0.34 102.78 124.39 0.37
0.05 20000 24577.6 2229.8 1991.8 -600.8 2.38 402.25 487.48 0.53 168.86 204.64 0.22
0.05 30000 36835.8 3377.2 2988 -922.6 3.79 777.28 1237.56* 0.79 205.15 326.64* 0.21
0.05 40000 48991.6 4530.2 3969.6 -1230.2 7.85 2887.69* - 0.97 367.98* - 0.12
0.05 50000 61259 5649.8 4943 -1523.4 7.93 2867.01* - 1.25 361.65* - 0.16
0.05 75000 91862 8457.6 7406.2 -2289.4 12.55 5643.84* - 1.88 449.56* - 0.15
0.05 100000 122644 11291 9975.2 -3063.6 20.16 12227.03* - 2.35 606.38* - 0.12
0.1 10000 24546.4 261.4 983.2 -203 0.75 54.03 192.53 0.31 71.8 255.85 0.41
0.1 20000 48911 519.4 1988.2 -407.4 1.8 209.54 865.77 0.53 116.63 481.87 0.3
0.1 30000 73412.4 789.8 2977 -613.2 2.59 410.03 2145.46* 0.76 158.07 827.08* 0.29
0.1 40000 97919.4 1034.6 4002.4 -817.8 4.43 1432.73* - 0.91 323.42* - 0.2
0.1 50000 122317 1297.6 5003.8 -1023.8 5.08 1473.1* - 1.26 289.72* - 0.25
0.1 75000 183908.8 1923.6 7505.8 -1538 7.84 2595.37* - 1.84 330.89* - 0.23
0.1 100000 245126.4 2575.8 9966.8 -2043 11.24 6004.65* - 2.50 534.15* - 0.22
0.15 10000 36659.4 204.2 985 -198.8 0.76 46.35 248.44 0.3 60.76 325.67 0.39
0.15 20000 73491.2 406 1982 -397 1.58 179.11 1106.47 0.54 113.02 698.19 0.34
0.15 30000 110142.8 609.8 2975.4 -598 2.52 355.1 2732.98* 0.81 140.84 1083.99* 0.32
0.15 40000 147038 815.4 3996.2 -797 4.01 1253.63* - 1.05 312.48* - 0.26
0.15 50000 183608.2 1017.6 5027.2 -996.6 4.74 1252.73* - 1.26 264.24* - 0.27
0.15 75000 275649.8 1527.8 7377.8 -1491.2 7.02 2148.04* - 1.84 306.06* - 0.26
0.15 100000 367722.8 2037.6 10038.2 -1992.6 10.36 4390.39* - 2.56 423.73* - 0.25
0.2 10000 48912 200.2 1001.6 -198 0.64 42.45 301.25 0.3 66.55 472.25 0.48
0.2 20000 97934.6 400.6 2004.8 -396.4 1.48 177.88 1350.96 0.55 120.14 912.48 0.37
0.2 30000 146978.6 600.6 2991.8 -597.6 2.17 327.65 3321.56* 0.8 151.03 1531.12* 0.37
0.2 40000 195696.4 801 3984.6 -795 3.86 1189.67* - 1.04 307.87* - 0.27
0.2 50000 245184.2 1000.8 4975.8 -992.6 4.29 1190.05* - 1.29 277.59* - 0.3
0.2 75000 367597.4 1501.2 7560.6 -1493.8 6.63 2064.55* - 1.85 311.61* - 0.28
0.2 100000 490189.4 2002 9961.6 -1988.8 10.07 3912.13* - 2.56 388.51* - 0.25
0.25 10000 61323.2 200 975.4 -198.6 0.65 41.86 369.78 0.3 64.59 570.55 0.47
0.25 20000 122483.6 400 2022 -397 1.43 175.22 1549.37 0.55 122.58 1083.88 0.38
0.25 30000 183640.2 600 2993.2 -596.8 2.17 308.71 3992.07* 0.83 142.37 1841.06* 0.38
0.25 40000 244909.2 800 3977 -794 3.44 1089.95* - 1.04 316.77* - 0.3
0.25 50000 306185 1000 4979 -993.4 4.07 1141.95* - 1.31 280.37* - 0.32
0.25 75000 459241.8 1500.4 7537.2 -1493 6.77 1954.61 - 1.88 288.89* - 0.28
0.25 100000 612608 2000 9956.4 -1987.4 9.86 3817.87* - 2.61 387.37* - 0.27
0.3 10000 73516.6 200 987.6 -199 0.58 40.34 417.09 0.32 69 713.43 0.54
0.3 20000 146868.2 400 1998.2 -395.8 1.36 162.49 1780.68 0.62 119.43 1308.86 0.46
0.3 30000 220461.2 600 2999 -596.6 2.13 295.67 4408.44* 0.8 139.1 2074.04* 0.38
0.3 40000 294246.4 800 3992 -793.2 3.33 1078.65* - 1.11 323.76* - 0.33
0.3 50000 367958.6 1000 4939.6 -994.4 4.16 1134.23* - 1.29 272.53* - 0.31
0.3 75000 551294.2 1500 7481.2 -1489.6 6.19 1908.72* - 1.92 308.38* - 0.31
0.3 100000 735921.8 2000 10018.6 -1987 9.64 3789.45* - 2.6 393.14* - 0.27
The columns |E| and cc’s provide the number of edges and connected components before the change.
∆|E| provides the number of inserted edges by the change, and ∆cc’s gives the difference of the number
of connected components. The columns DYN-c, DYN-s, STAT-SQL, STAT-Py give the runtime in seconds
of the algorithms DYN-complex, DYN-single, STATIC-SQL, STATIC-Python, respectively. The values are
avaraged over five instances for each pair (p, n).
145
Chapter 7 Experimental evaluation of the dynamic approach
0 0.1 0.2 0.3
0.1
1
10
100
1,000
p
ra
tio
to
DY
N-
co
mp
le
x
(a) n = 10000
0 0.1 0.2 0.3
0.1
1
10
100
1,000
p
ra
tio
to
DY
N-
co
mp
le
x
(b) n = 20000
0 0.1 0.2 0.3
0.1
1
10
100
1,000
p
ra
tio
to
DY
N-
co
mp
le
x
(c) n = 30000
0 0.1 0.2 0.3
0.1
1
10
100
1,000
p
ra
tio
to
DY
N-
co
mp
le
x
(d) n = 100000
5 10
0.1
1
10
100
1,000
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(e) p = 0
5 10
0.1
1
10
100
1,000
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(f) p = 0.05
5 10
0.1
1
10
100
1,000
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(g) p = 0.1
5 10
0.1
1
10
100
1,000
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(h) p = 0.3
Figure 7.1: Ratios of the running times in Experiment 1 of the different evaluation
methods compared to DYN-complex, for different values of n and p, as listed
in Table 7.1. Dashed lines indicate that the values are obtained with a reduced
testing schedule. : DYN-singleDYN-complex :
STATIC-SQL
DYN-complex :
STATIC-Python
DYN-complex
processing of complex changes is also faster than computation from scratch using SQL.
In our tests, DYN-complex ran three to four orders of magnitude faster than STATIC-SQL,
which did not even terminate in reasonable time on larger graphs. For p = 0, STATIC-SQL
is faster than DYN-complex. This is as expected: because no edge was present before the
change, no prior auxiliary information is available to the dynamic programs. Basically,
they also have to recompute the query result from scratch.
In all cases, the dynamic approaches are outperformed by the Python-based evaluation
from scratch. For p 6= 0, the speed-up is between 2 and 8.
Second experiment In the second experiment, we compared the evaluation meth-
ods for the same graphs but for a change operation that connects only few connected
components. More precisely, the change operation ρ2 with parameters v1, . . . , v7 con-
nects all neighbours of v1 and v2 with the nodes v3, . . . , v7 through the rule ρ2(v¯) =
µE(v1, . . . , v7;x, y) = E(x, y) ∨
(
(E(x, v1) ∨ E(x, v2)
) ∧ (y = v3 ∨ y = v4 ∨ y = v5 ∨ y =
v6 ∨ y = v7)
)
. This change can be expressed by a union of ten conjunctive queries. Its
bridge bound is easily seen to be 2, which is much better than the bound of 20 given by
Proposition 5.5.
The set-up of this experiment is analogous to the first experiment. The case p = 0
was omitted, since ρ2 does not change empty graphs at all. The results are provided in
Table 7.2 and Figure 7.2.
146
7.2 Experiments and discussions
Table 7.2: Results of the second experiment (insertion of edges between up to seven
connected components of graphs with n nodes, consisting of random
subgraphs of size 50 and edge probability p).
p n |E| cc’s ∆|E| ∆cc’s DYN-c DYN-s STAT-SQL STAT-Py DYN-sDYN-c STAT-SQLDYN-c
STAT-Py
DYN-c
0.05 10000 12259.6 1124 28 -5.8 0.25 0.56 1.07 0.3 2.21 4.24 1.19
0.05 20000 24577.6 2229.8 32 -6 0.4 1.06 1.83 0.56 2.67 4.6 1.4
0.05 30000 36835.8 3377.2 21 -5.6 0.54 1.42 2.66 0.77 2.64 4.95 1.43
0.05 40000 48991.6 4530.2 28 -6 0.73 1.94 3.55 0.98 2.65 4.86 1.34
0.05 50000 61259 5649.8 27 -5.6 0.84 2.21 4.39 1.22 2.65 5.25 1.45
0.05 75000 91862 8457.6 31 -6 1.26 3.45 6.12 1.84 2.75 4.87 1.47
0.05 100000 122644 11291 20 -5.8 1.59 4.39 8.25 2.34 2.76 5.19 1.47
0.1 10000 24546.4 261.4 34 -6 0.22 0.5 1.53 0.29 2.27 6.92 1.31
0.1 20000 48911 519.4 49 -6 0.33 0.88 2.71 0.55 2.67 8.19 1.66
0.1 30000 73412.4 789.8 43 -6 0.46 1.16 3.84 0.78 2.53 8.34 1.69
0.1 40000 97919.4 1034.6 56 -6 0.57 1.41 4.86 0.99 2.48 8.52 1.74
0.1 50000 122317 1297.6 47 -6 0.64 1.69 5.81 1.23 2.64 9.05 1.92
0.1 75000 183908.8 1923.6 47 -6 0.83 2.56 8.68 1.84 3.08 10.43 2.21
0.1 100000 245126.4 2575.8 48 -6 1.13 3.39 11.66 2.52 3 10.33 2.23
0.15 10000 36659.4 204.2 72 -6 0.21 0.44 1.9 0.32 2.09 8.92 1.51
0.15 20000 73491.2 406 71 -6 0.30 0.68 3.14 0.56 2.24 10.38 1.85
0.15 30000 110142.8 609.8 70 -6 0.35 0.87 4.57 0.78 2.49 13.05 2.23
0.15 40000 147038 815.4 73 -6 0.42 1.14 5.83 1.01 2.72 13.88 2.4
0.15 50000 183608.2 1017.6 59 -6 0.48 1.45 7.04 1.24 3.01 14.61 2.58
0.15 75000 275649.8 1527.8 62 -6 0.73 2.25 10.74 1.86 3.07 14.67 2.55
0.15 100000 367722.8 2037.6 65 -6 0.98 2.95 14.4 2.57 3.02 14.76 2.63
0.2 10000 48912 200.2 99 -6 0.19 0.42 2.19 0.31 2.21 11.65 1.63
0.2 20000 97934.6 400.6 94 -6 0.23 0.54 3.44 0.56 2.35 14.93 2.42
0.2 30000 146978.6 600.6 108 -6 0.29 0.79 5.34 0.8 2.72 18.36 2.75
0.2 40000 195696.4 801 98 -6 0.38 1.05 6.79 1.01 2.79 18.07 2.68
0.2 50000 245184.2 1000.8 89 -6 0.43 1.33 8.56 1.28 3.11 20 2.99
0.2 75000 367597.4 1501.2 91 -6 0.68 2.08 12.84 1.87 3.05 18.87 2.74
0.2 100000 490189.4 2002 107 -6 0.9 2.77 17.19 2.65 3.09 19.19 2.96
0.25 10000 61323.2 200 124 -6 0.20 0.39 2.51 0.31 1.92 12.4 1.55
0.25 20000 122483.6 400 107 -6 0.22 0.5 4.06 0.57 2.21 18.11 2.53
0.25 30000 183640.2 600 127 -6 0.28 0.76 6.06 0.8 2.7 21.46 2.87
0.25 40000 244909.2 800 119 -6 0.35 1 8.23 1.03 2.82 23.22 2.9
0.25 50000 306185 1000 116 -6 0.41 1.26 10.08 1.29 3.08 24.6 3.15
0.25 75000 459241.8 1500.4 134 -6 0.64 1.98 14.94 1.92 3.08 23.18 2.98
0.25 100000 612608 2000 104 -6 0.86 2.65 20.17 2.69 3.07 23.33 3.12
0.3 10000 73516.6 200 137 -6 0.16 0.28 2.56 0.3 1.8 16.36 1.93
0.3 20000 146868.2 400 138 -6 0.2 0.49 4.6 0.57 2.42 22.62 2.82
0.3 30000 220461.2 600 135 -6 0.28 0.75 6.96 0.81 2.68 24.9 2.89
0.3 40000 294246.4 800 159 -6 0.35 0.97 9.02 1.11 2.76 25.64 3.16
0.3 50000 367958.6 1000 160 -6 0.4 1.22 11.11 1.3 3.05 27.76 3.25
0.3 75000 551294.2 1500 115 -6 0.63 1.93 16.84 1.92 3.06 26.74 3.05
0.3 100000 735921.8 2000 142 -6 0.83 2.56 22.59 2.72 3.1 27.35 3.29
See Table 7.1 for an explanation of the columns.
147
Chapter 7 Experimental evaluation of the dynamic approach
0.1 0.2 0.3
1
10
100
p
ra
tio
to
DY
N-
co
mp
le
x
(a) n = 10000
0.1 0.2 0.3
1
10
100
p
ra
tio
to
DY
N-
co
mp
le
x
(b) n = 20000
0.1 0.2 0.3
1
10
100
p
ra
tio
to
DY
N-
co
mp
le
x
(c) n = 30000
0.1 0.2 0.3
1
10
100
p
ra
tio
to
DY
N-
co
mp
le
x
(d) n = 100000
5 10
1
10
100
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(e) p = 0.05
5 10
1
10
100
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(f) p = 0.1
5 10
1
10
100
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(g) p = 0.2
5 10
1
10
100
n (× 10000)
ra
tio
to
DY
N-
co
mp
le
x
(h) p = 0.3
Figure 7.2: Ratios of the running times in Experiment 2 of the different evaluation
methods compared to DYN-complex, for different values of n and p, as listed
in Table 7.2. : DYN-singleDYN-complex :
STATIC-SQL
DYN-complex :
STATIC-Python
DYN-complex
In this experiment, DYN-complex was again considerably faster than STATIC-SQL and
even slightly faster than STATIC-Python. Also it outperformed DYN-single. This pro-
gram in turn performed very well on large graphs even in comparison to STATIC-Python.
This seems to be because it can tell rapidly whether an inserted edge lies inside a con-
nected component, and thus can be ignored. STATIC-SQL performed better than in the
first experiment, since the connected components after the change are smaller here.
Third experiment For the last experiment, we tested the performance of the dynamic
programs on a large real-world graph. The graph Gdblp is obtained from a snapshot of
the DBLP dataset from June 20177. The nodes of Gdblp correspond to authors, and
edges are based on co-authorship: an (undirected) edge (u, v) implies that the authors
corresponding to u and v are co-authors of some publication.
We tested the performance for both insertions ρ1 and ρ2. As ρ1 is an insertion query
for coloured graphs, we coloured one out of thousand nodes with colour C1. Details on
the DBLP graph and the results of the experiment are provided in Table 7.3.
While DYN-complex updated the query result in a reasonable amount of time in
both tests, DYN-single was only able to do so for the “easier” change ρ2. It needed
more than 100 minutes to process the change ρ1. However, for ρ2, DYN-single was
actually the fastest method. Again, DYN-complex was slightly faster than STATIC-Python.
7see http://dblp.dagstuhl.de/xml/release/
148
7.3 Outlook and bibliographic remarks
Table 7.3: Results of the third experiment (application of the changes of the first two
experiments to a large real-world graph).
Graph |V | |E| cc’s change ∆|E| ∆cc’s DYN-c DYN-s STAT-SQL STAT-Py DYN-sDYN-c STAT-SQLDYN-c
STAT-Py
DYN-c
Gdblp 1949121 8823792 151874 ρ1 1950 -248 51.29 6270.86 − 59.65 122.26 − 1.16
Gdblp 1949121 8823792 151874 ρ2 140 -3 42.1 38.11 − 52.31 0.91 − 1.24
See Table 7.1 for an explanation of the columns. Timings for STATIC-SQL are not listed, as the experiment
did not finish within twelve hours.
STATIC-SQL reached its limits in this experiment: it did not finish within twelve hours.
Discussion We conclude from the results of the experiments that using dynamic
programs and in particular DYN-complex to maintain undirected reachability can reduce
the running time significantly, especially compared with STATIC-SQL. Not surprisingly, a
special purpose program evaluating from scratch can achieve better results. That said,
we actually expected the gap to DYN-complex to be larger, and we are surprised that
for some test cases, and in particular for the large real-world graph Gdblp, DYN-complex
even performs slightly better. In any case, if no such program is at hand, the dynamic
solution can be by far the best solution.
The results indicate that, as one would expect, the running time of DYN-single and
DYN-complex is dominated by the number of connected components that are connected
by the change; the running time of STATIC-SQL and STATIC-Python on the other hand
is dominated by the size of the graph, and for STATIC-SQL in particular by the size and
the density of the connected components. So, the relative performances of the different
evaluation methods very much depend on the test case at hand.
7.3 Outlook and bibliographic remarks
The encouraging results concerning the performance of the dynamic approach that we
obtained in this chapter can be seen as a practical proof of concept of the results obtained
in Section 5.2 and especially Subsection 5.2.2. They were not at all predictable, since the
benefit of reusing previously computed information might have been smaller than the
cost of updating all auxiliary relations.
It is however too early to attest superiority of the dynamic approach over static query
evaluation, even restricted to UReach. We have only evaluated the different approaches
in a very limited number of scenarios. For better-founded conclusions we would need to
consider more divers graphs and changes, preferably extracted from real-world data.
Now that we have promising results for UReach, the canonical next step is to evaluate
the dynamic program for directed reachability [Datta et al. 2018a]. Preliminary tests
suggest that one cannot expect competitive performances here. We mention some
challenges. The dynamic program from [Datta et al. 2018a] maintains matrices of
quadratic size in the number of nodes, so it requires space Ω(n2), which might be too
much for input graphs of “interesting” size. Of course this argument already applies to
149
Chapter 7 Experimental evaluation of the dynamic approach
the query relation, but that relation could be stored more succinctly, for example as a
tree of strongly connected components, analogously to the table connected_components
we used above. Additionally, such matrices are maintained for many primes, which also
increases the work significantly. If the computations are not done modulo many primes,
the involved numbers increase in each step, so the algorithm might run into numerical
problems.
In general, dynamic programs that rely on the Muddling technique do not seem to be
good choices for being implemented, as this technique involves to simulate many instances
of a dynamic program in parallel. While this gives low parallel time, the overall work to
be done is high. In practical parallel settings with a constant number of processors, the
efficiency of update routines largely depends on the work, which roughly corresponds to
the number of gates in a circuit that expresses the update. The question for update rules
with low parallel complexity and simultaneously low overall work needs to be further
investigated. The class DynQF, defined via quantifier-free update formulas that may use
functions [Hesse 2003b], provides a good starting point for this investigation, in particular
when only unary auxiliary relations are used. See [Hesse 2003b; Gelade et al. 2012; Zeume
2015] for some upper and lower bounds for this class.
Implementing dynamic programs is tedious work. A high-level programming language
together with a compiler would greatly facilitate this process.
Bibliographic remarks
The contents of this chapter are joint work with Thomas Schwentick and Thomas Zeume,
and are published as [Schwentick et al. 2018]. The framework for the empirical evaluation
and parts of the implementation of the dynamic programs were programmed by Dennis
Ciba. We thank the Database and Information Systems group at TU Dortmund led by
Jens Teubner for providing access to a compute server for the experiments.
150
CHAPTER 8
Conclusion
Whether the reachability query Reach for arbitrary directed graphs can be maintained
in DynFO under single-edge insertions and deletions was by far the most important
open question in the dynamic complexity framework, until Datta, Kulkarni, Mukherjee,
Schwentick, and Zeume [2015; 2018a] gave a positive answer.
Solving this question brought new research goals within reach, in particular the question
whether Reach, as well as other queries that are maintainable in DynFO under single-
tuple changes, can still be maintained under more general changes. This thesis is a first
comprehensive investigation of questions of this kind. We have shown multiple strong
maintainability results, both for first-order definable changes as well as size-restricted
bulk changes. We summarise them briefly; afterwards we briefly discuss results concerning
static analysis problems in dynamic complexity and give an outlook on further research.
Summary of the results Our strongest results concern the undirected reachability
query UReach, which we showed to be maintainable in DynFO and DynFO(≤,+,×)
under all first-order definable insertions as well as under insertions and deletions of
polylogarithmically many edges (with respect to the number of nodes), respectively
(Theorem 5.4 and Theorem 6.10). The latter result is optimal in the sense that UReach
cannot be maintained in DynFO(≤,+,×) under larger changes, see the discussion of
Proposition 6.3.
We presented partial results for the directed reachability query Reach: it can be
maintained under quantifier-free first-order definable insertions for acyclic graphs (Theo-
rem 5.8) and under insertions of polylogarithmic size (Theorem 6.9). The result that
Reach can be maintained under edge insertions and deletions that affect lognlog logn many
nodes [Datta et al. 2018b], see also Theorem 6.13 and Theorem 6.14, is a very instructive
example on how the techniques of [Datta et al. 2018a] can be generalised to complex
151
Chapter 8 Conclusion
changes, but is not part of this thesis. We use its techniques here to show that the
Distance query can be maintained in DynFO+Maj(≤,+,×) under polylogarithmically
many insertions and deletions of edges (Theorem 6.15), and discuss in Subsection 6.3.3 a
result on Distance that requires FO+Maj(≤,+,×) formulas only to extract the query
result—with the downside that only changes of size lognlog logn are supported.
Despite the high expressive power of first-order update formulas we were able to
provide non-maintainability results, at least in restricted dynamic settings. We have
shown that Reach is not in DynProp under quantifier-free first-order insertions or
deletions (Theorem 5.11), and that Reach cannot be maintained in DynFO under very
weak first-order definable insertions or deletions if only unary additional auxiliary relations
are allowed (Theorem 5.9).
In addition to these results on the graph reachability queries, we extended the results
of Gelade et al. [2012] on maintenance of formal languages. We proved that all context-
free languages are in DynFO under changes that are defined by first-order formulas
without quantifier alternation, see Section 5.4. All regular languages of strings and
ranked trees can be maintained in DynFO(≤,+,×) under changes of polylogarithmically
many positions (Theorem 6.11), which is again an optimal result for DynFO(≤,+,×)
with respect to the size of the change, see once more Proposition 6.3. We also showed
that all AC1 queries are in DynFO under parameter-free first-order definable changes
(Corollary 5.20).
Apart from the reachability query and formal language maintenance, which are both
well-studied in the dynamic complexity framework, we started to investigate the class
of MSO-definable queries and proved a dynamic version of Courcelle’s Theorem for
single-tuple changes: all MSO-definable queries are in DynFO for graphs with bounded
treewidth under changes of single tuples, see Theorem 4.3 and Theorem 4.4. The Muddling
technique as presented in Chapter 3 was an essential tool in the proofs of these results,
and was used beyond that. We clarified the structure of DynProp and proved an arity
hierarchy for Boolean graph queries under single-edge changes (Theorem 4.13).
We were not only able to prove expressibility results that are surprisingly strong, at
least in the author’s opinion, but our experimental evaluation presented in Chapter 7
showed that a direct implementation of dynamic programs may run considerably faster
than naive re-computation from scratch and can achieve performance results that are
comparable with those of special-purpose static algorithms. It also confirmed the intuition
that a direct treatment of complex changes is advantageous compared to handling them
as a sequence of single-tuple changes, which further justifies the research agenda pursued
in this thesis.
Static analysis of dynamic programs This work focussed on the expressive power
of first-order update formulas under complex changes and single-tuple changes. We now
briefly discuss questions regarding the static analysis of dynamic programs: given a
dynamic program, what is the complexity of deciding whether this program has some
specific property? This line of work originated in the author’s master’s thesis [Vortmeier
2013], and was considerably extended jointly with Thomas Schwentick and Thomas
152
Zeume, and published as [Schwentick et al. 2015].
One motivation for studying static analysis of dynamic programs as stated in [Schwentick
et al. 2015] is the wish to understand what a dynamic program at hand “does”, and,
more generally, to understand what various classes of restricted dynamic programs “can
do” in general. Insights regarding these questions might ultimately lead to methods for
proving non-maintainability result in restricted dynamic settings.
In [Schwentick et al. 2015], the vague question “what does a given dynamic program do?”
is translated into three static analysis problems for classes of dynamic programs. The first
problem concerns the emptiness of dynamic programs: given a dynamic program, is there
a sequence of changes such that the query relation maintained by the dynamic program is
non-empty? Related static analysis problems are standard in database theory and other
fields. More specific for the dynamic setting is the consistency problem: given a dynamic
program, do all sequences of changes that lead to the same input structure also lead
to the same maintained query relation? This question basically asks whether the given
dynamic program actually maintains some query. For a consistent dynamic program,
the query relation is functionally dependent from the input relations. Some dynamic
programs satisfy even the stronger condition that all auxiliary relations are functionally
dependent from the input structure. We call such dynamic programs history independent,
other authors use the terms memoryless [Patnaik and Immerman 1997] and deterministic
[Dong and Su 1997]. History independent dynamic programs are studied for example
in [Dong and Su 1997; Grädel and Siebertz 2012]. The history independence problems
asks whether the given dynamic program is history independent. For all three problems,
[Schwentick et al. 2015] only considers change sequences that consist of insertions and
deletions of single tuples.
Not surprisingly, these three problems are undecidable for general dynamic programs,
due to the undecidability of the satisfiability problem for first-order logic. Therefore
[Schwentick et al. 2015] studies these problems for restricted classes of dynamic programs
and investigates the precise border of undecidability. The considered classes are defined
according to the following parameters:
• whether update formulas may use quantifiers,
• the maximal arity of input relations,
• the maximal arity of auxiliary relations, and
• only for the emptiness problem: whether the input program is guaranteed to be
consistent.
It is observed that the emptiness problem is equivalent to the consistency problem for
all considered classes of not necessarily consistent dynamic programs. Both problems
are decidable only for a severely restricted class of dynamic programs. Slightly stronger
decidability results were obtained for the emptiness problem for consistent dynamic
programs and for the history independence problem. The precise results are summarised
in Table 8.1.
Outlook: open problems and further research Several questions regarding the
expressive power of first-order update formulas remain open. Some of them have already
153
Chapter 8 Conclusion
Emptiness,
Consistency
Emptiness for
consistent programs
History
Independence
Undecidable
DynFO(1-in, 0-aux)
DynProp(2-in, 0-aux)
DynProp(1-in, 2-aux)
DynFO(1-in, 2-aux)
DynFO(2-in, 0-aux) DynFO(2-in, 0-aux)
Decidable DynProp(1-in, 1-aux)
DynFO(1-in, 1-aux)
DynProp(1-in)
DynProp(1-aux)
DynFO(1-in)
DynProp(1-aux)
Open — DynProp(2-in, 2-aux)and beyond
DynProp(2-in, 2-aux)
and beyond
Table 8.1: Summary of the results of [Schwentick et al. 2015], see [Schwentick et al.
2015, Table 1]. DynFO(`-in,m-aux) stands for dynamic first-order pro-
grams with (at most) `-ary input relations and m-ary auxiliary relations.
DynFO(m-aux) and DynFO(`-in) represent programs with m-ary auxiliary re-
lations (and input relations with arbitrary arity) and programs with `-ary
input relations (and auxiliary relations with arbitrary arity), respectively.
DynProp(`-in,m-aux),DynProp(`-in) and DynProp(m-aux) are defined analo-
gously for classes of dynamic programs with quantifier-free first-order update
formulas.
been stated at the end of the preceding chapters, and we repeat the most important ones.
We also present topics for further research.
The precise dynamic complexity of the reachability query While we know
many DynFO maintainability results for Reach and UReach, there are comparatively
few results with respect to complex deletions. It is for example open whether UReach is
in DynFO under (some non-trivial class of) first-order definable deletions, even if besides
these deletions only single-edge insertions are allowed.
We showed in this thesis that Reach is in DynFO(≤,+,×) under polylogarithmically
many edge insertions; the corresponding dynamic program cannot handle any edge
deletions. It is unclear whether this result can be extended to allow additionally deletions
of single edges. The so far best result for DynFO(≤,+,×) that allows for insertions and
deletions covers changes of size lognlog logn [Datta et al. 2018b], and we ask whether this result
can be extended towards insertions and deletions of logarithmic or even polylogarithmic
size. A first step towards answering this question might be to only consider restricted
graph classes, like the class of planar graphs or graphs of bounded treewidth.
We still do not know whether Reach is in DynProp under single-edge changes, although
some lower bounds for special cases are known [Zeume and Schwentick 2015]. We ask for
a proof of the general DynProp non-maintainability result.
Stronger reductions for dynamic complexity Bounded first-order reductions,
as introduced by Patnaik and Immerman [1997], have the property that whenever such a
154
reduction Υ maps an instance D to an instance Υ(D), then for any single-tuple change α
to D the structures Υ(D) and Υ(α(D)) only differ by some constant number of tuples.
See Example 2.4 for an application.
The bulk changes we studied in Chapter 6 give rise to the stronger form of polyloga-
rithmically bounded first-order reductions, which allow any single-tuple change α of D
to lead to logd n many changes to Υ(D), where n is the size of the domain of D and
d ∈ N is some global constant. All classes that include DynFO(≤,+,×) are closed under
these reductions. As we have shown that UReach is in DynFO(≤,+,×) under changes
of size logd n for every fixed d ∈ N, we can infer (q,∆) ∈ DynFO(≤,+,×) for every
dynamic query (q,∆) such that (1) q can be reduced to UReach by a polylogarithmically
bounded first-order reduction, and (2) ∆ is a set of bulk change operations of at most
polylogarithmic size. This approach can be pursued also for any other query that can be
maintained under changes of polylogarithmic size.
We hope for new maintainability results by means of polylogarithmically bounded
first-order reductions, and in particular for results for whole classes of queries. Remember
that we cannot deduce from the fact (Reach,∆E) ∈ DynFO [Datta et al. 2018a] that
all NL queries are in DynFO under single-tuple changes, because Reach is not NL-hard
under bounded first-order reductions as defined by Patnaik and Immerman [Patnaik and
Immerman 1997]. Can this result be extended to polylogarithmically bounded first-order
reductions? Are there natural classes of queries such that Reach or UReach is hard for
these classes under polylogarithmically bounded first-order reductions?
Dynamic complexity, dynamic algorithms and practice We have seen in
Chapter 7 that direct implementations of dynamic programs can exhibit competitive
running times. This insight may lead to further research in two directions: more
evaluations of dynamic programs and more emphasis on “efficient” dynamic programs.
We briefly discuss these directions.
The conducted experiments we presented in Section 7.2 naturally cover only a limited
number of use cases. Further experiments using the different implemented routines
answering UReach might be helpful to get a more comprehensive overview of their
efficiency. Most notably, such experiments should involve definable changes with a bridge
bound larger than 2. Because of the encouraging results we obtained for UReach we
advocate the implementation and evaluation of dynamic problems for other queries, as
for example Reach. Further research might facilitate the implementation process. As
dynamic programs are usually only sketched in proofs, a high-level specification language
would be helpful to obtain a first complete formal description of a dynamic program.
In a second step, a compiler could translate such a description into a prototypical SQL
program. In might however still be necessary that domain experts subsequently optimise
such a program to actually obtain an efficient implementation.
More experience regarding efficiency of implemented dynamic programs might also
help to identify classes of dynamic programs in the dynamic complexity framework that
“inherently” lead to efficient implementations, for example because the overall work of
the dynamic program is low. Here, the work of a dynamic program is the (sequential)
155
Chapter 8 Conclusion
time that is necessary to evaluate it.
One key factor that affects the work is the number of tuples for which the update
formulas are evaluated. This number is linear for unary auxiliary relations, which makes
constructing dynamic programs with unary auxiliary structures a prime goal. Our results
for UReach suggest however that also binary auxiliary relations can be updated in
reasonable time if the relations are sparse. An orthogonal approach might be to study
dynamic programs that only update the auxiliary relations for tuples that are within a
certain distance to a changed tuple.
The other key factor that affects the overall work is the work that is necessary to
evaluate an update formula once. Quantifier-free update formulas can be evaluated in
constant time even when they are allowed to use functions, at least if we assume that
functions and atomic formulas can be evaluated in constant time. Apart from update
formulas of this form one could also consider update formalisms that do not lead to
constant parallel time updates but to low work, as for example updates specified by NC1
circuits of linear size.
These options lead to multiple different models of parallel update routines whose
expressive power might be worth to be investigated. Maybe they could also help to bridge
the gap between the research fields of dynamic complexity and dynamic algorithms,
which so far do not seem to profit much from each other.
156
Bibliography
[Abboud and Williams 2014] Amir Abboud and Virginia Vassilevska Williams. 2014.
Popular Conjectures Imply Strong Lower Bounds for Dynamic Problems. In 55th IEEE
Annual Symposium on Foundations of Computer Science, FOCS 2014, Philadelphia,
PA, USA, October 18-21, 2014. IEEE Computer Society, 434–443. https://doi.
org/10.1109/FOCS.2014.53 (Cited on page 2.)
[Abiteboul et al. 1995] Serge Abiteboul, Richard Hull, and Victor Vianu. 1995. Foundations
of Databases. Addison-Wesley, Boston, MA, USA. http://webdam.inria.fr/Alice/
(Cited on pages 3, 9, 11, and 87.)
[Agrawal and Vinay 2008] Manindra Agrawal and V. Vinay. 2008. Arithmetic Circuits: A
Chasm at Depth Four. In 49th Annual IEEE Symposium on Foundations of Computer
Science, FOCS 2008, October 25-28, 2008, Philadelphia, PA, USA. IEEE Computer
Society, 67–75. https://doi.org/10.1109/FOCS.2008.32 (Cited on page 132.)
[Ajtai 1983] Miklós Ajtai. 1983. Σ11-Formulae on finite structures. Annals of Pure
and Applied Logic 24, 1 (1983), 1–48. https://doi.org/10.1016/0168-0072(83)
90038-6 (Cited on pages 19 and 115.)
[Amarilli et al. 2019] Antoine Amarilli, Pierre Bourhis, Stefan Mengel, and Matthias
Niewerth. 2019. Enumeration on Trees with Tractable Combined Complexity and Effi-
cient Updates. In Proceedings of the 38th ACM SIGMOD-SIGACT-SIGAI Symposium
on Principles of Database Systems, PODS 2019, Amsterdam, The Netherlands, June
30 - July 5, 2019., Dan Suciu, Sebastian Skritek, and Christoph Koch (Eds.). ACM,
89–103. https://doi.org/10.1145/3294052.3319702 (Cited on page 9.)
[Ameloot et al. 2013] Tom J. Ameloot, Jan Van den Bussche, and Emmanuel Waller.
2013. On the Expressive Power of Update Primitives. In Proceedings of the 32Nd ACM
SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems (PODS ’13).
157
Bibliography
ACM, New York, NY, USA, 139–150. https://doi.org/10.1145/2463664.2465218
(Cited on page 80.)
[Arnborg et al. 1987] Stefan Arnborg, Derek G. Corneil, and Andrzej Proskurowski. 1987.
Complexity of Finding Embeddings in a K-tree. SIAM J. Algebraic Discrete Methods
8, 2 (April 1987), 277–284. https://doi.org/10.1137/0608024 (Cited on page 40.)
[Arnborg et al. 1991] Stefan Arnborg, Jens Lagergren, and Detlef Seese. 1991. Easy
Problems for Tree-Decomposable Graphs. J. Algorithms 12, 2 (1991), 308–340. https:
//doi.org/10.1016/0196-6774(91)90006-K (Cited on page 39.)
[Bagan et al. 2017] Guillaume Bagan, Angela Bonifati, Radu Ciucanu, George H. L.
Fletcher, Aurélien Lemay, and Nicky Advokaat. 2017. gMark: Schema-Driven Genera-
tion of Graphs and Queries. IEEE Trans. Knowl. Data Eng. 29, 4 (2017), 856–869.
https://doi.org/10.1109/TKDE.2016.2633993 (Cited on pages 142 and 144.)
[Balmin et al. 2004] Andrey Balmin, Yannis Papakonstantinou, and Victor Vianu. 2004.
Incremental validation of XML documents. ACM Trans. Database Syst. 29, 4 (2004),
710–751. https://doi.org/10.1145/1042046.1042050 (Cited on page 9.)
[Barbosa et al. 2004] Denilson Barbosa, Alberto O. Mendelzon, Leonid Libkin, Laurent
Mignet, and Marcelo Arenas. 2004. Efficient Incremental Validation of XML Documents.
In Proceedings of the 20th International Conference on Data Engineering, ICDE 2004,
30 March - 2 April 2004, Boston, MA, USA, Z. Meral Özsoyoglu and Stanley B. Zdonik
(Eds.). IEEE Computer Society, 671–682. https://doi.org/10.1109/ICDE.2004.
1320036 (Cited on page 9.)
[Barrington et al. 1990] David A. Mix Barrington, Neil Immerman, and Howard Straubing.
1990. On Uniformity within NC1. J. Comput. Syst. Sci. 41, 3 (1990), 274–306.
https://doi.org/10.1016/0022-0000(90)90022-D (Cited on pages 4, 15, 22, 128,
and 132.)
[Berkholz et al. 2017] Christoph Berkholz, Jens Keppeler, and Nicole Schweikardt. 2017.
Answering Conjunctive Queries under Updates. In Proceedings of the 36th ACM
SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2017,
Chicago, IL, USA, May 14-19, 2017, Emanuel Sallinger, Jan Van den Bussche, and
Floris Geerts (Eds.). ACM, 303–318. https://doi.org/10.1145/3034786.3034789
(Cited on page 9.)
[Berkholz et al. 2018a] Christoph Berkholz, Jens Keppeler, and Nicole Schweikardt. 2018a.
Answering FO+MOD Queries under Updates on Bounded Degree Databases. ACM
Trans. Database Syst. 43, 2 (2018), 7:1–7:32. https://doi.org/10.1145/3232056
(Cited on page 9.)
[Berkholz et al. 2018b] Christoph Berkholz, Jens Keppeler, and Nicole Schweikardt.
2018b. Answering UCQs under Updates and in the Presence of Integrity Constraints.
In 21st International Conference on Database Theory, ICDT 2018, March 26-29, 2018,
158
Bibliography
Vienna, Austria (LIPIcs), Benny Kimelfeld and Yael Amsterdamer (Eds.), Vol. 98.
Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 8:1–8:19. https://doi.org/10.
4230/LIPIcs.ICDT.2018.8 (Cited on page 9.)
[Björklund et al. 2010] Henrik Björklund, Wouter Gelade, and Wim Martens. 2010.
Incremental XPath evaluation. ACM Trans. Database Syst. 35, 4 (2010), 29:1–29:43.
https://doi.org/10.1145/1862919.1862926 (Cited on page 9.)
[Blakeley et al. 1989] José A. Blakeley, Neil Coburn, and Per-Åke Larson. 1989. Updating
Derived Relations: Detecting Irrelevant and Autonomously Computable Updates. ACM
Trans. Database Syst. 14, 3 (Sept. 1989), 369–400. https://doi.org/10.1145/68012.
68015 (Cited on page 9.)
[Blumensath et al. 2008] Achim Blumensath, Thomas Colcombet, and Christof Löding.
2008. Logical theories and compatible operations. In Logic and Automata: History and
Perspectives [in Honor of Wolfgang Thomas] (Texts in Logic and Games), Jörg Flum,
Erich Grädel, and Thomas Wilke (Eds.), Vol. 2. Amsterdam University Press, 73–106.
(Cited on pages 14 and 49.)
[Bodlaender 1996] Hans L. Bodlaender. 1996. A Linear-Time Algorithm for Finding
Tree-Decompositions of Small Treewidth. SIAM J. Comput. 25, 6 (1996), 1305–1317.
https://doi.org/10.1137/S0097539793251219 (Cited on page 40.)
[Bouyer and Jugé 2017] Patricia Bouyer and Vincent Jugé. 2017. Dynamic Complexity of
the Dyck Reachability. In Foundations of Software Science and Computation Structures
- 20th International Conference, FOSSACS 2017, Held as Part of the European Joint
Conferences on Theory and Practice of Software, ETAPS 2017, Uppsala, Sweden,
April 22-29, 2017, Proceedings (Lecture Notes in Computer Science), Javier Esparza
and Andrzej S. Murawski (Eds.), Vol. 10203. 265–280. https://doi.org/10.1007/
978-3-662-54458-7_16 (Cited on page 101.)
[Bouyer-Decitre et al. 2017] Patricia Bouyer-Decitre, Vincent Jugé, and Nicolas
Markey. 2017. Courcelle’s Theorem Made Dynamic. CoRR abs/1702.05183 (2017).
arXiv:1702.05183 http://arxiv.org/abs/1702.05183 (Cited on page 37.)
[Chandra et al. 1984] Ashok K. Chandra, Larry J. Stockmeyer, and Uzi Vishkin. 1984.
Constant Depth Reducibility. SIAM J. Comput. 13, 2 (1984), 423–439. https:
//doi.org/10.1137/0213028 (Cited on page 138.)
[Chen et al. 2016] Xi Chen, Igor Carboni Oliveira, Rocco A. Servedio, and Li-Yang
Tan. 2016. Near-optimal small-depth lower bounds for small distance connectivity. In
Proceedings of the 48th Annual ACM SIGACT Symposium on Theory of Computing,
STOC 2016, Cambridge, MA, USA, June 18-21, 2016, Daniel Wichs and Yishay Man-
sour (Eds.). ACM, 612–625. https://doi.org/10.1145/2897518.2897534 (Cited
on page 118.)
159
Bibliography
[Comon et al. 2007] H. Comon, M. Dauchet, R. Gilleron, C. Löding, F. Jacquemard, D.
Lugiez, S. Tison, and M. Tommasi. 2007. Tree Automata Techniques and Applications.
Available on: http://www.grappa.univ-lille3.fr/tata. release October, 12th 2007.
(Cited on page 117.)
[Cook 1985] Stephen A. Cook. 1985. A Taxonomy of Problems with Fast Parallel
Algorithms. Information and Control 64, 1-3 (1985), 2–21. https://doi.org/10.
1016/S0019-9958(85)80041-3 (Cited on page 132.)
[Courcelle 1990] Bruno Courcelle. 1990. The Monadic Second-Order Logic of Graphs.
I. Recognizable Sets of Finite Graphs. Inf. Comput. 85, 1 (1990), 12–75. https:
//doi.org/10.1016/0890-5401(90)90043-H (Cited on pages 39 and 40.)
[Courcelle 1994] Bruno Courcelle. 1994. The Monadic Second order Logic of Graphs
VI: on Several Representations of Graphs By Relational Structures. Discrete Applied
Mathematics 54, 2-3 (1994), 117–149. https://doi.org/10.1016/0166-218X(94)
90019-1 (Cited on page 46.)
[Courcelle and Engelfriet 2012] Bruno Courcelle and Joost Engelfriet. 2012. Graph Struc-
ture and Monadic Second-Order Logic - A Language-Theoretic Approach. Encyclopedia
of mathematics and its applications, Vol. 138. Cambridge University Press, New York,
NY, USA. (Cited on page 46.)
[Datta et al. 2014] Samir Datta, William Hesse, and Raghav Kulkarni. 2014. Dynamic
Complexity of Directed Reachability and Other Problems. In Automata, Languages, and
Programming - 41st International Colloquium, ICALP 2014, Copenhagen, Denmark,
July 8-11, 2014, Proceedings, Part I (Lecture Notes in Computer Science), Javier
Esparza, Pierre Fraigniaud, Thore Husfeldt, and Elias Koutsoupias (Eds.), Vol. 8572.
Springer, 356–367. https://doi.org/10.1007/978-3-662-43948-7_30 (Cited on
pages 5 and 36.)
[Datta et al. 2015] Samir Datta, Raghav Kulkarni, Anish Mukherjee, Thomas Schwentick,
and Thomas Zeume. 2015. Reachability is in DynFO. In Automata, Languages, and
Programming - 42nd International Colloquium, ICALP 2015, Kyoto, Japan, July
6-10, 2015, Proceedings, Part II (Lecture Notes in Computer Science), Magnús M.
Halldórsson, Kazuo Iwama, Naoki Kobayashi, and Bettina Speckmann (Eds.), Vol. 9135.
Springer, 159–170. https://doi.org/10.1007/978-3-662-47666-6_13 (Cited on
pages 27, 33, 36, and 151.)
[Datta et al. 2018a] Samir Datta, Raghav Kulkarni, Anish Mukherjee, Thomas Schwentick,
and Thomas Zeume. 2018a. Reachability Is in DynFO. J. ACM 65, 5, Article 33 (Aug.
2018), 24 pages. https://doi.org/10.1145/3212685 (Cited on pages 3, 5, 19, 20,
21, 22, 23, 25, 26, 28, 29, 32, 33, 36, 101, 114, 126, 149, 151, and 155.)
[Datta et al. 2009] Samir Datta, Nutan Limaye, Prajakta Nimbhorkar, Thomas Thierauf,
and Fabian Wagner. 2009. Planar Graph Isomorphism is in Log-Space. In Proceedings
of the 24th Annual IEEE Conference on Computational Complexity, CCC 2009, Paris,
160
Bibliography
France, 15-18 July 2009. IEEE Computer Society, 203–214. https://doi.org/10.
1109/CCC.2009.16 (Cited on page 78.)
[Datta et al. 2017] Samir Datta, Anish Mukherjee, Thomas Schwentick, Nils Vortmeier,
and Thomas Zeume. 2017. A Strategy for Dynamic Programs: Start over and Muddle
Through. In 44th International Colloquium on Automata, Languages, and Programming,
ICALP 2017, July 10-14, 2017, Warsaw, Poland (LIPIcs), Ioannis Chatzigiannakis,
Piotr Indyk, Fabian Kuhn, and Anca Muscholl (Eds.), Vol. 80. Schloss Dagstuhl -
Leibniz-Zentrum fuer Informatik, 98:1–98:14. https://doi.org/10.4230/LIPIcs.
ICALP.2017.98 (Cited on pages 28, 33, and 78.)
[Datta et al. 2019] Samir Datta, Anish Mukherjee, Thomas Schwentick, Nils Vortmeier,
and Thomas Zeume. 2019. A Strategy for Dynamic Programs: Start over and Muddle
through. Logical Methods in Computer Science Volume 15, Issue 2 (May 2019).
https://doi.org/10.23638/LMCS-15(2:12)2019 (Cited on pages 28, 33, 52, and 78.)
[Datta et al. 2018b] Samir Datta, Anish Mukherjee, Nils Vortmeier, and Thomas Zeume.
2018b. Reachability and Distances under Multiple Changes. In 45th International
Colloquium on Automata, Languages, and Programming, ICALP 2018, July 9-13,
2018, Prague, Czech Republic (LIPIcs), Ioannis Chatzigiannakis, Christos Kaklamanis,
Dániel Marx, and Donald Sannella (Eds.), Vol. 107. Schloss Dagstuhl - Leibniz-Zentrum
fuer Informatik, 120:1–120:14. https://doi.org/10.4230/LIPIcs.ICALP.2018.120
(Cited on pages 32, 33, 114, 126, 127, 130, 138, 139, 140, 151, and 154.)
[Demetrescu et al. 2010] Camil Demetrescu, David Eppstein, Zvi Galil, and Giuseppe F.
Italiano. 2010. Algorithms and Theory of Computation Handbook. Chapman &
Hall/CRC, Chapter Dynamic Graph Algorithms, 9–9. http://dl.acm.org/citation.
cfm?id=1882757.1882766 (Cited on page 2.)
[Denenberg et al. 1986] Larry Denenberg, Yuri Gurevich, and Saharon Shelah. 1986.
Definability by Constant-Depth Polynomial-Size Circuits. Information and Control 70,
2/3 (1986), 216–240. https://doi.org/10.1016/S0019-9958(86)80006-7 (Cited
on page 115.)
[Dong et al. 1999] Guozhu Dong, Leonid Libkin, Jianwen Su, and Limsoon Wong.
1999. Maintaining Transitive Closure of Graphs in SQL. Int. Journal of Information
Technology 51, 1 (1999), 46–78. (Cited on pages 10 and 141.)
[Dong et al. 2003] Guozhu Dong, Leonid Libkin, and Limsoon Wong. 2003. Incremental
recomputation in local languages. Inf. Comput. 181, 2 (2003), 88–98. https://doi.
org/10.1016/S0890-5401(03)00017-8 (Cited on pages 5 and 37.)
[Dong and Pang 1997] Guozhu Dong and Chaoyi Pang. 1997. Maintaining Transitive
Closure in First Order After Node-Set and Edge-Set Deletions. Inf. Process. Lett. 62,
4 (1997), 193–199. https://doi.org/10.1016/S0020-0190(97)00066-5 (Cited on
pages 4 and 6.)
161
Bibliography
[Dong and Su 1993] Guozhu Dong and Jianwen Su. 1993. First-Order Incremental Evalu-
ation of Datalog Queries. In Database Programming Languages (DBPL-4), Proceedings
of the Fourth International Workshop on Database Programming Languages - Object
Models and Languages, Manhattan, New York City, USA, 30 August - 1 September
1993 (Workshops in Computing), Catriel Beeri, Atsushi Ohori, and Dennis E. Shasha
(Eds.). Springer, 295–308. (Cited on page 2.)
[Dong and Su 1995] Guozhu Dong and Jianwen Su. 1995. Incremental and Decremental
Evaluation of Transitive Closure by First-Order Queries. Inf. Comput. 120, 1 (1995),
101–106. https://doi.org/10.1006/inco.1995.1102 (Cited on pages 5, 20, 22, 23,
36, and 83.)
[Dong and Su 1997] Guozhu Dong and Jianwen Su. 1997. Deterministic FOIES are
Strictly Weaker. Ann. Math. Artif. Intell. 19, 1-2 (1997), 127–146. https://doi.org/
10.1023/A:1018951521198 (Cited on pages 5, 20, 100, and 153.)
[Dong and Su 1998] Guozhu Dong and Jianwen Su. 1998. Arity Bounds in First-Order
Incremental Evaluation and Definition of Polynomial Time Database Queries. J.
Comput. Syst. Sci. 57, 3 (1998), 289–308. https://doi.org/10.1006/jcss.1998.
1565 (Cited on pages 5, 10, 22, 36, 37, 61, 74, 83, 85, 86, 122, 123, 126, and 143.)
[Dong et al. 1995] Guozhu Dong, Jianwen Su, and Rodney W. Topor. 1995. Nonrecursive
Incremental Evaluation of Datalog Queries. Ann. Math. Artif. Intell. 14, 2-4 (1995),
187–223. https://doi.org/10.1007/BF01530820 (Cited on pages 2, 6, 20, 22, 23,
36, and 110.)
[Dong and Topor 1992] Guozhu Dong and Rodney W. Topor. 1992. Incremental Evaluation
of Datalog Queries. In Database Theory - ICDT’92, 4th International Conference,
Berlin, Germany, October 14-16, 1992, Proceedings (Lecture Notes in Computer
Science), Joachim Biskup and Richard Hull (Eds.), Vol. 646. Springer, 282–296. https:
//doi.org/10.1007/3-540-56039-4_48 (Cited on page 2.)
[Dong and Zhang 2000] Guozhu Dong and Louxin Zhang. 2000. Separating Auxiliary
Arity Hierarchy of First-Order Incremental Evaluation Systems Using (3k+1)-ary Input
Relations. Int. J. Found. Comput. Sci. 11, 4 (2000), 573–578. https://doi.org/10.
1142/S0129054100000302 (Cited on pages 5 and 61.)
[Durand et al. 2006] Arnaud Durand, Clemens Lautemann, and Malika More. 2006. Count-
ing Results in Weak Formalisms. In Circuits, Logic, and Games, 08.11. - 10.11.2006
(Dagstuhl Seminar Proceedings), Thomas Schwentick, Denis Thérien, and Heribert
Vollmer (Eds.), Vol. 06451. Internationales Begegnungs- und Forschungszentrum fuer
Informatik (IBFI), Schloss Dagstuhl, Germany. http://drops.dagstuhl.de/opus/
volltexte/2007/976 (Cited on page 121.)
[Eisenberg and Melton 1999] Andrew Eisenberg and Jim Melton. 1999. SQL: 1999,
Formerly Known As SQL3. SIGMOD Rec. 28, 1 (March 1999), 131–138. https:
//doi.org/10.1145/309844.310075 (Cited on page 3.)
162
Bibliography
[Elberfeld et al. 2016] Michael Elberfeld, Martin Grohe, and Till Tantau. 2016. Where
First-Order and Monadic Second-Order Logic Coincide. ACM Trans. Comput. Log. 17,
4 (2016), 25:1–25:18. https://doi.org/10.1145/2946799 (Cited on page 47.)
[Elberfeld et al. 2010] Michael Elberfeld, Andreas Jakoby, and Till Tantau. 2010. Logspace
Versions of the Theorems of Bodlaender and Courcelle. In 51th Annual IEEE Symposium
on Foundations of Computer Science, FOCS 2010, October 23-26, 2010, Las Vegas,
Nevada, USA. IEEE Computer Society, 143–152. https://doi.org/10.1109/FOCS.
2010.21 (Cited on pages 39, 41, and 76.)
[Etessami 1998] Kousha Etessami. 1998. Dynamic Tree Isomorphism via First-Order
Updates. In Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Sympo-
sium on Principles of Database Systems, June 1-3, 1998, Seattle, Washington, USA,
Alberto O. Mendelzon and Jan Paredaens (Eds.). ACM Press, 235–243. https:
//doi.org/10.1145/275487.275514 (Cited on pages 4, 5, 21, 22, 23, 36, 37, 74, 76,
78, 80, and 86.)
[Fagin et al. 1985] Ronald Fagin, Maria M. Klawe, Nicholas Pippenger, and Larry J.
Stockmeyer. 1985. Bounded-Depth, Polynomial-Size Circuits for Symmetric Functions.
Theor. Comput. Sci. 36 (1985), 239–250. https://doi.org/10.1016/0304-3975(85)
90045-3 (Cited on page 115.)
[Feferman 1957] Solomon Feferman. 1957. Some recent work of Ehrenfeucht and Fraïssé.
In Proc. Summer Institute of Symbolic Logic. 201–209. (Cited on page 14.)
[Feferman and Vaught 1959] Solomon Feferman and Robert L. Vaught. 1959. The first
order properties of algebraic systems. Fund. Math. 47 (1959), 57–103. (Cited on
page 14.)
[Furst et al. 1984] Merrick L. Furst, James B. Saxe, and Michael Sipser. 1984. Parity,
Circuits, and the Polynomial-Time Hierarchy. Mathematical Systems Theory 17, 1
(1984), 13–27. https://doi.org/10.1007/BF01744431 (Cited on pages 19 and 115.)
[Gelade et al. 2012] Wouter Gelade, Marcel Marquardt, and Thomas Schwentick. 2012.
The dynamic complexity of formal languages. ACM Trans. Comput. Log. 13, 3 (2012),
19. https://doi.org/10.1145/2287718.2287719 (Cited on pages 5, 21, 22, 25, 36,
37, 38, 39, 58, 60, 66, 67, 78, 100, 101, 102, 103, 104, 124, 125, 126, 150, and 152.)
[Godsil 1993] Chris D. Godsil. 1993. Algebraic combinatorics. Chapman and Hall. (Cited
on pages 129 and 130.)
[Gottlob et al. 2001] Georg Gottlob, Nicola Leone, and Francesco Scarcello. 2001. The
complexity of acyclic conjunctive queries. J. ACM 48, 3 (2001), 431–498. https:
//doi.org/10.1145/382780.382783 (Cited on page 87.)
[Grädel and Siebertz 2012] Erich Grädel and Sebastian Siebertz. 2012. Dynamic definabil-
ity. In 15th International Conference on Database Theory, ICDT ’12, Berlin, Germany,
163
Bibliography
March 26-29, 2012, Alin Deutsch (Ed.). ACM, 236–248. https://doi.org/10.1145/
2274576.2274601 (Cited on pages 4, 5, 19, 21, 22, 23, 36, 37, 80, and 153.)
[Grohe 2014] Martin Grohe. 2014. Algorithmic Meta Theorems for Sparse Graph Classes.
In Computer Science - Theory and Applications - 9th International Computer Science
Symposium in Russia, CSR 2014, Moscow, Russia, June 7-11, 2014. Proceedings
(Lecture Notes in Computer Science), Edward A. Hirsch, Sergei O. Kuznetsov, Jean-
Éric Pin, and Nikolay K. Vereshchagin (Eds.), Vol. 8476. Springer, 16–22. https:
//doi.org/10.1007/978-3-319-06686-8_2 (Cited on pages 38 and 39.)
[Gupta and Mumick 1999] Ashish Gupta and Iderpal Singh Mumick (Eds.). 1999. Materi-
alized Views: Techniques, Implementations, and Applications. MIT Press, Cambridge,
MA, USA. (Cited on page 8.)
[Håstad 1986] Johan Håstad. 1986. Almost Optimal Lower Bounds for Small Depth
Circuits. In Proceedings of the 18th Annual ACM Symposium on Theory of Computing,
May 28-30, 1986, Berkeley, California, USA, Juris Hartmanis (Ed.). ACM, 6–20.
https://doi.org/10.1145/12130.12132 (Cited on pages 119 and 120.)
[Henderson and Searle 1981] H. Henderson and S. Searle. 1981. On Deriving the Inverse of a
Sum of Matrices. SIAM Rev. 23, 1 (1981), 53–60. https://doi.org/10.1137/1023004
(Cited on pages 127 and 130.)
[Henzinger and Fredman 1998] Monika Rauch Henzinger and Michael L. Fredman. 1998.
Lower Bounds for Fully Dynamic Connectivity Problems in Graphs. Algorithmica 22,
3 (1998), 351–362. https://doi.org/10.1007/PL00009228 (Cited on page 2.)
[Hesse 2003a] William Hesse. 2003a. The dynamic complexity of transitive closure is in
DynTC0. Theor. Comput. Sci. 296, 3 (2003), 473–485. https://doi.org/10.1016/
S0304-3975(02)00740-5 (Cited on pages 5, 36, 128, 129, and 130.)
[Hesse 2003b] William Hesse. 2003b. Dynamic Computational Complexity. Ph.D. Disser-
tation. University of Massachusetts Amherst. (Cited on pages 5, 10, 19, 21, 22, 23, 32,
36, 37, 100, and 150.)
[Hesse et al. 2002] William Hesse, Eric Allender, and David A. Mix Barrington. 2002. Uni-
form constant-depth threshold circuits for division and iterated multiplication. J. Com-
put. Syst. Sci. 65, 4 (2002), 695–716. https://doi.org/10.1016/S0022-0000(02)
00025-9 (Cited on pages 131, 132, 133, 135, and 136.)
[Hesse and Immerman 2002] William Hesse and Neil Immerman. 2002. Complete Problems
for Dynamic Complexity Classes. In 17th IEEE Symposium on Logic in Computer
Science (LICS 2002), 22-25 July 2002, Copenhagen, Denmark, Proceedings. IEEE
Computer Society, 313. https://doi.org/10.1109/LICS.2002.1029839 (Cited on
pages 5 and 19.)
164
Bibliography
[Idris et al. 2017] Muhammad Idris, Martín Ugarte, and Stijn Vansummeren. 2017. The
Dynamic Yannakakis Algorithm: Compact and Efficient Query Processing Under
Updates. In Proceedings of the 2017 ACM International Conference on Management
of Data, SIGMOD Conference 2017, Chicago, IL, USA, May 14-19, 2017, Semih
Salihoglu, Wenchao Zhou, Rada Chirkova, Jun Yang, and Dan Suciu (Eds.). ACM,
1259–1274. https://doi.org/10.1145/3035918.3064027 (Cited on page 9.)
[Idris et al. 2018] Muhammad Idris, Martín Ugarte, Stijn Vansummeren, Hannes Voigt,
and Wolfgang Lehner. 2018. Conjunctive Queries with Inequalities Under Updates.
PVLDB 11, 7 (2018), 733–745. https://doi.org/10.14778/3192965.3192966
(Cited on page 9.)
[Immerman 1987] Neil Immerman. 1987. Languages that Capture Complexity Classes.
SIAM J. Comput. 16, 4 (1987), 760–778. https://doi.org/10.1137/0216051 (Cited
on page 120.)
[Immerman 1999] Neil Immerman. 1999. Descriptive complexity. Springer. https:
//doi.org/10.1007/978-1-4612-0539-5 (Cited on pages 2, 3, 13, 15, 30, and 76.)
[Impagliazzo and Paturi 2001] Russell Impagliazzo and Ramamohan Paturi. 2001. On
the Complexity of k-SAT. J. Comput. Syst. Sci. 62, 2 (2001), 367–375. https:
//doi.org/10.1006/jcss.2000.1727 (Cited on page 139.)
[Jenner et al. 1998] Birgit Jenner, Pierre McKenzie, and Jacobo Torán. 1998. A Note on
the Hardness of Tree Isomorphism. In Proceedings of the 13th Annual IEEE Conference
on Computational Complexity, Buffalo, New York, USA, June 15-18, 1998. IEEE
Computer Society, 101–105. https://doi.org/10.1109/CCC.1998.694595 (Cited
on page 38.)
[Kähler and Wilke 2003] Detlef Kähler and Thomas Wilke. 2003. Program Complexity
of Dynamic LTL Model Checking. In Computer Science Logic, 17th International
Workshop, CSL 2003, 12th Annual Conference of the EACSL, and 8th Kurt Gödel
Colloquium, KGC 2003, Vienna, Austria, August 25-30, 2003, Proceedings (Lecture
Notes in Computer Science), Matthias Baaz and Johann A. Makowsky (Eds.), Vol. 2803.
Springer, 271–284. https://doi.org/10.1007/978-3-540-45220-1_23 (Cited on
page 128.)
[Kara et al. 2019] Ahmet Kara, Hung Q. Ngo, Milos Nikolic, Dan Olteanu, and Haozhe
Zhang. 2019. Counting Triangles under Updates in Worst-Case Optimal Time. In
22nd International Conference on Database Theory, ICDT 2019, March 26-28, 2019,
Lisbon, Portugal (LIPIcs), Pablo Barceló and Marco Calautti (Eds.), Vol. 127. Schloss
Dagstuhl - Leibniz-Zentrum fuer Informatik, 4:1–4:18. https://doi.org/10.4230/
LIPIcs.ICDT.2019.4 (Cited on page 9.)
[Koch 2010] Christoph Koch. 2010. Incremental query evaluation in a ring of databases.
In Proceedings of the Twenty-Ninth ACM SIGMOD-SIGACT-SIGART Symposium on
165
Bibliography
Principles of Database Systems (PODS). 87–98. https://doi.org/10.1145/1807085.
1807100 (Cited on pages 9 and 141.)
[Koch et al. 2014] Christoph Koch, Yanif Ahmad, Oliver Kennedy, Milos Nikolic, Andres
Nötzli, Daniel Lupei, and Amir Shaikhha. 2014. DBToaster: higher-order delta
processing for dynamic, frequently fresh views. VLDB J. 23, 2 (2014), 253–278.
https://doi.org/10.1007/s00778-013-0348-4 (Cited on pages 9 and 141.)
[Ladner and Fischer 1980] Richard E. Ladner and Michael J. Fischer. 1980. Parallel
Prefix Computation. J. ACM 27, 4 (Oct. 1980), 831–838. https://doi.org/10.
1145/322217.322232 (Cited on page 103.)
[Libkin 2003] Leonid Libkin. 2003. Expressive power of SQL. Theor. Comput. Sci. 296,
3 (2003), 379–404. https://doi.org/10.1016/S0304-3975(02)00736-3 (Cited on
page 3.)
[Libkin 2004] Leonid Libkin. 2004. Elements of Finite Model Theory. Springer. (Cited
on page 11.)
[Libkin and Wong 1997] Leonid Libkin and Limsoon Wong. 1997. Incremental Re-
computation of Recursive Queries with Nested Sets and Aggregate Functions. In
Database Programming Languages, 6th International Workshop, DBPL-6, Estes
Park, Colorado, USA, August 18-20, 1997, Proceedings (Lecture Notes in Com-
puter Science), Sophie Cluet and Richard Hull (Eds.), Vol. 1369. Springer, 222–238.
https://doi.org/10.1007/3-540-64823-2_13 (Cited on pages 9 and 10.)
[Mahajan and Vinay 1999] Meena Mahajan and V. Vinay. 1999. Determinant: Old
Algorithms, New Insights. SIAM J. Discrete Math. 12, 4 (1999), 474–490. https:
//doi.org/10.1137/S0895480198338827 (Cited on page 132.)
[Makowsky 2004] Johann A. Makowsky. 2004. Algorithmic uses of the Feferman-Vaught
Theorem. Ann. Pure Appl. Logic 126, 1-3 (2004), 159–213. https://doi.org/10.
1016/j.apal.2003.11.002 (Cited on pages 14 and 55.)
[Miltersen 1999] Peter Bro Miltersen. 1999. Cell Probe Complexity - a Survey. In 19th
Conference on the Foundations of Software Technology and Theoretical Computer
Science (FSTTCS), 1999. Advances in Data Structures Workshop. (Cited on page 2.)
[Miltersen et al. 1994] Peter Bro Miltersen, Sairam Subramanian, Jeffrey Scott Vitter,
and Roberto Tamassia. 1994. Complexity Models for Incremental Computation. Theor.
Comput. Sci. 130, 1 (1994), 203–236. https://doi.org/10.1016/0304-3975(94)
90159-7 (Cited on page 9.)
[Mukherjee 2019] Anish Mukherjee. 2019. Static and Dynamic Complexity of Reachability,
Matching and Related Problems. Ph.D. Dissertation. Chennai Mathematical Institute.
(Cited on page 138.)
166
Bibliography
[Muñoz et al. 2016] Pablo Muñoz, Nils Vortmeier, and Thomas Zeume. 2016. Dynamic
Graph Queries. In 19th International Conference on Database Theory, ICDT 2016,
Bordeaux, France, March 15-18, 2016 (LIPIcs), Wim Martens and Thomas Zeume
(Eds.), Vol. 48. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 14:1–14:18. https:
//doi.org/10.4230/LIPIcs.ICDT.2016.14 (Cited on pages 36 and 101.)
[Nikolic et al. 2016] Milos Nikolic, Mohammad Dashti, and Christoph Koch. 2016. How
to Win a Hot Dog Eating Contest: Distributed Incremental View Maintenance with
Batch Updates. In Proceedings of the 2016 International Conference on Management
of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01,
2016, Fatma Özcan, Georgia Koutrika, and Sam Madden (Eds.). ACM, 511–526.
https://doi.org/10.1145/2882903.2915246 (Cited on pages 9 and 142.)
[Otto 2000] Martin Otto. 2000. Epsilon-Logic Is More Expressive Than First-Order Logic
Over Finite Structures. J. Symb. Log. 65, 4 (2000), 1749–1757. https://doi.org/
10.2307/2695073 (Cited on page 28.)
[Pang et al. 2005] Chaoyi Pang, Guozhu Dong, and Kotagiri Ramamohanarao. 2005.
Incremental maintenance of shortest distance and transitive closure in first-order logic
and SQL. ACM Trans. Database Syst. 30, 3 (2005), 698–721. https://doi.org/10.
1145/1093382.1093384 (Cited on pages 22, 141, and 142.)
[Patnaik and Immerman 1994] Sushant Patnaik and Neil Immerman. 1994. Dyn-FO: A
Parallel, Dynamic Complexity Class. In Proceedings of the Thirteenth ACM SIGACT-
SIGMOD-SIGART Symposium on Principles of Database Systems, May 24-26, 1994,
Minneapolis, Minnesota, USA, Victor Vianu (Ed.). ACM Press, 210–221. https:
//doi.org/10.1145/182591.182614 (Cited on pages 2, 22, and 23.)
[Patnaik and Immerman 1997] Sushant Patnaik and Neil Immerman. 1997. Dyn-FO:
A Parallel, Dynamic Complexity Class. J. Comput. Syst. Sci. 55, 2 (1997), 199–209.
https://doi.org/10.1006/jcss.1997.1520 (Cited on pages 2, 4, 5, 9, 11, 17, 18,
19, 20, 21, 22, 23, 35, 36, 80, 83, 92, 93, 100, 153, 154, and 155.)
[Przymus et al. 2010] Piotr Przymus, Aleksandra Boniewicz, Marta Burzanska, and
Krzysztof Stencel. 2010. Recursive Query Facilities in Relational Databases: A Survey.
In Database Theory and Application, Bio-Science and Bio-Technology - International
Conferences, DTA and BSBT 2010. Proceedings (Communications in Computer and
Information Science), Yanchun Zhang, Alfredo Cuzzocrea, Jianhua Ma, Kyo-Il Chung,
Tughrul Arslan, and Xiaofeng Song (Eds.), Vol. 118. Springer, 89–99. https://doi.
org/10.1007/978-3-642-17622-7_10 (Cited on page 3.)
[Robertson and Seymour 1986] Neil Robertson and Paul D. Seymour. 1986. Graph
Minors. II. Algorithmic Aspects of Tree-Width. J. Algorithms 7, 3 (1986), 309–322.
https://doi.org/10.1016/0196-6774(86)90023-4 (Cited on page 39.)
167
Bibliography
[Robertson and Seymour 1990] Neil Robertson and Paul D. Seymour. 1990. Graph minors.
IV. Tree-width and well-quasi-ordering. J. Comb. Theory, Ser. B 48, 2 (1990), 227–254.
https://doi.org/10.1016/0095-8956(90)90120-O (Cited on page 77.)
[Schmidt et al. 2020] Jonas Schmidt, Thomas Schwentick, Nils Vortmeier, Thomas Zeume,
and Ioannis Kokkinis. 2020. Dynamic Complexity Meets Parameterised Algorithms.
(2020). Accepted for publication at CSL 2020. (Cited on pages 32 and 140.)
[Schwentick et al. 2015] Thomas Schwentick, Nils Vortmeier, and Thomas Zeume. 2015.
Static Analysis for Logic-based Dynamic Programs. In 24th EACSL Annual Conference
on Computer Science Logic, CSL 2015, September 7-10, 2015, Berlin, Germany
(LIPIcs), Stephan Kreutzer (Ed.), Vol. 41. Schloss Dagstuhl - Leibniz-Zentrum fuer
Informatik, 308–324. https://doi.org/10.4230/LIPIcs.CSL.2015.308 (Cited on
pages 153 and 154.)
[Schwentick et al. 2017a] Thomas Schwentick, Nils Vortmeier, and Thomas Zeume. 2017a.
Dynamic Complexity under Definable Changes. In 20th International Conference on
Database Theory, ICDT 2017, March 21-24, 2017, Venice, Italy (LIPIcs), Michael
Benedikt and Giorgio Orsi (Eds.), Vol. 68. Schloss Dagstuhl - Leibniz-Zentrum fuer
Informatik, 19:1–19:18. https://doi.org/10.4230/LIPIcs.ICDT.2017.19 (Cited
on pages 33, 111, and 112.)
[Schwentick et al. 2017b] Thomas Schwentick, Nils Vortmeier, and Thomas Zeume.
2017b. Dynamic Complexity under Definable Changes. CoRR abs/1701.02494 (2017).
arXiv:1701.02494 http://arxiv.org/abs/1701.02494 (Cited on page 94.)
[Schwentick et al. 2018] Thomas Schwentick, Nils Vortmeier, and Thomas Zeume. 2018.
Dynamic Complexity Under Definable Changes. ACM Trans. Database Syst. 43, 3,
Article 12 (Oct. 2018), 38 pages. https://doi.org/10.1145/3241040 (Cited on
pages 17, 23, 29, 78, 111, 112, and 150.)
[Schwentick and Zeume 2016] Thomas Schwentick and Thomas Zeume. 2016. Dynamic
complexity: recent updates. SIGLOG News 3, 2 (2016), 30–52. http://doi.acm.
org/10.1145/2948896.2948899 (Cited on pages 17, 20, and 23.)
[Selberg 1949] Atle Selberg. 1949. An elementary proof of the prime-number theorem.
Ann. Math 50, 2 (1949), 305–313. (Cited on pages 88 and 90.)
[Shalygina and Novikov 2017] Galina Shalygina and Boris Novikov. 2017. Implement-
ing Common Table Expressions for MariaDB. In Second Conference on Software
Engineering and Information Management (SEIM-2017). 12–17. (Cited on page 3.)
[Shelah 1975] Saharon Shelah. 1975. The monadic theory of order. Annals of Mathematics
(1975), 379–419. https://doi.org/10.2307/1971037 (Cited on page 49.)
[Siebertz 2011] Sebastian Siebertz. 2011. Dynamic Definability. Diploma Thesis. RWTH
Aachen. (Cited on page 6.)
168
Bibliography
[Vista 1998] Dimitra Vista. 1998. Integration of Incremental View Maintenance into
Query Optimizers. In Advances in Database Technology - EDBT’98, 6th International
Conference on Extending Database Technology, Valencia, Spain, March 23-27, 1998,
Proceedings (Lecture Notes in Computer Science), Hans-Jörg Schek, Fèlix Saltor,
Isidro Ramos, and Gustavo Alonso (Eds.), Vol. 1377. Springer, 374–388. https:
//doi.org/10.1007/BFb0100997 (Cited on page 142.)
[Vollmer 1999] Heribert Vollmer. 1999. Introduction to Circuit Complexity - A Uniform
Approach. Springer. (Cited on page 57.)
[Vortmeier 2013] Nils Vortmeier. 2013. Komplexitätstheorie verlaufsunabhängiger dy-
namischer Programme. Master’s thesis. TU Dortmund University. (Cited on pages 37,
101, and 152.)
[Vortmeier and Zeume 2020] Nils Vortmeier and Thomas Zeume. 2020. Dynamic Com-
plexity of Parity Exists Queries. (2020). Accepted for publication at CSL 2020. (Cited
on page 78.)
[Weber and Schwentick 2007] Volker Weber and Thomas Schwentick. 2007. Dynamic
Complexity Theory Revisited. Theory Comput. Syst. 40, 4 (2007), 355–377. https:
//doi.org/10.1007/s00224-006-1312-0 (Cited on pages 5, 21, 23, 36, and 101.)
[Zeume 2015] Thomas Zeume. 2015. Small dynamic complexity classes. Ph.D. Dissertation.
TU Dortmund University, Germany. http://hdl.handle.net/2003/34163 (Cited
on pages 4, 10, 20, 21, 36, 37, 61, 68, 74, 78, 80, 94, 101, and 150.)
[Zeume 2017] Thomas Zeume. 2017. The dynamic descriptive complexity of k-clique. Inf.
Comput. 256 (2017), 9–22. https://doi.org/10.1016/j.ic.2017.04.005 (Cited
on pages 5, 21, 37, 38, 61, 66, 67, and 78.)
[Zeume and Schwentick 2015] Thomas Zeume and Thomas Schwentick. 2015. On the
quantifier-free dynamic complexity of Reachability. Inf. Comput. 240 (2015), 108–129.
https://doi.org/10.1016/j.ic.2014.09.011 (Cited on pages 5, 21, 23, 25, 37, 58,
61, 66, 67, 68, 73, 78, 96, 98, and 154.)
[Zeume and Schwentick 2017] Thomas Zeume and Thomas Schwentick. 2017. Dynamic
conjunctive queries. J. Comput. Syst. Sci. 88 (2017), 3–26. https://doi.org/10.
1016/j.jcss.2017.03.014 (Cited on pages 5, 19, 21, 22, 33, and 108.)
169