On the reasonable effectiveness of Relational Diagrams: explaining relational query patterns and the pattern expressiveness of relational languages

@Article{Gatterbauer2024ReasonableEffectiveness,
  author    = {Wolfgang Gatterbauer, Cody Dunne},
  journal   = {Proc.\ ACM on Management of Data—SIGMOD/PACMMOD},
  title     = {On the reasonable effectiveness of {Relational Diagrams}: explaining relational query patterns and the pattern expressiveness of relational languages},
  year      = {2024},
  note      = {SIGMOD '24 Best Paper Honorable Mention (1/3)! Preprint at \url{https://arxiv.org/abs/2401.04758}. Supplemental material at \url{https://osf.io/q9g6u/}.},
  number    = {1},
  volume    = {2},
  abstract  = {Comparing relational languages by their logical expressiveness is well understood. Less well understood is how to compare relational languages by their ability to represent relational query patterns. Indeed, what are query patterns other than "a certain way of writing a query"? And how can query patterns be defined across procedural and declarative languages, irrespective of their syntax? To the best of our knowledge, we provide the first semantic definition of relational query patterns by using a variant of structure-preserving mappings between the relational tables of queries. This formalism allows us to analyze the relative pattern expressiveness of relational language fragments and create a hierarchy of languages with equal logical expressiveness yet different pattern expressiveness. Notably, for the non-disjunctive language fragment, we show that relational calculus can express a larger class of patterns than the basic operators of relational algebra.

Our language-independent definition of query patterns opens novel paths for assisting database users. For example, these patterns could be leveraged to create visual query representations that faithfully represent query patterns, speed up interpretation, and provide visual feedback during query editing. As a concrete example, we propose Relational Diagrams, a complete and sound diagrammatic representation of safe relational calculus that is provably (i) unambiguous, (ii) relationally complete, and (iii) able to represent all query patterns for unions of non-disjunctive queries. Among all diagrammatic representations for relational queries that we are aware of, ours is the only one with these three properties. Furthermore, our anonymously preregistered user study shows that Relational Diagrams allow users to recognize patterns meaningfully faster and more accurately than SQL.},
  articleno = {61},
  doi       = {10.1145/3639316},
  series    = {SIGMOD/PACMMOD},
}

Cody Dunne, Vis Lab — Northeastern University
West Village H, Room 302F
440 Huntington Ave, Boston, MA 02115, USA