Continued from A1: Foundations of Data Analysis Workflow Validation
Description
With DAW execution spanning multiple data centers, the assumption of centralized access to provenance traces is problematic: Policies prevent access to low-level data, and the general transfer of all monitoring data would induce a severe overhead. As such, a decentralized model is preferable, in which the required computation is pushed to the data sources as much as possible. This subproject aims to explore how query discovery can be realized at distributed sources.

Scientists
- Luisa Gerlach
- Hannes Ueck
Publications
2025
A Quantum-Leap into Schema Matching: Beyond 1-to-1 Matchings Proceedings Article
In: Association for Computing Machinery, New York, NY, USA, 2025.
DISCES: Systematic Discovery of Event Stream Queries Journal Article
In: Proc. ACM Manag. Data, vol. 3, no. 1, pp. 32:1–32:26, 2025.
Embracing Change: Incremental Updates of Discovered Event Queries Proceedings Article
In: Klettke, Meike; Schenkel, Ralf; Henrich, Andreas; Nicklas, Daniela; Schüle, Maximilian E.; Meyer-Wegener, Klaus (Ed.): Datenbanksysteme für Business, Technologie und Web (BTW 2025), 21. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 03.-07, März 2025, Bamberg, Germany, Proceedings, pp. 417–437, Gesellschaft für Informatik e.V., 2025.
Reaching New Limits: Discovery of Multi-Dimensional Disjunctive Subsequence-Queries with Intervals Proceedings Article
In: Klettke, Meike; Schenkel, Ralf; Henrich, Andreas; Nicklas, Daniela; Schüle, Maximilian E.; Meyer-Wegener, Klaus (Ed.): Datenbanksysteme für Business, Technologie und Web (BTW 2025), 21. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 03.-07, März 2025, Bamberg, Germany, Proceedings, pp. 49–70, Gesellschaft für Informatik e.V., 2025.
2024
Validity constraints for data analysis workflows Journal Article
In: Future Generation Computer Systems, vol. 157, pp. 82–97, 2024, ISSN: 0167-739X.
2023
Puzzling over Subsequence-Query Extensions: Disjunction and Generalised Gaps Proceedings Article
In: Kimelfeld, Benny; Martinez, Maria Vanina; Angles, Renzo (Ed.): Proceedings of the 15th Alberto Mendelzon International Workshop on Foundations of Data Management {(AMW} 2023), Santiago de Chile, Chile, May 22-26, 2023, CEUR-WS.org, 2023.
Discovering Multi-Dimensional Subsequence Queries from Traces - From Theory to Practice Proceedings Article
In: König-Ries, Birgitta; Scherzinger, Stefanie; Lehner, Wolfgang; Vossen, Gottfried (Ed.): Datenbanksysteme für Business, Technologie und Web (BTW 2023), 20. Fachtagung des GI-Fachbereichs ,,Datenbanken und Informationssysteme" (DBIS), 06.-10, März 2023, Dresden, Germany, Proceedings, pp. 511-533, Gesellschaft für Informatik e.V., 2023.
Validity Constraints for Data Analysis Workflows Miscellaneous
2023.
2022
Discovering Event Queries from Traces: Laying Foundations for Subsequence-Queries with Wildcards and Gap-Size Constraints Proceedings Article
In: Olteanu, Dan; Vortmeier, Nils (Ed.): 25th International Conference on Database Theory, ICDT 2022, March 29 to April 1, 2022, Edinburgh, UK (Virtual Conference), pp. 18:1–18:21, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022.
Predicate-based push-pull communication for distributed CEP Proceedings Article
In: Zhou, Yongluan; Chrysanthis, Panos K.; Gulisano, Vincenzo; Zacharatou, Eleni Tzirita (Ed.): 16th ACM International Conference on Distributed and Event-based Systems, DEBS 2022, Copenhagen, Denmark, June 27 - 30, 2022, pp. 31–42, ACM, 2022.