
Research interests
My general research interests lie within the broad field of data management and data engineering.
Recent research directions include:
Somewhat less recent research directions:
- Reasoning on data partitioning for single-round and multi-round join evaluation in massively parallel systems (PODS 2015, EDBT 2015, ICDT 2016, VLDB 2016, ICDT 2018, PODS 2019, ICDT 2019, ICDT 2020)
- Schema languages for tabular data (WWW 2015, WWW 2018, ICDT 2018)
- Programming the cloud: Calm conjecture (PODS 2011, PODS 2014, ICDT 2015a, ICDT 2015b,ICDT 2015c )
Publications from a distant past that I am the most proud of:
- Succinctness of the Complement and Intersection of Regular Expressions. ACM Trans. Comput. Log. 13(1): 4:1-4:19 (2012)
- We prove that negation can make a regular expression much more concise: eliminating a single complement operator can cause a double exponential blow-up of its length. This work is featured in the Wikipedia article on regular expressions.
- Inference of concise regular expressions and DTDs. ACM Trans. Database Syst. 35(2): 11:1-11:47 (2010)
- We were among the first to show that simple regular expressions can be learned from examples in the context of XML schema inference.
- Expressiveness and complexity of XML Schema. ACM Trans. Database Syst. 31(3): 770-813 (2006)
- This paper sheds light on the effects of the various structural constraints that were imposed by the XML Schema standard.
- Finite state machines for strings over infinite alphabets. ACM Trans. Comput. Log. 5(3): 403-435 (2004).
- We investigate an automaton model for strings over data values. It instigated a surge of new research. To date this work remains my most cited paper.
Lists of publications
Awards
Invited Talks
Research
- Remaining CALM in Declarative Networking. ICDT 2014
- Logical Aspects of Massively Parallel and Distributed Systems. PODS 2016 (slides)
- Robustness Against Read Committed: A Free Transactional Lunch. PODS 2022 (slides)
General audience