TY - JOUR
T1 - Streamable regular transductions
AU - Alur, Rajeev
AU - Fisman, Dana
AU - Mamouras, Konstantinos
AU - Raghothaman, Mukund
AU - Stanford, Caleb
N1 - Funding Information:
This work was supported by NSF award CCF 1763514.
Funding Information:
This work was supported by NSF award CCF 1763514 .
Publisher Copyright:
© 2019
PY - 2020/2/6
Y1 - 2020/2/6
N2 - Motivated by real-time monitoring and data processing applications, we develop a formal theory of quantitative queries for streaming data that can be evaluated efficiently. We consider the model of unambiguous Cost Register Automata (CRAs), which are machines that combine finite-state control (for identifying regular patterns) with a finite set of data registers (for computing numerical aggregates). The definition of CRAs is parameterized by the collection of numerical operations that can be applied to the registers. These machines give rise to the class of streamable regular transductions (SR), and to the class of streamable linear regular transductions (SLR) when the register updates are copyless, i.e. every register appears at most once in the right-hand-side expressions of the updates. We give a logical characterization of the class SR (resp., SLR) using MSO-definable transformations from strings to DAGs (resp., trees) without backward edges. Additionally, we establish that the two classes SR and SLR are closed under operations that are relevant for designing query languages. Finally, we study the relationship with weighted automata (WA), and show that CRAs over a suitably chosen set of operations correspond to WA, thus establishing that WA are a special case of CRAs.
AB - Motivated by real-time monitoring and data processing applications, we develop a formal theory of quantitative queries for streaming data that can be evaluated efficiently. We consider the model of unambiguous Cost Register Automata (CRAs), which are machines that combine finite-state control (for identifying regular patterns) with a finite set of data registers (for computing numerical aggregates). The definition of CRAs is parameterized by the collection of numerical operations that can be applied to the registers. These machines give rise to the class of streamable regular transductions (SR), and to the class of streamable linear regular transductions (SLR) when the register updates are copyless, i.e. every register appears at most once in the right-hand-side expressions of the updates. We give a logical characterization of the class SR (resp., SLR) using MSO-definable transformations from strings to DAGs (resp., trees) without backward edges. Additionally, we establish that the two classes SR and SLR are closed under operations that are relevant for designing query languages. Finally, we study the relationship with weighted automata (WA), and show that CRAs over a suitably chosen set of operations correspond to WA, thus establishing that WA are a special case of CRAs.
KW - Cost Register Automata
KW - MSO transductions
KW - Quantitative automata
KW - Regular functions
KW - Stream processing
KW - Weighted automata
UR - http://www.scopus.com/inward/record.url?scp=85076598587&partnerID=8YFLogxK
U2 - 10.1016/j.tcs.2019.11.018
DO - 10.1016/j.tcs.2019.11.018
M3 - Article
AN - SCOPUS:85076598587
SN - 0304-3975
VL - 807
SP - 15
EP - 41
JO - Theoretical Computer Science
JF - Theoretical Computer Science
ER -