Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2452376.2452439acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Data exchange with arithmetic operations

Published: 18 March 2013 Publication History

Abstract

Data exchange is the problem of transforming data structured under a source schema into data structured under a target schema, taking into account structural relationships between the two schemas, which are described by a schema mapping. Existing schema-mapping languages lack the ability to express arithmetic operations, such as addition and multiplication, which naturally arise in data warehousing, ETL applications, and applications involving scientific data. We initiate the study of data exchange for arithmetic schema mappings, that is, schema mappings specified by source-to-target dependencies and target dependencies that may include arithmetic formulas interpreted over the algebraic real numbers (we restrict attention to algebraic real numbers to maintain finite presentability).
We show that, for arithmetic schema mappings without target dependencies, the existence-of-solutions problem can be solved in polynomial time, and, if a solution exists, then a universal solution (suitably defined) exists and can be computed in polynomial time. In the case of arithmetic schema mappings with a weakly acyclic set of target dependencies, a universal solution may not exist, but a finite universal basis exists (if a solution exists) and can be computed in polynomial space. The existence-of-solutions problem turns out to be NP-hard, and solvable in PSpace. In fact, we show it is ∃R-complete, which means that it has the same complexity as the decision problem for the existential theory of the real numbers, or, equivalently, the problem of deciding whether or not a quantifier-free arithmetic formula has a solution over the real numbers. If we allow only linear arithmetic formulas in the schema mapping and in the query, interpreted over the rational numbers, then the existence-of-solutions problem is NP-complete. We obtain analogous complexity results for the data complexity of computing the certain answers of arithmetic conjunctive queries and linear arithmetic conjunctive queries.

References

[1]
S. Abiteboul and O. M. Duschka. Complexity of Answering Queries Using Materialized Views. In PODS, pages 254--263, 1998.
[2]
F. N. Afrati, C. Li, and V. Pavlaki. Data exchange in the presence of arithmetic comparisons. In EDBT, pages 487--498, 2008.
[3]
B. Alexe, B. ten Cate, P. G. Kolaitis, and W.-C. Tan. Designing and refining schema mappings via data examples. In SIGMOD, pages 133--144. ACM, 2011.
[4]
E. Allender, P. Bürgisser, J. Kjeldgaard-Pedersen, and P. B. Miltersen. On the complexity of numerical analysis. In IEEE Conference on Computational Complexity, pages 331--339, 2006.
[5]
M. Arenas, P. Barceló, L. Libkin, and F. Murlak. Relational and XML Data Exchange. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2010.
[6]
M. Baudinet, J. Chomicki, and P. Wolper. Constraint-generating dependencies. Journal of Computer and System Sciences, 59(1):94--115, 1999.
[7]
C. Beeri and M. Y. Vardi. The implication problem for data dependencies. In ICALP, pages 73--85, 1981.
[8]
M. Ben-Or, D. Kozen, and J. Reif. The complexity of elementary algebra and geometry. In STOC, pages 457--464. ACM, 1984.
[9]
D. Bienstock. Some provably hard crossing number problems. Discrete and Computational Geometry, 6:443--459, 1991.
[10]
J. Canny. Some algebraic and geometric computations in PSPACE. In STOC, pages 460--467. ACM, 1988.
[11]
J. H. Davenport and J. Heintz. Real quantifier elimination is doubly exponential. J. Symb. Comput., 5:29--35, February 1988.
[12]
D. Dou and S. Coulondre. A sound and complete chase procedure for constrained tuple-generating dependencies. Journal of Intelligent Information Systems, Online First:1--22, 2012.
[13]
I. Z. Emiris, B. Mourrain, and E. P. Tsigaridas. Real algebraic numbers: Complexity analysis and experimentation. In Reliable Implementation of Real Number Algorithms, pages 57--82, 2008.
[14]
R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa. Data Exchange: Semantics and Query Answering. Theoretical Computer Science, 336(1):89--124, 2005.
[15]
J. Ferrante and C. Rackoff. A decision procedure for the first order theory of real addition with order. SIAM J. Comput., 4(1):69--76, 1975.
[16]
A. Fuxman, P. G. Kolaitis, R. J. Miller, and W. C. Tan. Peer data exchange. ACM Trans. Database Syst., 31(4):1454--1498, 2006.
[17]
R. Greenlaw, H. Hoover, and W. Ruzzo. Limits to parallel computation: P-completeness theory. Oxford University Press, 1995.
[18]
J. Heintz, M.-F. Roy, and P. Solernó. Sur la complexité du principe de Tarski-Seidenberg. Bulletin de la Société Mathématique de France, tome 118(1):101--126, 1990.
[19]
L. Khachiyan. A polynomial algorithm in linear programming. Soviet Mathematics Doklady, 20:191--194, 1979.
[20]
P. G. Kolaitis. Schema mappings, data exchange, and metadata management. In PODS, pages 61--75, 2005.
[21]
A. Madry. Data exchange: On the complexity of answering queries with inequalities. Inf. Process. Lett., 94(6):253--257, 2005.
[22]
M. J. Maher. Constrained dependencies. Theor. Comput. Sci., 173(1):113--149, 1997.
[23]
M. J. Maher and D. Srivastava. Chasing constrained tuple-generating dependencies. In R. Hull, editor, PODS, pages 128--138. ACM Press, 1996.
[24]
The rectilinear crossing number project. http://www.ist.tugraz.at/staff/aichholzer/research/rp/triangulations/crossing/. Accessed 10/17/2012.
[25]
J. Renegar. On the computational complexity and geometry of the first-order theory of the reals. Part I. J. Symb. Comput., 13(3):255--299, 1992.
[26]
J. Robinson. Definability and decision problems in arithmetic. J. Symb. Logic, 14(2):pp. 98--114, 1949.
[27]
M. Schaefer. Complexity of some geometric and topological problems. In Graph Drawing, volume 5849 of LNCS, pages 334--344. 2010.
[28]
A. Tarski. A Decision Method for Elementary Algebra and Geometry. University of California Press, 1951.
[29]
B. ten Cate, L. Chiticariu, P. G. Kolaitis, and W. C. Tan. Laconic schema mappings: Computing the core with sql queries. PVLDB, 2(1):1006--1017, 2009.
[30]
J. Wang, R. W. Topor, and M. J. Maher. Reasoning with disjunctive constrained tuple-generating dependencies. In DEXA, pages 963--973, 2001.
[31]
V. Weispfenning. The complexity of linear problems in fields. J. Symb. Comput., 5:3--27, February 1988.
[32]
Wolfram MathWorld. Rectilinear crossing problem. http://mathworld.wolfram.com/RectilinearCrossingNumber.html. Accessed 10/17/2012.

Cited By

View all
  • (2024)Early detection of temporal constraint violationsInformation and Computation10.1016/j.ic.2023.105114296(105114)Online publication date: Jan-2024
  • (2023)Probing the quantitative–qualitative divide in probabilistic reasoningAnnals of Pure and Applied Logic10.1016/j.apal.2023.103339(103339)Online publication date: Jul-2023
  • (2022)IS CAUSAL REASONING HARDER THAN PROBABILISTIC REASONING?The Review of Symbolic Logic10.1017/S1755020322000211(1-26)Online publication date: 18-May-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '13: Proceedings of the 16th International Conference on Extending Database Technology
March 2013
793 pages
ISBN:9781450315975
DOI:10.1145/2452376
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 March 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. arithmetic
  2. data exchange
  3. schema mappings

Qualifiers

  • Research-article

Funding Sources

Conference

EDBT/ICDT '13

Acceptance Rates

Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Early detection of temporal constraint violationsInformation and Computation10.1016/j.ic.2023.105114296(105114)Online publication date: Jan-2024
  • (2023)Probing the quantitative–qualitative divide in probabilistic reasoningAnnals of Pure and Applied Logic10.1016/j.apal.2023.103339(103339)Online publication date: Jul-2023
  • (2022)IS CAUSAL REASONING HARDER THAN PROBABILISTIC REASONING?The Review of Symbolic Logic10.1017/S1755020322000211(1-26)Online publication date: 18-May-2022
  • (2019)Answering Queries Using Views, Second EditionSynthesis Lectures on Data Management10.2200/S00884ED2V01Y201811DTM05414:3(1-275)Online publication date: 15-Apr-2019
  • (2019)The homomorphism property in query containment and data integrationProceedings of the 23rd International Database Applications & Engineering Symposium10.1145/3331076.3331127(1-12)Online publication date: 10-Jun-2019
  • (2019)Temporal data exchangeInformation Systems10.1016/j.is.2019.07.004Online publication date: Jul-2019
  • (2018)Reflections on Schema Mappings, Data Exchange, and Metadata ManagementProceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems10.1145/3196959.3196991(107-109)Online publication date: 27-May-2018
  • (2017)Foundations of information integration under bag semanticsProceedings of the 32nd Annual ACM/IEEE Symposium on Logic in Computer Science10.5555/3329995.3330039(1-12)Online publication date: 20-Jun-2017
  • (2017)Answering Queries Using ViewsSynthesis Lectures on Data Management10.2200/S00805ED1V01Y201709DTM0469:2(1-235)Online publication date: Dec-2017
  • (2017)Foundations of information integration under bag semantics2017 32nd Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)10.1109/LICS.2017.8005104(1-12)Online publication date: Jun-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media