Nothing Special   »   [go: up one dir, main page]

skip to main content
article

A framework for management of concurrent XML markup

Published: 01 February 2005 Publication History

Abstract

The problem of concurrent markup hierarchies in XML encodings of documents has attracted attention of a number of humanities researchers in recent years. The key problem with using concurrent hierarchies to encode documents is that markup in one hierarchy is not necessarily well-formed with respect to the markup in another hierarchy. Previously proposed solutions to this problem rely on the XML expertise of the editors and their ability to maintain correct DTDs for complex markup languages. In this paper, we approach the problem of maintenance of concurrent XML markup from the Computer Science perspective. We propose a framework that allows the editors to concentrate on the semantic aspects of the encoding, while leaving the burden of maintaining XML documents to the software. The paper describes the formal notion of the concurrent markup languages and the algorithms for automatic maintenance of XML documents with concurrent markup.

References

[1]
{1} A. Renear, E. Mylonas, D. Durand, Refining our notion of what text really is: The problem of overlapping hierarchies, in: N. Ide, S. Hockey (Eds.), Research in Humanities Computing.
[2]
{2} C.M. Sperberg-McQueen, L. Burnard, Guidelines for Text Encoding and Interchange (P4), the TEI Consortium, 2001. Available from <http://www.tei-c.org/P4X/index.html>.
[3]
{3} P. Durusau, M.B. O'Donnell, Concurrent Markup for XML Documents, in: Proc. XML Europe, 2002.
[4]
{4} A. Witt, Meaning and interpretation of concurrent markup, in: Proc. Joint Conference of the ALLC and ACH, 2002, pp. 145-147.
[5]
{5} P. Durusau, M. O'Donnell, Declaring trees: the future of the evolution of markup? in: Proc. Conference on Extreme Markup Languages, 2002.
[6]
{6} S. Abiteboul, J. McHugh, M. Rys, V. Vassalos, J.L. Wiener, Incremental maintenance for materialized views over semistructured data, in: Proc. of VLDB, 1998, pp. 38-49.
[7]
{7} W. May, Integration of XML data in XPathLog, in: DIWeb, 2001, pp. 2-16.
[8]
{8} W. May, Lopix: a system for XML data integration and manipulation, in: The VLDB Journal, 2001, pp. 707-708.
[9]
{9} I. Manolescu, D. Florescu, D. Kossmann, Answering XML queries over heterogeneous data sources, in: Proc. of VLDB, Roma, Italy, 2001, pp. 241-250.
[10]
{10} C. Huitfeldt, C.M. Sperberg-McQueen, TexMECS: an experimental markup meta-language for complex documents, February 2001. Available from <http://www.hit.uib.no/claus/mlcd/papers/texmecs.html>.
[11]
{11} C.M. Sperberg-McQueen, C. Huitfeldt, GODDAG: A Data Structure for Overlapping Hierarchies, ACH-ALLC Conference, Charlottesville, June 1999.
[12]
{12} K. Kiernan, J. Jaromczyk, A. Dekhtyar, D. Porter, K. Hawley, S. Bodapati, I. Iacob, The ARCHway project: architecture for research in computing for humanities through research, teaching, and learning, Literary and Linguistic Computing, forthcoming.
[13]
{13} K. Kiernan, A. Prescott, E. Solopova, D. French, L. Cantara, M. Ellis, C. Yuan, I. Iacob, Electronic Beowulf, 2003. Available from <http://www.uky.edu/~kiernan/eBeowulf/guide.htm>.
[14]
{14} E. Solopova, Encoding a transcript of the Beowulf manuscript in SGML, in: Proc. ACH/ALCC, 1999.
[15]
{15} W. Scales, J. Griffioen, K. Kiernan, C.J. Yuan, L. Cantara, The digital atheneum: New technologies for restoring and preserving Old documents, Computers in Libraries 20 (2) (2000) 26-30.
[16]
{16} Canterbury tales project, De Monfort University, 1999. Available from <http://www.cta.dmu.ac.uk/projects/ctp/>.
[17]
{17} C. Sperberg-McQueen, D. Seaman, A TEI-based tag set for manuscript transcription, Digital Scriptorium.
[18]
{18} British Library MS Cotton Otho A. vi, fol. 38v.
[19]
{19} T. Bray, J. Paoli, C.M. Sperberg-McQueen, E. Maler, Extensible Markup Language (XML) 1.0, second ed., W3C, REC-xml-20001006, October 2000. Available from <http://www.w3.org/TR/REC-xml>.
[20]
{20} Simple API for XML (SAX) 2.0.1, SourceForge project, January 2002. Available from <http://www.saxproject.org>.
[21]
{21} A.L. Hors, P.L. Hégaret, L. Wood, G. Nicol, J. Robie, M. Champion, S. Byrne, DOM, November 2000. Available from <http://www.w3.org/TR/2000/REC-DOM-Level-2-Core-20001113/>.

Cited By

View all
  • (2012)Exploring manuscriptsProceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics10.1145/2254129.2254184(1-12)Online publication date: 13-Jun-2012
  • (2010)Reconciling two models of multihierarchical markupProcceedings of the 13th International Workshop on the Web and Databases10.1145/1859127.1859146(1-6)Online publication date: 6-Jun-2010
  • (2008)Towards the unification of formats for overlapping markupThe New Review of Hypermedia and Multimedia10.1080/1361456080231614514:1(57-94)Online publication date: 1-Jan-2008
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Data & Knowledge Engineering
Data & Knowledge Engineering  Volume 52, Issue 2
Special issue: XML schema and data management
February 2005
89 pages

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 01 February 2005

Author Tags

  1. XML
  2. concurrent markup

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2012)Exploring manuscriptsProceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics10.1145/2254129.2254184(1-12)Online publication date: 13-Jun-2012
  • (2010)Reconciling two models of multihierarchical markupProcceedings of the 13th International Workshop on the Web and Databases10.1145/1859127.1859146(1-6)Online publication date: 6-Jun-2010
  • (2008)Towards the unification of formats for overlapping markupThe New Review of Hypermedia and Multimedia10.1080/1361456080231614514:1(57-94)Online publication date: 1-Jan-2008
  • (2006)Representing and querying multi-dimensional markup for question answeringProceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing10.5555/1621034.1621036(3-9)Online publication date: 4-Apr-2006
  • (2006)Describing and querying hierarchical XML structures defined over the same textual dataProceedings of the 2006 ACM symposium on Document engineering10.1145/1166160.1166199(147-154)Online publication date: 10-Oct-2006
  • (2006)Support for XML markup of image-based electronic editionsInternational Journal on Digital Libraries10.1007/s00799-005-0123-26:1(55-69)Online publication date: 1-Feb-2006
  • (2006)Implementing a linguistic query language for historic textsProceedings of the 2006 international conference on Current Trends in Database Technology10.1007/11896548_45(601-612)Online publication date: 26-Mar-2006
  • (2005)A framework for processing complex document-centric XML with overlapping structuresProceedings of the 2005 ACM SIGMOD international conference on Management of data10.1145/1066157.1066280(897-899)Online publication date: 14-Jun-2005
  • (2005)Processing XML documents with overlapping hierarchiesProceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries10.1145/1065385.1065513(409-409)Online publication date: 7-Jun-2005
  • (2005)Searching multi-hierarchical XML documentsProceedings of the 16th international conference on Database and Expert Systems Applications10.1007/11546924_56(576-585)Online publication date: 22-Aug-2005
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media