Welcome to the ACM Symposium on Document Engineering (DocEng) 2002, the second in a series of meetings on the emerging area of document engineer-ing, which investigates, from the perspective of computer science, systems for creating, managing, and maintaining documents in any form and in all media.DocEng'02 is the second in what we hope will be a long series of meet-ings. The first of the series, DocEng'01, was held in Atlanta, GA during November of 2001. The symposium continues the tradition of several other con-ferences, including the long series of biennial Electronic Publishing (EP) confer-ences and Principles of Document Processing workshops, the 2000 Digital Documents and Electronic Publishing conference, the 1988 ACM Conference on Document Processing Systems, and the 1981 ACM Symposium on Text Manipulation.DocEng'02 once again joins with the ACM Conference on Information and Knowledge Management (CIKM) to present this symposium. We wish to thank the CIKM organizing committee for their assistance.The technical program results from the efforts of the Program Committee and additional reviewers. Each paper received at least three reviews. We thank the PC and reviewers for their careful reviews and prompt efforts, especially as our review schedule coincided with many committee members' vacation schedules. Forty-six papers were submitted and the committee selected twenty-one papers for presentation at DocEng'02.We hope you will agree with us that the Symposium's papers reflect a broad and dynamic area of research, and share with us our excitement over the directions and activities presented here.
Engineering broad-spectrum document software: lessons from ghostscript
Almost 14 years after its first public release, Ghostscript is still a commercially thriving, actively evolving, nearly-Open Source PDL interpreter. It has successfully evolved from its original function as a PostScript Level 1 previewer running on MS-...
A presentation language for controlling the formatting process in multimedia presentations
Multimedia information encapsulated inside documents is more and more specific because its content is specified using domain vocabularies. Their integration in space and time to form a document implies transformation steps to produce "presentation ...
Applying caT's programmable browsing semantics to specify world-wide web documents that reflect place, time, reader, and community
In this paper we discuss application of caT, which extends the Trellis Petri-net-based model of document/hypertext, towards specification of Web-browsable documents that respond to their reader's characteristics, browsing activities, use environment, ...
Multimedia document engineering in MCF
This article demonstrates how several of the general-purpose principles which have proved successful in the area of large-scale software development and maintenance are relevant to multimedia design. We present the Media Construction Formalism, MCF, ...
The relevance of software documentation, tools and technologies: a survey
This paper highlights the results of a survey of software professionals. One of the goals of this survey was to uncover the perceived relevance (or lack thereof) of software documentation, and the tools and technologies used to maintain, verify and ...
Supporting document and data views of source code
The paper describes the use of an XML format to store and represent program source code. A new XML application, srcML (SouRCe Markup Language), is presented. srcML presumes a document view of source code where information about the syntactic structure ...
Document engineering for e-business
It can be said that "document exchange" is the "mother of all patterns" for business (and for e-business). Yet, by itself this view isn't sufficiently prescriptive. In this paper, we present additional perspectives or frameworks that make this ...
XConnector: extending XLink to provide multimedia synchronization
This paper proposes XConnector, a language for the creation of complex hypermedia relations with causal or constraint semantics. XConnector allows the definition of relations independently of which resources are related. Another feature is the ...
XLinkProxy: external linkbases with XLink
In the linking model of the World Wide Web each link is stored in the referring document within an attribute of the A tag. All the hyperlink defined this way can reference a single resource or a single fragment. With the evolution of Web technologies ...
An open linking service supporting the authoring of web documents
- Renato Bulcao Neto,
- Claudia Akemi Izeki,
- Maria da Graça Pimentel,
- Renata Pontin Fortes,
- Khai Nhut Truong
Both content driven web authors and application designers may have their attention deviated from their main task when they have to be concerned with the generation of elaborated linking structures. This work aims to demonstrate how a metadata-enhanced ...
Managing and querying multi-version XML data with update logging
With the increasing popularity of storing content on the WWW and intranet in XML form, there arises the need for the control and management of this data. As this data is constantly evolving, users want to be able to query previous versions, query ...
Experimenting with the circus language for XML modeling and transformation
After a brief introduction to the Circus programming language, we present a simple type set to model XML structures. We then describe a transformation that takes a mail as input and produces a reply, showing how subtyping is used in order to refine the ...
Lazy XML processing
This paper formalizes the domain of tree-based XML processing and classifies several implementation approaches. The lazy approach, an original contribution, is presented in depth. Proceeding from experimental measurements, we derive a selection strategy ...
Mapping and displaying structural transformations between XML and PDF
Documents are often marked up in XML-based tagsets to delineate major structural components such as headings, paragraphs, figure captions and so on, without much regard to their eventual displayed appearance. And yet these same abstract documents, after ...
Towards automating of document structure transformations
In this paper we develop a syntax-directed approach to transformation of documents from one structure to another. The aim is to automate a transformation between two grammars that have common parts, although the grammars and names of elements may ...
Simple and accurate feature selection for hierarchical categorisation
Categorisation of digital documents is useful for organisation and retrieval. While document categories can be a set of unstructured category labels, some document categories are hierarchically structured. This paper investigates automatic hierarchical ...
Towards a semantics for XML markup
Although XML Document Type Definitions provide a mechanism for specifying, in machine-readable form, the syntax of an XML markup language, there is no comparable mechanism for specifying the semantics of an XML vocabulary. That is, there is no way to ...
Generation of images of historical documents by composition
This paper describes a system for efficient storage, indexing and network transmission of images of historical documents. The documents are first decomposed into their features such as paper texture, colours, typewritten parts, pictures, etc. Document ...
A dynamic user interface for document assembly
Document assembly has turned out to be a convenient approach to corporate publishing and reuse of large collections of documents. Automated assembly of a document reduces the amount of human effort when creating customized documents consisting of ...
Degraded character image restoration using active contours: a first approach
In this paper, we describe an example of the use of active contours for the reconstruction of degraded character images. Active contours were introduced fifteen years ago by Kass and al. [8], and have been widely used since then for segmentation purpose ...
Recognizing records from the extracted cells of microfilm tables
Microfilm documents contain a wealth of information, but extracting and organizing this information by hand is slow, error-prone, and tedious. As an initial step toward automating access to this information, we describe in this paper an algorithmic ...
Recommendations
Acceptance Rates
Year | Submitted | Accepted | Rate |
---|---|---|---|
DocEng '24 | 27 | 16 | 59% |
DocEng '23 | 27 | 9 | 33% |
DocEng '19 | 77 | 30 | 39% |
DocEng '17 | 71 | 13 | 18% |
DocEng '16 | 35 | 11 | 31% |
DocEng '15 | 31 | 11 | 35% |
DocEng '14 | 41 | 15 | 37% |
DocEng '13 | 50 | 16 | 32% |
DocEng '10 | 42 | 13 | 31% |
DocEng '08 | 62 | 21 | 34% |
DocEng '02 | 46 | 21 | 46% |
DocEng '01 | 55 | 18 | 33% |
Overall | 564 | 194 | 34% |