Efficient Memory Representation of XML Documents - CiteSeerX
Recommend Documents
17 Apr 2007 ... needed to store the tree structure of the XML document. ..... node u ∈ N∗ of t, the
pattern p matches t at u if there are trees t1,...,tk such that ... no possibility to
extend m and m′ at the leafs to matches of some larger common.
Sep 18, 2009 - ments, XHTML has become a standard for the XML repre- sentation of web pages [18]. Throughout this paper, a XML document is considered ...
Sep 18, 2009 - Document (ODF) [4] and Office OpenXML [17]. Both have been been approved as ISO standards, and rely on XML to ensure interoperability.
ditionally, measures are numeric values (e.g., units of sales or total sale amount) ..... budgets of the groups on the lowest level, followed by the sum of budgets of ...
sions in which no two different occurrences of a symbol ... #PCDATA) and other elements as its content. .... We represent strings by their symbols wι... wt , with.
ated HTML/XML documents are guaranteed to be well-formed. In addition ... with name compact and (string) value compact, and two child elements. The first.
Third, by indexing on sequences of elements organized in a trie structure and using a sophisticated matching algorithm,. XTrie is able to both reduce the number ...
Sep 9, 2014 - Business Analytics. IBM. Chicago ... Despite its popularity the usage of XML for "Big Data" is challenging due to ... demand for an efficient processing of large amount of data stored in XML using Apache Hadoop map reduce.
Denilson Barbosa, Gregory Leighton, and Andrew Smith. University of Calgary, Calgary AB T2N 1N4, Canada. {denilson, gleighto, smithaj}@cs.ucalgary.ca.
ELEMENT Vacation-Package (Description, Airfare, Hotel-Cost, Price)>. ELEMENT ..... Aurora generalizes the function of searching by de ning a single schema.
Sep 9, 2014 - solution" that is based, for example, on Apache Hive User Defined Functions (UDF). As XML processing is the common use case, this paper ...
schemas implies that the schema validators are not used to determine if a ...... http://www.oreillynet.com/xml/blog/2006/05/metrics_for_xml_projects_1_ele.html.
from business objects exported to XML. For example, consider the task of compiling a table of product prices and descriptions from different vendors, where each ...
Keywords. XML information retrieval, Web data indexing, semistruc- tured data indexing, full text ... tion 5 presents the ranking scheme and section 6 discusses.
Dat-. aChannel (www.datachannel.com) has proposed a coarser solution; their Server product links XML ... to XML documents and XML Schemas, to provide.
A data cube is a multidimensional data model used to conceptualize data in a ...... OLAP, International Journal of Data Warehousing and Mining, Idea Group Inc.,.
lacks full extensibility, as it is not possible to express general integrity requirements in the same way as .... as denials, which indicates that there must be no variable binding satisfying the condi- tion in the .... Consider the following stateme
The memory size of explicit MPC solutions is primarily determined by ..... 0.5] u,. (9) where the states and inputs are constrained, respectively, by |xi| ⤠5, i = 1, 2, ...
DTD), we produce a new XML document (a slice) that contains the relevant information ... (X)HTML or XSL-FO, that can be interpreted by any standard browser.
Nov 4, 2005 - Institute for Software Technology. Universität der Bundeswehr München. ABSTRACT. Office applications such as OpenOffice and Microsoft ...
Besides this, it is known that the correct interpretation of human action needs to ... element of this cycle is a representation of the data sources suitable for further.
Brian J. Young,1 Tim Otto,2 Gregory D. Fox,3 and Howard Eichenbaum4. 1Department ...... Kesner RP, Boland BL, Davis M (1993) Memory for spatial locations,.
amygdalofugal pathways. Exp Brain Res 57:554â561. Brown MW (1996) Neuronal responses and recognition memory. Semin. Neurosci 8:23â32. Brown MW ...
Memory Efficient Skeletonization of Utility Maps. Albert M. Vossepoel1 ... objects in the drawings, with vectorization or other .... 1, with the four tiles (delineated by solid lines) taken apart .... equals the total overlap in the y-direction, i.e.
Efficient Memory Representation of XML Documents - CiteSeerX
via, e.g., the DOM, suffer from huge memory demands: the space needed to load ...... A. Arion, A. Bonifati, G. Costa, S. D'Aguanno, I. Manolescu, and A. Pugliese.
Efficient Memory Representation of XML Documents Giorgio Busatto1 , Markus Lohrey2 , and Sebastian Maneth3 1
Abstract. Implementations that load XML documents and give access to them via, e.g., the DOM, suffer from huge memory demands: the space needed to load an XML document is usually many times larger than the size of the document. A considerable amount of memory is needed to store the tree structure of the XML document. Here a technique is presented that allows to represent the tree structure of an XML document in an efficient way. The representation exploits the high regularity in XML documents by “compressing” their tree structure; the latter means to detect and remove repetitions of tree patterns. The functionality of basic tree operations, like traversal along edges, is preserved in the compressed representation. This allows to directly execute queries (and in particular, bulk operations) without prior decompression. For certain tasks like validation against an XML type or checking equality of documents, the representation allows for provably more efficient algorithms than those running on conventional representations.