Alsubai, S. and North, S.D. orcid.org/0000-0002-8478-8960 (2017) A Prime Number Approach to Matching an XML Twig Pattern including Parent-Child Edges. In: Proceedings of the 13th International Conference on Web Information Systems and Technologies. 13th International Conference on Web Information Systems and Technologies, 25-27 Apr 2017, Porto, Portugal. SCITEPRESS , pp. 204-211. ISBN 978-989-758-246-2
Abstract
Twig pattern matching is a core operation in XML query processing because it is how all the occurrences of a twig pattern in an XML document are found. In the past decade, many algorithms have been proposed to perform twig pattern matching. They rely on labelling schemes to determine relationships between elements corresponding to query nodes in constant time. In this paper, a new algorithm TwigStackPrime is proposed, which is an improvement to TwigStack (Bruno et al., 2002). To reduce the memory consumption and computation overhead of twig pattern matching algorithms when Parent-Child (P-C) edges are involved, TwigStackPrime efficiently filters out a tremendous number of irrelevant elements by introducing a new labelling scheme, called Child Prime Label (CPL). Extensive performance studies on various real-world and artificial datasets were conducted to demonstrate the significant improvement of CPL over the previous indexing and querying techniques. The experimental results show that the new technique has a superior performance to the previous approaches.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2017 SCITEPRESS. This is an author produced version of a paper subsequently published in Proceedings of the 13th International Conference on Web Information Systems and Technologies. Uploaded in accordance with the publisher's self-archiving policy. |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 08 Jun 2017 15:25 |
Last Modified: | 19 Dec 2022 13:36 |
Published Version: | https://doi.org/10.5220/0006225602040211 |
Status: | Published |
Publisher: | SCITEPRESS |
Refereed: | Yes |
Identification Number: | 10.5220/0006225602040211 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:117467 |