A pattern based tokenization model for XML parsing on mobile devices
Autor: | Geerish Suddul, Nimal Nissanke, Nawaz Mohamudally |
---|---|
Rok vydání: | 2013 |
Předmět: |
Document Structure Description
XML Encryption Programming language Computer science business.industry Lexical analysis Efficient XML Interchange XML validation computer.file_format computer.software_genre Simple API for XML Streaming XML XML schema Artificial intelligence business computer Natural language processing computer.programming_language |
Zdroj: | 2013 Africon. |
DOI: | 10.1109/afrcon.2013.6757818 |
Popis: | This paper presents a theoretical tokenization model for XML parsing on resource constrained mobile devices. The model is based on the identification of sequentially repeating patterns within the structure of an XML document. As soon as it identifies a repeating structure, it relieves the parser from the computationally intensive conventional tokenization process, and focuses on extracting text node based values for further processing by the calling application. Our experiments demonstrate that the proposed tokenization model considerably relieves the processing bottlenecks encountered in conventional XML parsers. |
Databáze: | OpenAIRE |
Externí odkaz: |