LZ-Compressed String Dictionaries
Autor: | Johannes Fischer, Julian Arz |
---|---|
Rok vydání: | 2014 |
Předmět: |
FOS: Computer and information sciences
TheoryofComputation_MISCELLANEOUS Lossless compression Computer science business.industry String (computer science) Search engine indexing Pattern recognition Data_CODINGANDINFORMATIONTHEORY Data structure Substring Uncompressed video Computer Science - Data Structures and Algorithms Compression ratio Data Structures and Algorithms (cs.DS) Artificial intelligence business Data compression |
Zdroj: | DCC |
Popis: | We show how to compress string dictionaries using the Lempel-Ziv (LZ78) data compression algorithm. Our approach is validated experimentally on dictionaries of up to 1.5 GB of uncompressed text. We achieve compression ratios often outperforming the existing alternatives, especially on dictionaries containing many repeated substrings. Our query times remain competitive. |
Databáze: | OpenAIRE |
Externí odkaz: |