Popis: |
Discovering and establishing similarities among web documents is one of the key research streams in web usage mining community in the recent years. The knowledge obtained from the exercise can be used for many applications such as optimizing web cache organization and improving the quality of web document pre-fetching. This paper presents a matrix-based method to establish similarities among web documents, which are then applied to a Similarity-aware web content management system, facilitating offline building of the similarity-ware web caches and online updating similarity profiles of the system. |