Popis: |
We have developed SimSearch, a tool that simplifies data exploration by enabling top-k similarity search over large collections of entities involving multiple heterogeneous attributes from different sources. We present the supported modes for data access, and the query mechanism orchestrating multi-attribute similarity search over diverse types of attributes, including textual, numerical and spatial. Users can specify their query parameters and preferences through a web interface, and visually inspect and compare the results through appropriate visualizations for the different types of attributes involved. We demonstrate SimSearch using a real-world, commercial dataset, highlighting its capabilities for interactive, user-friendly, and intuitive data exploration. |