Popis: |
This abstract paper sketches our research towards Structured Semantic Embedding of multimedia data. Though a tag may have multiple senses with completely different visual imagery, current semantic embedding methods represent the tag by a single vector regardless of its senses. We challenge this convention, arguing the importance of adding semantic structures into semantic embedding. In particular, we develop Hierarchical Semantic Embedding, a simple model that exploits the WordNet hierarchy to make the semantic embedding structured to some extent. We demonstrate the viability of structured semantic embedding for tag disambiguation and zero-shot image tagging. |