Evaluating the descriptive power of Instagram hashtags
Autor: | Stamatios Giannoulakis, Nicolas Tsapatsoulis |
---|---|
Rok vydání: | 2016 |
Předmět: |
Computer and Information Sciences
Information retrieval Computer science business.industry Online database 020207 software engineering Context (language use) 02 engineering and technology Computer-assisted web interviewing Image tagging Crowdsourcing Image (mathematics) Metadata Automatic image annotation Machine learning 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Natural Sciences business Image retrieval Hashtags |
Zdroj: | Journal of Innovation in Digital Ecosystems |
ISSN: | 2352-6645 |
DOI: | 10.1016/j.jides.2016.10.001 |
Popis: | Image tagging is an essential step for developing Automatic Image Annotation (AIA) methods that are based on the learning by example paradigm. However, manual image annotation, even for creating training sets for machine learning algorithms, requires hard effort and contains human judgment errors and subjectivity. Thus, alternative ways for automatically creating training examples, i.e., pairs of images and tags, are pursued. In this work, we investigate whether tags accompanying photos in the Instagram can be considered as image annotation metadata. If such a claim is proved then Instagram could be used as a very rich, easy to collect automatically, source of training data for the development of AIA techniques. Our hypothesis is that Instagram hashtags, and especially those provided by the photo owner/creator, express more accurately the content of a photo compared to the tags assigned to a photo during explicit image annotation processes like crowdsourcing. In this context, we explore the descriptive power of hashtags by examining whether other users would use the same, with the owner, hashtags to annotate an image. For this purpose 1000 Instagram images were collected and one to four hashtags, considered as the most descriptive ones for the image in question, were chosen among the hashtags used by the photo owner. An online database was constructed to generate online questionnaires containing 20 images each, which were distributed to experiment participants so they can choose the best suitable hashtag for every image according to their interpretation. Results show that an average of 66% of the participants hashtag choices coincide with those suggested by the photo owners; thus, an initial evidence towards our hypothesis confirmation can be claimed. |
Databáze: | OpenAIRE |
Externí odkaz: |