Evaluating the descriptive power of Instagram hashtags

Autor:	Stamatios Giannoulakis, Nicolas Tsapatsoulis
Rok vydání:	2016
Předmět:	Computer and Information Sciences Information retrieval Computer science business.industry Online database 020207 software engineering Context (language use) 02 engineering and technology Computer-assisted web interviewing Image tagging Crowdsourcing Image (mathematics) Metadata Automatic image annotation Machine learning 0202 electrical engineering electronic engineering information engineering Instagram 020201 artificial intelligence & image processing Natural Sciences business Image retrieval Hashtags
Zdroj:	Journal of Innovation in Digital Ecosystems
ISSN:	2352-6645
DOI:	10.1016/j.jides.2016.10.001
Popis:	Image tagging is an essential step for developing Automatic Image Annotation (AIA) methods that are based on the learning by example paradigm. However, manual image annotation, even for creating training sets for machine learning algorithms, requires hard effort and contains human judgment errors and subjectivity. Thus, alternative ways for automatically creating training examples, i.e., pairs of images and tags, are pursued. In this work, we investigate whether tags accompanying photos in the Instagram can be considered as image annotation metadata. If such a claim is proved then Instagram could be used as a very rich, easy to collect automatically, source of training data for the development of AIA techniques. Our hypothesis is that Instagram hashtags, and especially those provided by the photo owner/creator, express more accurately the content of a photo compared to the tags assigned to a photo during explicit image annotation processes like crowdsourcing. In this context, we explore the descriptive power of hashtags by examining whether other users would use the same, with the owner, hashtags to annotate an image. For this purpose 1000 Instagram images were collected and one to four hashtags, considered as the most descriptive ones for the image in question, were chosen among the hashtags used by the photo owner. An online database was constructed to generate online questionnaires containing 20 images each, which were distributed to experiment participants so they can choose the best suitable hashtag for every image according to their interpretation. Results show that an average of 66% of the participants hashtag choices coincide with those suggested by the photo owners; thus, an initial evidence towards our hypothesis confirmation can be claimed.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ead45f268b3b8424d86692930ef1d2ea Zobrazit plný text záznamu