Audio-based age and gender identification to enhance the recommendation of TV content
Autor: | Søren Holdt Jensen, Sven Ewan Shepstone, Zheng-Hua Tan |
---|---|
Rok vydání: | 2013 |
Předmět: |
advertisement
Matching (statistics) Information retrieval Multimedia gender identification Computer science business.industry age identification Genetic algorithms Recommender system computer.software_genre Identification (information) Categorization demographic filtering Audio analyzer Media Technology proportional recommendation Digital television Electrical and Electronic Engineering Audio signal processing Adaptation (computer science) business computer |
Zdroj: | Shepstone, S E, Tan, Z-H & Jensen, S H 2013, ' Audio-based Age and Gender Identification to Enhance the Recommendation of TV Content ', I E E E Transactions on Consumer Electronics, vol. 59, no. 3, pp. 721-729 . https://doi.org/10.1109/TCE.2013.6626261 |
ISSN: | 0098-3063 |
DOI: | 10.1109/tce.2013.6626261 |
Popis: | Recommending TV content to groups of viewers is best carried out when relevant information such as the demographics of the group is available. However, it can be difficult and time consuming to extract information for every user in the group. This paper shows how an audio analysis of the age and gender of a group of users watching the TV can be used for recommending a sequence of N short TV content items for the group. First, a state of the art audio-based classifier determines the age and gender of each user in an M-user group and creates a group profile. A genetic recommender algorithm then selects for each user in the profile, a single personalized multimedia item for viewing. When the number of items to be presented is different to the number of viewers in the group, i.e. M ≠ N, a novel adaptation algorithm is proposed that first converts the M-user group profile to an N-slot content profile, thus ensuring that items are proportionally allocated to users with respect to their demographic categorization. The proposed system is compared to an ideal system where the group demographics are provided explicitly. Results using real speaker utterances show that, in spite of the inaccuracies of state-of-the-art age-and-gender detection systems, the proposed system has a significant ability to predict an item with a matching age and gender category. User studies were conducted where subjects were asked to rate a sequence of advertisements, where half of the advertisements were randomly selected, and the other half were selected using the audio-derived demographics. The recommended advertisements received a significant higher median rating of 7.75, as opposed to 4.25 for the randomly selected advertisements 1. Recommending TV content to groups of viewers is best carried out when relevant information such as the demographics of the group is available. However, it can be difficult and time consuming to extract information for every user in the group. This paper shows how an audio analysis of the age and gender of a group of users watching the TV can be used for recommending a sequence of N short TV content items for the group. First, a state of the art audio-based classifier determines the age and gender of each user in an M-user group and creates a group profile. A genetic recommender algorithm then selects for each user in the profile, a single personalized multimedia item for viewing. When the number of items to be presented is different to the number of viewers in the group, i.e. M ≠ N, a novel adaptation algorithm is proposed that first converts the M-user group profile to an N-slot content profile, thus ensuring that items are proportionally allocated to users with respect to their demographic categorization. The proposed system is compared to an ideal system where the group demographics are provided explicitly. Results using real speaker utterances show that, in spite of the inaccuracies of state-of-the-art age-and-gender detection systems, the proposed system has a significant ability to predict an item with a matching age and gender category. User studies were conducted where subjects were asked to rate a sequence of advertisements, where half of the advertisements were randomly selected, and the other half were selected using the audio-derived demographics. The recommended advertisements received a significant higher median rating of 7.75, as opposed to 4.25 for the randomly selected advertisements 1. |
Databáze: | OpenAIRE |
Externí odkaz: |