The MediaMill TRECVID 2010 semantic video search engine

Autor: Snoek, C., Sande, K.E.A., Rooij, O., Huurnink, B., Gavves, E., Odijk, D., Rijke, Maarten, Gevers, T., Worring, Marcel, Koelma, D.C., Smeulders, Arnold
Přispěvatelé: Information and Language Processing Syst (IVI, FNWI), Intelligent Sensory Information Systems (IVI, FNWI)
Jazyk: angličtina
Rok vydání: 2010
Předmět:
Zdroj: TRECVID 2010 notebook
Popis: In this paper we describe our TRECVID 2010 video retrieval experiments. The MediaMill team participated in three tasks: semantic indexing, known-item search, and instance search. The starting point for the MediaMill concept detection approach is our top-performing bag-of-words system of TRECVID 2009, which uses multiple color SIFT descriptors, sparse codebooks with spatial pyramids, kernel-based machine learning, and multi-frame video processing. We improve upon this baseline system by further speeding up its execution times for both training and classification using GPU-optimized algorithms, approximated histogram intersection kernels, and several multi-frame combination methods. Being more efficient allowed us to supplement the Internet video training collection with positively labeled examples from international news broadcasts and Dutch documentary video from the TRECVID 2005-2009 benchmarks. Our experimental setup covered a huge training set of 170 thousand keyframes and a test set of 600 thousand keyframes in total. Ultimately leading to 130 robust concept detectors for video retrieval. For retrieval, a robust but limited set of concept detectors justifies the need to rely on as many auxiliary information channels as possible. For automatic known item search we therefore explore how we can learn to rank various information channels simultaneously to maximize video search results for a given topic. To further improve the video retrieval results, our interactive known item search experiments investigate how to combine metadata search and visualization into a single interface. The 2010 edition of the TRECVID benchmark has again been a fruitful participation for the MediaMill team, resulting in the top ranking for concept detection in the semantic indexing task. Again a lot has been learned during this year’s TRECVID campaign; we highlight the most important lessons at the end of this paper.
Databáze: OpenAIRE