Weaving the Web(VTT) of Data

Autor: Steiner, T., Mühleisen, H., Verborgh, R., Champin, P. -A, Encelle, B., Yannick Prié
Přispěvatelé: Prié, Yannick, Corpus, données et outils de la recherche en sciences humaines et sociales - Constitution de Corpus, analyse génétique de spectacles et nouvelles publications collaboratives à l'aide d'un dispositif pour la captation, l'indexation et le partage d'archives enrichies selon les standards du Web sémantique, audiovisuel et social - - Spectale en ligne(s)2012 - ANR-12-CORP-0015 - Corpus - VALID, Database Architectures, Supporting Interaction and Learning by Experience (SILEX), Laboratoire d'InfoRmatique en Image et Systèmes d'information (LIRIS), Institut National des Sciences Appliquées de Lyon (INSA Lyon), Université de Lyon-Institut National des Sciences Appliquées (INSA)-Université de Lyon-Institut National des Sciences Appliquées (INSA)-Centre National de la Recherche Scientifique (CNRS)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-École Centrale de Lyon (ECL), Université de Lyon-Université Lumière - Lyon 2 (UL2)-Institut National des Sciences Appliquées de Lyon (INSA Lyon), Université de Lyon-Université Lumière - Lyon 2 (UL2), Traces, Web, Education, Adaptation, Knowledge (TWEAK), Database Architectures Group, Centrum Wiskunde & Informatica (CWI), Multimedia Lab, Universiteit Gent = Ghent University [Belgium] (UGENT), Situated Interaction, Collaboration, Adaptation and Learning (SICAL), Laboratoire d'Informatique de Nantes Atlantique (LINA), Centre National de la Recherche Scientifique (CNRS)-Mines Nantes (Mines Nantes)-Université de Nantes (UN), ANR-12-CORP-0015,Spectale en ligne(s),Constitution de Corpus, analyse génétique de spectacles et nouvelles publications collaboratives à l'aide d'un dispositif pour la captation, l'indexation et le partage d'archives enrichies selon les standards du Web sémantique, audiovisuel et social(2012)
Jazyk: angličtina
Rok vydání: 2014
Předmět:
Zdroj: Scopus-Elsevier
LDOW 2014
LDOW 2014, Apr 2014, Seoul, South Korea. http://ceur-ws.org/Vol-1184/ldow2014_paper_11.pdf
Popis: International audience; Video has become a first class citizen on the Web with broad support in all common Web browsers. Where with struc- tured mark-up on webpages we have made the vision of the Web of Data a reality, in this paper, we propose a new vi- sion that we name the Web(VTT) of Data, alongside with concrete steps to realize this vision. It is based on the evolving standards WebVTT for adding timed text tracks to videos and JSON-LD, a JSON-based format to serial- ize Linked Data. Just like the Web of Data that is based on the relationships among structured data, the Web(VTT) of Data is based on relationships among videos based on WebVTT files, which we use as Web-native spatiotemporal Linked Data containers with JSON-LD payloads. In a first step, we provide necessary background information on the technologies we use. In a second step, we perform a large- scale analysis of the 148 terabyte size Common Crawl corpus in order to get a better understanding of the status quo of Web video deployment and address the challenge of integrat- ing the detected videos in the Common Crawl corpus into the Web(VTT) of Data. In a third step, we open-source an online video annotation creation and consumption tool, targeted at videos not contained in the Common Crawl cor- pus and for integrating future video creations, allowing for weaving the Web(VTT) of Data tighter, video by video.
Databáze: OpenAIRE