Cost-effective solution to synchronised audio-visual data capture using multiple sensors
Autor: | Lee, T., Lichtenauer, Jeroen, Soatto, S., Shen, Jie, Valstar, Michel, Pantic, Maja |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2011 |
Předmět: |
Audio recording
Offset (computer science) EC Grant Agreement nr.: ERC/203143 Computer science METIS-284996 Real-time computing Automatic identification and data capture Serial port HMI-MI: MULTIMODAL INTERACTIONS 02 engineering and technology EWI-21248 Synchronisation 0202 electrical engineering electronic engineering information engineering IR-79390 business.industry 020207 software engineering Sensor fusion EC Grant Agreement nr.: FP7/211486 Sound recording and reproduction Video recording Asynchronous communication Temporal resolution Signal Processing Multisensor systems 020201 artificial intelligence & image processing Computer Vision and Pattern Recognition Timestamp business Computer hardware |
Zdroj: | Image and Vision Computing Image and vision computing, 29(10), 666-680. Elsevier |
ISSN: | 0262-8856 |
DOI: | 10.1016/j.imavis.2011.07.004 |
Popis: | Applications such as surveillance and human behaviour analysis require high-bandwidth recording from multiple cameras, as well as from other sensors. In turn, sensor fusion has increased the required accuracy of synchronisation between sensors. Using commercial off-the-shelf components may compromise quality and accuracy due to several challenges, such as dealing with the combined data rate from multiple sensors; unknown offset and rate discrepancies between independent hardware clocks; the absence of trigger inputs or -outputs in the hardware; as well as the different methods for time-stamping the recorded data. To achieve accurate synchronisation, we centralise the synchronisation task by recording all trigger- or timestamp signals with a multi-channel audio interface. For sensors that don't have an external trigger signal, we let the computer that captures the sensor data periodically generate timestamp signals from its serial port output. These signals can also be used as a common time base to synchronise multiple asynchronous audio interfaces. Furthermore, we show that a consumer PC can currently capture 8-bit video data with 1024 × 1024 spatial- and 59.1 Hz temporal resolution, from at least 14 cameras, together with 8 channels of 24-bit audio at 96 kHz. We thus improve the quality/cost ratio of multi-sensor systems data capture systems. |
Databáze: | OpenAIRE |
Externí odkaz: |