Low Delay Robust Audio Coding by Noise Shaping, Fractional Sampling, and Source Prediction
Autor: | Jan Ostergaard |
---|---|
Přispěvatelé: | Bilgin, Ali, Marcellin, Michael W., Serra-Sagrista, Joan, Storer, James A. |
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
PEAQ
Audio signal Computer science Network packet low delay 020206 networking & telecommunications 02 engineering and technology fractional sampling Noise shaping 030507 speech-language pathology & audiology 03 medical and health sciences audio coding Packet loss 0202 electrical engineering electronic engineering information engineering source predictions Oversampling Multiple descriptions 0305 other medical science Algorithm noise shaping Decoding methods Data compression |
Zdroj: | Østergaard, J 2021, Low Delay Robust Audio Coding by Noise Shaping, Fractional Sampling, and Source Prediction . in A Bilgin, M W Marcellin, J Serra-Sagrista & J A Storer (eds), Proceedings-DCC 2021 : 2021 Data Compression Conference ., 9418676, IEEE Signal Processing Society, Data Compression Conference. Proceedings, pp. 273-282, 2021 Data Compression Conference (DCC), Snowbird, Utah, United States, 23/03/2021 . https://doi.org/10.1109/DCC50243.2021.00035 DCC |
DOI: | 10.1109/DCC50243.2021.00035 |
Popis: | It was recently shown that the combination of source prediction, two-times oversampling, and noise shaping, can be used to obtain a robust (multiple-description) audio coding frame- work for networks with packet loss probabilities less than 10%. Specifically, it was shown that audio signals could be encoded into two descriptions (packets), which were separately sent over a communication channel. Each description yields a desired performance by itself, and when they are combined, the performance is improved. This paper extends the previ- ous work to an arbitrary number of descriptions (packets) by using fractional oversampling and a new decoding principle. We demonstrate that, due to source aliasing, existing MSE optimized reconstruction rules from noisy sampled data, performs poorly from a perceptual point of view. A simple reconstruction rule is proposed, that improves the PEAQ objective difference grades (ODG) by more than 2 points. The proposed audio coder enables low- delay high-quality audio streaming on networks with late packet arrivals or packet losses. With a coding delay of 2.5 ms, and a total bitrate of 300 kbps, it is demonstrated that mean PEAQ ODGs around -0.65 can be obtained for 48 kHz (mono) music (pop & rock), and packet loss probabilities of 20%. |
Databáze: | OpenAIRE |
Externí odkaz: |