
- Read more about Stereo InSE-NET: Stereo Audio Quality Predictor Transfer Learned from Mono InSE-NET
- Log in to post comments
This is a presentation of the paper (http://www.aes.org/e-lib/browse.cfm?elib=21902), presented at the 153rd Audio Engineering Society Convention (https://sched.co/1CK3O).
- Categories:

- Read more about End-to-End Neural Speech Coding for Real-Time Communications
- Log in to post comments
- Categories:

- Read more about A Data-Driven Cognitive Salience Model for Objective Perceptual Audio Quality Assessment
- Log in to post comments
Objective audio quality assessment systems often use perceptual models to predict the subjective quality scores of processed signals, as reported in listening tests. Most systems map different metrics of perceived degradation into a single quality score predicting subjective quality. This requires a quality mapping stage that is informed by real listening test data using statistical learning (\iec a data-driven approach) with distortion metrics as input features.
- Categories:

- Read more about InSE-NET: A Perceptually Coded Audio Quality Model based on CNN
- Log in to post comments
This is a presentation of the paper (http://www.aes.org/e-lib/browse.cfm?elib=21478), presented virtually at the 151st Audio Engineering Society Convention (https://sched.co/mIhg).
- Categories:

- Read more about Low Delay Robust Audio Coding by Noise Shaping, Fractional Sampling, and Source Prediction
- Log in to post comments
It was recently shown that the combination of source prediction, two-times oversampling, and noise shaping, can be used to obtain a robust (multiple-description) audio coding frame- work for networks with packet loss probabilities less than 10%. Specifically, it was shown that audio signals could be encoded into two descriptions (packets), which were separately sent over a communication channel. Each description yields a desired performance by itself, and when they are combined, the performance is improved.
- Categories:

- Read more about Pruning of an Audio Enhancing Deep Generative Neural Network
- Log in to post comments
This is a presentation of the paper (http://www.aes.org/e-lib/browse.cfm?elib=20769), presented virtually at the 148th Audio Engineering Society Convention (https://www.eventscribe.com/2020/VirtualVienna/).
- Categories:

- Read more about Source Coding of Audio Signals with a Generative Model
- 1 comment
- Log in to post comments
These are the slides from the video presentation at ICASSP 2020 of the paper "Source Coding of Audio Signals with a Generative Model".
- Categories:

- Read more about Source Coding of Audio Signals with a Generative Model
- 2 comments
- Log in to post comments
We consider source coding of audio signals with the help of a generative model. We use a construction where a waveform is first quantized, yielding a finite bitrate representation. The waveform is then reconstructed by random sampling from a model conditioned on the quantized waveform. The proposed coding scheme is theoretically analyzed. Using SampleRNN as the generative model, we demonstrate that the proposed coding structure provides performance competitive with state-of-the-art source coding tools for specific categories of audio signals.
- Categories:

- Read more about A high efficient cascade coder with predictor blending method for lossless audio compression
- Log in to post comments
In this paper, the improvement of the cascaded prediction method was presented. The prediction method with backward adaptation and extended Ordinary Least Square (OLS+) was presented. An own approach to implementation of the effective context-dependent constant component removal block was used. Also the improved adaptive arithmetic coder with short, medium and long-term adaptation was used and the experiment was carried out comparing the results with other known lossless audio coders against which our method obtained the best efficiency.
- Categories:
- Read more about Learning about perception of temporal fine structure by building audio codecs
- Log in to post comments
This is a poster presented Thursday August 22, 2019 at the International Symposium on Auditory and Audiological Research (ISAAR). https://www.isaar.eu/index.php
SP.72 - Learning about perception of temporal fine structure by building audio codecs
- Categories: