Sorry, you need to enable JavaScript to visit this website.

RF-GML: Reference-Free Generative Machine Listener

Citation Author(s):
Arijit Biswas, Guanxin Jiang
Submitted by:
Arijit Biswas
Last updated:
29 April 2025 - 3:56am
Document Type:
Presentation Slides
Document Year:
2025
Event:
Presenters:
Arijit Biswas
Paper Code:
2199
Categories:
 

This paper introduces a novel reference-free (RF) audio quality metric called the RF-Generative Machine Listener (RF-GML), designed to evaluate coded mono, stereo, and binaural audio at a 48 kHz sample rate. RF-GML leverages transfer learning from a state-of-the-art full-reference (FR) Generative Machine Listener (GML) with minimal architectural modifications. The term "generative" refers to the model’s ability to generate an arbitrary number of simulated listening scores. Unlike existing RF models, RF-GML accurately predicts subjective quality scores across diverse content types and codecs. Extensive evaluations demonstrate its superiority in rating unencoded audio and distinguishing different levels of coding artifacts. RF-GML’s performance and versatility make it a valuable tool for coded audio quality assessment and monitoring in various applications, all without the need for a reference signal.

up
0 users have voted: