Documents
Poster
F0 ESTIMATION FROM TELEPHONE SPEECH USING DEEP FEATURE LOSS
- Citation Author(s):
- Submitted by:
- Supritha Shetty
- Last updated:
- 31 May 2023 - 7:00am
- Document Type:
- Poster
- Document Year:
- 2023
- Event:
- Presenters:
- K T Deepak
- Paper Code:
- 4051
- Categories:
- Keywords:
- Log in to post comments
Accurate pitch estimation in speech signal plays a vital role in several applications. Robust pitch estimation in telephone speech is still a challenge due to the narrow bandwidth of the signal. Electroglottograph (EGG) signal is a reliable means for pitch estimation, however, it’s not practically possible to
measure such a signal in many applications. In this work, a method is proposed to synthesize EGG signal from telephone speech using deep feature loss network and subsequently pitch contour is derived from synthesized EGG (SEGG) signal. In order to evaluate the proposed work, CMU-Arctic speech database is used as it contains simultaneous EGG signal recorded. The telephonic speech is derived using International Telecommunications Union ITU-T as specified in the
Blizzard Challenge. The robustness of the proposed method is demonstrated under different noisy conditions. The performance of the proposed work is encouraging when compared with other state-of-the-art methods.
Comments
The poster and the paper
The poster and the paper preprint materials are available here.