IEEE ICASSP 2024

IEEE ICASSP 2024 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The IEEE ICASSP 2024 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

HIM: DISCOVERING IMPLICIT RELATIONSHIPS IN HETEROGENEOUS SOCIAL NETWORKS

Read more about HIM: DISCOVERING IMPLICIT RELATIONSHIPS IN HETEROGENEOUS SOCIAL NETWORKS
Log in to post comments

To date, research on relation mining has typically focused on analyzing explicit relationships between entities, while ignoring the underlying connections between entities, known as implicit relationships. Exploring implicit relationships can reveal more about social dynamics and potential relationships in heterogeneous social networks to better explain complex social behaviors. The research presented in this paper explores implicit relationships discovery methods in the context of heterogeneous social networks.

HIM-Supplementary.pdf

Supplementary materials for the paper "HIM: DISCOVERING IMPLICIT RELATIONSHIPS IN HETEROGENEOUS SOCIAL NETWORKS" (353)

Xu.pdf

Paper for the paper "HIM: DISCOVERING IMPLICIT RELATIONSHIPS IN HETEROGENEOUS SOCIAL NETWORKS" (303)

ICASSP-HIM.pptx

ICASSP-HIM.pptx (282)

Categories:: Knowledge and Data Engineering
Other
Other

136 Views

INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA

Read more about INVERTIBLE VOICE CONVERSION WITH PARALLEL DATA
Log in to post comments

This paper introduces an innovative deep learning framework for parallel voice conversion to mitigate inherent risks associated with such systems. Our approach focuses on developing an invertible model capable of countering potential spoofing threats. Specifically, we present a conversion model that allows for the retrieval of source voices, thereby facilitating the identification of the source speaker. This framework is constructed using a series of invertible modules composed of affine coupling layers to ensure the reversibility of the conversion process.

zexin_poster.pdf

zexin_poster.pdf (303)

Categories:: Audio Processing Systems

84 Views

Joint Multi-Band DOA Estimation Using Low-Rank Matrix Recovery

Read more about Joint Multi-Band DOA Estimation Using Low-Rank Matrix Recovery
Log in to post comments

To address wideband direction of arrival (DOA) estimation problems, this paper proposes a gridless and covariance-free joint multi-band (JMB) DOA estimation method using low-rank matrix recovery. In contrast with subspace methods and sparse array-based methods, a unified frequency grid is established based on the concept of the greatest common divisor (GCD) to solve the nonlinearity of steering matrices from multiple frequencies. With the unified frequency grid, a low-rank master matrix is formed as a combination of the truncated Hankel matrices from different subbands and snapshots.

GuoZ_ICASSP24_poster.pdf

GuoZ_ICASSP24_poster.pdf (394)

Categories:: Signal and System Modeling, Representation and Estimation

128 Views

ZERO SHOT AUDIO TO AUDIO EMOTION TRANSFER WITH SPEAKER DISENTANGLEMENT

Read more about ZERO SHOT AUDIO TO AUDIO EMOTION TRANSFER WITH SPEAKER DISENTANGLEMENT
Log in to post comments

The problem of audio-to-audio (A2A) style transfer involves replacing the style features of the source audio with those from the target audio while preserving the content related attributes of the source audio. In this paper, we propose an efficient approach, termed as Zero-shot Emotion Style Transfer (ZEST), that allows the transfer of emotional content present in the given source audio with the one embedded in the target audio while retaining the speaker and speech content from the source.

2401.04511.pdf

2401.04511.pdf (297)

Categories:: Audio and Acoustic Signal Processing

122 Views