ICASSP 2021

ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2021 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

MULTI-GRANULARITY FEATURE INTERACTION AND RELATION REASONING FOR 3D DENSE ALIGNMENT AND FACE RECONSTRUCTION

In this paper, we propose a multi-granularity feature interaction and relation reasoning network (MFIRRN) which can recover a detail-rich 3D face and perform more accurate dense alignment in an unconstrained environment. Traditional 3DMM-based methods directly regress parameters, resulting in the lack of fine-grained details in the reconstruction 3D face. To this end, we use different branches to capture discriminative features at different granularities, especially local features at medium and fine granularities.

ICASSP-poster.pdf

ICASSP-poster.pdf (663)

Categories:: Image, Video, and Multidimensional Signal Processing

15 Views

FULLY-NEURAL APPROACH TO VEHICLE WEIGHING AND STRAIN PREDICTION ON BRIDGES USING WIRELESS ACCELEROMETERS

Bridge weigh-in-motion (BWIM) is a technique of estimating vehicle loads on bridges and can be used to assess a bridge's structural fatigue and therefore its life.
BWIM can be realized by analyzing the bridge deflection in terms of its response to moving axle loads.
To obtain accurate load estimates, current BWIM systems require strain sensors, whose (re-) installation costs have limited their application.

icassp21s.pdf

icassp21s.pdf (356)

Categories:: Applications of Sensor Array and Multi-channel Signal Processing

51 Views

Complex-Valued Vs. Real-Valued Neural Networks for Classification Perspectives: An Example on Non-Circular Data

This paper shows the benefits of using Complex-Valued Neural Network (CVNN) on classification tasks for non-circular complex-valued datasets. Motivated by radar and especially Synthetic Aperture Radar (SAR) applications, we propose a statistical analysis of fully connected feed-forward neural networks performance in the cases where real and imaginary parts of the data are correlated through the non-circular property.

Poster-40x60cm-BleuClair.pdf

Poster presentation ICASSP 2021 MLSP-9.6 (313)

Categories:: Neural network learning (MLR-NNLR)

42 Views

NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling

Read more about NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling
Log in to post comments

Many commercial and forensic applications of speech demand the extraction of information about the speaker characteristics, which falls into the broad category of speaker profiling. The speaker characteristics needed for profiling include physical traits of the speaker like height, age, and gender of the speaker along with the native language of the speaker. Many of the datasets available have only partial information for speaker profiling.

Icassp_poster_nisp.pdf

NISP- Multilingual Multi accent dataset for speaker profiling --Poster (448)

NISP_slides.pdf

poster-slides (303)

Categories:: Other

140 Views

Short-time spectral aggregation for speaker embedding

Read more about Short-time spectral aggregation for speaker embedding
Log in to post comments

State-of-the-art speaker verification systems take frame-level acoustics features as input and produce fixed-dimensional embeddings as utterance-level representations. Thus, how to aggregate information from frame-level features is vital for achieving high performance. This paper introduces short-time spectral pooling (STSP) for better aggregation of frame-level information. STSP transforms the temporal feature maps of a speaker embedding network into the spectral domain and extracts the lowest spectral components of the averaged spectrograms for aggregation.