Other

Estimation of Fields Using Binary Measurements From a Mobile Agent

Read more about Estimation of Fields Using Binary Measurements From a Mobile Agent
Log in to post comments

presentation_ICAS.pdf

presentation_ICAS.pdf (233)

Categories:: Other

16 Views

Improving a User's Haptic Perceptual Sensitivity by Optimizing Effective Manipulability of a Redundant User Interface

Human perceptual sensitivity of various types of forces, e.g., stiffness and friction, is important for surgeons during robotic surgeries such as needle insertion and palpation. However, force feedback from robot end-effector is usually a combination of desired and undesired force components which could have an effect on the perceptual sensitivity of the desired one. In presence of undesired forces, to improve perceptual sensitivity of desired force could benefit robotic surgical outcomes.

20210712_ICAS2021_paper112.pdf

20210712_ICAS2021_paper112.pdf (304)

Categories:: Other

15 Views

Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model

Modern Automatic Speech Recognition (ASR) systems can achieve high performance in terms of recognition accuracy. However, a perfectly accurate transcript still can be challenging to read due to disfluency, filter words, and other errata common in spoken communication. Many downstream tasks and human readers rely on the output of the ASR system; therefore, errors introduced by the speaker and ASR system alike will be propagated to the next task in the pipeline.

slides.pdf

Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model (330)

Categories:: Other

58 Views

Assisted Learning: Cooperative AI with Autonomy

Read more about Assisted Learning: Cooperative AI with Autonomy
Log in to post comments

ASCII_ICASSP_slides.pdf

ASCII_ICASSP_slides.pdf (191)

Categories:: Other

10 Views

Detecting Acoustic Reflector using a Robot's Ego-noise

Read more about Detecting Acoustic Reflector using a Robot's Ego-noise
Log in to post comments

In this paper, we propose a method to estimate the proximity of an acoustic reflector, e.g., a wall, using ego-noise, i.e., the noise produced by the moving parts of a listening robot. This is achieved by estimating the times of arrival of acoustic echoes reflected from the surface. Simulated experiments show that the proposed non-intrusive approach is capable of accurately estimating the distance of a reflector up to 1 meter and outperforms a previously proposed intrusive approach under loud ego-noise conditions.

Poster_for_ICASSP_2021.pdf

Poster_for_ICASSP_2021.pdf (251)

Detecting-acoustic-reflector-localization-using-robot-ego-noise-main.zip

Detecting-acoustic-reflector-localization-using-robot-ego-noise-main.zip (248)

Categories:: Signal and System Modeling, Representation and Estimation
Other

16 Views

A PARALLELIZABLE LATTICE RESCORING STRATEGY WITH NEURAL LANGUAGE MODELS

Read more about A PARALLELIZABLE LATTICE RESCORING STRATEGY WITH NEURAL LANGUAGE MODELS
Log in to post comments

This paper proposes a parallel computation strategy and a posterior-based lattice expansion algorithm for efficient lattice rescoring with neural language models (LMs) for automatic speech recognition. First, lattices from first-pass decoding are expanded by the proposed posterior-based lattice expansion algorithm. Second, each expanded lattice is converted into a minimal list of hypotheses that covers every arc. Each hypothesis is constrained to be the best path for at least one arc it includes.

ICASSP21_poster_Ke.pdf

ICASSP21_poster_Ke.pdf (247)

Categories:: Other

7 Views

Federated Learning With Local Differential Privacy: Trade-Offs Between Privacy, Utility, and Communication

Federated learning (FL) allows to train a massive amount of data privately due to its decentralized structure. Stochastic gradient descent (SGD) is commonly used for FL due to its good empirical performance, but sensitive user information can still be inferred from weight updates shared during FL iterations. We consider Gaussian mechanisms to preserve local differential privacy (LDP) of user data in the FL model with SGD. The trade-offs between user privacy, global utility, and transmission rate are proved by defining appropriate metrics for FL with LDP.

ICASSP 2021 poster v3.pdf

The pdf file of the poster used for the poster session. (369)

ICASSP 2021 presentation wo video.pptx

The slides (.pptx file) used for the recorded presentation. (279)

Categories:: Other

86 Views

NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling

Read more about NISP: A Multi-lingual Multi-accent Dataset for Speaker Profiling
Log in to post comments

Many commercial and forensic applications of speech demand the extraction of information about the speaker characteristics, which falls into the broad category of speaker profiling. The speaker characteristics needed for profiling include physical traits of the speaker like height, age, and gender of the speaker along with the native language of the speaker. Many of the datasets available have only partial information for speaker profiling.

Icassp_poster_nisp.pdf

NISP- Multilingual Multi accent dataset for speaker profiling --Poster (251)

NISP_slides.pdf

poster-slides (237)

Categories:: Other

121 Views

MULTIVIEW SENSING WITH UNKNOWN PERMUTATIONS: AN OPTIMAL TRANSPORT APPROACH

Read more about MULTIVIEW SENSING WITH UNKNOWN PERMUTATIONS: AN OPTIMAL TRANSPORT APPROACH
Log in to post comments

In several applications, including imaging of deformable objects while in motion, simultaneous localization and mapping, and unlabeled sensing, we encounter the problem of recovering a signal that is measured subject to unknown permutations. In this paper we take a fresh look at this problem through the lens of optimal transport (OT). In particular, we recognize that in most practical applications the unknown permutations are not arbitrary but some are more likely to occur than others.