Machine Learning for Signal Processing

AN ATTENTION ENHANCED MULTI-TASK MODEL FOR OBJECTIVE SPEECH ASSESSMENT IN REAL-WORLD ENVIRONMENTS

slide_1603.pdf

slide_1603.pdf (355)

Categories:: Audio and Acoustic Signal Processing
Machine Learning for Signal Processing

18 Views

Deep geometric knowledge distillation with graphs

Read more about Deep geometric knowledge distillation with graphs
Log in to post comments

GKD___ICASSP_presentation (3).pdf

GKD___ICASSP_presentation (3).pdf (313)

Categories:: Machine Learning for Signal Processing

23 Views

A NOVEL TWO-PATHWAY ENCODER-DECODER NETWORK FOR 3D FACE RECONSTRUCTION

Read more about A NOVEL TWO-PATHWAY ENCODER-DECODER NETWORK FOR 3D FACE RECONSTRUCTION
Log in to post comments

3D Morphable Model (3DMM) is a statistical tool widely employed in reconstructing 3D face shape. Existing methods are aimed at predicting 3DMM shape parameters with a single encoder but suffer from unclear distinction of different attributes. To address this problem, Two-Pathway Encoder-Decoder Network (2PEDN) is proposed to regress the identity and expression components via global and local pathways. Specifically, each 2D face image is cropped into global face and local details as the inputs for the corresponding pathways.

A NOVEL TWO-PATHWAY ENCODER-DECODER NETWORK .pptx

A NOVEL TWO-PATHWAY ENCODER-DECODER NETWORK .pptx (279)

Categories:: Machine Learning for Signal Processing

25 Views

Training machine learning on JPEG compressed images

Read more about Training machine learning on JPEG compressed images
Log in to post comments

DCC.pdf

DCC.pdf (661)

Categories:: Machine Learning for Signal Processing

87 Views

Lossless Multi-Component Image Compression based on Integer Wavelet Coefficient Prediction using Convolutional Neural Networks

DCC2020_Eze_v4.pdf

DCC2020_Eze_v4.pdf (488)

Categories:: Machine Learning for Signal Processing

57 Views

Segmentation of Text-Lines and Words from JPEG Compressed Printed Text Documents Using DCT Coefficients

Segmenting a document image into text-lines and words finds applications in many research areas of DIA(Document Image Analysis) such as OCR, Word Spotting, and document retrieval. However, carrying out segmentation operation directly in the compressed document images is still an unexplored and challenging research area. Since JPEG is most widely accepted compression algorithm, this research paper attempts to segment a JPEG compressed printed text document image into text-lines and words, without fully decompressing the image.

dccv.pdf

DCC2020 Paper ID 181 (457)

Categories:: Image, Video, and Multidimensional Signal Processing
Machine Learning for Signal Processing
Multimedia Signal Processing

54 Views

Improved Subspace K-Means Performance via a Randomized Matrix Decomposition

Read more about Improved Subspace K-Means Performance via a Randomized Matrix Decomposition
Log in to post comments

Subspace clustering algorithms provide the capability
to project a dataset onto bases that facilitate clustering.
Proposed in 2017, the subspace k-means algorithm simultaneously
performs clustering and dimensionality reduction with the goal
of finding the optimal subspace for the cluster structure; this
is accomplished by incorporating a trade-off between cluster
and noise subspaces in the objective function. In this study,
we improve subspace k-means by estimating a critical transformation

Improved Subspace K-means Performance via a Randomized Matrix Decomposition.pdf

Improved Subspace K-means Performance via a Randomized Matrix Decomposition.pdf (474)

Categories:: Machine Learning for Signal Processing

30 Views

Poster: Generative-Discriminative Crop Type Identification using Satellite Images

Read more about Poster: Generative-Discriminative Crop Type Identification using Satellite Images
Log in to post comments

Crop type identification refers to distinguishing certain crop from other landcovers, which is an essential and crucial task in agricultural monitoring. Satellite images are good data input for identifying different crops since satellites capture relatively wider area and more spectral information. Based on prior knowledge of crop phenology, multi-temporal images are stacked to extract the growth pattern of varied crops.

poster_2.pdf

Poster: Generative-Discriminative Crop Type Identification using Satellite Images (365)

Categories:: Machine Learning for Signal Processing

28 Views

A deep network for single-snapshot direction of arrival estimation

Read more about A deep network for single-snapshot direction of arrival estimation
Log in to post comments

This paper examines a deep feedforward network for beamforming with the single--snapshot Sample Covariance Matrix (SCM). The Conventional beamforming formulation, typically quadratic in the complex weight space, is reformulated as real and linear in the weight covariance and SCM. The reformulated SCMs are used as input to a deep feed--forward neural network (FNN) for two source localization. Simulations demonstrate the effect of source incoherence and performance in a noisy tracking example.

conference_poster_6.pdf

conference_poster_6.pdf (867)

Categories:: Machine Learning for Signal Processing

41 Views

Deep Reinforcement Learning Based Energy Beamforming for Powering Sensor Networks

Read more about Deep Reinforcement Learning Based Energy Beamforming for Powering Sensor Networks
1 comment
Log in to post comments

We focus on a wireless sensor network powered with an energy beacon, where sensors send their measurements to the sink using the harvested energy. The aim of the system is to estimate an unknown signal over the area of interest as accurately as possible. We investigate optimal energy beamforming at the energy beacon and optimal transmit power allocation at the sensors under non-linear energy harvesting models. We use a deep reinforcement learning (RL) based approach where multi-layer neural networks are utilized.

2019MLSP_poster.pdf

2019MLSP_poster.pdf (475)

Categories:: Machine Learning for Signal Processing

97 Views

Machine Learning for Signal Processing

Pages