Sorry, you need to enable JavaScript to visit this website.

Audio and Acoustic Signal Processing

Being low-rank in the time-frequency plane


When using optimization methods with matrix variables in signal processing and machine learning, it is customary to assume some low-rank prior on the targeted solution. Nonnegative matrix factorization of spectrograms is a case in point in audio signal processing. However, this low-rank prior is not straightforwardly related to complex matrices obtained from a short-time Fourier -- or discrete Gabor -- transform (STFT), which is generally defined from and studied based on a modulation operator and a translation operator applied to a so-called window.

Paper Details

Authors:
Valentin Emiya, Ronan Hamon, Caroline Chaux
Submitted On:
13 April 2018 - 4:18am
Short Link:
Type:
Event:
Document Year:
Cite

Document Files

2018_icassp_poster.pdf

(19 downloads)

Keywords

Subscribe

[1] Valentin Emiya, Ronan Hamon, Caroline Chaux, "Being low-rank in the time-frequency plane", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2651. Accessed: May. 26, 2018.
@article{2651-18,
url = {http://sigport.org/2651},
author = {Valentin Emiya; Ronan Hamon; Caroline Chaux },
publisher = {IEEE SigPort},
title = {Being low-rank in the time-frequency plane},
year = {2018} }
TY - EJOUR
T1 - Being low-rank in the time-frequency plane
AU - Valentin Emiya; Ronan Hamon; Caroline Chaux
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2651
ER -
Valentin Emiya, Ronan Hamon, Caroline Chaux. (2018). Being low-rank in the time-frequency plane. IEEE SigPort. http://sigport.org/2651
Valentin Emiya, Ronan Hamon, Caroline Chaux, 2018. Being low-rank in the time-frequency plane. Available at: http://sigport.org/2651.
Valentin Emiya, Ronan Hamon, Caroline Chaux. (2018). "Being low-rank in the time-frequency plane." Web.
1. Valentin Emiya, Ronan Hamon, Caroline Chaux. Being low-rank in the time-frequency plane [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2651

EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS


CPD.pdf

PDF icon CPD.pdf (17 downloads)

Paper Details

Authors:
Fangxuan Sun; Jun Lin; Zhongfeng Wang
Submitted On:
13 April 2018 - 2:49am
Short Link:
Type:
Paper Code:
Document Year:
Cite

Document Files

CPD.pdf

(17 downloads)

Keywords

Subscribe

[1] Fangxuan Sun; Jun Lin; Zhongfeng Wang, "EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2626. Accessed: May. 26, 2018.
@article{2626-18,
url = {http://sigport.org/2626},
author = {Fangxuan Sun; Jun Lin; Zhongfeng Wang },
publisher = {IEEE SigPort},
title = {EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS},
year = {2018} }
TY - EJOUR
T1 - EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS
AU - Fangxuan Sun; Jun Lin; Zhongfeng Wang
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2626
ER -
Fangxuan Sun; Jun Lin; Zhongfeng Wang. (2018). EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS. IEEE SigPort. http://sigport.org/2626
Fangxuan Sun; Jun Lin; Zhongfeng Wang, 2018. EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS. Available at: http://sigport.org/2626.
Fangxuan Sun; Jun Lin; Zhongfeng Wang. (2018). "EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS." Web.
1. Fangxuan Sun; Jun Lin; Zhongfeng Wang. EADNET: EFFICIENT ARCHITECTURE FOR DECOMPOSED CONVOLUTIONAL NEURAL NETWORKS [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2626

High-speed light field image formation analysis using wavefield modeling with flexible sampling


In order to understand the image formation inside plenoptic systems, a wave-optic-based model is proposed in this paper that uses the Fresnel diffraction equation to propagate the whole object field into the plenoptic systems. The proposed model is much flexible at sampling on propagation planes by utilizing the method of multiple partial propagations. In order to verify the effectiveness of the proposed model, numerical simulations are conducted by comparing with existing wave optic model under different optical configurations of plenoptic cameras.

Paper Details

Authors:
Xin Jin, Li Liu, Qionghai Dai
Submitted On:
13 April 2018 - 2:49am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2018Poster.pdf

(16 downloads)

Keywords

Subscribe

[1] Xin Jin, Li Liu, Qionghai Dai, "High-speed light field image formation analysis using wavefield modeling with flexible sampling", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2625. Accessed: May. 26, 2018.
@article{2625-18,
url = {http://sigport.org/2625},
author = {Xin Jin; Li Liu; Qionghai Dai },
publisher = {IEEE SigPort},
title = {High-speed light field image formation analysis using wavefield modeling with flexible sampling},
year = {2018} }
TY - EJOUR
T1 - High-speed light field image formation analysis using wavefield modeling with flexible sampling
AU - Xin Jin; Li Liu; Qionghai Dai
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2625
ER -
Xin Jin, Li Liu, Qionghai Dai. (2018). High-speed light field image formation analysis using wavefield modeling with flexible sampling. IEEE SigPort. http://sigport.org/2625
Xin Jin, Li Liu, Qionghai Dai, 2018. High-speed light field image formation analysis using wavefield modeling with flexible sampling. Available at: http://sigport.org/2625.
Xin Jin, Li Liu, Qionghai Dai. (2018). "High-speed light field image formation analysis using wavefield modeling with flexible sampling." Web.
1. Xin Jin, Li Liu, Qionghai Dai. High-speed light field image formation analysis using wavefield modeling with flexible sampling [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2625

High-speed light field image formation analysis using wavefield modeling with flexible sampling


In order to understand the image formation inside plenoptic systems, a wave-optic-based model is proposed in this paper that uses the Fresnel diffraction equation to propagate the whole object field into the plenoptic systems. The proposed model is much flexible at sampling on propagation planes by utilizing the method of multiple partial propagations. In order to verify the effectiveness of the proposed model, numerical simulations are conducted by comparing with existing wave optic model under different optical configurations of plenoptic cameras.

Paper Details

Authors:
Xin Jin, Li Liu, Qionghai Dai
Submitted On:
13 April 2018 - 2:49am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2018Poster.pdf

(19 downloads)

Keywords

Additional Categories

Subscribe

[1] Xin Jin, Li Liu, Qionghai Dai, "High-speed light field image formation analysis using wavefield modeling with flexible sampling", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2624. Accessed: May. 26, 2018.
@article{2624-18,
url = {http://sigport.org/2624},
author = {Xin Jin; Li Liu; Qionghai Dai },
publisher = {IEEE SigPort},
title = {High-speed light field image formation analysis using wavefield modeling with flexible sampling},
year = {2018} }
TY - EJOUR
T1 - High-speed light field image formation analysis using wavefield modeling with flexible sampling
AU - Xin Jin; Li Liu; Qionghai Dai
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2624
ER -
Xin Jin, Li Liu, Qionghai Dai. (2018). High-speed light field image formation analysis using wavefield modeling with flexible sampling. IEEE SigPort. http://sigport.org/2624
Xin Jin, Li Liu, Qionghai Dai, 2018. High-speed light field image formation analysis using wavefield modeling with flexible sampling. Available at: http://sigport.org/2624.
Xin Jin, Li Liu, Qionghai Dai. (2018). "High-speed light field image formation analysis using wavefield modeling with flexible sampling." Web.
1. Xin Jin, Li Liu, Qionghai Dai. High-speed light field image formation analysis using wavefield modeling with flexible sampling [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2624

ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION

Paper Details

Authors:
Hemant K. Kathania, S. Shahnawazuddin , Nagaraj Adiga and Waquar Ahmad
Submitted On:
13 April 2018 - 12:39am
Short Link:
Type:
Event:
Paper Code:
Document Year:
Cite

Document Files

ICASSP_2018_poster_final.pdf

(23 downloads)

Keywords

Subscribe

[1] Hemant K. Kathania, S. Shahnawazuddin , Nagaraj Adiga and Waquar Ahmad, "ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2588. Accessed: May. 26, 2018.
@article{2588-18,
url = {http://sigport.org/2588},
author = {Hemant K. Kathania; S. Shahnawazuddin ; Nagaraj Adiga and Waquar Ahmad },
publisher = {IEEE SigPort},
title = {ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION},
year = {2018} }
TY - EJOUR
T1 - ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION
AU - Hemant K. Kathania; S. Shahnawazuddin ; Nagaraj Adiga and Waquar Ahmad
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2588
ER -
Hemant K. Kathania, S. Shahnawazuddin , Nagaraj Adiga and Waquar Ahmad. (2018). ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION. IEEE SigPort. http://sigport.org/2588
Hemant K. Kathania, S. Shahnawazuddin , Nagaraj Adiga and Waquar Ahmad, 2018. ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION. Available at: http://sigport.org/2588.
Hemant K. Kathania, S. Shahnawazuddin , Nagaraj Adiga and Waquar Ahmad. (2018). "ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION." Web.
1. Hemant K. Kathania, S. Shahnawazuddin , Nagaraj Adiga and Waquar Ahmad. ROLE OF PROSODIC FEATURES ON CHILDREN’S SPEECH RECOGNITION [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2588

REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION

Paper Details

Authors:
Yuzhong WU, Tan LEE
Submitted On:
12 April 2018 - 11:39pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

icassp2018_yzwu_poster_ver5.pdf

(17 downloads)

Keywords

Subscribe

[1] Yuzhong WU, Tan LEE, "REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2572. Accessed: May. 26, 2018.
@article{2572-18,
url = {http://sigport.org/2572},
author = {Yuzhong WU; Tan LEE },
publisher = {IEEE SigPort},
title = {REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION},
year = {2018} }
TY - EJOUR
T1 - REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION
AU - Yuzhong WU; Tan LEE
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2572
ER -
Yuzhong WU, Tan LEE. (2018). REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION. IEEE SigPort. http://sigport.org/2572
Yuzhong WU, Tan LEE, 2018. REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION. Available at: http://sigport.org/2572.
Yuzhong WU, Tan LEE. (2018). "REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION." Web.
1. Yuzhong WU, Tan LEE. REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2572

END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER

Paper Details

Authors:
Dilek Hakkani-Tür, Paul Crook, Xiujun Li, Jianfeng Gao, Li Deng
Submitted On:
12 April 2018 - 11:05pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

0308_icassp07.pptx

(20 downloads)

Keywords

Subscribe

[1] Dilek Hakkani-Tür, Paul Crook, Xiujun Li, Jianfeng Gao, Li Deng, "END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2566. Accessed: May. 26, 2018.
@article{2566-18,
url = {http://sigport.org/2566},
author = {Dilek Hakkani-Tür; Paul Crook; Xiujun Li; Jianfeng Gao; Li Deng },
publisher = {IEEE SigPort},
title = {END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER},
year = {2018} }
TY - EJOUR
T1 - END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER
AU - Dilek Hakkani-Tür; Paul Crook; Xiujun Li; Jianfeng Gao; Li Deng
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2566
ER -
Dilek Hakkani-Tür, Paul Crook, Xiujun Li, Jianfeng Gao, Li Deng. (2018). END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER. IEEE SigPort. http://sigport.org/2566
Dilek Hakkani-Tür, Paul Crook, Xiujun Li, Jianfeng Gao, Li Deng, 2018. END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER. Available at: http://sigport.org/2566.
Dilek Hakkani-Tür, Paul Crook, Xiujun Li, Jianfeng Gao, Li Deng. (2018). "END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER." Web.
1. Dilek Hakkani-Tür, Paul Crook, Xiujun Li, Jianfeng Gao, Li Deng. END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2566

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition

Paper Details

Authors:
Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson
Submitted On:
12 April 2018 - 11:09pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

icassp18-poster (5).pdf

(20 downloads)

Keywords

Subscribe

[1] Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson, "Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2565. Accessed: May. 26, 2018.
@article{2565-18,
url = {http://sigport.org/2565},
author = {Andrew Rosenberg; Samuel Thomas; Bhuvana Ramabhadran; Mark Hasegawa-Johnson },
publisher = {IEEE SigPort},
title = {Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition},
year = {2018} }
TY - EJOUR
T1 - Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
AU - Andrew Rosenberg; Samuel Thomas; Bhuvana Ramabhadran; Mark Hasegawa-Johnson
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2565
ER -
Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson. (2018). Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition. IEEE SigPort. http://sigport.org/2565
Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson, 2018. Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition. Available at: http://sigport.org/2565.
Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson. (2018). "Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition." Web.
1. Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson. Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2565

INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION


Ambisonics is a series of flexible spatial sound systems based on spatial harmonics decomposition and each order approximation of sound field. Accuracy and complexity of system increase with order. Considering that the horizontal localization resolution of human hearing is higher than vertical resolution, mixed-order Ambisonics (MOA) reconstructs horizontal sound field with higher order spatial harmonics, while reconstructs vertical sound field with lower order spatial harmonics, and thereby reaches a compromise between the perceptual performance and the complexity of system.

Paper Details

Authors:
Haiming Mai, Bosun Xie, Jianliang Jiang
Submitted On:
12 April 2018 - 10:29pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2018-Paper1435-Poster

(17 downloads)

Keywords

Subscribe

[1] Haiming Mai, Bosun Xie, Jianliang Jiang, "INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2560. Accessed: May. 26, 2018.
@article{2560-18,
url = {http://sigport.org/2560},
author = {Haiming Mai; Bosun Xie; Jianliang Jiang },
publisher = {IEEE SigPort},
title = {INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION},
year = {2018} }
TY - EJOUR
T1 - INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION
AU - Haiming Mai; Bosun Xie; Jianliang Jiang
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2560
ER -
Haiming Mai, Bosun Xie, Jianliang Jiang. (2018). INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION. IEEE SigPort. http://sigport.org/2560
Haiming Mai, Bosun Xie, Jianliang Jiang, 2018. INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION. Available at: http://sigport.org/2560.
Haiming Mai, Bosun Xie, Jianliang Jiang. (2018). "INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION." Web.
1. Haiming Mai, Bosun Xie, Jianliang Jiang. INFLUENCE OF THE NUMBER OF LOUDSPEAKERS ON THE TIMBRE IN MIXED-ORDER AMBISONICS REPRODUTION [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2560

AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR


This paper considers the target localization problem using the hybrid bistatic range and time difference of arrival (TDOA) measurements in multistatic radar. An algebraic closed-form solution to this nonlinear estimation problem is developed through two-stage processing, where the nuisance variables are introduced in the first stage and the localization error of first stage solution is estimated to improve the final target position estimate in the second stage.

Paper Details

Authors:
Submitted On:
12 April 2018 - 10:04pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2018-Zhaotao Qin-2653-Poster.pdf

(23 downloads)

Keywords

Additional Categories

Subscribe

[1] , "AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2552. Accessed: May. 26, 2018.
@article{2552-18,
url = {http://sigport.org/2552},
author = { },
publisher = {IEEE SigPort},
title = {AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR},
year = {2018} }
TY - EJOUR
T1 - AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR
AU -
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2552
ER -
. (2018). AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR. IEEE SigPort. http://sigport.org/2552
, 2018. AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR. Available at: http://sigport.org/2552.
. (2018). "AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR." Web.
1. . AN EFFICIENT TARGET LOCALIZATION ESTIMATOR FROM BISTATIC RANGE AND TDOA MEASUREMENTS IN MULTISTATIC RADAR [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2552

Pages