Sorry, you need to enable JavaScript to visit this website.

Speech Production (SPE-SPRD)

Voice Impersonation Using Generative Adversarial Networks

Paper Details

Authors:
Rita Singh, Bhiksha Raj
Submitted On:
14 April 2018 - 8:39pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

YG_poster.pdf

(92 downloads)

Subscribe

[1] Rita Singh, Bhiksha Raj, "Voice Impersonation Using Generative Adversarial Networks", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2862. Accessed: Nov. 19, 2018.
@article{2862-18,
url = {http://sigport.org/2862},
author = {Rita Singh; Bhiksha Raj },
publisher = {IEEE SigPort},
title = {Voice Impersonation Using Generative Adversarial Networks},
year = {2018} }
TY - EJOUR
T1 - Voice Impersonation Using Generative Adversarial Networks
AU - Rita Singh; Bhiksha Raj
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2862
ER -
Rita Singh, Bhiksha Raj. (2018). Voice Impersonation Using Generative Adversarial Networks. IEEE SigPort. http://sigport.org/2862
Rita Singh, Bhiksha Raj, 2018. Voice Impersonation Using Generative Adversarial Networks. Available at: http://sigport.org/2862.
Rita Singh, Bhiksha Raj. (2018). "Voice Impersonation Using Generative Adversarial Networks." Web.
1. Rita Singh, Bhiksha Raj. Voice Impersonation Using Generative Adversarial Networks [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2862

Voice Impersonation Using Generative Adversarial Networks

Paper Details

Authors:
Rita Singh, Bhiksha Raj
Submitted On:
14 April 2018 - 8:39pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

YG_poster.pdf

(98 downloads)

Subscribe

[1] Rita Singh, Bhiksha Raj, "Voice Impersonation Using Generative Adversarial Networks", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2861. Accessed: Nov. 19, 2018.
@article{2861-18,
url = {http://sigport.org/2861},
author = {Rita Singh; Bhiksha Raj },
publisher = {IEEE SigPort},
title = {Voice Impersonation Using Generative Adversarial Networks},
year = {2018} }
TY - EJOUR
T1 - Voice Impersonation Using Generative Adversarial Networks
AU - Rita Singh; Bhiksha Raj
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2861
ER -
Rita Singh, Bhiksha Raj. (2018). Voice Impersonation Using Generative Adversarial Networks. IEEE SigPort. http://sigport.org/2861
Rita Singh, Bhiksha Raj, 2018. Voice Impersonation Using Generative Adversarial Networks. Available at: http://sigport.org/2861.
Rita Singh, Bhiksha Raj. (2018). "Voice Impersonation Using Generative Adversarial Networks." Web.
1. Rita Singh, Bhiksha Raj. Voice Impersonation Using Generative Adversarial Networks [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2861

DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES


A new technique for representing speech articulation with
an ultrasound-driven finite element model of the tongue is
presented. By using a snake contour extraction algorithm
with anatomically motivated constraints and a common
coordinate system between the ultrasound and the tongue
model, it is possible for the first time to obtain a realistic 3D
simulation of the tongue directly from a non-invasive sensor
(ultrasound), without mapping through any intermediate
sensor modalities, and at near real-time frame rates.

ICASSP.pptx

File ICASSP.pptx (92 downloads)

ICASSP.pptx

File ICASSP.pptx (172 downloads)

Paper Details

Authors:
Submitted On:
19 April 2018 - 9:35pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP.pptx

(92 downloads)

ICASSP.pptx

(172 downloads)

Subscribe

[1] , "DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2846. Accessed: Nov. 19, 2018.
@article{2846-18,
url = {http://sigport.org/2846},
author = { },
publisher = {IEEE SigPort},
title = {DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES},
year = {2018} }
TY - EJOUR
T1 - DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES
AU -
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2846
ER -
. (2018). DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES. IEEE SigPort. http://sigport.org/2846
, 2018. DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES. Available at: http://sigport.org/2846.
. (2018). "DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES." Web.
1. . DIRECT, NEAR REAL TIME ANIMATION OF A 3D TONGUE MODEL USING NON-INVASIVE ULTRASOUND IMAGES [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2846

Production and Perception of Focus in L2 Mandarin of Qiang Speakers


The present study investigated production and perception of focus in L2 Mandarin of Qiang speakers. Three target sentences were uttered under four focus conditions, i.e., initial, medial, final and neutral focus by 10 Qiang-Mandarin speakers. Systematic acoustic analysis showed that: (1) In Qiang-Mandarin, on-focus words exhibit significant F0 rising, intensity increasing and duration lengthening. There is no Post-focus Compression (PFC). The duration of pre-focus and post-focus words remains largely intact.

Paper Details

Authors:
Bei Wang
Submitted On:
15 October 2016 - 5:27am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Qiang-Mandarin_ID174.pdf

(277 downloads)

Subscribe

[1] Bei Wang, "Production and Perception of Focus in L2 Mandarin of Qiang Speakers", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1224. Accessed: Nov. 19, 2018.
@article{1224-16,
url = {http://sigport.org/1224},
author = {Bei Wang },
publisher = {IEEE SigPort},
title = {Production and Perception of Focus in L2 Mandarin of Qiang Speakers},
year = {2016} }
TY - EJOUR
T1 - Production and Perception of Focus in L2 Mandarin of Qiang Speakers
AU - Bei Wang
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1224
ER -
Bei Wang. (2016). Production and Perception of Focus in L2 Mandarin of Qiang Speakers. IEEE SigPort. http://sigport.org/1224
Bei Wang, 2016. Production and Perception of Focus in L2 Mandarin of Qiang Speakers. Available at: http://sigport.org/1224.
Bei Wang. (2016). "Production and Perception of Focus in L2 Mandarin of Qiang Speakers." Web.
1. Bei Wang. Production and Perception of Focus in L2 Mandarin of Qiang Speakers [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1224

An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts


This paper conducted an interface research on rhetorical and prosodic aspects of Chinese reading discourses within the Rhetorical Structure Theory (RST) framework. Ten discourses in 3 genres (Commentary, Narrative, and Descriptive) from the Annotated Speech Corpus of Chinese Discourse (ASCCD) were diagrammed in RST. The recordings from 5 males and 5 females were annotated and further analyzed acoustically and statistically by applying Praat and R.

Paper Details

Authors:
Liang Zhang, Yuan Jia, Aijun Li
Submitted On:
18 October 2016 - 7:52am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Liang_O10-4-1016.pptx

(301 downloads)

Liang_O10-4-1016.pptx

(279 downloads)

Liang_O10-4-1018.pptx

(287 downloads)

Subscribe

[1] Liang Zhang, Yuan Jia, Aijun Li, "An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1217. Accessed: Nov. 19, 2018.
@article{1217-16,
url = {http://sigport.org/1217},
author = {Liang Zhang; Yuan Jia; Aijun Li },
publisher = {IEEE SigPort},
title = {An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts},
year = {2016} }
TY - EJOUR
T1 - An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts
AU - Liang Zhang; Yuan Jia; Aijun Li
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1217
ER -
Liang Zhang, Yuan Jia, Aijun Li. (2016). An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts. IEEE SigPort. http://sigport.org/1217
Liang Zhang, Yuan Jia, Aijun Li, 2016. An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts. Available at: http://sigport.org/1217.
Liang Zhang, Yuan Jia, Aijun Li. (2016). "An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts." Web.
1. Liang Zhang, Yuan Jia, Aijun Li. An Interface Research on Rhetorical Structure and Prosody Features in Chinese Reading Texts [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1217

Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra


The bilateral cavities of the piriform fossa are the side branches of the vocal tract and produce anti-resonance(s) in the transfer function. This effect has been known for male vocal tracts, but female data were few. This study investigates contributions of the piriform fossa to vowel spectra in female vocal tracts by means of MRI-based vocal-tract modeling and acoustic experiment with the water-filling technique. Results from three female subjects indicate that the piriform fossa generates one or two dips in the frequency region of 4-6 kHz.

Paper Details

Authors:
Congcong Zhang, Kiyoshi Honda, Ju Zhang, Jianguo Wei
Submitted On:
15 October 2016 - 12:24am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

zcc_ISCSLP2016.pdf

(280 downloads)

Subscribe

[1] Congcong Zhang, Kiyoshi Honda, Ju Zhang, Jianguo Wei, "Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1203. Accessed: Nov. 19, 2018.
@article{1203-16,
url = {http://sigport.org/1203},
author = {Congcong Zhang; Kiyoshi Honda; Ju Zhang; Jianguo Wei },
publisher = {IEEE SigPort},
title = {Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra},
year = {2016} }
TY - EJOUR
T1 - Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra
AU - Congcong Zhang; Kiyoshi Honda; Ju Zhang; Jianguo Wei
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1203
ER -
Congcong Zhang, Kiyoshi Honda, Ju Zhang, Jianguo Wei. (2016). Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra. IEEE SigPort. http://sigport.org/1203
Congcong Zhang, Kiyoshi Honda, Ju Zhang, Jianguo Wei, 2016. Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra. Available at: http://sigport.org/1203.
Congcong Zhang, Kiyoshi Honda, Ju Zhang, Jianguo Wei. (2016). "Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra." Web.
1. Congcong Zhang, Kiyoshi Honda, Ju Zhang, Jianguo Wei. Contributions of the Piriform Fossa of Female Speakers to Vowel Spectra [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1203

Individual difference and acoustic effect of female laryngeal cavities


This study examines the acoustic effect of the laryngeal cavity of female speakers on the higher vowel spectra. To do so, MRI data of vowels /a/ and /i/ obtained from three female speakers were analyzed with data from a male speaker as reference. 3D vocal-tract shapes were extracted from the MRI data and printed as solid mechanical models. Transfer functions of the models' vocal tracts were estimated by a transmission line model. Individual variations of the laryngeal cavity were described by the area functions of the cavity.

Paper Details

Authors:
Jing Li, Kiyoshi Honda, Ju Zhang, Jianguo Wei
Submitted On:
14 October 2016 - 10:43am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

[ISCSLP2016] ID82 Oral.PDF

(223 downloads)

Subscribe

[1] Jing Li, Kiyoshi Honda, Ju Zhang, Jianguo Wei, "Individual difference and acoustic effect of female laryngeal cavities", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1202. Accessed: Nov. 19, 2018.
@article{1202-16,
url = {http://sigport.org/1202},
author = {Jing Li; Kiyoshi Honda; Ju Zhang; Jianguo Wei },
publisher = {IEEE SigPort},
title = {Individual difference and acoustic effect of female laryngeal cavities},
year = {2016} }
TY - EJOUR
T1 - Individual difference and acoustic effect of female laryngeal cavities
AU - Jing Li; Kiyoshi Honda; Ju Zhang; Jianguo Wei
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1202
ER -
Jing Li, Kiyoshi Honda, Ju Zhang, Jianguo Wei. (2016). Individual difference and acoustic effect of female laryngeal cavities. IEEE SigPort. http://sigport.org/1202
Jing Li, Kiyoshi Honda, Ju Zhang, Jianguo Wei, 2016. Individual difference and acoustic effect of female laryngeal cavities. Available at: http://sigport.org/1202.
Jing Li, Kiyoshi Honda, Ju Zhang, Jianguo Wei. (2016). "Individual difference and acoustic effect of female laryngeal cavities." Web.
1. Jing Li, Kiyoshi Honda, Ju Zhang, Jianguo Wei. Individual difference and acoustic effect of female laryngeal cavities [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1202

L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern


Assuming that linguistic specifications and information
planning contribute to different levels of prosodic organization
that cumulatively constitute output prosody, quantitative
analysis of respective contributions can be derived through
normalization procedures that remove levels of interactions
involved. The current study attempts to account for how L2
prosody departs from the L1 norm in the two levels mentioned
and whether an account can be offered. F0 patterns of word
English stress categories (primary, secondary and tertiary) and

Paper Details

Authors:
Chao-yu Su, Chiu-yu Tseng
Submitted On:
12 October 2016 - 2:03am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

Final ISCSLP16_Poster.pdf

(230 downloads)

Subscribe

[1] Chao-yu Su, Chiu-yu Tseng , "L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern ", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1157. Accessed: Nov. 19, 2018.
@article{1157-16,
url = {http://sigport.org/1157},
author = {Chao-yu Su; Chiu-yu Tseng },
publisher = {IEEE SigPort},
title = {L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern },
year = {2016} }
TY - EJOUR
T1 - L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern
AU - Chao-yu Su; Chiu-yu Tseng
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1157
ER -
Chao-yu Su, Chiu-yu Tseng . (2016). L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern . IEEE SigPort. http://sigport.org/1157
Chao-yu Su, Chiu-yu Tseng , 2016. L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern . Available at: http://sigport.org/1157.
Chao-yu Su, Chiu-yu Tseng . (2016). "L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern ." Web.
1. Chao-yu Su, Chiu-yu Tseng . L1/L2 Difference in Phonological Sensitivity and Information Planning - Evidence from F0 Pattern [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1157