Publications

Towards large-scale cross-speaker articulatory modeling of vowels

Abstract

While previous studies have attempted to decompose vowel articulatory data into a set of basis factors, these studies have often been limited in both scale and the data being sparsely sampled, limiting interpretability and generalizability of the results (Nix et al. 1996 and Serrurier et al. 2019). In this study, the data were analyzed from 36 (23F, 13M) American English speakers producing 13 vowels in bVt sequences obtained using real-time MRI. Midsagittal tongue contours were obtained during vowel productions for all speakers using a semi-automated segmentation algorithm (Jain et al. 2024). Frames corresponding to the vowel articulation were segmented using MFA and simultaneously recorded audio. A combination of Procrustes analysis for cross-speaker normalization and guided PCA were employed to decompose the pooled articulatory space into a set of vowel “primitives.” 71% of the variation within the …

Metadata

publication
The Journal of the Acoustical Society of America 156 (4_Supplement), A49-A49, 2024
year
2024
publication date
2024/10/1
authors
Sean Foley, Shrikanth Narayanan
link
https://pubs.aip.org/asa/jasa/article-abstract/156/4_Supplement/A49/3331075
journal
The Journal of the Acoustical Society of America
volume
156
issue
4_Supplement
pages
A49-A49
publisher
Acoustical Society of America