Publications
Towards large-scale cross-speaker articulatory modeling of vowels
Abstract
While previous studies have attempted to decompose vowel articulatory data into a set of basis factors, these studies have often been limited in both scale and the data being sparsely sampled, limiting interpretability and generalizability of the results (Nix et al. 1996 and Serrurier et al. 2019). In this study, the data were analyzed from 36 (23F, 13M) American English speakers producing 13 vowels in bVt sequences obtained using real-time MRI. Midsagittal tongue contours were obtained during vowel productions for all speakers using a semi-automated segmentation algorithm (Jain et al. 2024). Frames corresponding to the vowel articulation were segmented using MFA and simultaneously recorded audio. A combination of Procrustes analysis for cross-speaker normalization and guided PCA were employed to decompose the pooled articulatory space into a set of vowel “primitives.” 71% of the variation within the …
Metadata
- publication
- The Journal of the Acoustical Society of America 156 (4_Supplement), A49-A49, 2024
- year
- 2024
- publication date
- 2024/10/1
- authors
- Sean Foley, Shrikanth Narayanan
- link
- https://pubs.aip.org/asa/jasa/article-abstract/156/4_Supplement/A49/3331075
- journal
- The Journal of the Acoustical Society of America
- volume
- 156
- issue
- 4_Supplement
- pages
- A49-A49
- publisher
- Acoustical Society of America