Publications

Phone duration modeling for speaker age estimation in children

Abstract

Automatic inference of paralinguistic information from speech, such as age, is an important area of research with many technological applications. Speaker age estimation can help with age-appropriate curation of information content and personalized interactive experiences. However, automatic speaker age estimation in children is challenging due to the paucity of speech data representing the developmental spectrum, and the large signal variability including within a given age group. Most prior approaches in child speaker age estimation adopt methods directly drawn from research on adult speech. In this paper, we propose a novel technique that exploits temporal variability present in children's speech for estimation of children's age. We focus on phone durations as biomarker of children's age. Phone duration distributions are derived by forced-aligning children's speech with transcripts. Regression models are …

Metadata

publication
The Journal of the Acoustical Society of America 152 (5), 3000-3009, 2022
year
2022
publication date
2022/11/1
authors
Prashanth Gurunath Shivakumar, Somer Bishop, Catherine Lord, Shrikanth Narayanan
link
https://pubs.aip.org/asa/jasa/article/152/5/3000/2840136
resource_link
https://arxiv.org/pdf/2109.01568
journal
The Journal of the Acoustical Society of America
volume
152
issue
5
pages
3000-3009
publisher
AIP Publishing