Publications
This section contains publications published by ISI over the past several decades.
2024
Can a Machine Distinguish High and Low Amount of Social Creak in Speech?
Anne-Maria Laukkanen, Sudarsana Reddy Kadiri, Shrikanth Narayanan, Paavo Alku
Journal of Voice, 2024
Scaling wearable foundation models
Girish Narayanswamy, Xin Liu, Kumar Ayush, Yuzhe Yang, Xuhai Xu, Shun Liao, Jake Garrison, Shyam Tailor, Jake Sunshine, Yun Liu, Tim Althoff, Shrikanth Narayanan, Pushmeet Kohli, Jiening Zhan, Mark Malhotra, Shwetak Patel, Samy Abdel-Ghaffar, Daniel McDuff
arXiv preprint arXiv:2410.13638, 2024
Towards large-scale cross-speaker articulatory modeling of vowels
Sean Foley, Shrikanth Narayanan
The Journal of the Acoustical Society of America 156 (4_Supplement), A49-A49, 2024
Evaluation of state-of-the-art ASR Models in Child-Adult Interactions
Aditya Ashvin, Rimita Lahiri, Aditya Kommineni, Somer Bishop, Catherine Lord, Sudarsana Reddy Kadiri, Shrikanth Narayanan
arXiv preprint arXiv:2409.16135, 2024
Speech2rtMRI: Speech-Guided Diffusion Model for Real-time MRI Video of the Vocal Tract during Speech
Hong Nguyen, Sean Foley, Kevin Huang, Xuan Shi, Tiantian Feng, Shrikanth Narayanan
arXiv preprint arXiv:2409.15525, 2024
Towards child-inclusive clinical video understanding for autism spectrum disorder
Aditya Kommineni, Digbalay Bose, Tiantian Feng, So Hyun Kim, Helen Tager-Flusberg, Somer Bishop, Catherine Lord, Sudarsana Kadiri, Shrikanth Narayanan
arXiv preprint arXiv:2409.13606, 2024
Personalized Speech Recognition for Children with Test-Time Adaptation
Zhonghao Shi, Harshvardhan Srivastava, Xuan Shi, Shrikanth Narayanan, Maja J Matarić
arXiv preprint arXiv:2409.13095, 2024
Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling
Tiantian Feng, Anfeng Xu, Xuan Shi, Somer Bishop, Shrikanth Narayanan
arXiv preprint arXiv:2409.09340, 2024
Data efficient child-adult speaker diarization with simulated conversations
Anfeng Xu, Tiantian Feng, Helen Tager-Flusberg, Catherine Lord, Shrikanth Narayanan
arXiv preprint arXiv:2409.08881, 2024
ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation
Tiantian Feng, Tuo Zhang, Salman Avestimehr, Shrikanth S Narayanan
arXiv preprint arXiv:2408.15803, 2024
Early detection of coffee leaf rust through convolutional neural networks trained on low-resolution images
Angelly Cabrera, Kleanthis Avramidis, Shrikanth Narayanan
arXiv preprint arXiv:2407.14737, 2024
Artificial intelligence to differentiate pediatric pseudopapilledema and true papilledema on fundus photographs
Melinda Y Chang, Gena Heidary, Shannon Beres, Stacy L Pineles, Eric D Gaier, Ryan Gise, Mark Reid, Kleanthis Avramidis, Mohammad Rostami, Shrikanth Narayanan, Pediatric Optic Nerve Investigator Group
Ophthalmology Science 4 (4), 100496, 2024
Scaling representation learning from ubiquitous ecg with state-space models
Kleanthis Avramidis, Dominika Kunc, Bartosz Perz, Kranti Adsul, Tiantian Feng, Przemysław Kazienko, Stanisław Saganowski, Shrikanth Narayanan
IEEE Journal of Biomedical and Health Informatics, 2024
Saliency analysis of eye tracking in children with cortical/cerebral visual impairment (CVI) enabled by machine learning
Melinda Chang, Kleanthis Avramidis, Rahul Sharma, Mark Borchert, Shrikanth Narayanan
Investigative Ophthalmology & Visual Science 65 (7), 1501-1501, 2024
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding
Tuo Zhang, Tiantian Feng, Yibin Ni, Mengqin Cao, Ruying Liu, Katharine Butler, Yanjun Weng, Mi Zhang, Shrikanth S Narayanan, Salman Avestimehr
arXiv preprint arXiv:2406.10318, 2024
Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?
Tiantian Feng, Dimitrios Dimitriadis, Shrikanth Narayanan
arXiv preprint arXiv:2406.08800, 2024
Exploring speech foundation models for speaker diarization in child-adult dyadic interactions
Anfeng Xu, Kevin Huang, Tiantian Feng, Lue Shen, Helen Tager-Flusberg, Shrikanth Narayanan
arXiv preprint arXiv:2406.07890, 2024
Toward fully-end-to-end listened speech decoding from EEG signals
Jihwan Lee, Aditya Kommineni, Tiantian Feng, Kleanthis Avramidis, Xuan Shi, Sudarsana Kadiri, Shrikanth Narayanan
arXiv preprint arXiv:2406.08644, 2024
Machine-learning-based prediction of client distress from session recordings
Patty B Kuo, Michael J Tanana, Simon B Goldberg, Derek D Caperton, Shrikanth Narayanan, David C Atkins, Zac E Imel
Clinical Psychological Science 12 (3), 435-446, 2024
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality
Tiantian Feng, Xuan Shi, Rahul Gupta, Shrikanth S Narayanan
arXiv preprint arXiv:2404.17983, 2024