Publications

This section contains publications published by ISI over the past several decades.

2024

Can a Machine Distinguish High and Low Amount of Social Creak in Speech?

Anne-Maria Laukkanen, Sudarsana Reddy Kadiri, Shrikanth Narayanan, Paavo Alku
Journal of Voice,  2024

Scaling wearable foundation models

Girish Narayanswamy, Xin Liu, Kumar Ayush, Yuzhe Yang, Xuhai Xu, Shun Liao, Jake Garrison, Shyam Tailor, Jake Sunshine, Yun Liu, Tim Althoff, Shrikanth Narayanan, Pushmeet Kohli, Jiening Zhan, Mark Malhotra, Shwetak Patel, Samy Abdel-Ghaffar, Daniel McDuff
arXiv preprint arXiv:2410.13638,  2024

Towards large-scale cross-speaker articulatory modeling of vowels

Sean Foley, Shrikanth Narayanan
The Journal of the Acoustical Society of America 156 (4_Supplement), A49-A49,  2024

Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Aditya Ashvin, Rimita Lahiri, Aditya Kommineni, Somer Bishop, Catherine Lord, Sudarsana Reddy Kadiri, Shrikanth Narayanan
arXiv preprint arXiv:2409.16135,  2024

Speech2rtMRI: Speech-Guided Diffusion Model for Real-time MRI Video of the Vocal Tract during Speech

Hong Nguyen, Sean Foley, Kevin Huang, Xuan Shi, Tiantian Feng, Shrikanth Narayanan
arXiv preprint arXiv:2409.15525,  2024

Towards child-inclusive clinical video understanding for autism spectrum disorder

Aditya Kommineni, Digbalay Bose, Tiantian Feng, So Hyun Kim, Helen Tager-Flusberg, Somer Bishop, Catherine Lord, Sudarsana Kadiri, Shrikanth Narayanan
arXiv preprint arXiv:2409.13606,  2024

Personalized Speech Recognition for Children with Test-Time Adaptation

Zhonghao Shi, Harshvardhan Srivastava, Xuan Shi, Shrikanth Narayanan, Maja J Matarić
arXiv preprint arXiv:2409.13095,  2024

Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling

Tiantian Feng, Anfeng Xu, Xuan Shi, Somer Bishop, Shrikanth Narayanan
arXiv preprint arXiv:2409.09340,  2024

Data efficient child-adult speaker diarization with simulated conversations

Anfeng Xu, Tiantian Feng, Helen Tager-Flusberg, Catherine Lord, Shrikanth Narayanan
arXiv preprint arXiv:2409.08881,  2024

ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation

Tiantian Feng, Tuo Zhang, Salman Avestimehr, Shrikanth S Narayanan
arXiv preprint arXiv:2408.15803,  2024

Early detection of coffee leaf rust through convolutional neural networks trained on low-resolution images

Angelly Cabrera, Kleanthis Avramidis, Shrikanth Narayanan
arXiv preprint arXiv:2407.14737,  2024

Artificial intelligence to differentiate pediatric pseudopapilledema and true papilledema on fundus photographs

Melinda Y Chang, Gena Heidary, Shannon Beres, Stacy L Pineles, Eric D Gaier, Ryan Gise, Mark Reid, Kleanthis Avramidis, Mohammad Rostami, Shrikanth Narayanan, Pediatric Optic Nerve Investigator Group
Ophthalmology Science 4 (4), 100496,  2024

Scaling representation learning from ubiquitous ecg with state-space models

Kleanthis Avramidis, Dominika Kunc, Bartosz Perz, Kranti Adsul, Tiantian Feng, Przemysław Kazienko, Stanisław Saganowski, Shrikanth Narayanan
IEEE Journal of Biomedical and Health Informatics,  2024

Saliency analysis of eye tracking in children with cortical/cerebral visual impairment (CVI) enabled by machine learning

Melinda Chang, Kleanthis Avramidis, Rahul Sharma, Mark Borchert, Shrikanth Narayanan
Investigative Ophthalmology & Visual Science 65 (7), 1501-1501,  2024

Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

Tuo Zhang, Tiantian Feng, Yibin Ni, Mengqin Cao, Ruying Liu, Katharine Butler, Yanjun Weng, Mi Zhang, Shrikanth S Narayanan, Salman Avestimehr
arXiv preprint arXiv:2406.10318,  2024

Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?

Tiantian Feng, Dimitrios Dimitriadis, Shrikanth Narayanan
arXiv preprint arXiv:2406.08800,  2024

Exploring speech foundation models for speaker diarization in child-adult dyadic interactions

Anfeng Xu, Kevin Huang, Tiantian Feng, Lue Shen, Helen Tager-Flusberg, Shrikanth Narayanan
arXiv preprint arXiv:2406.07890,  2024

Toward fully-end-to-end listened speech decoding from EEG signals

Jihwan Lee, Aditya Kommineni, Tiantian Feng, Kleanthis Avramidis, Xuan Shi, Sudarsana Kadiri, Shrikanth Narayanan
arXiv preprint arXiv:2406.08644,  2024

Machine-learning-based prediction of client distress from session recordings

Patty B Kuo, Michael J Tanana, Simon B Goldberg, Derek D Caperton, Shrikanth Narayanan, David C Atkins, Zac E Imel
Clinical Psychological Science 12 (3), 435-446,  2024

TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality

Tiantian Feng, Xuan Shi, Rahul Gupta, Shrikanth S Narayanan
arXiv preprint arXiv:2404.17983,  2024