Publications

This section contains publications published by ISI over the past several decades.

2024

Political-LLM: Large Language Models in Political Science

Lincan Li, Jiaqi Li, Catherine Chen, Fred Gui, Hongjia Yang, Chenxiao Yu, Zhengguang Wang, Jianing Cai, Junlong Aaron Zhou, Bolin Shen, Alex Qian, Weixin Chen, Zhongkai Xue, Lichao Sun, Lifang He, Hanjie Chen, Kaize Ding, Zijian Du, Fangzhou Mu, Jiaxin Pei, Jieyu Zhao, Swabha Swayamdipta, Willie Neiswanger, Hua Wei, Xiyang Hu, Shixiang Zhu, Tianlong Chen, Yingzhou Lu, Yang Shi, Lianhui Qin, Tianfan Fu, Zhengzhong Tu, Yuzhe Yang, Jaemin Yoo, Jiaheng Zhang, Ryan Rossi, Liang Zhan, Liang Zhao, Emilio Ferrara, Yan Liu, Furong Huang, Xiangliang Zhang, Lawrence Rothenberg, Shuiwang Ji, Philip S Yu, Yue Zhao, Yushun Dong
arXiv preprint arXiv:2412.06864,  2024

Word embedding for social sciences: An interdisciplinary survey

Akira Matsui, Emilio Ferrara
PeerJ Computer Science 10 (e2562),  2024

Phishing Email Detection Using Inputs From Artificial Intelligence

Mithün Paul, Genevieve Bartlett, Jelena Mirkovic, Marjorie Freedman
arXiv preprint arXiv:2405.12494,  2024

Synthetic data generation for machine learning models

R Gupta, N Mehrabi, P Goyal, K Chang, A Galstyan
US Patent App. 18/216,271,  2024

Learning Morphisms with Gauss-Newton Approximation for Growing Networks

Neal Lawton, Aram Galstyan, Greg Ver Steeg
arXiv preprint arXiv:2411.05855,  2024

Adaptive Video Understanding Agent: Enhancing efficiency with dynamic frame sampling and feedback-driven reasoning

Sullam Jeoung, Goeric Huybrechts, Bhavana Ganesh, Aram Galstyan, Sravan Bodapati
arXiv preprint arXiv:2410.20252,  2024

SeRA: Self-Reviewing and Alignment of Large Language Models using Implicit Reward Margins

Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh, Sailik Sengupta, Sravan Bodapati, Aram Galstyan
arXiv preprint arXiv:2410.09362,  2024

QuAILoRA: Quantization-Aware Initialization for LoRA

Neal Lawton, Aishwarya Padmakumar, Judith Gaspers, Jack FitzGerald, Anoop Kumar, Greg Ver Steeg, Aram Galstyan
arXiv preprint arXiv:2410.14713,  2024

Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification

Tao Meng, Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Aram Galstyan, Richard Zemel, Kai-Wei Chang, Rahul Gupta, Charith Peris
arXiv preprint arXiv:2410.05559,  2024

Data advisor: Dynamic data curation for safety alignment of large language models

Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan
arXiv preprint arXiv:2410.05269,  2024

Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs

Elan Markowitz, Anil Ramakrishna, Jwala Dhamala, Ninareh Mehrabi, Charith Peris, Rahul Gupta, Kai-Wei Chang, Aram Galstyan
ACL'24,  2024

Proceedings of the 4th Workshop on Trustworthy Natural Language Processing (TrustNLP 2024)

Kai-Wei Chang, Anaelia Ovalle, Jieyu Zhao, Yang Trista Cao, Ninareh Mehrabi, Aram Galstyan, Jwala Dhamala, Anoop Kumar, Rahul Gupta
Proceedings of the 4th Workshop on Trustworthy Natural Language Processing …,  2024

Tokenization matters: Navigating data-scarce tokenization for gender inclusive language technologies

Anaelia Ovalle, Ninareh Mehrabi, Palash Goyal, Jwala Dhamala, Kai-Wei Chang, Richard Zemel, Aram Galstyan, Yuval Pinter, Rahul Gupta
Findings of the Association for Computational Linguistics: NAACL 2024, 1739-1756,  2024

MICo: Preventative detoxification of large language models through inhibition control

Roy Siegelmann, Ninareh Mehrabi, Palash Goyal, Prasoon Goyal, Lisa Bauer, Jwala Dhamala, Aram Galstyan, Rahul Gupta, Reza Ghanadan
Findings of the Association for Computational Linguistics: NAACL 2024, 1696-1703,  2024

Agenda-Driven Question Generation: A Case Study in the Courtroom Domain

Yi Fung, Anoop Kumar, Aram Galstyan, Heng Ji, Prem Natarajan
Proceedings of the 2024 Joint International Conference on Computational …,  2024

q-Diffusion leverages the full dimensionality of gene coexpression in single-cell transcriptomics

Myrl G Marmarelis, Russell Littman, Francesca Battaglin, Donna Niedzwiecki, Alan Venook, Jose-Luis Ambite, Aram Galstyan, Heinz-Josef Lenz, Greg Ver Steeg
Communications Biology 7 (1), 400,  2024

Correcting Language Model Outputs by Editing Salient Layers

Kshitij Mishra, Tamer Soliman, Anil Ramakrishna, Aram Galstyan, Anoop Kumar
Findings of the Association for Computational Linguistics: EACL 2024, 1295-1305,  2024