Publications
This section contains publications published by ISI over the past several decades.
2024
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Fei Wang, Xingyu Fu, James Y Huang, Zekun Li, Qin Liu, Xiaogeng Liu, Mingyu Derek Ma, Nan Xu, Wenxuan Zhou, Kai Zhang, Tianyi Lorena Yan, Wenjie Jacky Mo, Hsiang-Hui Liu, Pan Lu, Chunyuan Li, Chaowei Xiao, Kai-Wei Chang, Dan Roth, Sheng Zhang, Hoifung Poon, Muhao Chen
ICLR 2025, 2024
Contrastive Instruction Tuning
Tianyi Yan, Fei Wang, James Y Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen
ACL 2024 (Findings), 2024
DeAL: Decoding-time Alignment for Large Language Models
James Y Huang, Sailik Sengupta, Daniele Bonadiman, Yi-an Lai, Arshit Gupta, Nikolaos Pappas, Saab Mansour, Katrin Kirchoff, Dan Roth
arXiv preprint arXiv:2402.06147, 2024
Modeling and Simulating Agent-Based City Migration Using Conway's Game of Life
Bruce Deng, Mayank Kejriwal
arXiv preprint arXiv:2412.20691, 2024
Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models
Zhisheng Tang, Mayank Kejriwal
arXiv preprint arXiv:2412.15501, 2024
The plausibility machine commonsense (PMC) dataset: A massively crowdsourced human-annotated dataset for studying plausibility in large language models
Navapat Nananukul, Ke Shen, Mayank Kejriwal
Data in Brief 57, 110869, 2024
Can AI have common sense? Finding out will be key to achieving machine intelligence
Mayank Kejriwal, Henrique Santos, Alice M Mulvehill, Ke Shen, Deborah L McGuinness, Henry Lieberman
Nature 634 (8033), 291-294, 2024
A Q-learning Novelty Search Strategy for Evaluating Robustness of Deep Reinforcement Learning in Open-world Environments
Shafkat Islam, Min-Hsueh Chiu, Trevor Bonjour, Ruy de Oliveira, Bharat Bhargava, Mayank Kejriwal
IEEE Intelligent Systems, 2024
SelECT-SQL: Self-correcting ensemble Chain-of-Thought for Text-to-SQL
Ke Shen, Mayank Kejriwal
arXiv preprint arXiv:2409.10007, 2024
Defining and Evaluating Decision and Composite Risk in Language Models Applied to Natural Language Inference
Ke Shen, Mayank Kejriwal
arXiv preprint arXiv:2408.01935, 2024
PokerOWL: A Multi-Agent Poker Environment for Implementing and Evaluating Open-World Learning
Min-Hsueh Chiu, Mayank Kejriwal
2024
An Investigation of Marxist Alienation in the Postmodern Workplace in Apple TV's Severance
Mayank Kejriwal
Reintegrating Severance: Interdisciplinary Insights on Apple TV’s Dystopian …, 2024
A Semantic Search Engine for Helping Patients Find Doctors and Locations in a Large Healthcare Organization
Mayank Kejriwal, Hamid Haidarian, Min-Hsueh Chiu, Andy Xiang, Deep Shrestha, Faizan Javed
Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024
GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning
Zhisheng Tang, Mayank Kejriwal
arXiv preprint arXiv:2407.01892, 2024
Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions
Yongyi Ji, Zhisheng Tang, Mayank Kejriwal
arXiv preprint arXiv:2406.12216, 2024
Advancing computational sustainability in higher education
Mayank Kejriwal, Victoria Petryshyn
Nature Computational Science, 1-2, 2024
HALO: an ontology for representing and categorizing hallucinations in large language models
Navapat Nananukul, Mayank Kejriwal
Disruptive Technologies in Information Sciences VIII 13058, 86-100, 2024
Challenges, evaluation and opportunities for open-world learning
Mayank Kejriwal, Eric Kildebeck, Robert Steininger, Abhinav Shrivastava
Nature Machine Intelligence 6 (6), 580-588, 2024
An evaluation of estimative uncertainty in large language models
Zhisheng Tang, Ke Shen, Mayank Kejriwal
arXiv preprint arXiv:2405.15185, 2024
Cost-efficient prompt engineering for unsupervised entity resolution
Navapat Nananukul, Khanin Sisaengsuwanchai, Mayank Kejriwal
2024