Publications

Multilingual Meta-Distillation Alignment for Semantic Retrieval

Abstract

Multilingual semantic retrieval involves retrieving semantically relevant content to a query irrespective of the language. Compared to monolingual and bilingual semantic retrieval, multilingual semantic retrieval requires a stronger alignment approach to pull the contents to be retrieved close to the representation of their corresponding queries, no matter their language combinations. Traditionally, this is achieved through more supervision in the form of multilingual parallel resources, which are expensive to obtain, especially for low-resource languages. In this work, on top of an optimization-based Model-Agnostic Meta-Learner (MAML), we propose a data-efficient meta-distillation approach: MAML-Align,1 specifically for low-resource multilingual semantic retrieval. Our approach simulates a gradual feedback loop from monolingual to bilingual and from bilingual to multilingual semantic retrieval. We systematically …

Metadata

publication
Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024
year
2024
publication date
2024/7/10
authors
Meryem M'hamdi, Jonathan May, Franck Dernoncourt, Trung Bui, Seunghyun Yoon
link
https://dl.acm.org/doi/abs/10.1145/3626772.3657812
resource_link
https://dl.acm.org/doi/pdf/10.1145/3626772.3657812
book
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
pages
597-607