This is the official repository contains the code, data, and models of the paper titled "XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags", accepted for publication in Findings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL’24).
Millions of news articles published online daily can overwhelm readers. Headlines and entity (topic) tags are essential for guiding readers to decide if the content is worth their time. While headline generation has been extensively studied, tag generation remains largely unexplored, yet it offers readers better access to topics of interest. The need for conciseness in capturing readers' attention necessitates improved content selection strategies for identifying salient and relevant segments within lengthy articles, thereby guiding language models effectively. To address this, we propose to leverage auxiliary information such as images and captions embedded in the articles to retrieve relevant sentences and utilize instruction tuning with variations to generate both headlines and tags for news articles in a multilingual context. To make use of the auxiliary information, we have compiled a dataset named XL-HeadTags, which includes 20 languages across 6 diverse language families. Through extensive evaluation, we demonstrate the effectiveness of our plug-and-play multimodal-multilingual retrievers for both tasks. Additionally, we have developed a suite of tools for processing and evaluating multilingual texts, significantly contributing to the research community by enabling more accurate and efficient analysis across languages.
Dataset used in this work is available here: XL-HeadTags
Models with Caption Retrieved K=5 (Top-K) can be found here:
Multilingual tools used in this work can be found here:
Code for this work will be added soon
Contents of this repository are restricted to only non-commercial research purposes under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0). Copyright of the dataset contents belongs to the original copyright holders.
If you find this work useful for your research, please consider citing:
@inproceedings{shohan-etal-2024-xl,
title = "{XL}-{H}ead{T}ags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags",
author = "Shohan, Faisal and
Nayeem, Mir Tafseer and
Islam, Samsul and
Akash, Abu Ubaida and
Joty, Shafiq",
editor = "Ku, Lun-Wei and
Martins, Andre and
Srikumar, Vivek",
booktitle = "Findings of the Association for Computational Linguistics ACL 2024",
month = aug,
year = "2024",
address = "Bangkok, Thailand and virtual meeting",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.findings-acl.771",
pages = "12991--13024"
}
- Faisal Tareque Shohan ([email protected])
- Mir Tafseer Nayeem ([email protected])
- Samsul Islam ([email protected])
- Abu Ubaida Akash ([email protected])
- Shafiq Joty ([email protected])
- Mir Tafseer Nayeem is supported by Huawei Doctoral Fellowship.