Integrating Natural Language Processing
(NLP) and computer vision is a promising
effort. However, the applicability of these
methods directly depends on the availability
of a specific multimodal data that includes
images and texts. In this paper, we
present a collection of a Multimodal corpus
of comparable document and their images
in 9 languages from the web news articles
of Euronews website.1 This corpus
has found widespread use in the NLP community
in Multilingual and multimodal
tasks. Here, we focus on its acquisition
of the images and text data and their multilingual
alignment.