Data

The PMB is continuously improving. Stable versions of the PMB are provided in the releases below. We provide three different types of annotated data: Discourse Representation Structures, Semantic Tags, and Semantic Derivations (coming soon!).


Discourse Representation Structures

version # en # de # it # nl silver bronze release date full sample
4.0.0 10,715 2,844 1,686 1,467 yes yes 22-10-2021 2.8 GB 1 MB
3.0.0 8,403 1,979 1,062 1,012 yes yes 12-02-2020 1.9 GB 1 MB
2.2.0 5,929 1,419 724 633 yes yes 20-12-2018 1.4 GB NA
2.1.0 4,555 1,175 635 586 yes no 07-06-2018 314 MB NA
2.0.0 3,925 1,048 568 527 yes no 25-04-2018 282 MB NA
1.0.0 2,049 641 387 394 no no 22-12-2017 11 MB NA

Note: if you are only interested in the input (raw) and output (clf, sbn) files for DRS parsing, you do not need to download the full release. Please go to the release page and select the exp_data zip file for the release you want, e.g. exp_data_4.0.0.zip for the latest release.

If you have used the DRS data, please cite the following paper:

Lasha Abzianidze, Johannes Bjerva, Kilian Evang, Hessel Haagsma, Rik van Noord, Pierre Ludmann, Duc-Duy Nguyen, Johan Bos (2017): The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pp 242–247, Valencia, Spain. [PDF] [BibTeX]


Universal Semantic Tags

version # en # de # it # nl silver inc. release date download
0.2.0 14,129 2,924 1,741 1,354 yes 16-01-2024 36 MB ZIP file
0.1.0 5,438 0 0 0 yes 01-05-2018 19 MB ZIP file
If you have used the data labelled with the semantic tags, please cite the following paper:

Lasha Abzianidze, Johan Bos (2017): Towards Universal Semantic Tagging. Proceedings of the 12th International Conference on Computational Semantics (IWCS 2017) -- Short Papers, pp 1–6, Montpellier, France. [PDF] [BibTeX]