WikiBank: Using wikidata to improve multilingual frame-semantic parsing
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Frame-semantic annotations exist for a tiny fraction of the world's languages, Wikidata, however, links knowledge base triples to texts in many languages, providing a common, distant supervision signal for semantic parsers. We present WIKIBANK, a multilingual resource of partial semantic structures that can be used to extend pre-existing resources rather than creating new man-made resources from scratch. We also integrate this form of supervision into an off-the-shelf frame-semantic parser and allow cross-lingual transfer. Using Google's SLING architecture, we show significant improvements on the English and Spanish CoNLL 2009 datasets, whether training on the full available datasets or small subsamples thereof.
Original language | English |
---|---|
Title of host publication | LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings |
Editors | Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis |
Publisher | European Language Resources Association (ELRA) |
Publication date | 2020 |
Pages | 4183-4189 |
ISBN (Electronic) | 9791095546344 |
Publication status | Published - 2020 |
Event | 12th International Conference on Language Resources and Evaluation, LREC 2020 - Marseille, France Duration: 11 May 2020 → 16 May 2020 |
Conference
Conference | 12th International Conference on Language Resources and Evaluation, LREC 2020 |
---|---|
Land | France |
By | Marseille |
Periode | 11/05/2020 → 16/05/2020 |
Sponsor | Amazon AWS, Bertin, Lenovo, Ontotex, Vecsys, Vocapia |
- Cross-lingual frame semantic parsing, Data augmentation, Multilinguality
Research areas
ID: 258332560