WikiBank: Using wikidata to improve multilingual frame-semantic parsing
Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt
Frame-semantic annotations exist for a tiny fraction of the world's languages, Wikidata, however, links knowledge base triples to texts in many languages, providing a common, distant supervision signal for semantic parsers. We present WIKIBANK, a multilingual resource of partial semantic structures that can be used to extend pre-existing resources rather than creating new man-made resources from scratch. We also integrate this form of supervision into an off-the-shelf frame-semantic parser and allow cross-lingual transfer. Using Google's SLING architecture, we show significant improvements on the English and Spanish CoNLL 2009 datasets, whether training on the full available datasets or small subsamples thereof.
Originalsprog | Engelsk |
---|---|
Titel | LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings |
Redaktører | Nicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis |
Forlag | European Language Resources Association (ELRA) |
Publikationsdato | 2020 |
Sider | 4183-4189 |
ISBN (Elektronisk) | 9791095546344 |
Status | Udgivet - 2020 |
Begivenhed | 12th International Conference on Language Resources and Evaluation, LREC 2020 - Marseille, Frankrig Varighed: 11 maj 2020 → 16 maj 2020 |
Konference
Konference | 12th International Conference on Language Resources and Evaluation, LREC 2020 |
---|---|
Land | Frankrig |
By | Marseille |
Periode | 11/05/2020 → 16/05/2020 |
Sponsor | Amazon AWS, Bertin, Lenovo, Ontotex, Vecsys, Vocapia |
ID: 258332560