PD3: Better Low-Resource Cross-Lingual Transfer By Combining Direct Transfer and Annotation Projection

Abstract

We consider unsupervised cross-lingual transfer on two tasks, viz., sentence-level argumentation mining and standard POS tagging. We combine direct transfer using bilingual embeddings with annotation projection, which projects labels across unlabeled parallel data. We do so by either merging respective source and target language datasets or alternatively by using multi-task learning. Our combination strategy considerably improves upon both direct transfer and projection with few available parallel sentences, the most realistic scenario for many low-resource target languages.

Bibtex

@inproceedings{eger-etal-2018-pd3,
    title = "{PD}3: Better Low-Resource Cross-Lingual Transfer By Combining Direct Transfer and Annotation Projection",
    author = {Eger, Steffen  and
      R{\"u}ckl{\'e}, Andreas  and
      Gurevych, Iryna},
    booktitle = "Proceedings of the 5th Workshop on Argument Mining",
    month = nov,
    year = "2018",
    address = "Brussels, Belgium",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/W18-5216",
    doi = "10.18653/v1/W18-5216",
    pages = "131--143"
}