Semi-supervised Autoencoding Projective Dependency Parsing

Xiao Zhang     Dan Goldwasser    
International Conference on Computational Linguistics (CoLING), 2020


We describe two end-to-end autoencoding models for semi-supervised graph-based projective dependency parsing. The first model is a Locally Autoencoding Parser (LAP) encoding the input using continuous latent variables in a sequential manner; The second model is a Globally Autoencoding Parser (GAP) encoding the input into dependency trees as latent variables, with exact inference. Both models consist of two parts, an encoder enhanced by deep neural networks (DNN) that can utilize the contextual information to encode the input into latent variables, and a decoder which is a generative model able to reconstruct the input. Both LAP and GAP admit a unified structure with different loss functions for labeled and unlabeled data with shared parameters. We conducted experiments on WSJ and UD dependency parsing data sets, showing that our models can exploit the unlabeled data to improve the performance given a limited amount of labeled data, and outperform a previously proposed semi-supervised model.

Bib Entry

    author = "Xiao Zhang and Dan Goldwasser",
    title = "Semi-supervised Autoencoding Projective Dependency Parsing",
    booktitle = "International Conference on Computational Linguistics (CoLING)",
    year = "2020"