An efficient algorithm for easy-first non-directional dependency parsing

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    150 Scopus citations

    Abstract

    We present a novel deterministic dependency parsing algorithm that attempts to create the easiest arcs in the dependency structure first in a non-directional manner. Traditional deterministic parsing algorithms are based on a shift-reduce framework: they traverse the sentence from left-to-right and, at each step, perform one of a possible set of actions, until a complete tree is built. A drawback of this approach is that it is extremely local: while decisions can be based on complex structures on the left, they can look only at a few words to the right. In contrast, our algorithm builds a dependency tree by iteratively selecting the best pair of neighbours to connect at each parsing step. This allows incorporation of features from already built structures both to the left and to the right of the attachment point. The parser learns both the attachment preferences and the order in which they should be performed. The result is a deterministic, best-first, O(nlogn) parser, which is significantly more accurate than best-first transition based parsers, and nears the performance of globally optimized parsing models.

    Original languageEnglish
    Title of host publicationNAACL HLT 2010 - Human Language Technologies
    Subtitle of host publicationThe 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Main Conference
    Pages742-750
    Number of pages9
    StatePublished - 1 Dec 2010
    Event2010 Human Language Technologies Conference ofthe North American Chapter of the Association for Computational Linguistics, NAACL HLT 2010 - Los Angeles, CA, United States
    Duration: 2 Jun 20104 Jun 2010

    Publication series

    NameNAACL HLT 2010 - Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Main Conference

    Conference

    Conference2010 Human Language Technologies Conference ofthe North American Chapter of the Association for Computational Linguistics, NAACL HLT 2010
    Country/TerritoryUnited States
    CityLos Angeles, CA
    Period2/06/104/06/10

    ASJC Scopus subject areas

    • Language and Linguistics
    • Linguistics and Language

    Fingerprint

    Dive into the research topics of 'An efficient algorithm for easy-first non-directional dependency parsing'. Together they form a unique fingerprint.

    Cite this