OPT: Oslo—Potsdam—Teesside Pipelining Rules, Rankers, and Classifier Ensembles for Shallow Discourse Parsing

Stephan Oepen, Jonathon Read, Tatjana Scheffler, Uladzimir Sidarenka, Manfed Stede, Erik Velldal, Lilja Øvrelid

    Research output: Contribution to conferencePaperpeer-review

    135 Downloads (Pure)

    Abstract

    The OPT submission to the Shared Task of the 2016 Conference on Natural Language Learning (CoNLL) implements a ‘classic’ pipeline architecture, combining binary classification of (candidate) explicit connectives, heuristic rules for non-explicit discourse relations, ranking and ‘editing’ of syntactic constituents for argument identification, and an ensemble of classifiers to assign discourse senses. With an end-to-end performance of 27.77 F1 on the English ‘blind’ test data, our system advances the previous state of the art (Wang & Lan, 2015) by close to four F1 points, with particularly good results for the argument identification sub-tasks. OPT system results appear more competitive on the new, ‘blind’ test data than on the ‘test’ and ‘development’ sections of the Penn Discourse Treebank (PDTB; Prasad et al., 2008), which may indicate reduced over-fitting to specific properties of the venerable Wall Street Journal (WSJ) text underlying the PDTB.
    Original languageEnglish
    Publication statusPublished - 11 Aug 2016
    Event20th Conference on Computational Natural Language Learning - Berlin, Germany
    Duration: 11 Aug 201612 Aug 2016

    Conference

    Conference20th Conference on Computational Natural Language Learning
    Country/TerritoryGermany
    CityBerlin
    Period11/08/1612/08/16

    Fingerprint

    Dive into the research topics of 'OPT: Oslo—Potsdam—Teesside Pipelining Rules, Rankers, and Classifier Ensembles for Shallow Discourse Parsing'. Together they form a unique fingerprint.

    Cite this