figshare
Browse
file.pdf (246.75 kB)

A Corpus and Model Integrating Multiword Expressions and Supersenses

Download (246.75 kB)
journal contribution
posted on 2015-06-01, 00:00 authored by Nathan Schneider, Noah A. Smith

This paper introduces a task of identifying and semantically classifying lexical expressions in running text. We investigate the online reviews genre, adding semantic supersense annotations to a 55,000 word English corpus that was previously annotated for multiword expressions. The noun and verb supersenses apply to full lexical expressions, whether single- or multiword. We then present a sequence tagging model that jointly infers lexical expressions and their supersenses. Results show that even with our relatively small training corpus in a noisy domain, the joint task can be performed to attain 70% class labeling F1.

History

Publisher Statement

Copyright 2015 Association for Computational Linguistics

Date

2015-06-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC