Abstract
In this paper we present the Corpus of REcommendation STrength (CREST), a collection of HTML-formatted clinical guidelines
annotated with the location of recommendations. Recommendations are labelled with an author-provided indicator of their strength of
importance. As data was drawn from many disparate authors, we define a unified scheme of importance labels, and provide a mapping
for each guideline.
We demonstrate the utility of the corpus and its annotations in some initial measurements investigating the type of language construc-
tions associated with strong and weak recommendations, and experiments into promising features for recommendation classification,
both with respect to strong and weak labels, and to all labels of the unified scheme. An error analysis indicates that, while there is a
strong relationship between lexical choices and strength labels, there can be substantial variance in the choices made by different authors.
Original language | English |
---|---|
Publication status | Published - 23 May 2016 |
Event | 10th Language Resources and Evaluation Conference - Portorož, Slovenia Duration: 23 May 2016 → 28 May 2016 |
Conference
Conference | 10th Language Resources and Evaluation Conference |
---|---|
Country/Territory | Slovenia |
City | Portorož |
Period | 23/05/16 → 28/05/16 |