In this paper we present the Corpus of REcommendation STrength (CREST), a collection of HTML-formatted clinical guidelines annotated with the location of recommendations. Recommendations are labelled with an author-provided indicator of their strength of importance. As data was drawn from many disparate authors, we define a unified scheme of importance labels, and provide a mapping for each guideline. We demonstrate the utility of the corpus and its annotations in some initial measurements investigating the type of language construc- tions associated with strong and weak recommendations, and experiments into promising features for recommendation classification, both with respect to strong and weak labels, and to all labels of the unified scheme. An error analysis indicates that, while there is a strong relationship between lexical choices and strength labels, there can be substantial variance in the choices made by different authors.
|Publication status||Published - 23 May 2016|
|Event||10th Language Resources and Evaluation Conference - Portorož, Slovenia|
Duration: 23 May 2016 → 28 May 2016
|Conference||10th Language Resources and Evaluation Conference|
|Period||23/05/16 → 28/05/16|