subject predicate object context
20919 Creator 2515c15e5a8e5ef71a6e3a3c05d159fc
20919 Creator c0c66ab72293744b73fed6799b0e2a58
20919 Date 2010-05-18
20919 Is Part Of repository
20919 abstract We describe the construction of the CODA corpus, a parallel corpus of monologues and expository dialogues. The dialogue part of the corpus consists of expository, i.e., information-delivering rather than dramatic, dialogues written by several acclaimed authors. The monologue part of the corpus is a paraphrase in monologue form of these dialogues by a human annotator. The corpus was constructed as a resource for extracting rules for automated generation of dialogue from monologue. Using authored dialogues allows us to analyse the techniques used by accomplished writers for presenting information in the form of dialogue. The dialogues are annotated with dialogue acts and the monologues with rhetorical structure. We developed annotation and translation guidelines together with a custom-developed tool for carrying out translation, alignment and annotation.
20919 authorList authors
20919 presentedAt ext-0de059c06d2b3ae09936685407512d9b
20919 status peerReviewed
20919 uri http://data.open.ac.uk/oro/document/10402
20919 uri http://data.open.ac.uk/oro/document/12197
20919 uri http://data.open.ac.uk/oro/document/12444
20919 uri http://data.open.ac.uk/oro/document/4644
20919 type AcademicArticle
20919 type Article
20919 label Stoyanchev, Svetlana and Piwek, Paul (2010). Constructing the CODA corpus: A parallel corpus ofmonologues and expository dialogues. In: The seventh international conference on Language Resources and Evaluation (LREC) (Forthcoming), 18-21 May 2010, Malta.
20919 label Stoyanchev, Svetlana and Piwek, Paul (2010). Constructing the CODA corpus: A parallel corpus ofmonologues and expository dialogues. In: The seventh international conference on Language Resources and Evaluation (LREC) (Forthcoming), 18-21 May 2010, Malta.
20919 Title Constructing the CODA corpus: A parallel corpus ofmonologues and expository dialogues
20919 in dataset oro