20919 |
Creator |
2515c15e5a8e5ef71a6e3a3c05d159fc |
20919 |
Creator |
c0c66ab72293744b73fed6799b0e2a58 |
20919 |
Date |
2010-05-18 |
20919 |
Is Part Of |
repository |
20919 |
abstract |
We describe the construction of the CODA corpus, a parallel corpus of monologues and
expository dialogues. The dialogue part of the corpus consists of expository, i.e.,
information-delivering rather than dramatic, dialogues written by several acclaimed
authors. The monologue part of the corpus is a paraphrase in monologue form of these
dialogues by a human annotator. The corpus was constructed as a resource for extracting
rules for automated generation of dialogue from monologue. Using authored dialogues
allows us to analyse the techniques used by accomplished writers for presenting information
in the form of dialogue. The dialogues are annotated with dialogue acts and the monologues
with rhetorical structure. We developed annotation and translation guidelines together
with a custom-developed tool for carrying out translation, alignment and annotation. |
20919 |
authorList |
authors |
20919 |
presentedAt |
ext-0de059c06d2b3ae09936685407512d9b |
20919 |
status |
peerReviewed |
20919 |
uri |
http://data.open.ac.uk/oro/document/10402 |
20919 |
uri |
http://data.open.ac.uk/oro/document/12197 |
20919 |
uri |
http://data.open.ac.uk/oro/document/12444 |
20919 |
uri |
http://data.open.ac.uk/oro/document/4644 |
20919 |
type |
AcademicArticle |
20919 |
type |
Article |
20919 |
label |
Stoyanchev, Svetlana and Piwek, Paul (2010). Constructing the CODA corpus: A parallel
corpus ofmonologues and expository dialogues. In: The seventh international conference
on Language Resources and Evaluation (LREC) (Forthcoming), 18-21 May 2010, Malta. |
20919 |
label |
Stoyanchev, Svetlana and Piwek, Paul (2010). Constructing the CODA corpus: A
parallel corpus ofmonologues and expository dialogues. In: The seventh international
conference on Language Resources and Evaluation (LREC) (Forthcoming), 18-21 May 2010,
Malta. |
20919 |
Title |
Constructing the CODA corpus: A parallel corpus ofmonologues and expository dialogues |
20919 |
in dataset |
oro |