subject predicate object context
47284 Creator 2515c15e5a8e5ef71a6e3a3c05d159fc
47284 Creator ext-8f576a85fed2b79f8e55a2f918653196
47284 Date 2016-09-08
47284 Is Part Of repository
47284 abstract Question generation (QG) is the problem of automatically generating questions from inputs such as declarative sentences. The Shared Evaluation Task Challenge (QG-STEC) Task B that took place in 2010 evaluated several state-of-the-art QG systems. However, analysis of the evaluation results was affected by low inter-rater reliability. We adapted Nonaka & Takeuchi’s knowledge creation cycle to the task of improving the evaluation annotation guidelines with a preliminary test showing clearly improved inter-rater reliability.
47284 authorList authors
47284 editorList editors
47284 presentedAt ext-8b45b3a35ce79f146fba6707d2aec01a
47284 status peerReviewed
47284 uri http://data.open.ac.uk/oro/document/503002
47284 uri http://data.open.ac.uk/oro/document/503003
47284 uri http://data.open.ac.uk/oro/document/503008
47284 uri http://data.open.ac.uk/oro/document/503009
47284 uri http://data.open.ac.uk/oro/document/503010
47284 uri http://data.open.ac.uk/oro/document/503011
47284 uri http://data.open.ac.uk/oro/document/514016
47284 type AcademicArticle
47284 type Article
47284 label Godwin, Keith and Piwek, Paul (2016). Collecting Reliable Human Judgements on Machine-Generated Language: The Case of the QG-STEC Data. In: Proceedings of the 9th International Natural Language Generation Conference (Isard, Amy; Rieser, Verena and Gkatzia, Dimitra eds.), Association for Computational Linguistics, Edinburgh, pp. 212–216.
47284 label Godwin, Keith and Piwek, Paul (2016). Collecting Reliable Human Judgements on Machine-Generated Language: The Case of the QG-STEC Data. In: Proceedings of the 9th International Natural Language Generation Conference (Isard, Amy; Rieser, Verena and Gkatzia, Dimitra eds.), Association for Computational Linguistics, Edinburgh, pp. 212–216.
47284 Publisher ext-485e271a88ed871bf71e8d3433784f79
47284 Title Collecting Reliable Human Judgements on Machine-Generated Language: The Case of the QG-STEC Data
47284 in dataset oro