subject predicate object context
59244 Creator 20fbc73bc6c0afc98372cf072a54acc4
59244 Creator 2482a533b100c51b082644502f2b86e0
59244 Creator 6a5620b7e8134436ca39e792b4432945
59244 Creator 0ab97f84b0b21f6aab20f88012180fe1
59244 Creator ext-168d529095a262289df68cb7419a6aa3
59244 Date 2014-06-23
59244 Is Part Of repository
59244 abstract This paper addresses the problem of determining the best answer in Community-based Question Answering websites by focussing on the content. Previous research on this topic relies on the exploitation of community feedback on the answers, which involves rating of either users (e.g., reputation) or answers (e.g. scores manually assigned to answers). We propose a new technique that leverages the content/textual features of answers in a novel way. Our approach delivers better results than related linguistics-based solutions and manages to match rating-based approaches. More specifically, the gain in performance is achieved by rendering the values of these features into a discretised form. We also show how our technique manages to deliver equally good results in real-time settings, as opposed to having to rely on information not always readily available, such as user ratings and answer scores. We ran an evaluation on 21 StackExchange websites covering around 4 million questions and more than 8 million answers. We obtain 84% average precision and 70% recall, which shows that our technique is robust, effective, and widely applicable.
59244 authorList authors
59244 presentedAt ext-b7b2b7098400752b24e012d49c832895
59244 status peerReviewed
59244 type AcademicArticle
59244 type Article
59244 label Gkotsis, George ; Stepanyan, Karen ; Pedrinaci, Carlos ; Domingue, John and Liakata, Maria (2014). It's all in the content: state of the art best answer prediction based on discretisation of shallow linguistic features. In: ACM Web Science, 23-26 Jun 2014, Bloomington, Indiana, USA, pp. 202–210.
59244 label Gkotsis, George ; Stepanyan, Karen ; Pedrinaci, Carlos ; Domingue, John and Liakata, Maria (2014). It's all in the content: state of the art best answer prediction based on discretisation of shallow linguistic features. In: ACM Web Science, 23-26 Jun 2014, Bloomington, Indiana, USA, pp. 202–210.
59244 Title It's all in the content: state of the art best answer prediction based on discretisation of shallow linguistic features
59244 in dataset oro