subject predicate object context
44998 Creator 56412434212aaec1af1a675d5f017f1e
44998 Creator a057dc830901e1d749423ba53785e145
44998 Creator b718ee3619ebe0ed07678edb8b16f80f
44998 Creator ext-4f79cf26d4b73801ce458ce9dffe9267
44998 Creator ext-d78da3531a4878c0b17a803ac4c4bdb2
44998 Date 2016-01-22
44998 Is Part Of p03029743
44998 Is Part Of repository
44998 abstract In domain-specific information retrieval (IR), an emerging problem is how to provide different users with documents that are both relevant and readable, especially for the lay users. In this paper, we propose a novel document readability model to enhance the domain-specific IR. Our model incorporates the coverage and sequential dependency of latent topics in a document. Accordingly, two topical readability indicators, namely Topic Scope and Topic Trace are developed. These indicators, combined with the classical Surface-level indicator, can be used to rerank the initial list of documents returned by a conventional search engine. In order to extract the structured latent topics without supervision, the hierarchical Latent Dirichlet Allocation (hLDA) is used. We have evaluated our model from the user-oriented and system-oriented perspectives, in the medical domain. The user-oriented evaluation shows a good correlation between the readability scores given by our model and human judgments. Furthermore, our model also gains significant improvement in the system-oriented evaluation in comparison with one of the state-of-the-art readability methods.
44998 authorList authors
44998 presentedAt ext-f4ee057a13de83412d3e3c03334d7def
44998 status peerReviewed
44998 uri http://data.open.ac.uk/oro/document/385743
44998 uri http://data.open.ac.uk/oro/document/385744
44998 uri http://data.open.ac.uk/oro/document/385745
44998 uri http://data.open.ac.uk/oro/document/385746
44998 uri http://data.open.ac.uk/oro/document/385747
44998 uri http://data.open.ac.uk/oro/document/385748
44998 uri http://data.open.ac.uk/oro/document/400426
44998 volume 9460
44998 type AcademicArticle
44998 type Article
44998 label Zhang, Wenya; Song, Dawei ; Zhang, Peng ; Zhao, Xiaozhao and Hou, Yuexian (2016). A Sequential Latent Topic-based Readability Model for Domain-Specific Information Retrieval. In: Information Retrieval Technology, Springer, pp. 241–252.
44998 label Zhang, Wenya; Song, Dawei ; Zhang, Peng ; Zhao, Xiaozhao and Hou, Yuexian (2016). A Sequential Latent Topic-based Readability Model for Domain-Specific Information Retrieval. In: Information Retrieval Technology, Springer, pp. 241–252.
44998 Publisher ext-1c5ddec173ca8cdfba8b274309638579
44998 Title A Sequential Latent Topic-based Readability Model for Domain-Specific Information Retrieval.
44998 in dataset oro