23402 |
Creator |
03d5fecfafde0654fe3259b85bdc30c6 |
23402 |
Creator |
3c9bbcc5b9dec5879be1be1348991a7e |
23402 |
Creator |
c79cc1129e553cdca30fa856443ac46e |
23402 |
Creator |
ext-6acd29960d32bdf43b46b612cc160e38 |
23402 |
Date |
2010 |
23402 |
Is Part Of |
repository |
23402 |
abstract |
The objective of Eurogene is to collect a critical mass of educational content in
the field of human genetics in nine European languages and to build a platform that
will support the retrieval, sharing and navigation over the learning content. The
Eurogene platform is already operational and is being used by the genetics community.
In this paper, a part of the Eurogene platform related to the retrieval and machine
translation of domain specific content is described. Our contribution lies in an approach
for domain-specific adaption of cross-language information retrieval (CLIR) and machine
translation (MT). The CLIR system is based on a multilingual domain ontology which
is also used as a synchronization component between CLIR and MT. The MT system is
adapted to the target domain using the terminology represented in the ontology and
using statistical training performed on a collection of parallel texts. In the statistical
training phase, new translations of a term can be discovered and used for ontology
updating. The paper is organized as follows. First, we describe the motivation for
our approach and the multilingual domain ontology. Later, the CLIR and MT components
and their domain adaption and synchronization are discussed. |
23402 |
authorList |
authors |
23402 |
editorList |
editors |
23402 |
presentedAt |
ext-fdb9c6645bbf277daa4f2a5f8f86c1e8 |
23402 |
status |
peerReviewed |
23402 |
uri |
http://data.open.ac.uk/oro/document/11037 |
23402 |
uri |
http://data.open.ac.uk/oro/document/14896 |
23402 |
uri |
http://data.open.ac.uk/oro/document/15876 |
23402 |
uri |
http://data.open.ac.uk/oro/document/6027 |
23402 |
type |
AcademicArticle |
23402 |
type |
Article |
23402 |
label |
Knoth, Petr ; Collins, Trevor ; Sklavounou, Elsa and Zdrahal, Zdenek (2010). EUROGENE:
multilingual retrieval and machine translation applied to human genetics. In: Advances
in Information Retrieval: 32nd European Conference on IR Research, ECIR 2010, Milton
Keynes, UK, March 28-31, 2010. Proceedings (Gurrin, Cathal; He, Yulan; Kazai, Gabriella;
Kruschwitz, Udo; Little, Suzanne; Roelleke, Thomas; Rüger, Stefan and van Rijsbergen,
Keith eds.), Lecture Notes in Computer Science, Springer, Berlin, pp. 670–671. |
23402 |
label |
Knoth, Petr ; Collins, Trevor ; Sklavounou, Elsa and Zdrahal, Zdenek (2010).
EUROGENE: multilingual retrieval and machine translation applied to human genetics.
In: Advances in Information Retrieval: 32nd European Conference on IR Research, ECIR
2010, Milton Keynes, UK, March 28-31, 2010. Proceedings (Gurrin, Cathal; He, Yulan;
Kazai, Gabriella; Kruschwitz, Udo; Little, Suzanne; Roelleke, Thomas; Rüger, Stefan
and van Rijsbergen, Keith eds.), Lecture Notes in Computer Science, Springer, Berlin,
pp. 670–671. |
23402 |
Publisher |
ext-1c5ddec173ca8cdfba8b274309638579 |
23402 |
Title |
EUROGENE: multilingual retrieval and machine translation applied to human genetics |
23402 |
in dataset |
oro |