(43) Automatic extraction of bilingual word pairsusing inductive chain learning
in variouslanguages
¡¡¡¡¡¡Information Processing and Management,Elsevier, Vol.42,No.5,pp.1294-1315,2006-9
¡¡In this paper, we propose a new learning method for extracting bilingual
word pairs from parallel corpora in various languages. In cross-language
information retrieval, the system must deal with various languages. Therefore,
automatic extraction of bilingual word pairs from parallel corpora with various
languages is important. However, previous works based on statistical methods
are insufficient because of the sparse data problem. Our learning method
automatically acquires rules, which are effective to solve the sparse data
problem, only from parallel corpora without any prior preparation of a bilingual
resource. We call this learning method Inductive Chain Learning (ICL). Evaluation
experiments demonstrated that the recalls of systems based on several statistical
approached were improved through the use of ICL.