Repository | Book | Chapter

184229

Identifying verb-preposition multi-category words in Chinese-English patent machine translation

Hongzheng Li , Yun Zhu , Yaohong Jin

pp. 409-421

Multi-category words are widely distributed in Chinese patent documents, and identification of them has been one of the difficulties in machine translation (MT). This paper proposes a rule-based method for identifying verb and preposition multi-category words in Chinese-English patent machine translation. Based on principles of boundary perception and according to syntactic and semantic information of multi-category words as well as context information, some reliable disambiguation rules are designed to help the MT system analyze proper categories of words, then proposes adverbial and predicate identification rules to determine and identify the words further. Related experiments and BLEU evaluations show that the method is efficient to recognize verbs and prepositions better, and is also helpful to improve final translation quality.

Publication details

DOI: 10.1007/978-3-319-14803-8_32

Full citation:

Li, H. , Zhu, Y. , Jin, Y. (2015)., Identifying verb-preposition multi-category words in Chinese-English patent machine translation, in M. Randall (ed.), Artificial life and computational intelligence, Dordrecht, Springer, pp. 409-421.

This document is unfortunately not available for download at the moment.