A Boundary-Oriented Chinese Segmentation Method Using NGram Mutual Information

Tang, Ling-Xiang, Geva, Shlomo, Trotman, Andrew, & Xu, Yue (2010) A Boundary-Oriented Chinese Segmentation Method Using NGram Mutual Information. In Sun, L & Chen, K.J. (Eds.) Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, Chinese Information Processing Society of China, China.

[img] Published Version (PDF 485kB)
Administrators only | Request a copy from author

Abstract

This paper describes our participation in the Chinese word segmentation task of CIPS-SIGHAN 2010. We implemented an n-gram mutual information (NGMI) based segmentation algorithm with the mixed-up features from unsupervised, supervised and dictionarybased segmentation methods. This algorithm is also combined with a simple strategy for out-of-vocabulary (OOV) word recognition. The evaluation for both open and closed training shows encouraging results of our system. The results for OOV word recognition in closed training evaluation were however found unsatisfactory.

Impact and interest:

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 80001
Item Type: Conference Paper
Refereed: Yes
Keywords: Translation; Chinese Segmentation, Boundary-Oriented Segmentation, Chinese language, N-Gram Mutual Information
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100)
Divisions: Current > Schools > School of Electrical Engineering & Computer Science
Current > QUT Faculties and Divisions > Science & Engineering Faculty
Deposited On: 12 Jan 2015 04:44
Last Modified: 27 Mar 2015 03:42

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page