enter search term and/or author name
Toward a unified approach to statistical language modeling for Chinese
Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu Lee
This article presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) there is no standard definition of words in Chinese; (2) word...
Meaningful term extraction and discriminative term selection in text categorization via unknown-word methodology
Yu-Sheng Lai, Chung-Hsien Wu
In this article, an approach based on unknown words is proposed for meaningful term extraction and discriminative term selection in text categorization. For meaningful term extraction, a phrase-like unit (PLU)-based likelihood ratio is proposed to...
Morpheme-based grapheme to phoneme conversion using phonetic patterns and morphophonemic connectivity information
Byeongchang Kim, Gary Geunbae Lee, Jong-Hyeok Lee
Both dictionary-based and rule-based methods on grapheme-to-phoneme conversion have their own advantages and limitations. For example, a large sized phonetic dictionary and complex morphophonemic rules are required for the dictionary-based method and...
Using tone information in Cantonese continuous speech recognition
Tan Lee, Wai Lau, Y. W. Wong, P. C. Ching
In Chinese languages, tones carry important information at various linguistic levels. This research is based on the belief that tone information, if acquired accurately and utilized effectively, contributes to the automatic speech recognition of...