Asian and Low-Resource Language Information Processing (TALLIP)


Search Issue
enter search term and/or author name


ACM Transactions on Asian Language Information Processing (TALIP), Volume 1 Issue 1, March 2002

Kam-Fai Wong, Jun'ichi Tsujii
Pages: 1-2
DOI: 10.1145/595576.595577

Toward a unified approach to statistical language modeling for Chinese
Jianfeng Gao, Joshua Goodman, Mingjing Li, Kai-Fu Lee
Pages: 3-33
DOI: 10.1145/595576.595578
This article presents a unified approach to Chinese statistical language modeling (SLM). Applying SLM techniques like trigram language models to Chinese is challenging because (1) there is no standard definition of words in Chinese; (2) word...

Meaningful term extraction and discriminative term selection in text categorization via unknown-word methodology
Yu-Sheng Lai, Chung-Hsien Wu
Pages: 34-64
DOI: 10.1145/595576.595579
In this article, an approach based on unknown words is proposed for meaningful term extraction and discriminative term selection in text categorization. For meaningful term extraction, a phrase-like unit (PLU)-based likelihood ratio is proposed to...

Morpheme-based grapheme to phoneme conversion using phonetic patterns and morphophonemic connectivity information
Byeongchang Kim, Gary Geunbae Lee, Jong-Hyeok Lee
Pages: 65-82
DOI: 10.1145/595576.595580
Both dictionary-based and rule-based methods on grapheme-to-phoneme conversion have their own advantages and limitations. For example, a large sized phonetic dictionary and complex morphophonemic rules are required for the dictionary-based method and...

Using tone information in Cantonese continuous speech recognition
Tan Lee, Wai Lau, Y. W. Wong, P. C. Ching
Pages: 83-102
DOI: 10.1145/595576.595581
In Chinese languages, tones carry important information at various linguistic levels. This research is based on the belief that tone information, if acquired accurately and utilized effectively, contributes to the automatic speech recognition of...