ACM Transactions on Asian Language Information Processing (TALIP), Volume 9 Issue 1, March 2010

Mining Synonymous Transliterations from the World Wide Web
Chung-Chian Hsu, Chien-Hsing Chen
Article No.: 1
DOI: 10.1145/1731035.1731036

The World Wide Web has been considered one of the important sources for information. Using search engines to retrieve Web pages can gather lots of information, including foreign information. However, to be better understood by local readers,...

Identification of Soundbite and Its Speaker Name Using Transcripts of Broadcast News Speech
Feifan Liu, Yang Liu
Article No.: 2
DOI: 10.1145/1731035.1731037

This article presents a pipeline framework for identifying soundbite and its speaker name from Mandarin broadcast news transcripts. Both of the two modules, soundbite segment detection and soundbite speaker name recognition, are based on a...

Inducing Morphemes Using Light Knowledge
Michael Tepper, Fei Xia
Article No.: 3
DOI: 10.1145/1731035.1731038

Allomorphic variation, or form variation among morphs with the same meaning, is a stumbling block to morphological induction (MI). To address this problem, we present a hybrid approach that uses a small amount of linguistic knowledge in the form...

A Reexamination of MRD-Based Word Sense Disambiguation
Timothy Baldwin, Sunam Kim, Francis Bond, Sanae Fujita, David Martinez, Takaaki Tanaka
Article No.: 4
DOI: 10.1145/1731035.1731039

This article reconsiders the task of MRD-based word sense disambiguation, in extending the basic Lesk algorithm to investigate the impact on WSD performance of different tokenization schemes and methods of definition extension. In experimentation...