Asian and Low-Resource Language Information Processing (TALLIP)


Search Issue
enter search term and/or author name


ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) - Special Issue on Chinese Spell Checking, Volume 14 Issue 4, October 2015

Introduction to the Special Issue on Chinese Spell Checking
Lung-Hao Lee, Gina-Anne Levow, Shih-Hung Wu, Chao-Lin Liu
Article No.: 14
DOI: 10.1145/2818354

This special issue contains four articles based on and expanded from systems presented at the SIGHAN-7 Chinese Spelling Check Bakeoff. We provide an overview of the approaches and designs for Chinese spelling checkers presented in these articles....

A Probabilistic Framework for Chinese Spelling Check
Kuan-Yu Chen, Hsin-Min Wang, Hsin-Hsi Chen
Article No.: 15
DOI: 10.1145/2826234

Chinese spelling check (CSC) is still an unsolved problem today since there are many homonymous or homomorphous characters. Recently, more and more CSC systems have been proposed. To the best of our knowledge, language modeling is one of...

A Hybrid Ranking Approach to Chinese Spelling Check
Xiaodong Liu, Fei Cheng, Kevin Duh, Yuji Matsumoto
Article No.: 16
DOI: 10.1145/2822264

We propose a novel framework for Chinese Spelling Check (CSC), which is an automatic algorithm to detect and correct Chinese spelling errors. Our framework contains two key components: candidate generation and candidate ranking. Our...

Chinese Spelling Checker Based on an Inverted Index List with a Rescoring Mechanism
Jui-Feng Yeh, Wen-Yi Chen, Mao-Chuan Su
Article No.: 17
DOI: 10.1145/2826235

An approach is proposed for Chinese spelling error detection and correction, in which an inverted index list with a rescoring mechanism is used. The inverted index list is a structure for mapping from word to desired sentence, and for representing...

Correcting Chinese Spelling Errors with Word Lattice Decoding
Yu-Ming Hsieh, Ming-Hong Bai, Shu-Ling Huang, Keh-Jiann Chen
Article No.: 18
DOI: 10.1145/2791389

Chinese spell checkers are more difficult to develop because of two language features: 1) there are no word boundaries, and a character may function as a word or a word morpheme; and 2) the Chinese character set contains more than ten thousand...

Section: TALLIP Perspectives, Editorial Commentary

TALLIP Perspectives: Editorial Commentary: The State of the Journal

Article No.: 19
DOI: 10.1145/2823512