Asian and Low-Resource Language Information Processing (TALLIP)


ACM Transactions on Asian Language Information Processing (TALIP), Volume 13 Issue 1, February 2014

Time for More Languages: Temporal Tagging of Arabic, Italian, Spanish, and Vietnamese
Jannik Strötgen, Ayser Armiti, Tran Van Canh, Julian Zell, Michael Gertz
Article No.: 1
DOI: 10.1145/2540989

Most of the research on temporal tagging so far is done for processing English text documents. There are hardly any multilingual temporal taggers supporting more than two languages. Recently, the temporal tagger HeidelTime has been made publicly...

Distortion Model Based on Word Sequence Labeling for Statistical Machine Translation
Isao Goto, Masao Utiyama, Eiichiro Sumita, Akihiro Tamura, Sadao Kurohashi
Article No.: 2
DOI: 10.1145/2537128

This article proposes a new distortion model for phrase-based statistical machine translation. In decoding, a distortion model estimates the source word position to be translated next (subsequent position; SP) given the last translated source word...

Cross-Lingual Annotation Projection for Weakly-Supervised Relation Extraction
Seokhwan Kim, Minwoo Jeong, Jonghoon Lee, Gary Geunbae Lee
Article No.: 3
DOI: 10.1145/2529994

Although researchers have conducted extensive studies on relation extraction in the last decade, statistical systems based on supervised learning are still limited, because they require large amounts of training data to achieve high performance...

Arabic Text Categorization Based on Arabic Wikipedia
Adnan Yahya, Ali Salhi
Article No.: 4
DOI: 10.1145/2537129

This article describes an algorithm for categorizing Arabic text, relying on highly categorized corpus-based datasets obtained from the Arabic Wikipedia by using manual and automated processes to build and customize categories. The categorization...

TALIP Perspectives, Guest Editorial Commentary: What Counts (and What Ought to Count)?
Kenneth Church
Article No.: 5
DOI: 10.1145/2559789