ACM DL

Asian and Low-Resource Language Information Processing (TALLIP)

Menu

Search Issue
enter search term and/or author name

Archive


ACM Transactions on Asian Language Information Processing (TALIP), Volume 3 Issue 4, December 2004

Introduction to the special issue on computer processing of oriental languages
Sung Hyon Myaeng
Pages: 213-213
DOI: 10.1145/1039621.1039622

An adaptive k-nearest neighbor text categorization strategy
Li Baoli, Lu Qin, Yu Shiwen
Pages: 215-226
DOI: 10.1145/1039621.1039623
k is the most important parameter in a text categorization system based on the k-nearest neighbor algorithm (kNN). To classify a new document, the k-nearest documents in the training set are determined first. The...

Usefulness of temporal information automatically extracted from news articles for topic tracking
Pyung Kim, Sung Hyon Myaeng
Pages: 227-242
DOI: 10.1145/1039621.1039624
Temporal information plays an important role in natural language processing (NLP) applications such as information extraction, discourse analysis, automatic summarization, and question-answering. In the topic detection and tracking (TDT) area, the...

An evaluation of statistical spam filtering techniques
Le Zhang, Jingbo Zhu, Tianshun Yao
Pages: 243-269
DOI: 10.1145/1039621.1039625
This paper evaluates five supervised learning methods in the context of statistical spam filtering. We study the impact of different feature pruning methods and feature set sizes on each learner's performance using cost-sensitive measures. It is...