enter search term and/or author name
Introduction to the special issue on computer processing of oriental languages
Sung Hyon Myaeng
An adaptive k-nearest neighbor text categorization strategy
Li Baoli, Lu Qin, Yu Shiwen
k is the most important parameter in a text categorization system based on the k-nearest neighbor algorithm (kNN). To classify a new document, the k-nearest documents in the training set are determined first. The...
Usefulness of temporal information automatically extracted from news articles for topic tracking
Pyung Kim, Sung Hyon Myaeng
Temporal information plays an important role in natural language processing (NLP) applications such as information extraction, discourse analysis, automatic summarization, and question-answering. In the topic detection and tracking (TDT) area, the...
An evaluation of statistical spam filtering techniques
Le Zhang, Jingbo Zhu, Tianshun Yao
This paper evaluates five supervised learning methods in the context of statistical spam filtering. We study the impact of different feature pruning methods and feature set sizes on each learner's performance using cost-sensitive measures. It is...