IJCATR Volume 3 Issue 9

An Evaluation of Two-Step Techniques for Positive-Unlabeled Learning in Text Classification

Azam Kaboutari Jamshid Bagherzadeh Fatemeh Kheradmand
10.7753/IJCATR0309.1012
keywords : PU Learning; positive-unlabeled learning; one-class classification; text classification; partially supervised learning

PDF
Positive-unlabeled (PU) learning is a learning problem which uses a semi-supervised method for learning. In PU learning problem, the aim is to build an accurate binary classifier without the need to collect negative examples for training. Two-step approach is a solution for PU learning problem that consists of tow steps: (1) Identifying a set of reliable negative documents. (2) Building a classifier iteratively. In this paper we evaluate five combinations of techniques for two-step strategy. We found that using Rocchio method in step 1 and Expectation-Maximization method in step 2 is most effective combination in our experiments.
@artical{a392014ijcatr03091012,
Title = "An Evaluation of Two-Step Techniques for Positive-Unlabeled Learning in Text Classification",
Journal ="International Journal of Computer Applications Technology and Research(IJCATR)",
Volume = "3",
Issue ="9",
Pages ="592 - 594",
Year = "2014",
Authors ="Azam Kaboutari Jamshid Bagherzadeh Fatemeh Kheradmand"}
  • null