CRFTagger: CRF English POS Tagger

(built upon FlexCRFs)


Copyright (c) 2006 by

Xuan-Hieu Phan (pxhieu at gmail dot com), Graduate School of Information Sciences, Tohoku University

CRFTagger: A Java-based Conditional Random Fields Part-of-Speech (POS) Tagger for English that was built upon FlexCRFs. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (accuracy of 97.00%). Tagging speed: 500 sentences / second.


Related links:

Researches using this tool for running experiments should include the following citation:

Xuan-Hieu Phan, "CRFTagger: CRF English POS Tagger",, 2006.

We would like to thank professor Tu-Bao Ho for providing us Penn Treebank data for training the POS tagging model. We would also like to thank for hosting this project.

Last updated: March 24, 2007