Copyright (c) 2006 by
Xuan-Hieu Phan (pxhieu at gmail dot com), Graduate School of Information Sciences, Tohoku University
CRFTagger: A Java-based Conditional Random Fields Part-of-Speech (POS) Tagger for English that was built upon FlexCRFs. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (accuracy of 97.00%). Tagging speed: 500 sentences / second.
Researches using this tool for running experiments should include the following citation:
Xuan-Hieu Phan, "CRFTagger: CRF English POS Tagger", http://crftagger.sourceforge.net/, 2006.
We would like to thank professor Tu-Bao Ho for providing us Penn Treebank data for training the POS tagging model. We would also like to thank Sourceforge.net for hosting this project.
Last updated: March 24, 2007