Professional Documents
Culture Documents
0 - 2017-06-09
Copyright (c) 2002-2012 The Board of Trustees of
The Leland Stanford Junior University. All Rights Reserved.
If you really want to use this software under Java 1.4, look into RetroWeaver:
http://retroweaver.sourceforge.net/
QUICKSTART
-----------------------------------------------
The Stanford POS Tagger is designed to be used from the command line or
programmatically via its API.
1)
java -mx200m -classpath stanford-postagger.jar
edu.stanford.nlp.tagger.maxent.MaxentTaggerGUI models/wsj-0-18-left3words-
distsim.tagger
2) Running the appropriate script for your operating system:
stanford-postagger-gui.bat
./stanford-postagger-gui.sh
To run the tagger from the command line, you can start with the provided
script appropriate for you operating system:
./stanford-postagger.sh models/wsj-0-18-left3words-distsim.tagger sample-
input.txt
stanford-postagger models\wsj-0-18-left3words-distsim.tagger sample-input.txt
The output should match what is found in sample-output.txt
The tagger has three modes: tagging, training, and testing. Tagging
allows you to use a pretrained model (two English models are included)
to assign part of speech tags to unlabeled text. Training allows you to
save a new model based on a set of tagged data that you provide.
Testing allows you to see how well a tagger performs by tagging labeled
data and evaluating the results against the correct tags.
Many options are available for training, tagging, and testing. These
options can be set using a properties file. To start, you can generate a
default properties file by:
Tagged output will be printed to standard out, which you can redirect
as above. Note that the bidirectional model is slightly more accurate
but significantly slower than the left3words model.
To test a model
===============
This file.
LICENSE.txt
Stanford POS Tagger is licensed under the GNU General Public License (v2+).
stanford-postagger.jar
stanford-postagger-YYYY-MM-DD.jar
src
A directory containing the Java 1.8 source code for the Stanford POS
Tagger distribution.
build.xml, Makefile
Files for building the distribution (with ant and make, respectively)
models
sample-input.txt
A sample text file that you can tag to demonstrate the tagger.
sample-output.txt
stanford-postagger-gui.sh
stanford-postagger-gui.bat
stanford-postagger.sh
stanford-postagger.bat
TaggerDemo.java
A sample file for how to call the tagger in your own program. You
should be able to compile and run it with:
(If you are on Windows, you need to replace the ":" with a ";" in the
-cp argument, and should use a "\" in place of the "/" in the filename....)
THANKS
-----------------------------------------------
http://nlp.stanford.edu/javanlp/
CHANGES
-----------------------------------------------
LICENSE
-----------------------------------------------
You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA.
CONTACT
-----------------------------------------------
For questions about the Stanford POS tagger, please feel free to contact
the Stanford JavaNLP user community at the mailing list
java-nlp-user@lists.stanford.edu. You need to be a member of this
mailing list to be able to post to it. Join the list either by emailing
java-nlp-user-join@lists.stanford.edu (leave the subject and message
body empty) or by using the web interface at:
https://mailman.stanford.edu/mailman/listinfo/java-nlp-user