Professional Documents
Culture Documents
Conversion
Histogram Analysis
Hidden Markov Model (HMM)
Maximum Entropy Model (ME)
Conditional Random Field (CRF)
Memory Based Learning (MBL)
POS Tagger : Basic Requirements
Tag Set
Corpus
Indian Languages : Tag Sets
Available
Very few tag sets available publicly
IL Tagset -IIT Hyderabad : very coarse structure in
linguistic analysis, resulting into a very flat structure
e.g. Tag “PREP” used for POSTP also.
Corpus Generation