AILab@IMCS web service: POS tagger

category LRT (Language Resources and Technology)
language Latvian
access rights The web service is primarily intended for testing and evaluation purposes, and for experimental/academic/non-commercial usage.
Different terms of use can be discussed individually (see contacts below).
Please, let us know how and where you are using or would like to use this service.
description The morphological or POS (part of speech) tagger service provides two methods: one for preprocessing (tokenization and sentence splitting) of a given text, and the other for the actual tagging.
The tagger is available also as a standalone application.
last update 2011-07-01
service type REST
interface

http://tagger.ailab.lv/tokenize/ — parameters: request=<plain-text>, output format: MAF
http://tagger.ailab.lv/tag/ — parameters: request=<MAF>, output format: MAF

http://tagger.ailab.lv/tokenizetcf/ — parameters: request=<plain-text>, output format: TCF
http://tagger.ailab.lv/tagtcf/ — parameters: request=<TCF>, output format: TCF

HTTP method POST
MIME type request: application/x-www-form-urlencoded
response: text/xml
encoding UTF-8
annotation standards MAF (ISO/DIS 24611), using a slight extension: a sentence marker.
TCF (vers. 0.4)
ISOcat (ISO 12620); data categories used by the analyzer/synthesizer are listed here.
MULTEXT-East (vers. 4); slightly extended for Latvian.
contact us
demo




   

(c) IMCS UL, 2011