| category | LRT (Language Resources and Technology) | ||
| language | Latvian | ||
| access rights |
The web service is primarily intended for testing and evaluation purposes, and for experimental/academic/non-commercial usage. Different terms of use can be discussed individually (see contacts below). Please, let us know how and where you are using or would like to use this service. |
||
| description |
The morphological or POS (part of speech) tagger service provides two methods: one for preprocessing (tokenization and sentence splitting) of a given text, and the other for the actual tagging. The tagger is available also as a standalone application. |
||
| last update | 2011-07-01 | ||
| service type | REST | ||
| interface |
http://tagger.ailab.lv/tokenize/ — parameters: request=<plain-text>, output format: MAF http://tagger.ailab.lv/tokenizetcf/ — parameters: request=<plain-text>, output format: TCF |
||
| HTTP method | POST | ||
| MIME type |
request: application/x-www-form-urlencoded response: text/xml |
||
| encoding | UTF-8 | ||
| annotation standards |
MAF (ISO/DIS 24611), using a slight extension: a sentence marker. TCF (vers. 0.4) ISOcat (ISO 12620); data categories used by the analyzer/synthesizer are listed here. MULTEXT-East (vers. 4); slightly extended for Latvian. |
||
| contact us | |||
| demo |
|
(c) IMCS UL, 2011