resources

This commit is contained in:
Daniel Hládek 2020-03-02 14:49:10 +01:00
parent f0cb995f59
commit f12a8b0e33
2 changed files with 56 additions and 0 deletions

View File

@ -0,0 +1,9 @@
# Morfologická analýza slovenského jayzka
Identifikácia morfologických jednotiek
- Hunspell Leamtizátor
Byte Pair Encoding

View File

@ -0,0 +1,47 @@
# Slovenské jazykové zdroje
### POS
[Multext East](http://nl.ijs.si/ME/)
### Parsing-POS
[Slovak Dependency Treebank](https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-1822)
https://github.com/UniversalDependencies/UD_Slovak-SNK
[Artificial Treebank with Ellipsis](https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-2616)
### Wordnet
[Slovak Word Net](https://korpus.sk/WordNet.html)
### Parellel Corpus
Europarlament
[Czech-Slovak Parallel Corpus](https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0006-AADF-0)
[English-Slovak Parallel Corpus](https://lindat.mff.cuni.cz/repository/xmlui/handle/11858/00-097C-0000-0006-AAE0-A)
[Multext East](http://nl.ijs.si/ME/)
### Other
[Twitter sentiment for 15 European languages](https://www.clarin.si/repository/xmlui/handle/11356/1054)
### Web
[Aranea](http://ucts.uniba.sk/aranea_about/)
### Databázy zdrojov
https://www.clarin.eu/portal
https://www.clarin.eu/resource-families/manually-annotated-corpora
http://www.meta-share.org/
https://korpus.sk/res.html