zpwiki/pages/topics/resources/README.md

1.9 KiB

Slovenské jazykové zdroje

POS

Multext East Anotovaný román George Orwell 1984 v 15 európskych jazykoch

NER

Parsing-POS

Slovak Dependency Treebank

https://github.com/UniversalDependencies/UD_Slovak-SNK

Artificial Treebank with Ellipsis

Wordnet

Slovak Word Net

Parallel Corpus

Europarlament

Czech-Slovak Parallel Corpus

English-Slovak Parallel Corpus

Multext East

Sentiment

Twitter sentiment for 15 European languages

Web

Wikipedia

Wikipedia vo formáte JSON Elasticsearch Bulk

Word Embedding

FastText Word Embedding from Common Crawl

Databázy zdrojov

https://www.clarin.eu/portal

https://www.clarin.eu/resource-families/manually-annotated-corpora

http://www.meta-share.org/

https://korpus.sk/res.html