dmytro_ushatenko/pages/topics/resources
2020-04-14 15:14:32 +00:00
..
README.md Update 'pages/topics/resources/README.md' 2020-04-14 15:14:32 +00:00

Slovenské jazykové zdroje

POS

Multext East Anotovaný román George Orwell 1984 v 15 európskych jazykoch

NER

Parsing-POS

Slovak Dependency Treebank

https://github.com/UniversalDependencies/UD_Slovak-SNK

Artificial Treebank with Ellipsis

Wordnet

Slovak Word Net

Parellel Corpus

Europarlament

Czech-Slovak Parallel Corpus

English-Slovak Parallel Corpus

Multext East

Sentiment

Twitter sentiment for 15 European languages

Web

  • Aranea
  • SkTenTen automaticky POS anotovaný, prístup cez web rozhranie

Wikipedia

Wikipedia vo formáte JSON Elasticsearch Bulk

Word Embedding

FastText Word Embedding from Common Crawl

Databázy zdrojov

https://www.clarin.eu/portal

https://www.clarin.eu/resource-families/manually-annotated-corpora

http://www.meta-share.org/

https://korpus.sk/res.html