forked from KEMT/zpwiki
Update 'pages/topics/resources/README.md'
This commit is contained in:
parent
9e8b0aeb96
commit
72c65b5d15
@ -50,7 +50,9 @@ Europarlament
|
|||||||
|
|
||||||
### Word Embedding
|
### Word Embedding
|
||||||
|
|
||||||
[FastText Word Embedding from Common Crawl](https://fasttext.cc/docs/en/crawl-vectors.html)
|
- [FastText Word Embedding from Common Crawl](https://fasttext.cc/docs/en/crawl-vectors.html)
|
||||||
|
- [FastText Word Embedding from Wikipedia](https://fasttext.cc/docs/en/pretrained-vectors.html)
|
||||||
|
|
||||||
|
|
||||||
### Databázy zdrojov
|
### Databázy zdrojov
|
||||||
|
|
||||||
@ -62,3 +64,11 @@ http://www.meta-share.org/
|
|||||||
|
|
||||||
https://korpus.sk/res.html
|
https://korpus.sk/res.html
|
||||||
|
|
||||||
|
Slovak Stemming https://www.mediawiki.org/wiki/User:TJones_(WMF)/Notes/Slovak_Stemmer_Analysis
|
||||||
|
|
||||||
|
### Tools
|
||||||
|
|
||||||
|
- [Spacy](https://spacy.io/), tokenizer, stopwords, custom model
|
||||||
|
- [Slovak Lexer](https://github.com/hladek/slovak-lexer) / tokenizer
|
||||||
|
- [Slovak Elasticsearch](https://github.com/essential-data/elasticsearch-sk) - stopwords, stemmer
|
||||||
|
- [Slovak Hunspell](https://github.com/essential-data/hunspell-sk) - stemmer, spelling
|
||||||
|
Loading…
Reference in New Issue
Block a user