diff --git a/pages/topics/hatespeech/README.md b/pages/topics/hatespeech/README.md index db7bbd71..aa4ebd8e 100644 --- a/pages/topics/hatespeech/README.md +++ b/pages/topics/hatespeech/README.md @@ -26,6 +26,18 @@ Plan: - Annotate a bigger Slovak Corpus - Recognize and publish scientific contribution +Tasks: + +- Evaluate existing multilingual model. E.G. https://huggingface.co/Andrazp/multilingual-hate-speech-robacofi +- Translate existing English dataset into Slovak. Use OPUS English Slovak Marian NMT model. Train Slovak munolingual model. + +Future tasks: + +- Annotate a Twitter Dataset. Possible guidelines are: https://developers.perspectiveapi.com/s/about-the-api-training-data?language=en_US +- Annotate a Facebook Dataset. Use some other guidelines. e.g. sentence-level annotation, for context sensitive hate. +- Prepare existing Slovak Twitter dataaset, trainm evaluate a model. + + People: