From 9feade3a56ec72e679d73f57c59619a10ecdb6bd Mon Sep 17 00:00:00 2001 From: dano Date: Tue, 22 Aug 2023 08:32:37 +0000 Subject: [PATCH] Update 'pages/interns/sevval_bulburu/README.md' --- pages/interns/sevval_bulburu/README.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/pages/interns/sevval_bulburu/README.md b/pages/interns/sevval_bulburu/README.md index d9178027..0e534b02 100644 --- a/pages/interns/sevval_bulburu/README.md +++ b/pages/interns/sevval_bulburu/README.md @@ -23,13 +23,14 @@ State: Tasks: - Get familiar with the task of Hate speech detection. Find out how can we use Transformer neural networks to detect and categorize hate speech in internet comments created by random people. -- Get familiar with the basic tools: Huggingface Transformers, Prodi.gy +- Get familiar with the basic tools: Huggingface Transformers, Learn how to use - Evaluate existing multilingual model. E.G. https://huggingface.co/Andrazp/multilingual-hate-speech-robacofi in Python script. Learn something about Transformer neural networks. +- get familiar with Prodi.gy annotation tool. - Set up web-based annotation environment for students (open, cooperation with [/students/2021/vladimir_ferko](Vladimir Ferko) ). Future tasks (to be decided): -- Evaluate existing multilingual model. E.G. https://huggingface.co/Andrazp/multilingual-hate-speech-robacofi +- Evaluate existing multilingual model. E.G. https://huggingface.co/Andrazp/multilingual-hate-speech-robacofi with slovak data - Translate existing English dataset into Slovak. Use OPUS English Slovak Marian NMT model. Train Slovak munolingual model. - Prepare existing Slovak Twitter dataaset, trainm evaluate a model.