From 431ecad829a21002a39358c59630ef31d5b1a966 Mon Sep 17 00:00:00 2001 From: dano Date: Tue, 22 Aug 2023 08:19:19 +0000 Subject: [PATCH] Update 'pages/interns/sevval_bulburu/README.md' --- pages/interns/sevval_bulburu/README.md | 27 ++++++++++++++++++++++++++ 1 file changed, 27 insertions(+) diff --git a/pages/interns/sevval_bulburu/README.md b/pages/interns/sevval_bulburu/README.md index 5ece202f..d9178027 100644 --- a/pages/interns/sevval_bulburu/README.md +++ b/pages/interns/sevval_bulburu/README.md @@ -7,4 +7,31 @@ taxonomy: author: Daniel Hladek --- +Sevval Bulburu + + +IAESTE Intern Summer 2023, two months + +Goal: Help with the [Hate Speech Project](/topics/hatespeech) + +Meeting 22.8.2023 + +State: + +- Familiar with Python, Anaconda, Tensorflow, AI projects + +Tasks: + +- Get familiar with the task of Hate speech detection. Find out how can we use Transformer neural networks to detect and categorize hate speech in internet comments created by random people. +- Get familiar with the basic tools: Huggingface Transformers, Prodi.gy +- Set up web-based annotation environment for students (open, cooperation with [/students/2021/vladimir_ferko](Vladimir Ferko) ). + + +Future tasks (to be decided): + +- Evaluate existing multilingual model. E.G. https://huggingface.co/Andrazp/multilingual-hate-speech-robacofi +- Translate existing English dataset into Slovak. Use OPUS English Slovak Marian NMT model. Train Slovak munolingual model. +- Prepare existing Slovak Twitter dataaset, trainm evaluate a model. + +