From 738c5220aa1ef13e657ea5d27379665d9207aa73 Mon Sep 17 00:00:00 2001 From: dano Date: Thu, 12 Oct 2023 14:32:32 +0000 Subject: [PATCH] Update 'pages/interns/sevval_bulburu/README.md' --- pages/interns/sevval_bulburu/README.md | 18 ++++++++++++++---- 1 file changed, 14 insertions(+), 4 deletions(-) diff --git a/pages/interns/sevval_bulburu/README.md b/pages/interns/sevval_bulburu/README.md index 89a5ca62..7067910c 100644 --- a/pages/interns/sevval_bulburu/README.md +++ b/pages/interns/sevval_bulburu/README.md @@ -29,10 +29,20 @@ State: Tasks: -- Please send me your work report. -- Please upload your scripts and notebooks on git and send me a link. git is git.kemt.fei.tuke.sk or github. -- Prepare some short comment about scripts. -- You can also upload the slovak dataset - there is some work done on it. +- Please send me your work report. Please upload your scripts and notebooks on git and send me a link. git is git.kemt.fei.tuke.sk or github. Prepare some short comment about scripts.You can also upload the slovak datasetthere is some work done on it. + +Ideas for a paper: + +- "Data set balancing for Multilingual Hate Speech Detection" +- "BERT embeddings for HS Detection in Low Resource Languages" (Turkish and Slovak). +- Try 2 or 3 class Softmax Layer for neural network. +- Change the dataset for 3 class classification. +- Prepare classifier for Slovak, English, Turkish and for multiple BERT models. Try to use multilingual BERT model for baseline embeddings. +- Measure the effect of balancing the dataset by generation of additional examples. +- Summarize experiments in tables. + + + Meeting 5.9.2023