|
|
@ -24,16 +24,17 @@ Plan:
|
|
|
|
- Create small evaluation set in Slovak
|
|
|
|
- Create small evaluation set in Slovak
|
|
|
|
- Try multilingual/crosslingual approach. Possibility of machine translation.
|
|
|
|
- Try multilingual/crosslingual approach. Possibility of machine translation.
|
|
|
|
- Annotate a bigger Slovak Corpus
|
|
|
|
- Annotate a bigger Slovak Corpus
|
|
|
|
- Recognize and publish scientific contribution
|
|
|
|
- Recognize and publish scientific contribution
|
|
|
|
|
|
|
|
|
|
|
|
Futire Tasks:
|
|
|
|
Future Tasks:
|
|
|
|
|
|
|
|
|
|
|
|
- Evaluate existing multilingual model. E.G. https://huggingface.co/Andrazp/multilingual-hate-speech-robacofi
|
|
|
|
- Evaluate existing multilingual model. E.G. https://huggingface.co/Andrazp/multilingual-hate-speech-robacofi
|
|
|
|
- Translate existing English dataset into Slovak. Use OPUS English Slovak Marian NMT model. Train Slovak munolingual model.
|
|
|
|
- Translate existing English dataset into Slovak. Use OPUS English Slovak Marian NMT model. Train Slovak munolingual model.
|
|
|
|
- Annotate a Twitter Dataset. Possible guidelines are: https://developers.perspectiveapi.com/s/about-the-api-training-data?language=en_US
|
|
|
|
- Train or finetune or prompt a large langauge model.
|
|
|
|
|
|
|
|
|
|
|
|
In progress tasks:
|
|
|
|
In progress tasks:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
- Annotate a Twitter Dataset. Possible guidelines are: https://developers.perspectiveapi.com/s/about-the-api-training-data?language=en_US
|
|
|
|
- Annotate a Facebook Dataset. Use some other guidelines. e.g. sentence-level annotation, for context sensitive hate.
|
|
|
|
- Annotate a Facebook Dataset. Use some other guidelines. e.g. sentence-level annotation, for context sensitive hate.
|
|
|
|
- Prepare existing Slovak Twitter dataaset, train evaluate a model.
|
|
|
|
- Prepare existing Slovak Twitter dataaset, train evaluate a model.
|
|
|
|
|
|
|
|
|
|
|
@ -50,10 +51,13 @@ People:
|
|
|
|
- Daniel Hládek
|
|
|
|
- Daniel Hládek
|
|
|
|
- Zuzana Sokolová
|
|
|
|
- Zuzana Sokolová
|
|
|
|
- [Vladimír Ferko](/students/2021/vladimir_ferko)
|
|
|
|
- [Vladimír Ferko](/students/2021/vladimir_ferko)
|
|
|
|
- [Sevval Bulburu](/interns/sevval_bulburu)
|
|
|
|
- [Tetiana Mohorian](/students/2022/tetiana_mohorian)
|
|
|
|
|
|
|
|
- [Patrik Pokrivčák](/students/2019/patrik_pokrivcak)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Former participants:
|
|
|
|
Former participants:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
- [Sevval Bulburu](/interns/sevval_bulburu)
|
|
|
|
- [Manohar Gowdru Shridharu](/students/2021/manohar_gowdru_shridharu)
|
|
|
|
- [Manohar Gowdru Shridharu](/students/2021/manohar_gowdru_shridharu)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|