forked from KEMT/zpwiki
28 lines
626 B
Markdown
28 lines
626 B
Markdown
# Hatespeech Scientific Project
|
|
|
|
Goal:
|
|
|
|
- To be able to recognize text that contains hate.
|
|
|
|
|
|
Plan:
|
|
|
|
- Perform a review of the state-of-the-art
|
|
- Pick established (english) corpora
|
|
- Formalize the problem - classification of sentiment, recognition of topic, keyword selection,
|
|
- Propose a preliminary system, repeat existing approach.
|
|
- Create small evaluation set in Slovak
|
|
- Try multilingual/crosslingual approach. Possibility of machine translation.
|
|
- Annotate a bigger Sloval Corpus
|
|
- Fiund scientific contribution
|
|
|
|
People:
|
|
|
|
- Ján Staš
|
|
- Daniel Hládek
|
|
- Zuzana Sokolová
|
|
- Manuhar
|
|
|
|
Sources:
|
|
https://hatespeechdata.com/
|