Spolupráca na záverečných prácach https://zp.kemt.fei.tuke.sk
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 

2.1 KiB

title category tag
Hate Speech [project] [hatespeech nlp nlm]

Hate Speech Scientific Project

Goal:

  • To be able to recognize parts of text that contains hate or vulgarisms.

Possible applications:

  • Management of discussion forums / detection of spam or abuse.
  • "Postprocessing" for biased generative language models - preventing to generate inapropriate responses.

Plan:

  • Perform a review of the state-of-the-art
  • Pick established (english) corpora
  • Formalize the problem - classification of sentiment, recognition of topic, keyword selection,
  • Propose a preliminary system, repeat existing approach.
  • Create small evaluation set in Slovak
  • Try multilingual/crosslingual approach. Possibility of machine translation.
  • Annotate a bigger Slovak Corpus
  • Recognize and publish scientific contribution

Future Tasks:

In progress tasks:

Finished tasks:

  • Perform preliminary experiments with HS detection (Bulburu)
  • Prepare an anotation infrastructure for Facebook data annotation (Ferko)
  • Gather Facebook data and prepare for annotation. (Ferko)

People:

Former participants:

Links: