diff --git a/pages/interns/yussef_ressaissi/README.md b/pages/interns/yussef_ressaissi/README.md index 5b2d049e..cbec97ab 100644 --- a/pages/interns/yussef_ressaissi/README.md +++ b/pages/interns/yussef_ressaissi/README.md @@ -30,4 +30,29 @@ Tasks: - Prepare a final report with analysis, experiments and conclusions. - Publish the fine-tuned models in HF HUB. Publish the paper from the project. +Meeting 17.7.2025: + +State: + +- Studying of the task, metrics (ROUGE,BLEU) +- Loaded a model. preprocessed a dataset, evaluated a model +- loaded more models, used SlovakSum, generated summarization with four model and comapre them with ROUGE and BLEU (TUKE-KEMT/slovak-t5-base, google/mt5-small, google/mt5-base, facebook/mbart-large-50) +- the comparisin is without fine tuning (zero shot), for far, the best is MBART-large +- working on legal dataset "dennlinger/eur-lex-sum", +- notebooks are on the kemt git + +Tasks: + +- Prepare "mango.kemt.fei.tuke.sk" workflow +- Finetune an existing models and evaluate it. Use News and Legal datasets +- Try mbart-large, flan-t5-large, slovak-t5-base, google/t5-v1_1-large +- Describe the experimental setup, prepare tables with results. + + +Future tasks: + +- Try prompting LLM and evaluation of the results. We need to pick LLM with SLovak Support +- Finetune an LLM to summarize +- Use medical data (after they are ready). +- Prepare a detailed report (to be converted into a paper).