zz
This commit is contained in:
parent
a1b4a3d2f8
commit
c065ab523d
@ -30,4 +30,29 @@ Tasks:
|
|||||||
- Prepare a final report with analysis, experiments and conclusions.
|
- Prepare a final report with analysis, experiments and conclusions.
|
||||||
- Publish the fine-tuned models in HF HUB. Publish the paper from the project.
|
- Publish the fine-tuned models in HF HUB. Publish the paper from the project.
|
||||||
|
|
||||||
|
Meeting 17.7.2025:
|
||||||
|
|
||||||
|
State:
|
||||||
|
|
||||||
|
- Studying of the task, metrics (ROUGE,BLEU)
|
||||||
|
- Loaded a model. preprocessed a dataset, evaluated a model
|
||||||
|
- loaded more models, used SlovakSum, generated summarization with four model and comapre them with ROUGE and BLEU (TUKE-KEMT/slovak-t5-base, google/mt5-small, google/mt5-base, facebook/mbart-large-50)
|
||||||
|
- the comparisin is without fine tuning (zero shot), for far, the best is MBART-large
|
||||||
|
- working on legal dataset "dennlinger/eur-lex-sum",
|
||||||
|
- notebooks are on the kemt git
|
||||||
|
|
||||||
|
Tasks:
|
||||||
|
|
||||||
|
- Prepare "mango.kemt.fei.tuke.sk" workflow
|
||||||
|
- Finetune an existing models and evaluate it. Use News and Legal datasets
|
||||||
|
- Try mbart-large, flan-t5-large, slovak-t5-base, google/t5-v1_1-large
|
||||||
|
- Describe the experimental setup, prepare tables with results.
|
||||||
|
|
||||||
|
|
||||||
|
Future tasks:
|
||||||
|
|
||||||
|
- Try prompting LLM and evaluation of the results. We need to pick LLM with SLovak Support
|
||||||
|
- Finetune an LLM to summarize
|
||||||
|
- Use medical data (after they are ready).
|
||||||
|
- Prepare a detailed report (to be converted into a paper).
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user