Update pages/interns/yussef_ressaissi/README.md
This commit is contained in:
parent
020dc8ca83
commit
77aba37f85
@ -15,13 +15,18 @@ Goal: Evaluate and improve language models for summarization in Slovak medical o
|
||||
|
||||
Tasks:
|
||||
|
||||
- Get familiar with basic tools and prepare working environment: HF transformers, datasets, lm-evaluation-harness, HF trl
|
||||
1. Get familiar with basic tools
|
||||
- and prepare working environment: HF transformers, datasets, lm-evaluation-harness, HF trl
|
||||
- Read several recent papers about summarization using LLM and write a report.
|
||||
- Get familiar how to perform and evaluate document summarization using language models in Slovak.
|
||||
2. Make a comparison experiment
|
||||
- Pick summarization datasets and models. Evaluate several models for evaluation using ROUGE and BLEU metrics.
|
||||
- Describe the experiments. Summarize results in a table. Describe the results.
|
||||
- Improve performance of a languge model. Use more data. Prepare a domain-oriented dataset and finetune a model. Maybe generate artificial data to imporve summarization.
|
||||
3. Improve performance of a languge model.
|
||||
- Use more data. Prepare a domain-oriented dataset and finetune a model. Maybe generate artificial data to imporve summarization.
|
||||
- Run new expriments and write down the results.
|
||||
4. Report and disseminate
|
||||
- Prepare a final report with analysis, experiments and conclusions.
|
||||
- Publish the fine-tuned models in HF HUB. Publish the paper from the project.
|
||||
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user