forked from KEMT/zpwiki
		
	Update pages/interns/yussef_ressaissi/README.md
This commit is contained in:
		
							parent
							
								
									020dc8ca83
								
							
						
					
					
						commit
						77aba37f85
					
				| @ -15,13 +15,18 @@ Goal: Evaluate and improve language models for summarization in Slovak medical o | ||||
| 
 | ||||
| Tasks: | ||||
| 
 | ||||
| - Get familiar with basic tools and prepare working environment: HF transformers, datasets, lm-evaluation-harness, HF trl | ||||
| 1. Get familiar with basic tools  | ||||
|   -  and prepare working environment: HF transformers, datasets, lm-evaluation-harness, HF trl | ||||
|   - Read several recent papers about summarization using LLM and write a report. | ||||
|   - Get familiar how to perform and evaluate document summarization using language models in Slovak. | ||||
| 2. Make a comparison experiment | ||||
|   - Pick summarization datasets and models. Evaluate several models for evaluation using ROUGE and BLEU metrics. | ||||
|   - Describe the experiments. Summarize results in a table. Describe the results.  | ||||
| - Improve performance of a languge model. Use more data. Prepare a domain-oriented dataset and finetune a model. Maybe generate artificial data to imporve summarization. | ||||
| 3. Improve performance of a languge model.  | ||||
|   - Use more data. Prepare a domain-oriented dataset and finetune a model. Maybe generate artificial data to imporve summarization. | ||||
|   - Run new expriments and write down the results. | ||||
| 4. Report and disseminate | ||||
|   - Prepare a final report with analysis, experiments and conclusions. | ||||
|   - Publish the fine-tuned models in HF HUB. Publish the paper from the project. | ||||
| 
 | ||||
| 
 | ||||
|  | ||||
		Loading…
	
		Reference in New Issue
	
	Block a user