forked from KEMT/zpwiki
		
	zz
This commit is contained in:
		
							parent
							
								
									b06eb9c21c
								
							
						
					
					
						commit
						946bc7f9f1
					
				| @ -1,5 +1,16 @@ | ||||
| --- | ||||
| title: Cesar Abascal Gutierrez | ||||
| published: true | ||||
| taxonomy: | ||||
|     category: [iaeste] | ||||
|     tag: [ner,nlp] | ||||
|     author: Daniel Hladek | ||||
| --- | ||||
| 
 | ||||
| ## Named entity annotations | ||||
| 
 | ||||
| Intern, probably summer 2019 | ||||
| 
 | ||||
| Cesar Abascal Gutierrez <cesarbielva1994@gmail.com> | ||||
| 
 | ||||
| ## Goals | ||||
|  | ||||
							
								
								
									
										34
									
								
								pages/interns/oliver_pejic/README.md
									
									
									
									
									
										Normal file
									
								
							
							
						
						
									
										34
									
								
								pages/interns/oliver_pejic/README.md
									
									
									
									
									
										Normal file
									
								
							| @ -0,0 +1,34 @@ | ||||
| --- | ||||
| title: Oliver Pejic | ||||
| published: true | ||||
| taxonomy: | ||||
|     category: [iaeste] | ||||
|     tag: [hatespeech,nlp] | ||||
|     author: Daniel Hladek | ||||
| --- | ||||
| 
 | ||||
| Oliver Pejic | ||||
| 
 | ||||
| IAESTE Intern Summer 2024, six weeks in August and September | ||||
| 
 | ||||
| Goal: | ||||
|   | ||||
| - Help with the [Hate Speech Project](/topics/hatespeech) | ||||
| - Help with evaluation of sentence transformer models using toolkit [MTEB](https://github.com/embeddings-benchmark/mteb)  | ||||
| 
 | ||||
| Final Tasks: | ||||
| 
 | ||||
| - Prepare an MTEB evaluation task for [Slovak HATE speech](https://huggingface.co/datasets/TUKE-KEMT/hate_speech_slovak). | ||||
| - Prepare an MTEB evaluation task for [Slovak question answering](https://huggingface.co/datasets/TUKE-KEMT/retrieval-skquad). | ||||
| - [Machine translate](https://huggingface.co/google/madlad400-3b-mt) an SBERT evaluation set for multiple slavic languages. | ||||
| - Write a short scientific paper with results. | ||||
| 
 | ||||
| Preparation: | ||||
| 
 | ||||
| - Get familiar with [SentenceTransformer](https://sbert.net/) framework, study fundamental papers and write down notes. | ||||
| - Get familiar with [MTEB](https://github.com/embeddings-benchmark/mteb) evaluation framework. | ||||
| - Prepare a working  environment on Google Colab or on school server or Anaconda. | ||||
| - Get familiar with [existing finetuning scripts](https://git.kemt.fei.tuke.sk/dano/slovakretrieval). | ||||
| 
 | ||||
| 
 | ||||
| 
 | ||||
| @ -2,7 +2,7 @@ | ||||
| title: Sevval Bulburu | ||||
| published: true | ||||
| taxonomy: | ||||
|     category: [iaeste2023] | ||||
|     category: [iaeste] | ||||
|     tag: [hatespeech,nlp] | ||||
|     author: Daniel Hladek | ||||
| --- | ||||
|  | ||||
		Loading…
	
		Reference in New Issue
	
	Block a user