forked from KEMT/zpwiki
		
	zz
This commit is contained in:
		
							parent
							
								
									b06eb9c21c
								
							
						
					
					
						commit
						946bc7f9f1
					
				| @ -1,5 +1,16 @@ | |||||||
|  | --- | ||||||
|  | title: Cesar Abascal Gutierrez | ||||||
|  | published: true | ||||||
|  | taxonomy: | ||||||
|  |     category: [iaeste] | ||||||
|  |     tag: [ner,nlp] | ||||||
|  |     author: Daniel Hladek | ||||||
|  | --- | ||||||
|  | 
 | ||||||
| ## Named entity annotations | ## Named entity annotations | ||||||
| 
 | 
 | ||||||
|  | Intern, probably summer 2019 | ||||||
|  | 
 | ||||||
| Cesar Abascal Gutierrez <cesarbielva1994@gmail.com> | Cesar Abascal Gutierrez <cesarbielva1994@gmail.com> | ||||||
| 
 | 
 | ||||||
| ## Goals | ## Goals | ||||||
|  | |||||||
							
								
								
									
										34
									
								
								pages/interns/oliver_pejic/README.md
									
									
									
									
									
										Normal file
									
								
							
							
						
						
									
										34
									
								
								pages/interns/oliver_pejic/README.md
									
									
									
									
									
										Normal file
									
								
							| @ -0,0 +1,34 @@ | |||||||
|  | --- | ||||||
|  | title: Oliver Pejic | ||||||
|  | published: true | ||||||
|  | taxonomy: | ||||||
|  |     category: [iaeste] | ||||||
|  |     tag: [hatespeech,nlp] | ||||||
|  |     author: Daniel Hladek | ||||||
|  | --- | ||||||
|  | 
 | ||||||
|  | Oliver Pejic | ||||||
|  | 
 | ||||||
|  | IAESTE Intern Summer 2024, six weeks in August and September | ||||||
|  | 
 | ||||||
|  | Goal: | ||||||
|  |   | ||||||
|  | - Help with the [Hate Speech Project](/topics/hatespeech) | ||||||
|  | - Help with evaluation of sentence transformer models using toolkit [MTEB](https://github.com/embeddings-benchmark/mteb)  | ||||||
|  | 
 | ||||||
|  | Final Tasks: | ||||||
|  | 
 | ||||||
|  | - Prepare an MTEB evaluation task for [Slovak HATE speech](https://huggingface.co/datasets/TUKE-KEMT/hate_speech_slovak). | ||||||
|  | - Prepare an MTEB evaluation task for [Slovak question answering](https://huggingface.co/datasets/TUKE-KEMT/retrieval-skquad). | ||||||
|  | - [Machine translate](https://huggingface.co/google/madlad400-3b-mt) an SBERT evaluation set for multiple slavic languages. | ||||||
|  | - Write a short scientific paper with results. | ||||||
|  | 
 | ||||||
|  | Preparation: | ||||||
|  | 
 | ||||||
|  | - Get familiar with [SentenceTransformer](https://sbert.net/) framework, study fundamental papers and write down notes. | ||||||
|  | - Get familiar with [MTEB](https://github.com/embeddings-benchmark/mteb) evaluation framework. | ||||||
|  | - Prepare a working  environment on Google Colab or on school server or Anaconda. | ||||||
|  | - Get familiar with [existing finetuning scripts](https://git.kemt.fei.tuke.sk/dano/slovakretrieval). | ||||||
|  | 
 | ||||||
|  | 
 | ||||||
|  | 
 | ||||||
| @ -2,7 +2,7 @@ | |||||||
| title: Sevval Bulburu | title: Sevval Bulburu | ||||||
| published: true | published: true | ||||||
| taxonomy: | taxonomy: | ||||||
|     category: [iaeste2023] |     category: [iaeste] | ||||||
|     tag: [hatespeech,nlp] |     tag: [hatespeech,nlp] | ||||||
|     author: Daniel Hladek |     author: Daniel Hladek | ||||||
| --- | --- | ||||||
|  | |||||||
		Loading…
	
		Reference in New Issue
	
	Block a user