--- title: Bogdan Paul ChiČ™ published: true taxonomy: category: [erasmus] tag: [nlp, ie, rag, medical] author: Daniel Hladek --- ERASMUS Intern Spring 2026, 20 March - 21 May (62 days) Topic: (multilingual) Triplet extraction from medical data Goal: - Construct a knowledge graph from medical package inserts in multiple languages - Utilize the graph in an intelligent agent that recommends medication. Tasks: - Learn intelligent agents and generative models - OpenAI API, Agent frameworks, RAG systems. - Learn about knowledge graphs and GraphRAG. Read several research papers. - Find several existing drug knowledge databases. Identify possible entities and relations. - Prepare a Python based workflow, use git code repository - Try Light RAG - Simple RAG. - Scrape package inserts and parse the data. - Index the data and write a script that extracts a knowledge graph from data. - Visualize the graph - Prepare an agent that utilizes the unstructured data and graph-data. - Evaluate the agent using DeepEval or RAGAS. - Write a report - Put all code to GIT