
Dr Oscar Perez Concha
PhD Artificial Intelligence, Computer Science, Universidad Carlos III de Madrid, Spain, 2008
I am a health data scientist focused on improving healthcare processes and contributing to better patient outcomes. I specialise in analysing and extracting meaningful information from large unstructured health datasets using artificial intelligence (AI) and natural language processing (NLP). I currently lead research applying large language models (LLMs) and explainable AI to clinical text, with a focus on three key applications: improving the accuracy and efficiency of clinical coding, generating concise clinical summaries, and extracting clinically relevant information to support patient care.
I have a strong interest in teaching, learning, and the dissemination of AI in health. I founded and currently co-host the , (established in 2018) and co-organise the , both initiatives aimed at fostering interdisciplinary collaboration.聽I also teach and convene three postgraduate courses on applied AI in health, as part of the Master of Health Data Science and Clinical AI programs. I designed and created two of these courses: Health Data Analytics: Machine Learning I (HDAT9500), which introduces machine learning techniques for health data science research questions, and Health Data Analytics: Machine Learning II (HDAT9510), which focuses on deep learning methods for solving data science problems in healthcare.
- Publications
- Media
- Grants
- Awards
- Research Activities
- Engagement
- Teaching and Supervision
My Teaching
As part of of the MSc Health Data Science Program and the Clinical AI Program, I currently convene and teach the courses:
1. Health Data Analytics: Machine Learning I (HDAT9500) [Link]
Course Summary: This course provides an introduction to machine learning methods for addressing health research questions. You will learn how to formulate relevant questions and frame them as machine learning problems within a healthcare context. The course covers the fundamentals of key methods, including supervised learning (classification and regression) and unsupervised learning (dimensionality reduction and clustering). With a strong hands-on focus, the course equips you with the foundational skills and understanding needed to apply these methods to real-world healthcare data challenges.
2. Health Data Analytics: Machine Learning II (HDAT9510) [Link闭听
Course Summary: In this course, which builds upon a foundational understanding of machine learning, you will learn the core theory and practical application of deep learning algorithms in the area of health. Adopting a hands-on approach, the course is tailored to provide you with the essential skills and knowledge needed to effectively tackle and resolve healthcare data challenges using the deep learning algorithms covered in the course.
3. Health Data Science: Capstone (HDAT9910)聽[Link]
Course Summary: The capstone project involves completing extensive, desk-based, independent research tasks, requiring the use of the R and/or Python programming languages.聽 An entire health data science project聽has been constructed and sliced into the respective stages of the health data science pipeline, which you are required to complete.