
Mr Nic Kuo
Dr. Nicholas I-Hsien Kuo is a Research Fellow at the , 黑料网大事记 Sydney. His research focuses on developing machine learning methods to generate realistic synthetic health data, improve predictive model calibration, and support evidence-based clinical decision-making.
Nic is a founding member of , a global platform for open synthetic clinical datasets. His flagship synthetic dataset, the ART for HIV Dataset v2, is available via . More resources can be found at his .
Education
- PhD in Computer Science, Australian National University (2017鈥2021)
- Master of Science in Applied Mathematics (First Class Honours), University of Auckland (2016鈥2017)
Research Interests
- Machine Learning and Artificial Intelligence Applications in Health
- Synthetic Data Generation (, )
- and
- and
Invited Talks
- Plenary Speaker 鈥 National Big Data Health Science Conference (Columbia, SC, USA, 2025)
- Programme Chair & Tutorial Presenter 鈥 ALTA Conference (Canberra, AU, 2024)
- Plenary Speaker & Panelist 鈥 ASCOFAME International Conference (Cartagena, Colombia, 2023)
- Speaker 鈥 ANU AIxBIO Symposium (Canberra, AU, 2024)
- Speaker 鈥 Masterclass on AI for Healthcare, Northern Sydney LHD (Sydney, AU, 2024)
- Keynote Speaker 鈥 黑料网大事记 AI Symposium (Sydney, AU, 2023)
- Poster Presentation 鈥 AI for Learning Health Systems Symposium (Puerto Rico, USA, 2023)
Profiles
- 黑料网大事记 work email: n.kuo@unsw.edu.au
- Publications
- Media
- Grants
- Awards
- Research Activities
- Engagement
- Teaching and Supervision
Studied at the ANU under the Australian Government Research Training Program Domestic Scholarship.
Hello and Welcome
Hi there 鈥 I鈥檓 Nic! I鈥檓 generally interested in the cross-section of machine learning, mathematics, and statistics 鈥 especially when these come together to help us reason about complex systems like healthcare, language, or the brain. I enjoy asking questions that sit somewhere between theory and application: How should we represent uncertainty? Can machines generate meaningful data? What makes a model interpretable, or even trustworthy?
Much of my recent work explores how synthetic data can support these kinds of questions 鈥 both technically and ethically. To me, synthetic data isn鈥檛 just about creating 鈥渇ake鈥 records 鈥 it鈥檚 about designing usable, testable, and often elegant approximations of real-world phenomena. That opens up exciting possibilities across three main areas:
- Synthesis 鈥 Using synthetic data to make healthcare datasets more publicly accessible, while protecting privacy and confidentiality.
- Augmentation 鈥 Generating synthetic examples to address inherent biases or imbalances in downstream models.
- Imputation 鈥 Understanding the role of missing data handling and how different practices influence the trajectory of research and results.
Each of these areas has both practical and philosophical implications 鈥 and that鈥檚 what keeps me coming back.
On Memory, Latent Spaces, and Representation
Beyond data generation, I鈥檓 also fascinated by questions of memory and latent space representations in machine learning. I often find myself drawn to how models store, compress, and recall information across time and tasks.
What does it really mean for a model to 鈥渞emember鈥? Can we think of latent spaces as blueprints of knowledge 鈥 or maps of the world? And how do we design models that not only perform well, but also help us understand the systems they鈥檙e trained on?
This is the space where theory meets intuition 鈥 and where I find some of the most inspiring challenges in modern AI.
My Teaching
Teaching & Educational Engagements
- Guest Lecturer 鈥 黑料网大事记 Master of Science in Health Data Science
- Guest Lecturer 鈥 ANU Document Analysis and ANU School of Cybernetics
- Australian Competition Data Provider 鈥 (Sydney, AU, 2023)
- International Competition Data Provider 鈥 (Columbia, SC, USA, 2025)