黑料网大事记

Mr Nic Kuo

Mr Nic Kuo

Research Fellow
Medicine & Health
Centre for Big Data Research in Health

Dr. Nicholas I-Hsien Kuo is a Research Fellow at the , 黑料网大事记 Sydney. His research focuses on developing machine learning methods to generate realistic synthetic health data, improve predictive model calibration, and support evidence-based clinical decision-making.

Nic is a founding member of , a global platform for open synthetic clinical datasets. His flagship synthetic dataset, the ART for HIV Dataset v2, is available via . More resources can be found at his .

Education

  • PhD in Computer Science, Australian National University (2017鈥2021)
  • Master of Science in Applied Mathematics (First Class Honours), University of Auckland (2016鈥2017)

Research Interests

  • Machine Learning and Artificial Intelligence Applications in Health
  • Synthetic Data Generation (, )
  • and
  • and

Invited Talks

  • Plenary Speaker 鈥 National Big Data Health Science Conference (Columbia, SC, USA, 2025)
  • Programme Chair & Tutorial Presenter 鈥 ALTA Conference (Canberra, AU, 2024)
  • Plenary Speaker & Panelist 鈥 ASCOFAME International Conference (Cartagena, Colombia, 2023)
  • Speaker 鈥 ANU AIxBIO Symposium (Canberra, AU, 2024)
  • Speaker 鈥 Masterclass on AI for Healthcare, Northern Sydney LHD (Sydney, AU, 2024)
  • Keynote Speaker 鈥 黑料网大事记 AI Symposium (Sydney, AU, 2023)
  • Poster Presentation 鈥 AI for Learning Health Systems Symposium (Puerto Rico, USA, 2023)

Profiles

Location
Level 2, AGSM Building (G27), Gate 11, Botany St, 黑料网大事记 Sydney Campus, Botany St, Kensington NSW 2052
  • Book Chapters | 2024
    Wong ZSY; Waters N; Agchbayar A; Batsaikhan B; Enkhbold T; Batzorig K; Ganzorig O; Kuo NIH; Liu J, 2024, 'Rule-Based Natural Language Processing Pipeline to Detect Medication-Related Named Entities: Insights for Transfer Learning', in , pp. 584 - 588,
  • Journal articles | 2024
    Kuo NIH; Perez-Concha O; Hanly M; Mnatzaganian E; Hao B; Di Sipio M; Yu G; Vanjara J; Valerie IC; de Oliveira Costa J; Churches T; Lujic S; Hegarty J; Jorm L; Barbieri S, 2024, 'Enriching Data Science and Health Care Education: Application and Impact of Synthetic Data Sets Through the Health Gym Project', JMIR Medical Education, 10,
    Journal articles | 2023
    Kuo NIH; Garcia F; S枚nnerborg A; B枚hm M; Kaiser R; Zazzi M; Polizzotto M; Jorm L; Barbieri S, 2023, 'Generating synthetic clinical data that capture class imbalanced distributions with generative adversarial networks: Example using antiretroviral therapy for HIV', Journal of Biomedical Informatics, 144,
    Journal articles | 2022
    Kuo NIH; Polizzotto MN; Finfer S; Garcia F; S枚nnerborg A; Zazzi M; B枚hm M; Kaiser R; Jorm L; Barbieri S, 2022, 'The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms', Scientific Data, 9,
  • Preprints | 2025
    Kuo NI-H; Gallego B; Jorm L, 2025, Attention-Based Synthetic Data Generation for Calibration-Enhanced Survival Analysis: A Case Study for Chronic Kidney Disease Using Electronic Health Records,
    Preprints | 2023
    Kuo NI-H; Jorm L; Barbieri S, 2023, Synthetic Health-related Longitudinal Data with Mixed-type Variables Generated using Diffusion Models,
    Preprints | 2023
    Kuo NI-H; Perez-Concha O; Hanly M; Mnatzaganian E; Hao B; Di Sipio M; Yu G; Vanjara J; Valerie IC; de Oliveira Costa J; Churches T; Lujic S; Hegarty J; Jorm L; Barbieri S, 2023, Enriching Data Science and Health Care Education: Application and Impact of Synthetic Data Sets Through the Health Gym Project (Preprint),
    Preprints | 2023
    Marchesi R; Micheletti N; Kuo NI-H; Barbieri S; Jurman G; Osmani V, 2023, Generative AI Mitigates Representation Bias and Improves Model Fairness Through Synthetic Health Data,
    Preprints | 2022
    Kuo NI-H; Garcia F; S枚nnerborg A; Zazzi M; B枚hm M; Kaiser R; Polizzotto M; Jorm L; Barbieri S, 2022, Generating Synthetic Clinical Data that Capture Class Imbalanced Distributions with Generative Adversarial Networks: Example using Antiretroviral Therapy for HIV,
    Preprints | 2022
    Kuo NI-H; Polizzotto MN; Finfer S; Garcia F; S枚nnerborg A; Zazzi M; B枚hm M; Jorm L; Barbieri S, 2022, The Health Gym: Synthetic Health-Related Datasets for the Development of Reinforcement Learning Algorithms,
    Conference Papers | 2021
    I-Hsien Kuo N; Harandi M; Fourrier N; Walder C; Ferraro G; Suominen H, 2021, 'Learning to Continually Learn Rapidly from Few and Noisy Data', in Proceedings of Machine Learning Research, pp. 65 - 76
    Preprints | 2021
    Kuo NI-H; Harandi M; Fourrier N; Walder C; Ferraro G; Suominen H, 2021, Learning to Continually Learn Rapidly from Few and Noisy Data,
    Preprints | 2021
    Kuo NI-H; Polizzotto M; Finfer S; Jorm L; Barbieri S, 2021, Synthetic Acute Hypotension and Sepsis Datasets Based on MIMIC-III and Published as Part of the Health Gym Project,
    Conference Papers | 2021
    Kuo NIH; Harandi M; Fourrier N; Walder C; Ferraro G; Suominen H, 2021, 'Plastic and stable gated classifiers for continual learning', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 3548 - 3553,
    Preprints | 2020
    Kuo NI-H; Harandi M; Fourrier N; Walder C; Ferraro G; Suominen H, 2020, MTL2L: A Context Aware Neural Optimiser,
    Conference Papers | 2020
    Kuo NIH; Harandi M; Fourrier N; Walder C; Ferraro G; Suominen H, 2020, 'An Input Residual Connection for Simplifying Gated Recurrent Neural Networks', in Proceedings of the International Joint Conference on Neural Networks,
    Conference Papers | 2020
    Kuo NIH; Harandi M; Fourrier N; Walder C; Ferraro G; Suominen H, 2020, 'M2SGD: Learning to learn important weights', in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 957 - 964,

Studied at the ANU under the Australian Government Research Training Program Domestic Scholarship.

Hello and Welcome

Hi there 鈥 I鈥檓 Nic! I鈥檓 generally interested in the cross-section of machine learning, mathematics, and statistics 鈥 especially when these come together to help us reason about complex systems like healthcare, language, or the brain. I enjoy asking questions that sit somewhere between theory and application: How should we represent uncertainty? Can machines generate meaningful data? What makes a model interpretable, or even trustworthy?

Designing new GAN architecture

Much of my recent work explores how synthetic data can support these kinds of questions 鈥 both technically and ethically. To me, synthetic data isn鈥檛 just about creating 鈥渇ake鈥 records 鈥 it鈥檚 about designing usable, testable, and often elegant approximations of real-world phenomena. That opens up exciting possibilities across three main areas:

  • Synthesis 鈥 Using synthetic data to make healthcare datasets more publicly accessible, while protecting privacy and confidentiality.
  • Augmentation 鈥 Generating synthetic examples to address inherent biases or imbalances in downstream models.
  • Imputation 鈥 Understanding the role of missing data handling and how different practices influence the trajectory of research and results.

Each of these areas has both practical and philosophical implications 鈥 and that鈥檚 what keeps me coming back.

On Memory, Latent Spaces, and Representation

Learning to learn

Beyond data generation, I鈥檓 also fascinated by questions of memory and latent space representations in machine learning. I often find myself drawn to how models store, compress, and recall information across time and tasks.

What does it really mean for a model to 鈥渞emember鈥? Can we think of latent spaces as blueprints of knowledge 鈥 or maps of the world? And how do we design models that not only perform well, but also help us understand the systems they鈥檙e trained on?

This is the space where theory meets intuition 鈥 and where I find some of the most inspiring challenges in modern AI.

My Teaching

Teaching & Educational Engagements

  • Guest Lecturer 鈥 黑料网大事记 Master of Science in Health Data Science
  • Guest Lecturer 鈥 ANU Document Analysis and ANU School of Cybernetics
  • Australian Competition Data Provider 鈥 (Sydney, AU, 2023)
  • International Competition Data Provider 鈥 (Columbia, SC, USA, 2025)