In its first year, SYNTHIA set out not only to encourage dialogue and collaboration across scientific and clinical communities, but also to deliver concrete results and evidence supporting progress in synthetic data in healthcare innovation. Our scientific publications span core challenges in synthetic data generation, evaluation and application in real-world biomedical contexts. From foundational explorations of privacy and utility, to innovative models, each piece of collaborative work reflects both the span and depth of SYNTHIA’s activities with cutting-edge research questions.

All SYNTHIA publications are the result of close collaboration across our consortium, involving multiple partners.


The following SYNTHIA publications focus on one of the most critical barriers to synthetic data adoption: trust. They examine how privacy risks, data utility and evaluation metrics are currently defined, measured and interpreted in healthcare settings. By systematically reviewing existing approaches and proposing unified perspectives on re-identification, inference and reconstruction risks, these works aim to clarify how synthetic data can be assessed transparently and responsibly.  


 

These SYNTHIA publications advance the technical foundations of synthetic data generation for complex biomedical data. The focus is on developing and evaluating advanced generative models capable of capturing high-dimensional, multimodal and clinically relevant data structures. These studies demonstrate how synthetic data can support demanding downstream tasks, such as image segmentation and survival prediction, while maintaining fidelity and analytical usefulness. 

 


 

This SYNTHIA publication addresses the limitations of purely correlational models in healthcare AI. The featured publication focuses on causal generative modelling as a way to mitigate bias and hidden confounding, supporting more reliable inference from complex health data. This line of research strengthens the methodological robustness of both real and synthetic data applications, particularly in decision-support contexts.  


 

These SYNTHIA publications capture our work at the intersection of advanced AI architectures, privacy-preserving computation and regulation. They examine how federated learning and distributed synthetic data generation can be aligned with existing medical device regulations, addressing challenges related to validation, accountability, traceability and trust in decentralized AI systems.  


 

These SYNTHIA publications demonstrate how synthetic data methods and advanced analytics can be applied in concrete biomedical research scenarios. Spanning oncology and hematology, the work shows how data-driven approaches can support survival analysis, genomic benchmarking and disease understanding, even in settings where data access is constrained by sensitivity, scale or privacy concerns.   


 

These SYNTHIA publications focus on the practical enablers of synthetic data generation and reuse. The publications address how datasets can be made more interoperable, discoverable and machine-readable through improved metadata, semantic harmonization and benchmarking approaches. Together, they contribute to making both real and synthetic data easier to integrate across studies, institutions and research domains. 

 


Across 2024–2025, SYNTHIA’s first and partly-year publications delivered a strong body of scientific results, connecting foundational reviews with methodological innovation, practical modelling and clinical application. They lay important groundwork for the further development and practical use of synthetic data in health research. 

Explore all SYNTHIA publications and read them in full on our Publications page.