Articles Archives - GenoMed4All

The GenoMed4All Journey: Interview with Marilena Bicchieri (Humanitas Research Hospital)

Marilena Bicchieri is a Project Manager at Humanitas Research Hospital, where she oversees the institution’s participation in European and international projects. Her scientific background combines a degree in chemistry with a PhD in molecular oncology, and a research focus mostly dedicated towards investigating the therapeutic use of microRNAs for breast tumours.

As part of our series of interviews with GenoMed4All partners, Marilena reflects on the role she’s fulfilled during the project and her hopes for its impact on advancing precision medicine for hematology after the project ends in June 2025.

Work package leadership and interdisciplinary collaborations

In GenoMed4All, I’ve been working as part of the team responsible for use case validation. Our focus was to assess the feasibility and clinical value of the GenoMed4All platform and its AI-driven tools in real-world clinical settings. This meant piloting the platform across various institutions and seeing how it held up, not just in terms of data sharing through federated learning, but also how applicable and effective the AI models were in supporting clinicians through different phases of the patient journey, from diagnosis to prognosis and treatment decisions.

We addressed three specific use cases: sickle cell disease (SCD), myelodysplastic syndromes (MDS) and multiple myeloma (MM). Each came with very different clinical and technological challenges; for example, sickle cell disease is hereditary, whereas the other two are not. All three are rare diseases, so data scarcity was a recurring issue.

To tackle these complexities, we worked hand in hand with clinicians. We started by identifying the unmet clinical needs for each use case — such as diagnosis, risk prediction and treatment decisions — and built our methodology around those needs. A big part of our role was supporting the standardization of data collection across sites, drafting protocols, harmonizing data and defining clinical outcomes in collaboration with both clinicians and technical partners.

At Humanitas, we were especially focused on bridging the clinical and technical worlds. It was important to ensure that the AI solutions developed were actually usable and relevant in a clinical context. For instance, we put a lot of effort into creating a unified, FAIR-aligned methodology for data standardization. We also activated new collaborations with European registries like ENROL and RADeep. Ultimately, we evaluated the performance of the federated AI models through real pilot trials, always with an eye on their clinical utility.

Impact on precision medicine for hematology

What excites me most about the work we’ve done as part of GenoMed4All is that it shows how technological innovation can genuinely make an impact in clinical practice. This isn’t just a research exercise, we were able to demonstrate that AI models can be integrated into actual clinical workflows, offering tangible benefits for patients.

By using a federated learning approach and building a privacy-preserving infrastructure, we were able to bring data-driven, personalised care to the forefront while fully respecting patient data privacy. The tools we developed enable a much more tailored and effective way to support clinical decision-making, and I believe that this kind of integration is key for the future of precision medicine.

Learnings from this 4-year journey

One of the biggest takeaways for me has been the importance of a truly multidisciplinary approach. The project brought together a diverse range of stakeholders with different backgrounds, and that diversity was fundamental to its success.

Another major lesson is that technological innovation should always be guided by clinical needs, not the other way around. For AI tools to be trusted and adopted, clinicians need to be involved right from the start. At the same time, technical partners need to understand the real-world clinical challenges they’re aiming to solve.

The legacy of GenoMed4All going forward

Looking ahead, I believe GenoMed4All leaves behind a strong and lasting legacy. We’ve laid the foundation for a federated, privacy-preserving infrastructure that connects hospitals and research centres across borders. We’ve also built a tight-knit community of institutions, researchers and clinicians who are motivated to keep pushing this work forward.

Thanks to the support of ERN-EuroBloodNet and other key networks, we’re already thinking about how to expand. We want to bring more centres into this effort, and we’re putting a strategy in place to grow this community. Ultimately, GenoMed4All is not just a project — it’s the beginning of something much bigger.

by admin

The GenoMed4All project: Advancing precision medicine in blood disorders through AI

Abstract

GenoMed4All is building an open, federated data hub to explore new AI models and services for clinical support in blood disorders.

Haematological diseases comprise a large group of up to 450 disorders resulting from abnormalities of blood cells, lymphoid organs and coagulation factors, generally categorised as either oncological or non-oncological. Most have a genetic background, and they represent a significant public health challenge: haematological malignancies account for about 5% of cancers, most can cause chronic health problems, and many are life-threatening conditions. In 2016, the European Haematology Association (EHA) estimated the financial burden of blood disorders on European society to be approximately €22.5bn per year. Moreover, data scarcity remains a pressing issue: the number of available samples for blood disorders remains small and is characterised by a high level of fragmentation, which mostly stems from the sensitive nature of data.

GenoMed4All is the European response to this scenario: an EU-funded initiative to radically transform the way we approach diagnosis, prognosis and treatment in haematological diseases through the use of Artificial intelligence (AI).

by adminDARIO

The GenoMed4All Journey: Interview with Gastone Castellani (Università di Bologna)

Gastone Castellani is Full Professor of Physics and Biophysics at University of Bologna, where he combines his extensive background in physics of complex systems and biology. He is also a renowned figure in the field of Artificial Intelligence (AI) applied to medicine and healthcare, having spent 20 years at Brown University in the US during the 70s collaborating with other prominent AI researchers, including Leon Cooper and John Hopfield (both recipients of a Nobel Prize in Physics).

As part of our new series of interviews with GenoMed4All partners, Gastone reflects on the role he’s fulfilled during the project and his expectations for its impact on advancing precision medicine for hematology after the project ends in June 2025.

Work package leadership and interdisciplinary collaborations

In GenoMed4All, I’ve been leading Work Package 6, which is dedicated to developing artificial intelligence algorithms for the project. This work package has brought together a diverse and highly collaborative group, and our focus has been on designing AI models for both tabular data — like genomic information — and medical imaging.

We’ve worked across three use cases: sickle cell disease, multiple myeloma and myelodysplastic syndromes. Alongside our partners, we’ve developed a range of algorithms, and we’ve made our software openly available through a shared web repository for the benefit of GenoMed4All collaborators and the wider research community.

One of our most significant achievements has been the implementation of federated versions of these algorithms. Not all models are suitable for federated learning, but for those that are, we’ve successfully deployed and trained them within a federated network of GenoMed4All partners. This decentralized approach — where algorithms move instead of sensitive patient data — is one of the hallmarks of the project and a critical step forward in privacy-conscious AI development.

Impact on precision medicine for hematology

Precision medicine is an ambitious and evolving field, and while no single project can address all its challenges, I believe our contributions offer tangible progress.

For example, in the context of Sickle Cell Disease, which can be considered as a relatively underexplored condition, we developed a novel image analysis algorithm to detect “silent infarcts,” or white matter hyperintensities, using MRI data. These are very small lesions that are often hard to detect and easy to confuse with other brain regions. Our model, validated on over 500 MRI scans, has been well received by neuroradiologists, and we’re now submitting our results to a major journal.

In the area of Myelodysplastic syndromes, we’ve collaborated closely with other GenoMed4All partners to identify a genomic signature that can predict the risk of progression to acute myeloid leukemia, marking an important clinical milestone.

And in the case of Multiple Myeloma, we’ve combined genomic data with imaging techniques like PET and CT scans. This integration has led to more accurate predictions of overall and leukemia-free survival, and we’re currently preparing a publication on these findings.

It’s this kind of multimodal, integrative approach that I believe holds great promise for the future of personalized medicine.

The legacy of GenoMed4All going forward

One of the biggest challenges we faced was accessing data. In my experience, this is a recurring issue in many EU projects: it often takes two years or more — nearly half the project’s lifetime — to secure the data necessary for analysis.

That said, I’m hopeful that one of the lasting legacies of GenoMed4All will be the federated infrastructure we’ve established, which should significantly reduce these delays in future projects. This setup allows us to train models where the data resides, without needing to physically move sensitive information, something that aligns well with modern data governance principles.

Another important takeaway is that the approaches we’ve developed for hematological diseases can be extended to other areas of medicine, such as lung cancer, diabetes and neurodegenerative disorders. Hematology, much like neuroscience, is at the frontier of precision medicine, and the tools we’ve built here are widely applicable beyond the original scope of the project.

Looking ahead, I’m also excited about emerging fields such as synthetic data generation and digital twins. Through initiatives like the EU-funded project SYNTHEMA, we’re already exploring how to generate and validate synthetic datasets, which could provide an alternative when real patient data is hard to access; and digital twin technologies could enable us to model individual patient trajectories in-silico, not just in hematology, but across multiple disease areas. Thanks to GenoMed4All, we already have the federated software and infrastructure in place to support these next steps.

by adminDARIO

The GenoMed4All Journey: Interview with Silvia Uribe (UPM)

Silvia Uribe Mayoral is an Associate Professor at Universidad Politécnica de Madrid (UPM), where she’s been involved in various EU-funded projects focused on ICT applied to healthcare innovation, including GenoMed4All and SYNTHEMA among many others. With an academic and professional background in Telecommunications Engineering, she also holds a PhD in Technology and Communication Systems.

As part of our new series of interviews with GenoMed4All partners, Silvia reflects on the role she’s fulfilled during the project, and her expectations for its impact on fostering Federated Learning systems in clinical contexts after the project ends in June 2025.

Work package leadership and interdisciplinary collaborations

At UPM, my team and I have been leading efforts within Work Package 4, where our primary focus has been the development of the Federated Learning Platform. Our goal has been to address one of the key challenges in clinical AI: how to access and utilize sensitive clinical data for algorithm development while respecting the strict privacy requirements inherent in healthcare.

We began by engaging closely with various stakeholders, both from the clinical side and the software development community, to understand their specific needs. Based on this input, we designed and implemented a fully operational federated learning platform.

This platform enables data scientists and other users to develop AI algorithms directly on distributed datasets across different clinical nodes, without the need to centralize the data; in this architecture, the data stays where it is, and only the algorithms travel. This approach significantly enhances data privacy and compliance, especially in a clinical environment where patient confidentiality is paramount.

Impact on precision medicine for hematology

I believe our contribution is particularly important for the advancement of precision medicine in hematology.

By allowing stakeholders to access and analyze distributed clinical data without physically moving it, our platform opens up new possibilities for developing AI-driven diagnostic and decision-support tools. It respects privacy regulations while maximizing the value of data, which is crucial when working with highly sensitive and personal information.

The legacy of GenoMed4All going forward

Reflecting on my experience with GenoMed4All, one key takeaway is that despite all our advancements, there are still many unmet needs within the healthcare system. Technology, especially in the field of AI, has tremendous potential to address these gaps, but it requires a deep understanding of the context and a commitment to designing solutions that are both effective and ethically sound.

Looking ahead, I see the federated learning platform we’ve developed as a strong foundation for future work; it’s already been deployed across several nodes and has demonstrated real functionality. I hope it will serve as a springboard for new initiatives, helping us to tackle further challenges in clinical AI and continue building solutions that meet the evolving needs of healthcare.

by adminDARIO

The GenoMed4All Journey: Interview with Anna Rizzo (Datawizard)

Anna Rizzo is a Research Project Manager at Datawizard in Italy. Her current expertise in dissemination and exploitation for EU-funded projects, particularly related to e-health, draws from a combined academic and professional background in biology, data science and science journalism.

As part of our new series of interviews with GenoMed4All partners, Anna reflects on the role she’s had during the project, her contributions and main learnings, and her expectations for the project’s impact after it’s officially completed in June 2025.

Work package leadership and interdisciplinary collaborations

As a healthcare IT company with a strong focus on telemedicine applications and data standardisation, at Datawizard we’ve had the privilege of contributing to both the early and final phases of the project.

Our involvement began with data modelling, gathering requirements from clinicians and technical partners and translating those needs into functional and technical specifications for the GenoMed4All platform. One of the core challenges was selecting a coherent data model that could support the complex needs of the project, alongside harmonising and standardising diverse data sources.

We also worked closely with the clinical partners to support the characterisation of data flows across the project; this was essential for ensuring ethical and legal compliance, particularly under the GDPR, and towards the final phase we’ve also been focused on aligning our work with emerging frameworks like the EU AI Act and the technical standards being developed by bodies such as CEN-CENELEC.

Another key contribution was supporting the exploitation strategy. At Datawizard, we operate at the intersection of research and market implementation. We’re not only committed to advancing innovation at the European level but also to turning those innovations into usable products for hospitals and industry, particularly in telemedicine. In GenoMed4All, we took on the role of bridging the gap between research and the market; we’ve worked with partners to review project outcomes, identify potential use cases, and support efforts to give those results a life beyond the project — whether through further research, new funding opportunities, or early-stage market pathways.

Impact on precision medicine for hematology

One of the most common challenges with EU-funded research is that excellent work often remains confined within project boundaries. We’ve tried to counter that by helping bring GenoMed4All innovations closer to real-world application — or, as we like to say, closer to the patient’s bedside.

One major advantage has been our strong connection to ERN-EuroBloodNet, the European Reference Network for Rare Hematological Diseases. Through this collaboration, we’ve been able to ensure wider dissemination, support training programmes and, most importantly, lay the groundwork for the GenoMed4All platform’s sustainability.

Two new networks are currently being developed as part of the project’s legacy. The first, led by Vall d’Hebron Research Institute, focuses on non-oncological hematological conditions like Sickle Cell Disease; the second, led by Humanitas Research Hospital, is dedicated to hematological cancers. These networks will act as central nodes, connecting clinical centres with academic and private-sector partners who are developing AI-driven decision support tools.

The goal is to enable those partners to refine and validate their solutions using high-quality clinical data from the network. If we succeed, this will represent a major step forward in advancing precision medicine in hematology.

The legacy of GenoMed4All going forward

From the beginning, we understood that the GenoMed4All platform wasn’t meant to become a commercial product. Its value lies in its structure, its collaborative ecosystem, and the quality of its data. That’s why we’ve agreed within the consortium that the platform’s long-term stewardship should rest with ERN-EuroBloodNet and the clinical centres involved. They are best positioned to ensure continued research, support solution development, and train the next generation of tools and professionals.

This handover represents both an honour and a responsibility for the clinical centres. But with the support of the consortium and the broader ERN-EuroBloodNet community, it also marks the beginning of a federated research network in clinical AI. That, to me, is one of GenoMed4All’s most important achievements.

Main takeaways from this 4-year journey

There are always lessons to be learned from working on exploitation. One recurring challenge is getting partners, especially those from academia, to engage with the idea of exploitation. Many are focused on publishing results and advancing research, which is entirely valid, but they often have little experience or comfort with the concept of valorising those outcomes beyond academia.

This disconnect extends to relationships between academia, clinical centres, and industry. These groups often operate with very different goals and expectations. But for exploitation to succeed, we need to understand who the future users and buyers of our solutions might be — whether hospitals, companies or funding agencies.

Even when solutions are still at an early technology readiness level, we should think ahead: What does success look like beyond the lab or the pilot? Who might benefit from this innovation, and how can we bring them into the conversation early? That kind of forward-thinking is essential if we want our research to have a real impact, and I hope it’s a mindset we continue to cultivate across EU-funded projects.

by adminDARIO

The GenoMed4All Journey: Interview with Nathan Lea (i~HD)

Nathan Lea is the Information Governance Lead at The European Institute for Innovation through Health Data (i~HD). Drawing from his 25+ years of experience in Digital Health, backed by a masters in Computer Science and a PhD in Information Governance and Security, Nathan has become a widely renowned figure in regulatory affairs for clinical data and patients’ privacy rights.

As part of our new series of interviews with GenoMed4All partners, Nathan reflects on the role he’s had during the project, his contributions and main learnings, and his expectations for the project’s impact after it’s officially completed in June 2025.

Work package leadership and interdisciplinary collaborations

Over the past few years, I’ve been deeply involved in leading Work Package 2, looking after compliance, data governance and ethics.

My main focus has been to help the consortium develop a complete understanding of what we’re trying to achieve, how we’re doing it, and why it matters. That starts with understanding how data flows through the project, but it goes far beyond that. It’s about building a shared sense of purpose, ensuring that everyone, regardless of their role, understands what’s being done, with what data, and toward what end.

In multi-partner projects like this one, it’s easy for teams to get siloed. But GenoMed4All really brought together people who might never have collaborated otherwise, and we managed to create something cohesive, something greater than the sum of its parts. Even those not directly involved in data governance came to appreciate the importance of, for example, how informed consent at the beginning of a process affects tool development further down the line.

That kind of mutual understanding doesn’t come from compliance checklists or rigid directives. It comes from dialogue. From saying, “Let’s figure this out together.” That’s the approach we’ve taken throughout WP2 — not as enforcers, but as facilitators of shared responsibility and practical alignment.

Building a shared understanding to protect clinical data

I’ve been constantly impressed by the humility and collaboration within this consortium. Our partners brought extraordinary experience — deep knowledge of diseases, research protocols, patient engagement — but never once used it to dominate a conversation. Instead, they offered their insights in ways that strengthened what we were doing together. It created an environment where it felt safe to explore uncertainties and work toward clarity as a group; that’s not something I take for granted.

A lot of our work came down to experience and practicality. We didn’t start from scratch; instead, we used proven tools like data protection impact assessment (DPIA) templates that have been refined over years of working with clinical and health data. We also leaned on existing resources from the European Commission, such as their templates for joint controller agreements.

The goal was never to burden partners with extra processes, but to help legal, regulatory and data protection teams do their work more effectively — having worked in one of those teams myself, I know how overstretched they can be. Offering well-prepared templates, asking who their data protection officers are, and building relationships early on made it easier to move quickly while still doing things properly.

Ultimately, we were able to support a governance model that respected both innovation and institutional due process. We made it clear that while we would conduct a project-wide DPIA, each partner still had to do their own. We weren’t dictating from the top, we were building a toolkit for everyone to adapt and use.

Main takeaways from this 4-year journey

Personally and professionally, GenoMed4All has left a lasting impression on me. It’s deepened my appreciation for due process — not as red tape, but as the structure that enables meaningful innovation. In the rush to develop AI and data-driven tools, it’s easy to dismiss procedure as a barrier. But in truth, those processes are what make responsible, impactful work possible.

This project has also reinforced the importance of humility and collaboration. Though I’ve often said “I lead this” or “I lead that,” the truth is that leadership in GenoMed4All has always been shared. Every success we’ve had has come from people stepping up with their own expertise and contributing to something bigger. It’s a model I’ll carry with me, and one I hope to replicate in future work.

And perhaps most importantly, this project has reminded me that governance itself must evolve. We can’t apply outdated, intrusive models to fast-moving, collaborative, digital health environments; we need innovation in ethics and data protection too. GenoMed4All has shown that it’s possible, and I’m proud to have been a part of this journey.

The legacy of GenoMed4All going forward

As the GenoMed4All project draws to a close, I’ve found myself reflecting on what we’ve built together and what we’re leaving behind. Now, as we finalise our deliverables and recommendations, I hope the tools and insights we’ve developed will serve as a foundation for others navigating this space, particularly as new regulations like the AI Act begin to take shape.

We may not fully understand the impact of that legislation for another few years, once implementing acts are in place and case law starts to form, but the work we’ve done here gives people a chance to begin thinking critically about how AI can and should be used in healthcare — and just as importantly, what its limitations are.

by adminDARIO

The GenoMed4All Journey: Interview with Teresa García Lezana (CRG)

Teresa García Lezana is a Scientific Project Manager at European Genome-phenome Archive, which is part of the Centre for Genomic Regulation (CRG) based in Barcelona. Among her many duties, she provides support for the implementation of fair principles for genomic data management in European projects, especially in healthcare and medical R&I initiatives like GenoMed4All. Her background in translational medicine is supported by a degree in Biology, a PhD focused on liver diseases, and her current postdoctoral research in liver cancer genetics.

As part of our new series of interviews with GenoMed4All partners, Teresa reflects on the role she’s had during the project, her contributions and main learnings, and her expectations for the project’s impact after it’s officially completed in June 2025.

Involvement across work packages and interdisciplinary collaborations

In Genomed4All, the CRG team leads the Work Package 8, which aims to enhance the value of what has been done in the rest of the technical work packages by other project partners.

Our core function revolves around genomic standardization, and the final outcome is the development of the genomic standardization guidelines. For this, we have been interviewing and collecting information from the bioinformaticians involved in the project, to really understand. So we really need what kind of data analysis and formats they’ve been working with.

Through a process of sequencing and standardization, we have created this guide so other peers outside GenoMed4All can understand what has been done in the project, and can follow the same procedures, so eventually they could integrate their data with the data from GenoMed4All. Essentially, this activity allows for our data to be more interoperable and reusable.

Impact on precision medicine for hematology

As part of the CRG team, our main contribution has been towards achieving the fair usage of the data. By working together with other partners in our work package, we’ve created protocols for data access so it can be ethically found and used by other researchers.

Achieving the interoperability of the data – so it can be understood, reused and integrated by others – is a big advancement towards achieving a well-established system for precision medicine in hematology in Europe. Our hope is that the genomic standardization guidelines we’ve created will be a useful resource for the healthcare and medical community in general.

Main takeaways from this 4-year journey

GenoMed4All has made me more aware of how heterogeneous genomic data is, and how complex it is to use and reuse data that has been generated in a clinical context.

When working with different hospitals, different infrastructures and formats, the integration of this data is really complex and there are many challenges that we still need to address; so through this experience I’ve come to realize about the state in which we are in as a community in terms of data standardization.

Also, on a personal level, I’ve learned about the importance of multidisciplinary collaboration. Without the constant cooperation between researchers, clinicians and other key stakeholders, we wouldn’t have been able to achieve what we have done in GenoMed4All.

The legacy of GenoMed4All going forward

One of main legacies of GenoMed4All will be the data that have been created and curated, and that will be accessible to others.

I think the project is also setting some ground for the development of AI and precision medicine that will be used by other projects; all of us together, as a consortium, have created everlasting knowledge that can inspire others and pave the way for future developments in this area.

by adminDARIO

The GenoMed4All Journey: Interview with Catalina González (Dedalus)

Catalina González Martín is an International Project Manager at Dedalus. With a background in biomedical engineering, she now manages several projects focused on the design and integration of software solutions to aid the digitalization of healthcare systems worldwide.

As part of our new series of interviews with GenoMed4All partners, Catalina reflects on the role she’s had during the project, her contributions and main learnings, and her expectations for the project’s impact after it’s officially completed in June 2025.

Involvement across work packages and interdisciplinary collaborations

Dedalus has been working mostly across two work packages (WPs): as a leader for WP5 and as a collaborator on WP4.

These two were closely related and required direct collaboration with some of our partners, especially Humanitas Research Hospital and Universidad Politecnica de Madrid (UPM), and of course all the rest of partners that have been providing the data — which has been crucial for the success of both work packages.

Our main work has been related to the development, the deployment and the validation of the federated learning platform and the homogenization platform. These two platforms serve as the foundation for the implementation, the anonymization and the training of the data models and the algorithms that have been developed in the other WPs that are part of GenoMed4All.

We have also worked very extensively in the treatment of the different data that the hospitals and the other clinical partners have been providing throughout the project, to make sure it could be integrated with the platform while following careful procedures to standardize and anonymize the data.

Impact on precision medicine for hematology

Both work packages have significantly helped the advancement of the federated learning platform, whose ultimate goal is to enhance personalized medicine strategies for hematological diseases. For example, the anonymization of the data is crucial to safeguard the patient’s privacy and for the clinical partners that have provided this data to feed the algorithms.

Even though we have already seen some positive results before the end of the project, I believe the true impact of our work will likely be perceived over time, especially if the platform continues to be trained with new data, or with new models being implemented.

Main takeaways from this 4-year journey

GenoMed4All has been one of the largest and most heterogeneous projects I’ve been involved in. I have certainly learned a lot, both technically and functionally.

As a Project Manager, for me it’s all about organization and coordination. In such a large project as this one, I have learned that your greatest strength can sometimes also be your greatest weakness if you don’t manage things correctly. So I have realized that when you have many different partners from many different countries, languages, background, different professional profiles, the biggest challenge is to know how to manage all this potential together

The main highlight for me has been taking on this challenge, and I think that we have managed to overcome it and to get the best out of each collaborator.

The legacy of GenoMed4All going forward

Through the collaboration and the connection of all the partners in Europe, in GenoMed4All we have created a very powerful tool, and it’s really important to take care of this legacy. Scientifically and functionally, we’ve brought together so many different expertises, and I believe the involvement of such a diverse group of professionals has enriched the evolution of the project from the very beginning.

Also, thinking about the formation of the algorithms and the joint work we’ve done to create the platform, makes this a tool with great potential, one that can be further explored and tested even after the project has finished. So I definitely would like to see that it continues being implemented in the hospital nodes where it has already been deployed, and they continue training it with different new data models, different algorithms and taking advantage of all its full potential.

by adminDARIO

GenoMed4All's Women In Science - A chat with Marilena Bicchieri

On the occasion of International Women Day and the ongoing #WomenInScience campaign, we want to celebrate the amazing women of GenoMed4All.
And that's why we sat down with our colleague Marilena Bicchieri, GenoMed4All's Scientific Coordinator and Healthcare Project Manager at Humanitas Research Hospital. Here's what she had to say about her role in our project, together with her experience building a successful and meaningful career and navigating the highs and lows of STEM as a woman.

What is your role as a scientific coordinator: from both a personal and professional point of view?

As scientific coordinator, my role is both stimulating and challenging. From a professional perspective, I am responsible for keeping up-to-date with the progress of the project and identifying any gaps or needs of each involved partner to ensure the project moves forward smoothly and efficiently. I serve as a bridge between the technical experts and the clinicians, who often have different perspectives. Therefore, I must be able to understand both points of view and effectively communicate shared information.

From a personal perspective, I understand the importance of constantly learning and improving to excel in my role. By continually enhancing my knowledge and skills, I can provide valuable feedback, advice, and insights to my colleagues, helping to steer the project in the right direction. I am also driven by my ambition to achieve outstanding results, which motivates me to stay proactive and engaged every day.

What is your experience in GenoMed4All? (As part of a team? Your vision of the project as a whole?)

GenoMed4All is an ambitious project that requires a high level of expertise and coordination across a diverse group of partners. As a member of the team, I feel privileged to be part of such a dedicated and talented group of individuals. The consortium is composed of experts from various fields, including researchers, clinicians, technical experts, and industry partners, who all bring unique perspectives and skills to the table.

Working together as a team is essential to the success of the project, and I believe that everyone is committed to this shared goal. While it can be challenging to stay on the same track, the sense of unity and purpose within the team makes the difference. I am continually impressed by the dedication and professionalism of everyone involved in the project.

I believe that GenoMed4All has the potential to be a game-changer in the field of personalized medicine, thanks to the exploitation of -omics data, and will provide one of the first federated platform implementation in the healthcare sector. By leveraging the latest advances in AI technology and applying them to the study of haematological diseases, we can gain new insights on diagnosis and prognosis while developing more effective treatments with the ultimate goal to improve patient outcomes and quality of life. This is incredibly motivating for everyone involved and I am excited to be part of this initiative and look forward to seeing the impact that it will have on the field of haematology and technology.

Could you share some of the main challenges (and highlights!) you have experienced in your career?

As women in STEM, I feel incredibly fortunate to have had a positive, supportive and respectful environment throughout my career. However, it's important to consider that being a woman in a male-oriented society can come with additional challenges, such as breaking stereotypes and biases.

I've sometimes had to work hard to prove myself which, as positive effect, has helped me to grow stronger and more resilient. Indeed, it is important to focus on the rewards that come with overcoming these challenges rather than feeling defeated. Many steps forward have been made in our society and what I have experienced is the result of many years of battles for woman emancipation. I still believe that pursuing a career in STEM can be demanding and require a great deal of responsibility, but with the right approach and a commitment, it can also be incredibly rewarding.

To succeed in STEM, it is essential to be motivated. Motivation will lead you to be open to learning, taking on new challenges, and importantly being willing to take risks. Self-improvement is crucial, as it allows you to grow and develop your skills and expertise. Lastly, setting clear goals and working towards them is necessary to stay focused and driven.

What would be your inspiring words to encourage girls and women to pursue the STEM path?

One of the important advice I would give to girls and women committed to a STEM career is to be brave. Don't be afraid to take on new challenges, even if they seem daunting at first and never be afraid to drive important decisions. Surround yourself with supportive colleagues and mentors, who can help guide you along the way. And remember that the work you do in STEM can have a significant impact on the world around you: by pursuing a career in STEM, you have the opportunity to make a real difference and drive important advancements in science and technology.

I encourage all girls and women interested in STEM to follow their passions and believe in themselves. By doing so, we can keep up to break down the barriers that still exist in the field and pave the way for a more diverse and inclusive STEM community, especially in the most impactful apical roles, where the presence of women is significantly statistically underrepresented.

The world needs more female scientists, engineers, and innovators at the top of the society and I am confident that the next generation of girls will continue to make important contributions to the field.

Interview courtesy of Marilena Bicchieri, PhD - Healthcare Project Manager at Humanitas Research Hospital

by genomEditor4All

A conversation on Federated Learning - Part 2

Welcome to Part 2 in our miniseries on Federated Learning!

You can find all the details of our conversation so far in Part 1. In the first installment, we traveled through the continuum of machine data, learned about the different flavors of Federated Learning and pondered on its added value in healthcare. Ultimately, we were left with a question: what is holding back the adoption of Federated Learning applications?

Now it’s the time to address the still untapped potential of Federated Learning.

How to build a Federated Learning framework?

The challenges ahead

At first glance, we might be quick to assume that a cross-silo approach for Federated Learning seems to be the easiest path to take implementation-wise: after all, we are dealing with a limited number of well-known, addressable edge systems, which are more powerful and reliable overall. However, this apparent ‘simplicity’ conceals another wide spectrum of issues to account for, either from a business, data integration, security or platform perspective. Let’s dive right in.

Business challenges

In a Federated Learning network, there is a risk that edge nodes may behave ‘selfishly’ in order to compromise between model accuracy and cost [1]. This delicate balance of risk-reward is intimately tied to the governance of the network itself and has many implications on what is commonly known as ‘health justice’. In the context of GenoMed4All, this theme crystalizes into how we define and enforce ‘equity’ among nodes in the network and our capability to properly adjust for discrepancies in overall performance and model accuracy when onboarding a ‘dissonant’ node. Anticipating these ‘dissonances’ in an FL network is key, since participants may not be evenly matched in terms of the resources –human and material alike– they are able to commit to this joint enterprise. On this point, the research community has already dedicated quite a lot of effort to find out how we can maximize benefit for each node with limited engagement: the answer seems to lie in the way we estimate both the motivation and contribution of our network nodes.

On the topic of motivation, we may ask ourselves: how do I reward participation for each edge node in a way that ensures that the central server can maintain optimal quality? Putting in place incentive mechanisms that work for all participants involved is key, especially in such a highly heterogenous environment. These incentives or rewards may take multiple forms, like accessing specific central services, benefitting from models without contributing to their training, the opportunity to launch a new training plan… you name it. For the estimation of a node’s contribution, however, a reward can only be fixed if the ‘value’ each node brings to the network can be adequately quantified, and this is not a straightforward exercise: it has to consider both dataset size and quality, and the computation needs to then be correlated to the accuracy of the final model and updated with each training iteration [2].

Data integration challenges

When considering data usage, we must be mindful of how to onboard organizations operating across multiple geographic, political and regulatory scenarios –especially those dictating data protection regulations– to this FL network. The first barrier we must be aware of in terms of data integration is the minimum anonymized dataset that needs to be shared for the initial FL model to be correctly tested, developed and bootstrapped. Another significant roadblock are the access policies that govern dataset extraction at the edge and define what is and is not allowed in terms of data science operations on metadata and model alike. As of today, there is a marked interest on how to strike the right integration between authorization policy language to encode these access policies and the technology required to enforce them.

Additionally, there is the ever-present matter of data quality, which the distributed nature of Federated Learning only aggravates [3]. From the qualifying and onboarding phases to integration to monitoring, it permeates the whole FL lifecycle. Well before onboarding new edge nodes –another hospital, for example– to our network, we should have in place a clear, auditable set of qualifying criteria (e.g. incentive model, hosting capabilities, training resources, available datasets…) that potential candidates are expected to meet in order to officially become nodes. This pre-selection step, though critical to the whole network performance, does not usually get the recognition it deserves, due to either monetary or time constraints.

Immediately after, in the onboarding per se, data quality must be assessed again. It also comes into play when using and integrating a Common Data Model (CDM), since training algorithms with datasets from heterogeneous sources –like Electronic Health Records (EHRs)– has a negative impact in network maintenance and scalability, which can only be mitigated by enforcing a single CDM for the central and edge nodes [4]. For GenoMed4All, our CDM pick is FHIR (Fast Health Interoperable Resources), a standard that defines how healthcare data may be exchanged between nodes regardless of how it is actually stored in those nodes. Compared to other standards alternatives, FHIR shows large (and growing) adoption rates among care providers and has sufficient support for genomic data representation, two key and decisive arguments in the context of GenoMed4All. However, the future healthcare industry seems to be slowing but surely edging towards more fluid scenarios that favor the co-existence of a wide variety of CDM standards – for instance, the emerging OpenEHR standard. This trend would be especially relevant in federated ecosystems like GenoMed4All’s, since they intend to amalgamate an ever-growing, wildly heterogenous landscape of hospitals under a unique distributed umbrella.

Monitoring data quality during training is also tricky: every time datasets are added to the network, there is a need to evaluate whether they are really up-to-standard, mainly to avoid entering a new training loop that ultimately pushes back an updated, poorer quality model to the central server.

On top of these sizeable pile of issues to consider lies the unescapable fact that we are operating in the healthcare realm, where challenges in data integration are always multi-faceted. New social determinants –linked to decision support, care pathways, medication…– and unconventional sources of information –social media, the Internet of Things– have started to permeate the way we look at and make sense of healthcare processes … and in turn, this heightened understanding has exposed a pressing need to outline and regulate data subject rights. As a result, the concept of ‘digital sovereignty’ has been coined to protect the individual’s right for autonomy in a predominantly digital world, and the EU has embraced this notion as the cornerstone of its strategy to usher in a new area of European digital leadership centered around ensuring citizens retain control over their personal data.

Security challenges

In a cross-silo FL scenario, one of the most pressing issues currently under the spotlight is linked to data and client system security, or how to prevent information leaks during the multiple update iterations. Even if no data is exchanged between edge nodes and the central server, the model may still contain some patient-sensitive information in its parameters. The server is normally the one exploiting this vulnerability, since it centralizes client updates and has more control on the FL process as a whole. Solutions to this problem rely on Secure Multiparty Computation (SMC) to aggregate updates or Differential Privacy (DP) to distort client updates locally. Additionally, we might need to also protect the central server against potential malicious client attacks from the edge nodes: those aiming to compromise the convergence of the global model by either disrupting the training process or providing false updates [5]. For GenoMed4All this is not as relevant an issue since all partners participating as clients are considered trusted nodes in the network.

Platform challenges

The current landscape of Federated Learning platforms –and their features– paints a picture of highly heterogenous, research-specific and not yet mature alternatives (e.g. Flower, Fedbiomed, Fate, TensorFlow Federated, PySyft, Paddle) that are emerging as an unequivocal sign of all the excitement and interest surrounding FL. Key platform capabilities like configurability, robustness, scalability, performance, user experience... are non-negotiable for GenoMed4All’s ambition. After all, we are working on a production environment that intends to serve an ever-growing community with an increasing number of algorithms and use cases.

But how to make this vision a reality? The problem is, modern workspace software environments are sorely missing a model federation dimension. Nowadays, AI platforms are mature enough to handle everything from data exploration, testing, pre-processing and transformation and feature engineering to model validation and deployment… and yet, they still have not figured out how to support a model federation approach. As a result, we are missing out on several fronts: first, on metadata exploration tools for data scientists to build their models and features on; and second, on workspaces with adequate debug, development and testing capabilities to handle models with longer lifecycles and incremental contributions from edge nodes [6]. The inevitable conclusion? Team productivity and efficiency are greatly impacted.

Another contender for top platform challenge in FL is data extraction. Data scientists follow complex workflows for model development and data extraction plays a major role in the selection, (cohort) transformation and feature extraction steps. These operations must be first formalized by the platform so they can then be automatically reproduced on the edge nodes. For data scientists, a platform that can provide easy-to-use tools to step away from manual configuration before jumping to model deployment is certainly a bonus. That is why we are taking care to integrate a flexible ETL (Extract, Transform, Load) tool –containing data cohort definition linked to the target model– to configure data extraction and transformation steps from the CDM to the algorithms in GenoMed4All’s platform.

All these challenges are represented in the scorecard below, described in the context of GenoMed4All and ranked in order of priority (i.e. we have marked with 3 stars those that we consider to be core challenges in the project).

The GenoMed4All project or why Federated Learning will serve rare disease research

At GenoMed4All, we are building a Federated Learning platform where clinicians and researchers can work together in the definition, development, testing and validation of AI models to improve the way we currently diagnose and treat hematological diseases in the EU. We envision two complementary operational modes for this platform: a clinical mode, catering to the needs of healthcare professionals and patients in their daily practice; and a research mode, where data scientists can train and benchmark AI models from available data on hematological diseases.

For clinicians, GenoMed4All’s platform will act as a local decision support system to input new prospective and retrospective patient data, extracting insights from an ever-learning model. For researchers, GenoMed4All offers an AI sandbox to benchmark and train new AI models on real-world data and to ensure their clinical usability, a critical point that has so far hampered the real-world integration of AI applications in healthcare.

We believe that a radical shift in how we introduce these kind of tools to a clinical setting is sorely needed to ensure their accountability, transparency and usefulness among healthcare professionals. Drawing a parallel to how we rely on solid pharmaco-vigilance processes to monitor adverse reactions and ultimately confirm a certain drug is safe for use, we can certainly envision a similar clinical validation flow for these tools that successfully undergoes the same level of scrutiny and meets the required standards for performance excellence in a clinical setting.

All in all, we have seen that Federated Learning is indeed an emerging technology that is still finding its footing within the research field. The cross-silo approach we have followed does provide a number of unquestionably attractive capabilities for AI applications in the clinical research space, namely those in the data privacy domain. However, several challenges lurk in the horizon… and must be addressed before this approach can finally become mainstream practice in the healthcare industry, so that Federated Learning can effectively deliver on all the promises we have navigated through in this miniseries.

In this research space, GenoMed4All plays a pioneer role as it explores the large spectrum of issues raised by Federated Learning in healthcare: form platform technology selection and development all the way to defining the full data flow and Common Data Model, security, privacy and an end-to-end operational model. This close collaboration environment, spearheaded by multiple care providers in Europe, leading edge research institutions and recognized industrial partners (meet our stellar team here!) is our core strength to pave the way forward and deliver on new innovation opportunities.

If you enjoyed this miniseries on Federated Learning, stay tuned for future Knowledge Pills!

Missed anything? Check out these references!

[1] Zhang, Ning, Qian Ma, and Xu Chen. "Enabling Long-Term Cooperation in Cross-Silo Federated Learning: A Repeated Game Perspective". IEEE Transactions on Mobile Computing (2022)
[2] Tu, Xuezhen, et al. "Incentive Mechanisms for Federated Learning: From Economic and Game Theoretic Perspective". arXiv preprint arXiv:2111.11850 (2021)
[3] Rieke, Nicola, et al. "The future of digital health with federated learning." NPJ digital medicine1 (2020): 1-7
[4] Pfitzner, Bjarne, Nico Steckhan, and Bert Arnrich. "Federated learning in a medical context: A systematic literature review". ACM Transactions on Internet Technology (TOIT) 21.2 (2021): 1-31
[5] Zhang, Kaiyue, et al. "Challenges and future directions of secure federated learning: a survey". Frontiers of computer science5 (2022): 1-8.
[6] Ungersböck, Michael; Hiessl, Thomas; Schall, Daniel; Michahelles, Florian (2022): Explainable Federated Learning: A Lifecycle Dashboard for Industrial Settings. TechRxiv. Preprint. https://doi.org/10.36227/techrxiv.19410398.v1

This knowledge pill was created by Vincent Planat (DEDALUS), Francesco Cremonesi (DATAWIZARD) and Diana López (AUSTRALO) from the GenoMed4All consortium

Photo by Milad Fakurian on Unsplash

by genomEditor4All