2021-05-26Zeitschriftenartikel
Swarm Learning for decentralized and confidential clinical machine learning
Warnat-Herresthal, Stefanie
Schultze, Hartmut
Shastry, Krishnaprasad Lingadahalli
Manamohan, Sathyanarayanan
Mukherjee, Saikat
Garg, Vishesh
Sarveswara, Ravi
Händler, Kristian
Pickkers, Peter
Aziz, N. Ahmad
Ktena, Sofia
Tran, Florian
Bitzer, Michael
Ossowski, Stephan
Casadei, Nicolas
Herr, Christian
Petersheim, Daniel
Behrends, Uta
Kern, Fabian
Fehlmann, Tobias
Schommers, Philipp
Lehmann, Clara
Augustin, Max
Rybniker, Jan
Altmüller, Janine
Mishra, Neha
Bernardes, Joana P.
Krämer, Benjamin
Bonaguro, Lorenzo
Schulte-Schrepping, Jonas
de Domenico, Elena
Siever, Christian
Kraut, Michael
Desai, Milind
Monnet, Bruno
Saridaki, Maria
Siegel, Charles Martin
Drews, Anna
Nuesch-Germano, Melanie
Theis, Heidi
Heyckendorf, Jan
Schreiber, Stefan
Kim-Hellmuth, Sarah
COVID-19 Aachen Study (COVAS)
Nattermann, Jacb
Skowasch, Dirk
Kurth, Ingo
Keller, Andreas
Bals, Robert
Nürnberg, Peter
Rieß, Olaf
Rosenstiel, Philip
Netea, Mihai G.
Theis, Fabian
Mukherjee, Sach
Backes, Michael
Aschenbrenner, Anna C.
Ulas, Thomas
Deutsche COVID-19 Omics Initiative (DeCOI)
Breteler, Monique M.B.
Giamarellos-Bouboulis, Evangelos J.
Kox, Matthijs
Becker, Matthias
Cheran, Sorin
Woodcare, Michael S.
Goh, Eng Lim
Schultze, Joachim L.
Fast and reliable detection of patients with severe and heterogeneous illnesses is a major goal of precision medicine1,2. Patients with leukaemia can be identified using machine learning on the basis of their blood transcriptomes3. However, there is an increasing divide between what is technically possible and what is allowed, because of privacy legislation4,5. Here, to facilitate the integration of any medical data from any data owner worldwide without violating privacy laws, we introduce Swarm Learning—a decentralized machine-learning approach that unites edge computing, blockchain-based peer-to-peer networking and coordination while maintaining confidentiality without the need for a central coordinator, thereby going beyond federated learning. To illustrate the feasibility of using Swarm Learning to develop disease classifiers using distributed data, we chose four use cases of heterogeneous diseases (COVID-19, tuberculosis, leukaemia and lung pathologies). With more than 16,400 blood transcriptomes derived from 127 clinical studies with non-uniform distributions of cases and controls and substantial study biases, as well as more than 95,000 chest X-ray images, we show that Swarm Learning classifiers outperform those developed at individual sites. In addition, Swarm Learning completely fulfils local confidentiality regulations by design. We believe that this approach will notably accelerate the introduction of precision medicine.
Files in this item