Federated Multilingual Models for Medical Transcript Analysis

Andre Manoel, Mirian del Carmen Hipólito Garcia, Tal Baumel, Shize Su, Jialei Chen, Robert Sim, Dan Miller, Danny Karmon, Dimitrios Dimitriadis

Research output: Contribution to journalConference articlepeer-review

6 Scopus citations

Abstract

Federated Learning (FL) is a machine learning approach that allows the model trainer to access more data samples by training across multiple decentralized data sources while enforcing data access constraints. Such trained models can achieve significantly higher performance beyond what can be done when trained on a single data source. In a FL setting, none of the training data is ever transmitted to any central location; i.e. sensitive data remains local and private. These characteristics make FL perfectly suited for applications in healthcare, where a variety of compliance constraints restrict how data may be handled. Despite these apparent benefits in compliance and privacy, certain scenarios such as heterogeneity of the local data distributions pose significant challenges for FL. Such challenges are even more pronounced in the case of a multilingual setting. This paper presents a FL system for pre-training a large-scale multilingual model suitable for fine-tuning on downstream tasks such as medical entity tagging. Our work represents one of the first such production-scale systems, capable of training across multiple highly heterogeneous data providers, and achieving levels of accuracy that could not be otherwise achieved by using central training with public data only. We also show that the global model performance can be further improved by a local training step.

Original languageEnglish
Pages (from-to)147-162
Number of pages16
JournalProceedings of Machine Learning Research
Volume209
StatePublished - 1 Jan 2023
Externally publishedYes
Event2nd Conference on Health, Inference, and Learning, CHIL 2023 - Cambridge, United States
Duration: 22 Jun 202324 Jun 2023

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Federated Multilingual Models for Medical Transcript Analysis'. Together they form a unique fingerprint.

Cite this