Characterized by weakened or damaged heart musculature, heart failure results in the gradual buildup of fluid in a patientâs lungs, legs, feet, and other parts of the body. The condition is chronic and incurable, often leading to arrhythmias or sudden cardiac arrest. For many centuries, bloodletting and leeches were the treatment of choice, famously practiced by barber surgeons in Europe, during a time when physicians rarely operated on patients.Â
In the 21st century, the management of heart failure has become decidedly less medieval: Today, patients undergo a combination of healthy lifestyle changes, prescription of medications, and sometimes use pacemakers. Yet heart failure remains one of the leading causes of morbidity and mortality, placing a substantial burden on health-care systems across the globe.Â
âAbout half of the people diagnosed with heart failure will die within five years of diagnosis,â says Teya Bergamaschi, an MIT PhD student in the lab of Nina T. and Robert H. Rubin Professor Collin Stultz and the co-first author of a new paper introducing a deep learning model for predicting heart failure. âUnderstanding how a patient will fare after hospitalization is really important in allocating finite resources.â
The paper, published in Lancet eClinical Medicine by a team of researchers at MIT, Mass General Brigham, and Harvard Medical School, shares results from developing and testing PULSE-HF, which stands loosely for âPredict changes in left ventricULar Systolic function from ECGs of patients who have Heart Failure.â The project was conducted in Stultzâs lab, which is affiliated with the MIT Abdul Latif Jameel Clinic for Machine Learning in Health. Developed and retrospectively tested across three different patient cohorts from Massachusetts General Hospital, Brigham and Womenâs Hospital, and MIMIC-IV (a publicly available dataset), the deep learning model accurately predicts changes in the left ventricular ejection fraction (LVEF), which is the percentage of blood being pumped out of the left ventricle of the heart.
A healthy human heart pumps out about 50 to 70 percent of blood from the left ventricle with each beat â anything less is considered a sign of a potential problem. âThe model takes an [electrocardiogram] and outputs a prediction of whether or not there will be an ejection fraction within the next year that falls below 40 percent,â says Tiffany Yau, an MIT PhD student in Stultzâs lab who is also co-first author of the PULSE-HF paper. âThat is the most severe subgroup of heart failure.âÂ
If PULSE-HF predicts that a patientâs ejection fraction is likely to worsen within a year, the clinician can prioritize the patient for follow-up. Subsequently, lower-risk patients can reduce their number of hospital visits and the amount of time spent getting 10 electrodes adhered to their body for a 12-lead ECG. The model can also be deployed in low-resource clinical settings, including doctors offices in rural areas that donât typically have a cardiac sonographer employed to run ultrasounds on a daily basis.
âThe biggest thing that distinguishes [PULSE-HF] from other heart failure ECG methods is instead of detection, it does forecasting,â says Yau. The paper notes that to date, no other methods exist for predicting future LVEF decline among patients with heart failure.
During the testing and validation process, the researchers used a metric known as “area under the receiver operating characteristic curve” (AUROC) to measure PULSE-HFâs performance. AUROC is typically used to measure a modelâs ability to discriminate between classes on a scale from 0 to 1, with 0.5 being random and 1 being perfect. PULSE-HF achieved AUROCs ranging from 0.87 to 0.91 across all three patient cohorts.
Notably, the researchers also built a version of PULSE-HF for single-lead ECGs, meaning only one electrode needs to be placed on the body. While 12-lead ECGs are generally considered superior for being more comprehensive and accurate, the performance of the single-lead version of PULSE-HF was just as strong as the 12-lead version.
Despite the elegant simplicity behind the idea of PULSE-HF, like most clinical AI research, it belies a laborious execution. âItâs taken years [to complete this project],â Bergamaschi recalls. âItâs gone through many iterations.âÂ
One of the teamâs biggest challenges was collecting, processing, and cleaning the ECG and echocardiogram datasets. While the model aims to forecast a patientâs ejection fraction, the labels for the training data werenât always readily available. Much like a student learning from a textbook with an answer key, labeling is critical for helping machine-learning models correctly identify patterns in data.
Clean, linear text in the form of TXT files typically works best when training models. But echocardiogram files typically come in the form of PDFs, and when PDFs are converted to TXT files, the text (which gets broken up by line breaks and formatting) becomes difficult for the model to read. The unpredictable nature of real-life scenarios, like a restless patient or a loose lead, also marred the data. âThere are a lot of signal artifacts that need to be cleaned,â Bergamaschi says. âItâs kind of a never-ending rabbit hole.â
While Bergamaschi and Yau acknowledge that more complicated methods could help filter the data for better signals, there is a limit to the usefulness of these approaches. âAt what point do you stop?â Yau asks. âYou have to think about the use case â is it easiest to have this model that works on data that is slightly messy? Because it probably will be.â
The researchers anticipate that the next step for PULSE-HF will be testing the model in a prospective study on real patients, whose future ejection fraction is unknown.
Despite the challenges inherent to bringing clinical AI tools like PULSE-HF over the finish line, including the possible risk of prolonging a PhD by another year, the students feel that the years of hard work were worthwhile.Â
âI think things are rewarding partially because theyâre challenging,â Bergamaschi says. âA friend said to me, âIf you think you will find your calling after graduation, if your calling is truly calling, it will be there in the one additional year it takes you to graduate.â ⊠The way weâre measured as researchers in [the ML and health] space is different from other researchers in ML space. Everyone in this community understands the unique challenges that exist here.â
âThereâs too much suffering in the world,â says Yau, who joined Stultzâs lab after a health event made her realize the importance of machine learning in health care. âAnything that tries to ease suffering is something that I would consider a valuable use of my time.âÂ
