Machine Learning Models Rank Predictive Risks for Alzheimer's Disease

Once adults reach age 65, the threshold age for the onset of Alzheimer's disease, the extent of their genetic risk may outweigh age as a predictor of whether they will develop the fatal brain disorder, a new study suggests.

The study, published recently in the journal Scientific Reports, is the first to construct machine learning models with genetic risk scores, non-genetic information and electronic health record data from nearly half a million individuals to rank risk factors in order of how strong their association is with eventual development of Alzheimer's disease.

Researchers used the models to rank predictive risk factors for two populations from the UK Biobank: White individuals aged 40 and older, and a subset of those adults who were 65 or older.

Results showed that age - which constitutes one-third of total risk by age 85, according to the Alzheimer's Association - was the biggest risk factor for Alzheimer’s in the entire population, but for the older adults, genetic risk as determined by a polygenic risk score was more predictive.

"We all know Alzheimer’s disease is a later-onset disease, so we know age is an important risk factor. But when we consider risk only for people age 65 or older, then genetic information captured by a polygenic risk score ranks higher than age," said lead study author Xiaoyi Raymond Gao, associate professor of ophthalmology and visual sciences and of biomedical informatics in The Ohio State University College of Medicine. "That means it's really important to consider genetic information when we work on Alzheimer's disease."

A low household income also emerged as an important risk factor, ranking either third or fourth after the effects of age and genetics.

"The finding related to income is very, very interesting," said Gao, also a member of Ohio State’s Division of Human Genetics faculty, whose lab uses biomedical big data and artificial intelligence to study the genetics behind Alzheimer’s and ocular diseases. "We all want to have a healthy life, and income can be such an important factor to decide what you can afford to eat, where you can afford to live, education level, access to care - and all of these possibly contribute to Alzheimer's disease."

Of the 457,936 UK Biobank participants in the sample, 2,177 individuals had developed Alzheimer's disease and 455,759 had not, and 88,309 were 65 or older.

A few non-genetic risk factors that differed between people with and without Alzheimer's disease (AD) stood out: Results showed that in people with AD, higher systolic and lower diastolic blood pressure were more common, diabetes was more prevalent, household income and education were lower, and recent falls, hearing difficulty and a mother’s history of having AD were higher.

The top-20 list of risk factors for the full sample of adults also included diagnoses of high blood pressure, urinary tract infection, depressive episodes, fainting, unspecified chest pain, disorientation and abnormal weight loss. Other risk factors in the top 20 for people 65 and older included high cholesterol and gait abnormalities. These findings showed the power of adding condition codes from electronic health records to the models.

"Machine learning can explore relationships among all of those features, or variables, pick the important features and rank certain features at the top that contribute much more to Alzheimer's disease risk than the rest of the features," Gao said. "Typically, it's not good to be highly obese, but we also see here that a lower body mass index is not good. High blood pressure is typically not good, but here we see lower diastolic blood pressure is not good. The models revealed some interesting patterns."

Building the models was a two-step process. The team first conducted genome-wide association studies using data from the Alzheimer’s Disease Genetics Consortium to identify genetic variants linked to overall risk of developing Alzheimer's disease and to development of the disease after a specific age. The separate collections of variants were used to establish two polygenic risk scores, which aggregate genetic effects across the genome into a single measure of risk for each individual.

Those scores were applied to DNA data from the UK Biobank participants and combined with biobank information on conventional risk factors such as sex, education, body mass index and blood pressure, and more than 11,000 electronic health record condition codes that had been cited in individuals' records.

The team also used an algorithm in interpreting the model's output to ensure risk factor variables were weighted objectively in the analysis.

We are born with our genetic risk for disease already established, but information about how other health and socioeconomic factors affect our risk for Alzheimer's - as well as glaucoma, which Gao also studies - gives us power to take preventive measures, he said.

"If people know more about risk factors, they can possibly adjust their lifestyle. For both Alzheimer's and glaucoma, there is no cure, so prevention can help a lot," Gao said. 2I also hope constructing models to make these predictions could help with drug development and effective and low-cost screening programs."

Gao XR, Chiariglione M, Qin K, Nuytemans K, Scharre DW, Li YJ, Martin ER.
Explainable machine learning aggregates polygenic risk scores and electronic health records for Alzheimer's disease prediction.
Sci Rep. 2023 Jan 9;13(1):450. doi: 10.1038/s41598-023-27551-1

Most Popular Now

SPARK TSL Acquires Sentean Group

SPARK TSL is acquiring Sentean Group, a Dutch company with a complementary background in hospital entertainment and communication, and bringing its Fusion Bedside platform for clinical and patient apps to...

GPT-4 Matches Radiologists in Detecting …

Large language model GPT-4 matched the performance of radiologists in detecting errors in radiology reports, according to research published in Radiology, a journal of the Radiological Society of North America...

ChatGPT Extracts Data for Ischaemic Stro…

In an ischaemic stroke, an artery in the brain is blocked by blood clots and the brain cells can no longer be supplied with blood as a result. Doctors must...

Herefordshire and Worcestershire Health …

Herefordshire and Worcestershire Health and Care NHS Trust has successfully implemented Alcidion's Miya Precision platform to streamline bed management workflow across seven community hospitals in Worcestershire. The trust delivers community...

A Shortcut for Drug Discovery

For most human proteins, there are no small molecules known to bind them chemically (so called "ligands"). Ligands frequently represent important starting points for drug development but this knowledge gap...

New Horizon Europe Funding Boosts Europe…

The European Commission has announced the launch of new Horizon Europe calls, with a substantial funding pool of over €112 million. These calls are aimed primarily at pioneering projects in...

Cleveland Clinic Study Finds AI can Deve…

Cleveland Clinic researchers developed an artficial intelligence (AI) model that can determine the best combination and timeline to use when prescribing drugs to treat a bacterial infection, based solely on...

New AI-Technology Estimates Brain Age Us…

As people age, their brains do, too. But if a brain ages prematurely, there is potential for age-related diseases such as mild-cognitive impairment, dementia, or Parkinson's disease. If "brain age...

Radboud University Medical Center and Ph…

Royal Philips (NYSE: PHG, AEX: PHIA), a global leader in health technology, and Radboud University Medical Center have signed a hospital-wide, long-term strategic partnership that delivers the latest patient monitoring...

With Huge Patient Dataset, AI Accurately…

Scientists have designed a new artificial intelligence (AI) model that emulates randomized clinical trials at determining the treatment options most effective at preventing stroke in people with heart disease. The model...

GPT-4, Google Gemini Fall Short in Breas…

Use of publicly available large language models (LLMs) resulted in changes in breast imaging reports classification that could have a negative effect on patient management, according to a new international...

ChatGPT fails at heart risk assessment

Despite ChatGPT's reported ability to pass medical exams, new research indicates it would be unwise to rely on it for some health assessments, such as whether a patient with chest...