Insilico Medicine Scientists Propose Stricter Standards for Evaluating Generative AI-Produced Molecules

Insilico MedicineA new microperspective in the ACS journal Medicinal Chemistry Letters evaluates recent research on artificial intelligence (AI)-generated molecular structures from the point of view of the medicinal chemist and recommends guidelines for assessing the novelty and validity of these compounds. The perspective, published as part of the journal's virtual special issue "New Enabling Drug Discovery Technologies - Recent Progress," provides an analysis of eight molecular structures produced from generative chemistry published in the past two years to reveal the impact of AI and machine learning (ML) methods on modern-day drug discovery. In total, the authors found 55 recent publications covering generative chemistry efforts.

Designing synthetically feasible molecular structures that are novel and experimentally valid in the context of the disease is a challenge for generative chemistry algorithms. "We hoped to provide an in-depth analysis of the strengths of certain AI and ML generative chemistry approaches to produce truly novel and synthetically feasible molecular structures," says Alex Aliper, Ph.D., President of Insilico Medicine, who co-authored the study.

Rather than simply focusing on AI-generated structures, the authors examine the validity of these structures from the medicinal chemist's perspective - including synthesis and biological assessment.

Ultimately, say the Insilico scientists, as terms like "generative AI" and "generative chemistry" become more widespread, it’s essential to define relevant terms better and demonstrate the validity of generated structures across various measures. Their recommendations include:

  • Thoroughly inspecting generated structures in regards to their novelty and patentability.
  • Using rationally balanced preprocessing rules and medicinal chemistry filters adapted for generative pipelines.
  • Avoiding misleading statements, especially “novel drug candidate” and “novel lead compounds,” which must be supported with exhaustive biological data. In many cases, “primarily hit compound” is the only term that can be reasonably applied for active compounds of generative origin.
  • Employing severe similarity metrics.
  • Providing medicinal chemists with all generated structures besides those presented by authors as the most promising ones.
  • Evaluating active molecules of AI origin at least using standard MTS or MTT assays to avoid nonspecific action and cytotoxicity.
  • Assessing synthetic accessibility.
  • Improving the generative engine, with more attention to the training set, the test set, and similarity metrics.
  • Paying more attention to reinforcement learning with advanced systems and processes intended to rapidly evaluate the generated molecules for desired properties.

"We are encouraged by the increasing use of generative AI in chemistry which can help speed and expand drug discovery efforts," says Alex Zhavoronkov, PhD, founder and CEO of Insilico Medicine and co-author of the paper. "But we believe that publications in generative chemistry should always include experimental validation and rigorous evaluation and review by medicinal chemists. We think the process can be further improved by introducing new techniques to generate and evaluate the novel molecular structures from a medicinal chemistry perspective to produce the next generation of novel AI-generated drugs."

About Insilico Medicine

Insilico Medicine, a clinical-stage end-to-end artificial intelligence (AI)-driven drug discovery company, is connecting biology, chemistry, and clinical trials analysis using next-generation AI systems. The company has developed AI platforms that utilize deep generative models, reinforcement learning, transformers, and other modern machine learning techniques to discover novel targets and to design novel molecular structures with desired properties. Insilico Medicine is delivering breakthrough solutions to discover and develop innovative drugs for cancer, fibrosis, immunity, central nervous system (CNS) diseases and aging-related diseases.

Ivanenkov Y, Zagribelnyy B, Malyshev A, Evteev S, Terentiev V, Kamya P, Bezrukov D, Aliper A, Ren F, Zhavoronkov A.
The Hitchhiker's Guide to Deep Learning Driven Generative Chemistry.
ACS Med Chem Lett. 2023 Jun 30;14(7):901-915. doi: 10.1021/acsmedchemlett.3c00041

Most Popular Now

ChatGPT can Produce Medical Record Notes…

The AI model ChatGPT can write administrative medical notes up to ten times faster than doctors without compromising quality. This is according to a new study conducted by researchers at...

Can Language Models Read the Genome? Thi…

The same class of artificial intelligence that made headlines coding software and passing the bar exam has learned to read a different kind of text - the genetic code. That code...

Study Shows Human Medical Professionals …

When looking for medical information, people can use web search engines or large language models (LLMs) like ChatGPT-4 or Google Bard. However, these artificial intelligence (AI) tools have their limitations...

Bayer and Google Cloud to Accelerate Dev…

Bayer and Google Cloud announced a collaboration on the development of artificial intelligence (AI) solutions to support radiologists and ultimately better serve patients. As part of the collaboration, Bayer will...

Advancing Drug Discovery with AI: Introd…

A transformative study published in Health Data Science, a Science Partner Journal, introduces a groundbreaking end-to-end deep learning framework, known as Knowledge-Empowered Drug Discovery (KEDD), aimed at revolutionizing the field...

Shared Digital NHS Prescribing Record co…

Implementing a single shared digital prescribing record across the NHS in England could avoid nearly 1 million drug errors every year, stopping up to 16,000 fewer patients from being harmed...

Ask Chat GPT about Your Radiation Oncolo…

Cancer patients about to undergo radiation oncology treatment have lots of questions. Could ChatGPT be the best way to get answers? A new Northwestern Medicine study tested a specially designed ChatGPT...

North West Anglia Works with Clinisys to…

North West Anglia NHS Foundation Trust has replaced two, legacy laboratory information systems with a single instance of Clinisys WinPath. The trust, which serves a catchment of 800,000 patients in North...

Can AI Techniques Help Clinicians Assess…

Investigators have applied artificial intelligence (AI) techniques to gait analyses and medical records data to provide insights about individuals with leg fractures and aspects of their recovery. The study, published in...

AI Makes Retinal Imaging 100 Times Faste…

Researchers at the National Institutes of Health applied artificial intelligence (AI) to a technique that produces high-resolution images of cells in the eye. They report that with AI, imaging is...

GPT-4 Matches Radiologists in Detecting …

Large language model GPT-4 matched the performance of radiologists in detecting errors in radiology reports, according to research published in Radiology, a journal of the Radiological Society of North America...

SPARK TSL Acquires Sentean Group

SPARK TSL is acquiring Sentean Group, a Dutch company with a complementary background in hospital entertainment and communication, and bringing its Fusion Bedside platform for clinical and patient apps to...