Artigo 1

This article introduces a novel method for predicting protein conformational distributions using subsampled AlphaFold 2, which enhances its ability to predict relative populations of different protein conformations. The approach demonstrated over 80% accuracy in predicting changes in conformational states for proteins like Abl1 kinase and granulocyte-macrophage colony-stimulating factor, even with limited sequence data. This method offers a rapid and cost-effective tool for applications in pharmacology and evolutionary predictions, potentially transforming drug discovery processes.

Uploaded by

tacianyss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views13 pages

Artigo 1

Uploaded by

tacianyss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Article https://doi.org/10.

1038/s41467-024-46715-9

High-throughput prediction of protein

conformational distributions with
subsampled AlphaFold2
Received: 3 August 2023 Gabriel Monteiro da Silva 1, Jennifer Y. Cui 1
, David C. Dalgarno 2
,
George P. Lisi 1,3 & Brenda M. Rubenstein 1,3
Accepted: 28 February 2024

This paper presents an innovative approach for predicting the relative popu-
Check for updates lations of protein conformations using AlphaFold 2, an AI-powered method
1234567890():,;
1234567890():,;

that has revolutionized biology by enabling the accurate prediction of protein

structures. While AlphaFold 2 has shown exceptional accuracy and speed, it is
designed to predict proteins’ ground state conformations and is limited in its
ability to predict conformational landscapes. Here, we demonstrate how
AlphaFold 2 can directly predict the relative populations of different protein
conformations by subsampling multiple sequence alignments. We tested our
method against nuclear magnetic resonance experiments on two proteins with
drastically different amounts of available sequence data, Abl1 kinase and the
granulocyte-macrophage colony-stimulating factor, and predicted changes in
their relative state populations with more than 80% accuracy. Our subsampling
approach worked best when used to qualitatively predict the effects of
mutations or evolution on the conformational landscape and well-populated
states of proteins. It thus offers a fast and cost-effective way to predict the
relative populations of protein conformations at even single-point mutation
resolution, making it a useful tool for pharmacology, analysis of experimental
results, and predicting evolution.

Proteins are essential biomolecules that carry out a wide range of The recent development of machine learning algorithms has
functions in living organisms. Understanding their three-dimensional signiﬁcantly improved the speed of protein structure prediction9,10.
structures is critical for elucidating their functions and designing drugs One of the most remarkable achievements in this area is the AlphaFold
that target them1. Historically, experimental techniques such as X-ray 2 (AF2) engine developed by DeepMind, which uses a deep neural
crystallography, nuclear magnetic resonance (NMR) spectroscopy, network to predict ground state protein structures from amino acid
and electron microscopy have been used to determine protein sequences11,12. AlphaFold 2 was trained using large amounts of
structures2–4. However, these methods can be time-consuming, tech- experimental data and incorporates co-evolutionary information from
nically challenging, and expensive, and may not work for all proteins5. massive metagenomic databases11. Its accuracy has revolutionized the
To meet this challenge, ab initio structure prediction methods, which ﬁeld of protein structure prediction11,13,14, opening up new possibilities
use computational algorithms to predict protein structures from their for drug discovery and basic research with clear consequences for
amino acid sequences, have been developed6. For many years, ab initio human health15,16.
structure prediction methods have relied on physics-based algorithms However, a series of studies have found that the default AF2
to predict stable protein structures7. Although successful, these algorithm is limited in its capacity to predict alternative protein con-
methods are challenged by larger and more complex proteins8. formations and the effects of sequence variants17,18. Although AF2’s

1
Brown University Department of Molecular and Cell Biology and Biochemistry, Providence, RI, USA. 2Dalgarno Scientiﬁc LLC, Brookline, MA, USA. 3Brown
University Department of Chemistry, Providence, RI, USA. e-mail: [email protected]