New algorithm helps scientists reduce protein aggregation

Ostrava, 27 May 2026 – Researchers from the IT4Innovations National Supercomputing Center participated in the development of a unique tool that enables the identification of aggregation-prone regions in proteins, opening up new possibilities for their real-world applications. In collaboration with Masaryk University and the International Clinical Research Centre, the researchers developed AggreProt, a predictor based on deep neural networks, whose accuracy was successfully validated, with the results published in the prestigious scientific journal Communications Chemistry.

Proteins are the silent engines of modern life. They help make life-saving medicines, break down environmental pollutants, improve our food, and power industrial processes. Nevertheless, there is a catch: many proteins have a stubborn tendency to stick together, forming useless clumps that render them inactive. This frustrating behaviour has been holding back science until now.

To address this challenge, researchers from the International Clinical Research Centre and Masaryk University, in collaboration with IT4Innovations National Supercomputing Center at VSB – Technical University of Ostrava, developed a machine-learning-based algorithm for fast and reliable detection of the sticky regions that drive protein aggregation. Their identification allows the researchers to design changes into these regions that prevent proteins from sticking together, enabling their more efficient use in real-world applications. To demonstrate the approach, scientists from the Loschmidt Laboratories dramatically improved the production quality and yield of an enzyme that degrades toxic man-made chemicals in the environment. The method has been described in a recent article published in a leading scientific journal Communications Chemistry: Experimentally validated deep learning control of protein aggregations.

Additionally, using their software, the researchers identified and experimentally validated errors in widely used databases used to train similar algorithms. “Computer algorithms are increasingly accelerating research. However, their efficiency depends on the quality of the data used for their training. Our study significantly contributes to improving the reliability of these datasets and, consequently, the accuracy of future predictive tools,” says Antonin Kunka, one of the authors leading the experimental validation of the software.

”It was a pleasure to be part of the team that developed and experimentally validated the deep neural network-based predictor AggreProt, which can help researchers identify aggregation-prone regions in proteins and design mutations that suppress protein aggregation,” says Jan Martinovic from IT4Innovations. "The study demonstrated that the approach can substantially improve protein solubility and significantly increase production yields, opening new possibilities for biotechnology, environmental applications, and medicine. The project also revealed inaccuracies in existing aggregation databases, contributing to the development of more reliable AI-driven predictive methods in protein science.“

This figure illustrates the AggreProt tool for analysing protein aggregation, showing how certain sequences or regions of a protein relate to aggregation propensity and solvent exposure.

“The experimental validation demonstrates the great accuracy of our tool, AggreProt, in identifying the aggregation-prone regions in proteins,” adds Joan Planas-Iglesias from the Loschmidt Laboratories and St. Anne's University Hospital in Brno, and the Faculty of Medicine of Masaryk University, who led the development of the software algorithm and coordinated the collaboration between biologists and computer scientists. “AggreProt is now accessible to the wider scientific community, enabling researchers to improve the production of proteins important for biotechnology, environmental applications, and medicine.”

The collaboration between computational and experimental biologists highlights the importance of cross-institutional collaboration across different fields in driving world-class research.

Availability:

Publication:

https://www.nature.com/articles/s42004-026-02007-5

https://academic.oup.com/nar/article/52/W1/W159/7683054

AggreProt web server: https://loschmidt.chemi.muni.cz/aggreprot/

This work was supported by the CLARA project – The European Union’s Horizon Europe research and innovation programme under grant agreement No. 101136607. This project is co-funded by the European Union in the Center for Artificial Intelligence and Quantum Computing in System Brain Research project (CZ.02.01.01/00/23_029/0008437) under the OP JAC.

Created on: 27. 5. 2026