Exscalate4Cov: Difference between revisions
Gianaccordi (talk | contribs) m Gianaccordi moved page User:Gianaccordi/sandbox to User:Exscalate4Cov: Publication of the page |
Gianaccordi (talk | contribs) m Gianaccordi moved page User:Exscalate4Cov to Exscalate4Cov: Publication of the page |
(No difference)
|
Revision as of 13:17, 11 July 2024
Exscalate4Cov (E4C) | |
---|---|
Country | European Union |
Launched | 1st April 2020[1] |
Closed | 30th September 2021[1] |
Funding | 2 970 875 €[1] |
Status | Project Closed |
Website | https://www.exscalate4cov.eu |
Exscalate4Cov was a public-private consortium supported by the Horizon Europe program from the European Union, aimed at leveraging high-performance computing (HPC) as a response to the coronavirus pandemic. The project utilized high-throughput, extreme-scale, computer-aided drug design software to conduct experiments[2].
The Exsclate4Cov project, which stands for EXaSCale smArt pLatform Against paThogEns for Corona Virus[1], was coordinated by Dompé Farmaceutici and involved 17 participants[1]. It was part of the Horizon 2020 SOCIETAL CHALLENGES - Health, demographic change and well-being founding[3] funding.
The project executed of one of the largest virtual screening experiments[4], identifying a potentially effective molecule against SARS-CoV-2[5].
Context
Background
Drug discovery can be a long and costly process, often taking years and requiring substantial financial investment[1]. Pharmaceutical companies have large datasets of chemical compounds, which they test against a drug target, often a protein receptor. The goal is to find compounds that interact with the targets, leading to potential therapeutic effects[6].
Therefore, the process of finding new drugs usually involves high-throughput screening(HTS). HTS enables the rapid identification of active compounds[7]. For example, virtual screening can be used as an early stage of the drug discovery pipeline to evaluate the interactions between large datasets of small molecules and a drug target, identifying potential hit candidates. This approach helps in identifying potential hit candidates by predicting how different compounds will bind to the target protein, which will go further in the experimental validation[7].
In an urgent computing scenario, such as a pandemic, where time to solution is critical, virtual screening is used to identify hit molecules for the latter stages of the drug discovery pipeline, such as lead optimization and clinical trial[8]. The Exscalate4Cov project was initiated after the COVID-19 pandemic outbreak. The aim of this project was to leverage the computational power of EU supercomputers to accelerate the discovery of effective treatments for the coronavirus[9]. By utilizing high-throughput virtual screening, Exscalate4Cov aimed to find faster solutions to the crisis.
Scope
Exscalate4Cov's approach involved screening billions of compounds against various protein targets of the SARS-CoV-2 virus, identifying those with a higher binding affinity with the target. The project's objectives were:
- Identify potential drug candidates against the coronavirus to combat the COVID-19 pandemic[2].
- Conduct a large-scale experiment as an example for future pandemic scenarios[2].
- Develop a computer-aided drug design platform that leverages supercomputer capabilities[10].
- Fast sharing of data and scientific discoveries with the community[11] to work in an urgent computing scenario.
Previous projects
The Exscalate4Cov project followed the ANTAREX4ZIKA[12] project, both of which aimed to leverage HPC for drug discovery, albeit targeting different viruses. While Exscalate4Cov focused on the SARS-CoV-2 virus responsible for COVID-19, ANTAREX4ZIKA was dedicated to addressing the Zika virus. The ANTAREX4ZIKA project concluded at the end of 2018 and involved a virtual screening campaign on the CINECA Marconi machine, with a total of 10 PetaFLOPS[12]. The ANTAREX project[13], which stands for AutoTuning and Adaptivity appRoach for Energy efficient eXascale HPC systems, emphasized auto-tuning and energy efficiency of HPC applications, making them more effective in various research scenarios, including drug discovery.
Consortium
The E4C consortium of public-private entities has been coordinated by Dompè, and it involved 17 other institutions, from research centers to universities[1].
Organization | Type | Industry | Country |
---|---|---|---|
Dompé Farmaceutici | Private | Pharmaceutical industry | Italy |
CINECA | Public research center | Supercomputing | Italy |
Politecnico di milano | Public university | Scientific and technological research, education | Italy |
University of Milan | Public university | Scientific and technological research, education | Italy |
Katholieke Universiteit, Leuven | Public university | Scientific and technological research, education | Belgium |
International Institute of Molecular and Cell Biology | Public research center | Research center | Poland |
Elettra Sincrotrone Trieste | Research Organisations | Research center | Italy |
Fraunhofer-Gesellschaft | Research Organisations | Research center | Germany |
Barcelona Supercomputing Center | Public research center | Supercomputing | Spain |
Forschungszentrum Jülich | Public research center | Supercomputing | Germany |
University of Naples Federico II | Public university | Scientific and technological research, education | Italy |
University of Cagliari | Public university | Scientific and technological research, education | Italy |
SIB Swiss Institute of Bioinformatics | Public research center | Research center | Switzerland |
KTH Royal Institute of Technology | Public university | Scientific and technological research, education | Sweden |
Lazzaro Spallanzani National Institute for Infectious Diseases | Research Organisations | Hospital | Italy |
Associtazione Big Data | Company | Other | Italy |
Istituto Nazionale di Fisica Nucleare | Public research center | Research center | Italy |
Chelonia SA | Company | Other | Switzerland |
Pipeline
Here, we describe the docking pipeline developed and used in the project's primary virtual screening campaign.
Inputs at the application level consist of ligands from the chemical space and the protein target of the virtual screening campaign, specifically the spike protein in the case of E4C[9]. Following a molecular docking stage that generates potential ligand conformations, a scoring stage assesses the interaction strength between each ligand's pose and the protein[4]. The pipeline ultimately produces a ranking of hit compounds as its output, indicating the most promising candidates for further investigation[4].
At the software level, the project utilizes the EXSCALATE docking platform[4][12]. LiGen (Ligand Generator) is one of the main components of the platform, and it is used to perform molecular docking and scoring simulations. LiGen is responsible for generating and evaluating the conformations of ligands. Another relevant component at the same level is the libdpipe library, which facilitates scaling across multi-node and cores[4].
To hinge the computational power offered by HPC centers, the docking platform uses MPI[14] to scale multi-node and CUDA acceleration to take advantage of supercomputer GPUs. The CUDA version has undergone various optimizations, including OpenACC, OpenMP, and other techniques[15][16][17], to enhance performance and efficiency.
Virtual screening campaign
The project's main experiment evaluated the interactions between 12 viral proteins of SARS-CoV-2 against 70 billion molecules from the EXSCALATE[10] chemical library. In November 2020, consortium members coordinated one of the largest virtual screening campaigns, harnessing the combined computational power of two supercomputers totaling 81 PFLOPS.[18].
The supercomputers used are:
- Marconi100: Operated by CINECA, each node consists of 1 IBM POWER9 AC922 CPU (32 cores, 128 threads) and 4 NVIDIA V100 GPUs with 16 GB of VRAM. The machine consists of 970 nodes, providing a total of 29.3 PFLOPS[19].
- HPC5: Operated by Eni, each node consists of 1 Intel Xeon Gold 6252 24C CPU (24 cores, 48 threads) and 4 NVIDIA V100 GPUs with 16 GB of VRAM. The machine consists of 1820 nodes, providing a total of 51.7 PFLOPS[20].
Throughput
The large-scale campaign used a reservation of 800 Marconi100 nodes and 1500 HP5 nodes for 60 hours[4]. Achieving an average throughput was 2400 ligands per second (lig/s) on Marconi100 and 2000 lig/s on HPC5[4].
Data storage
Another critical aspect of the experiment was data storage management. The platform leveraged efficient MPI I/O[14] operations to handle multi-node computations. The input data required 3.3 TB of space in SMILES format[4]. However, SMILES data needed to be expanded in a pre-processing step involving 100 nodes over five days[4]. Similarly, the post-processing step involved 19 nodes over five days.
Output data
The final output consisted of CSV files containing scores for each input ligand, occupying 69 TB[4]. The resulting dataset, containing 570 million hit compounds, is freely available[4].
Results
Mediate
The project's large-scale campaign results are available through the MEDIATE (MolEcular DockIng AT homE) platform[21]. The objective of MEDIATE[22] is to collect a chemical library of Sars-COV-2 inhibitors. The MEDIATE portal provides access to a set of small molecules that research can use to start de-novo drug design from a reduced set of molecules.
Raloxifene
Raloxifene is a known chemical compound used to treat osteoporosis. The E4C large-scale experiment produced 570 million hit compounds, and among them, the project identified raloxifene as a possible candidate to treat early-stage COVID-19 patients[5][23], aiming to prevent clinical progression[24]. In October 2020, AIFA authorized clinical trials to treat COVID-19 patients[25], and it is currently undergoing testing for approval[26].
Public Interest
The experiments, including the discovery of raloxifene as a possible drug candidate against COVID-19, gained significant interest from the scientific community, as documented in several scientific articles[4][5][23].
The project's results also captured national interest in Italy, highlighted by various newspaper articles[27][28][29], due to the use of Italian supercomputers during the pandemic. Additionally, the large-scale campaign results gained attention from international journals[30][31].
See also
- CINECA
- COVID-19 pandemic
- Drug discovery
- EuroHPC
- Horizon 2020
- HPC5
- Raloxifene
- Supercomputing
- Virtual screening
References
- ^ a b c d e f "EXaSCale smArt pLatform Against paThogEns for Corona Virus | EXSCALATE4CoV Project | Fact Sheet | H2020". CORDIS | European Commission. doi:10.3030/101003551. Retrieved 2024-07-09.
- ^ a b c "Science". www.exscalate4cov.eu. Retrieved 2024-07-09.
- ^ "SOCIETAL CHALLENGES - Health, demographic change and well-being | Programme | H2020". CORDIS | European Commission. Retrieved 2024-07-09.
- ^ a b c d e f g h i j k l Gadioli, Davide; Vitali, Emanuele; Ficarelli, Federico; Latini, Chiara; Manelfi, Candida; Talarico, Carmine; Silvano, Cristina; Cavazzoni, Carlo; Palermo, Gianluca; Beccari, Andrea Rosario (2023-01-01). "EXSCALATE: An Extreme-Scale Virtual Screening Platform for Drug Discovery Targeting Polypharmacology to Fight SARS-CoV-2". IEEE Transactions on Emerging Topics in Computing. 11 (1): 170–181. doi:10.1109/TETC.2022.3187134. ISSN 2168-6750.
- ^ a b c Iaconis, Daniela; Bordi, Licia; Matusali, Giulia; Talarico, Carmine; Manelfi, Candida; Cesta, Maria Candida; Zippoli, Mara; Caccuri, Francesca; Bugatti, Antonella; Zani, Alberto; Filippini, Federica; Scorzolini, Laura; Gobbi, Marco; Beeg, Marten; Piotti, Arianna (2022-05-25). "Characterization of raloxifene as a potential pharmacological agent against SARS-CoV-2 and its variants". Cell Death & Disease. 13 (5): 1–9. doi:10.1038/s41419-022-04961-z. ISSN 2041-4889. PMC 9130985. PMID 35614039.
{{cite journal}}
: CS1 maint: PMC format (link) - ^ Kulkarni, V. S.; Alagarsamy, V.; Solomon, V. R.; Jose, P. A.; Murugesan, S. (2023-04-01). "Drug Repurposing: An Effective Tool in Modern Drug Discovery". Russian Journal of Bioorganic Chemistry. 49 (2): 157–166. doi:10.1134/S1068162023020139. ISSN 1608-330X. PMC 9945820. PMID 36852389.
{{cite journal}}
: CS1 maint: PMC format (link) - ^ a b Wildey, Mary Jo; Haunso, Anders; Tudor, Matthew; Webb, Maria; Connick, Jonathan H. (2017), "High-Throughput Screening", Annual Reports in Medicinal Chemistry, Elsevier, pp. 149–195, doi:10.1016/bs.armc.2017.08.004, retrieved 2024-07-11
- ^ Yang, Yanqing; Zhu, Zhengdan; Wang, Xiaoyu; Zhang, Xinben; Mu, Kaijie; Shi, Yulong; Peng, Cheng; Xu, Zhijian; Zhu, Weiliang (2021-01-18). "Ligand-based approach for predicting drug targets and for virtual screening against COVID-19". Briefings in Bioinformatics. 22 (2): 1053–1064. doi:10.1093/bib/bbaa422. ISSN 1467-5463. PMC 7929377. PMID 33461215.
{{cite journal}}
: CS1 maint: PMC format (link) - ^ a b Beccari, Andrea R.; Vistoli, Giulio (January 2022). "Exscalate4CoV: Innovative High Performing Computing (HPC) Strategies to Tackle Pandemic Crisis". International Journal of Molecular Sciences. 23 (19): 11576. doi:10.3390/ijms231911576. ISSN 1422-0067. PMC 9569893. PMID 36232873.
{{cite journal}}
: CS1 maint: PMC format (link) CS1 maint: unflagged free DOI (link) - ^ a b "Exscalate | AI Drug Discovery Platform". exscalate.com. Retrieved 2024-07-09.
- ^ "Home". mediate.exscalate4cov.eu. Retrieved 2024-07-09.
- ^ a b c "Exscalate Projects". www.exscalate.eu. Retrieved 2024-07-09.
- ^ "ANTAREX: Project Description". www.up.pt. Retrieved 2024-07-10.
- ^ a b Markidis, Stefano; Gadioli, Davide; Vitali, Emanuele; Palermo, Gianluca (November 2021). "Understanding the I/O Impact on the Performance of High-Throughput Molecular Docking". 2021 IEEE/ACM Sixth International Parallel Data Systems Workshop (PDSW). IEEE: 9–14. doi:10.1109/PDSW54622.2021.00007. ISBN 978-1-6654-1837-9.
- ^ Gadioli, Davide; Palermo, Gianluca; Cherubin, Stefano; Vitali, Emanuele; Agosta, Giovanni; Manelfi, Candida; Beccari, Andrea R.; Cavazzoni, Carlo; Sanna, Nico; Silvano, Cristina (January 2021). "Tunable approximations to control time-to-solution in an HPC molecular docking Mini-App". The Journal of Supercomputing. 77 (1): 841–869. doi:10.1007/s11227-020-03295-x. ISSN 0920-8542.
- ^ Vitali, Emanuele; Gadioli, Davide; Palermo, Gianluca; Beccari, Andrea; Cavazzoni, Carlo; Silvano, Cristina (July 2019). "Exploiting OpenMP and OpenACC to accelerate a geometric approach to molecular docking in heterogeneous HPC nodes". The Journal of Supercomputing. 75 (7): 3374–3396. doi:10.1007/s11227-019-02875-w. ISSN 0920-8542.
- ^ Vitali, Emanuele; Ficarelli, Federico; Bisson, Mauro; Gadioli, Davide; Accordi, Gianmarco; Fatica, Massimiliano; Beccari, Andrea R.; Palermo, Gianluca (2024-04-01). "GPU-optimized approaches to molecular docking-based virtual screening in drug discovery: A comparative analysis". Journal of Parallel and Distributed Computing. 186: 104819. doi:10.1016/j.jpdc.2023.104819. ISSN 0743-7315.
- ^ "EXSCALATE4COV: 60 ORE DI SUPERCALCOLO CONTRO IL CORONAVIRUS".
- ^ "HPC5 - PowerEdge C4140, Xeon Gold 6252 24C 2.1GHz, NVIDIA Tesla V100, Mellanox HDR Infiniband | TOP500". www.top500.org. Retrieved 2024-07-09.
- ^ "UG3.2: MARCONI100 UserGuide - SCAI - User Support - CINECA Technical Portal". wiki.u-gov.it. Retrieved 2024-07-09.
- ^ "Home". mediate.exscalate4cov.eu. Retrieved 2024-07-09.
- ^ Vistoli, Giulio; Manelfi, Candida; Talarico, Carmine; Fava, Anna; Warshel, Arieh; Tetko, Igor V.; Apostolov, Rossen; Ye, Yang; Latini, Chiara; Ficarelli, Federico; Palermo, Gianluca; Gadioli, Davide; Vitali, Emanuele; Varriale, Gaetano; Pisapia, Vincenzo (2023-08-03). "MEDIATE - Molecular DockIng at homE: Turning collaborative simulations into therapeutic solutions". Expert Opinion on Drug Discovery. 18 (8): 821–833. doi:10.1080/17460441.2023.2221025. ISSN 1746-0441.
- ^ a b Allegretti, Marcello; Cesta, Maria Candida; Zippoli, Mara; Beccari, Andrea; Talarico, Carmine; Mantelli, Flavio; Bucci, Enrico M.; Scorzolini, Laura; Nicastri, Emanuele (January 2022). "Repurposing the estrogen receptor modulator raloxifene to treat SARS-CoV-2 infection". Cell Death & Differentiation. 29 (1): 156–166. doi:10.1038/s41418-021-00844-6. ISSN 1476-5403. PMC 8370058. PMID 34404919.
{{cite journal}}
: CS1 maint: PMC format (link) - ^ Nicastri, Emanuele; Marinangeli, Franco; Pivetta, Emanuele; Torri, Elena; Reggiani, Francesco; Fiorentino, Giuseppe; Scorzolini, Laura; Vettori, Serena; Marsiglia, Carolina; Gavioli, Elizabeth Marie; Beccari, Andrea R.; Terpolilli, Giuseppe; De Pizzol, Maria; Goisis, Giovanni; Mantelli, Flavio (June 2022). "A phase 2 randomized, double-blinded, placebo-controlled, multicenter trial evaluating the efficacy and safety of raloxifene for patients with mild to moderate COVID-19". eClinicalMedicine. 48: 101450. doi:10.1016/j.eclinm.2022.101450. ISSN 2589-5370. PMC 9098200. PMID 35582123.
{{cite journal}}
: CS1 maint: PMC format (link) - ^ "EXSCALATE4COV: Italian Medicines Agency (AIFA) authorizes Raloxifene Clinical Trial for Paucisymptomatic Covid-19 Patients treated at Home and in Medical Facilities". www.dompe.com. Retrieved 2024-07-10.
- ^ Nicastri, Emanuele; Marinangeli, Franco; Pivetta, Emanuele; Torri, Elena; Reggiani, Francesco; Fiorentino, Giuseppe; Scorzolini, Laura; Vettori, Serena; Marsiglia, Carolina; Gavioli, Elizabeth Marie; Beccari, Andrea R.; Terpolilli, Giuseppe; De Pizzol, Maria; Goisis, Giovanni; Mantelli, Flavio (June 2022). "A phase 2 randomized, double-blinded, placebo-controlled, multicenter trial evaluating the efficacy and safety of raloxifene for patients with mild to moderate COVID-19". eClinicalMedicine. 48: 101450. doi:10.1016/j.eclinm.2022.101450. ISSN 2589-5370. PMC 9098200. PMID 35582123.
{{cite journal}}
: CS1 maint: PMC format (link) - ^ "Exscalate, il super software che scopre le molecole contro il Covid-19". Corriere della Sera (in Italian). 2021-10-08. Retrieved 2024-07-09.
- ^ "Covid: Aifa, ok a test su Raloxifene in casi lievi - Altre news - Ansa.it". Agenzia ANSA (in Italian). 2020-10-27. Retrieved 2024-07-09.
- ^ "Coronavirus, il supercomputer italiano scopre terapia con 'raloxifene'". la Repubblica (in Italian). 2020-06-19. Retrieved 2024-07-09.
- ^ Peckham, Oliver (2020-10-29). "Supercomputer Research Leads to Human Trial of Potential COVID-19 Therapeutic Raloxifene". HPCwire. Retrieved 2024-07-09.
- ^ Writer, Aila Slisco (2020-06-18). "Osteoporosis Drug Shows Promise in Fighting Coronavirus". Newsweek. Retrieved 2024-07-09.
Further readings
- Gadioli, D.; et al. (July 2022), "EXSCALATE: An Extreme-Scale Virtual Screening Platform for Drug Discovery Targeting Polypharmacology to Fight SARS-CoV-2", IEEE Transactions on Emerging Topics in Computing, doi:10.1109/TETC.2022.3187134
- Beccari, A.; et al. (September 2022), "Exscalate4CoV: Innovative High Performing Computing (HPC) Strategies to Tackle Pandemic Crisis", International Journal of Molecular Sciences, doi:10.3390/ijms231911576
{{citation}}
: CS1 maint: unflagged free DOI (link) - Emerson, A.; et al. (May 2023), "The High-Performance Computing Resources for the EXSCALATE4CoV Project", SpringerBriefs in Applied Sciences and Technology, doi:10.1007/978-3-031-30691-4_4
- Coletti, S.; et al. (2023), "Exscalate4CoV", SpringerBriefs in Applied Sciences and Technology, doi:10.1007/978-3-031-30691-4
- Beccari, A.; et al. (May 2023), "The Drug Repurposing Strategy in the Exscalate4CoV Project: Raloxifene Clinical Trials", SpringerBriefs in Applied Sciences and Technology, doi:10.1007/978-3-031-30691-4_3