Advanced Multi-dataset Methods for Macromolecular Structure Determination and Analysis
Title: Advanced Multi-dataset Methods for Macromolecular Structure Determination and Analysis
DNr: LiU-compute-2023-33
Project Type: LiU Compute
Principal Investigator: Nicholas Pearce <nicholas.pearce@liu.se>
Affiliation: Linköpings universitet
Duration: 2023-10-12 – 2024-11-01
Classification: 10601
Homepage: https://macromolecular.science
Keywords:

Abstract

Accurate macromolecular structures are the focus of research for thousands of researchers internationally and are foundational components of drug design and other therapeutic efforts. The overall workflow of macromolecular crystallography has changed little in the past 50 years, even as individual steps within this workflow have changed beyond recognition. Notwithstanding the widespread success and impact of macromolecular structural models on our understanding of the molecular basis of life, the fact remains that our crystallographic models remain fundamentally flawed and incomplete, despite huge technical leaps in the collection of experimental data. The overly simplistic representation of biological macromolecules as a single conformation loses all of the subtlety of molecular motions and dynamics, and cannot be used to understand complex structural transitions such as allostery. Thus, our current models only allow us to go so far in our understanding of molecular processes. Additionally, continuing manual subjective interpretation of the experimental data allows for misinterpretation and bias, especially of bound ligands. These problems will only become more acute with the growth of cryo-electron microscopy (cryoEM), where the average size of structures is much larger than those typical for macromolecular crystallography (MX), and the scale of the modelling challenge increases accordingly. The development of robust objective methods for modelling, validation and analysis therefore remains a significant challenge but also opportunity in structural biology. With increasing automation now enabling the routine collection of multiple datasets for each structural determination experiment, it is time to rethink how we determine molecular structures, by routinely using multiple datasets for structure determination and analysis.We aim to define new best practice approaches in the modelling, refinement and analysis of macromolecular structures in MX and cryoEM, and demonstrate the importance of improved models in real drug-discovery efforts.