论文标题

几何限制的折叠模型可以告诉我们现实世界中的蛋白接触图

What geometrically constrained folding models can tell us about real-world protein contact maps

论文作者

Molkenthin, Nora, Güven, J. J., Mühle, Steffen, Mey, Antonia S. J. S.

论文摘要

长期以来,可以根据其氨基酸序列确定蛋白质3D结构的机制一直是生物物理学的关键奥秘之一。通常,简单的模型,例如从几何约束中得出的模型,可以很好地捕获大量的现实世界3D蛋白质蛋白质特性。一种方法是使用蛋白质接触图更好地了解蛋白质的特性。在这里,我们研究了不同几何约束模型和现实世界蛋白质系统的接触图的紧急行为。我们通过平均场方法得出了模型氨基酸距离($ s $)分布的分析近似。然后,使用2D和3D随机相互作用模型以及实际蛋白质数据的触点图验证了该近似值以进行模拟验证。使用RCSB蛋白数据库(PDB)和Alphafold〜2数据库中的数据,分析近似拟合到$ l \ of100 $,$ l \ of200 $和$ l \ l \ of 300 $的蛋白质链长度。尽管无法推断出不同长度的蛋白质链的通用缩放行为,但我们提供证据表明,氨基酸距离分布可以归因于块状蛋白质链的几何约束,而在块状和氨基酸序列中仅起次要作用。

The mechanisms by which a protein's 3D structure can be determined based on its amino acid sequence have long been one of the key mysteries of biophysics. Often simplistic models, such as those derived from geometric constraints, capture bulk real-world 3D protein-protein properties well. One approach is using protein contact maps to better understand proteins' properties. Here, we investigate the emergent behaviour of contact maps for different geometrically constrained models and real-world protein systems. We derive an analytical approximation for the distribution of model amino acid distances, $s$, by means of a mean-field approach. This approximation is then validated for simulations using a 2D and 3D random interaction model, as well as from contact maps of real-world protein data. Using data from the RCSB Protein Data Bank (PDB) and AlphaFold~2 database, the analytical approximation is fitted to protein chain lengths of $L\approx100$, $L\approx200$, and $L\approx300$. While a universal scaling behaviour for protein chains of different lengths could not be deduced, we present evidence that the amino acid distance distributions can be attributed to geometric constraints of protein chains in bulk and amino acid sequences only play a secondary role.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源