Amber TUTORIAL B5: Simulating the Green Fluorescent Protein

时间 2019-11-19

标签 amber tutorial b5 simulating green fluorescent protein 繁體版

原文原文链接

Section 1: Preparing the PDB file

1EMA是本次教程所用的pdb，能够在PDB数据库下载。数据库

pdb4amber -i 1EMA.pdb -o gfp.pdb --dry --reduce

pdb4amber命令用于amber输入pdb格式文件的准备。 --dry会删除晶体结构中的水分子(WATER)，--reduce会对pdb加氢(H)。命令执行完成后须要对产生的gfp.pdb进行手工微调，1) 须要把文件中全部MSE更换为MET；2) 把全部SE 原子(Se)换为硫(SD)原子； 3)在pdb文件最后一列中，把SE元素，换乘S元素。优化

pdb4amber analyses PDB files and cleans them for further usage, especially with the LeaP programs of Amber. It does NOT use any information in the original PDB file other than that contained in ATOM and HETATM records. The final output files are stripped of everything not directly related to ATOM or HETATM records.ui

Section 2: Computing partial charges and atom types for CRO

gfp.pdb文件中的CRO残基为非标准氨基酸残基，不在标准amber残基库中，因此须要对此非标准氨基酸进行加电(deriving charges)和决定原子类型(determining the atom types of the CRO residue)操做;this

对于非标准残基CRO，咱们使用PDB数据库中的CIF文件对其进行定义，CRO.cif；atom

咱们使用antechamber程序读取CRO.cif (ccif)并对其进行加电（-c bcc）及分配原子类型 (-at amber)操做,(For more general organic molecules, it is usually better to use gaff atom types.)spa

运行下面程序前，咱们须要先将CRO.cif文件中第五行中的LINKING字符删掉，否则会报错。rest

antechamber -fi ccif -i CRO.cif -bk CRO -fo ac -o cro.ac -c bcc -at amber

antechamber: This is the most important program in the package. It can perform many file conversions, and can also assign atomic charges and atom types. As required by the input, antechamber executes the following programs: sqm (or,alternatively, mopac or divcon), atomtype, am1bcc, bondtype, espgen, respgen and prepgen. It typically produces many intermediate files; these may be recognized by their names, in which all letters are upper-case.code

Running Errors:orm

1. "Residue CRO has a type of LINKING. Quitting" 执行上述命令时，报错LINKING:blog

解决方法有两种：1) 删除文件CRO.cif中的LINKING字符。

　　　　　　　　2) 使用“//”注释掉$AMBERHOME/AmberTools/src/antechamber 下的mmcif.c文件的第68行，"// exit(1);"，而后make install，从新安装antechamber，便可解决问题。

2. 使用diff命令对比程序运行结果会和对照结果文件不一样，涉及到第一个原子NT和N，手动把NT改成N。

Section 3: Preparing the residue library and force field parameters for use with LEaP

PDB数据库中下载的CRO.cif在通过antechamber处理后，变成一个拥有完整的H完整的电荷的分子，可是咱们的gpf.pdb文件中的的CRO是以共价键方式与周围的氨基酸链接的，因此为了使通过加氢加电的CRO分子链接（CRO分子的NC端链接到pdb中）到gpf.pdb文件中，咱们须要对CRO分子进行处理，把CRO分子的NC端的H去掉，并和pdb分子中相应的位置链接，咱们使用prepgen程序处理：

prepgen -i cro.ac -o cro.prepin -m cro.mc -rn CRO

Prepgen: Prepgen generates the prep input file from an ac file. By default, the program generates a mainchain itself. However, you may also specify the main-chain atoms in the main chain file. From this file, you can also specify which atoms will be deleted, and whether to do charge correction or not. In order to generate the amino-acid-like residue (this kind of residue has one head atom and one tail atom to be connected to other residues), you need a main chain file. Sample main chain files are in $AMBERHOME/dat/antechamber.

使用prepgen程序及包含NC端原子，主链侧链信息的主链(mc)文件，对cro.ac进行处理，获得处理后的cro.prepin文件，至此，咱们便获得了非标准残基CRO的残基库及电荷信息；

接下来，咱们使用parmchk2程序检查CRO分子(cro.prepin)的共价键(bonds, angles, and dihedrals)参数化状况，

parmchk2 -i cro.prepin -f prepi -o frcmod.cro -a Y \
         -p $AMBERHOME/dat/leap/parm/parm10.dat

parmchk2: The parmchk2 program figures out what parameters will be needed and checks to see if they are in the standard files. If not, it tries to make educated guesses, and puts these new parameters into a file we are calling "frcmod.cro" here. -p flag specifies the parm10.dat file because it is the main parameter database for the force field we plan to use, ff14SB.

frcmod.cro文件中，有的行会标记“ATTN, need revision“，这种标记意味着，parmchk2程序不能在parm10数据库中找到类似的参数，咱们把这些标记的行在frcmod.cro文件中删除，而后再在gaff.dat数据库中去寻找合适的参数。

grep -v "ATTN" frcmod.cro > frcmod1.cro # Strip out ATTN lines
parmchk2 -i cro.prepin -f prepi -o frcmod2.cro

至此，咱们获得两个frcmod文件，即frcmod1.cro（parm10.dat，删除了ATTN）和frcmod2.cro（gaff.dat），用于下面的步骤。

Section 4: Creating the topology and coordinate files for simulation

咱们使用以前步骤中的文件，准备拓扑文件及坐标文件。咱们使用ff14SB力场，使用隐式溶剂模型(igb=8，并设置PBRadii默认值为mbondi3)；首先咱们加载cro.prepin文件，而后对于参数文件，咱们先load frcmod2.cro再load frcmod1.cro，以确保全部gaff参数被parm10参数替换，而后再导入gfp.pdb结构文件，输出gfp.parm7和gfp.rst7文件。

$$$ tleap.in

source leaprc.protein.ff14SB
set default PBRadii mbondi3
loadAmberPrep cro.prepin
loadAmberParams frcmod2.cro
loadAmberParams frcmod1.cro
x = loadPDB gfp.pdb
saveAmberParm x gfp.parm7 gfp.rst7
quit

tleap -f tleap.in

Section 5: Simulations; minimization, heating, equilibration, and production

Minimization

以初始坐标为起点，对整个体系优化，排除一些不利的构像

$$$ min.in

 simple generalized Born minimization script
 &cntrl
   imin=1, ntb=0, maxcyc=100, ntpr=10, cut=1000., igb=8, 
 /

sander -O -i min.in -p gfp.parm7 -c gfp.rst7 -o min1.out -r min1.rst7

Heating

以minimization的rst文件为起点，对整个体系加热,200ps加热时间，从10K到300K

$$$ heat.in


Implicit solvent initial heating mdin
 &cntrl
   imin=0, irest=0, ntx=1,
   ntpr=1000, ntwx=1000, nstlim=100000,
   dt=0.002, ntt=3, tempi=10,
   temp0=300, gamma_ln=1.0, ig=-1,
   ntp=0, ntc=2, ntf=2, cut=1000,
   ntb=0, igb=8, ioutfm=1, nmropt=1,
 /
 &wt
   TYPE='TEMP0', ISTEP1=1, ISTEP2=100000,
   VALUE1=10.0, VALUE2=300.0,
 /
 &wt TYPE='END' /

sander -O -i heat.in -p gfp.parm7 -c min1.rst7 -o heat.mdout \
       -x heat.nc -r heat.rst7

Production

以上一步坐标为起点，进行MD run 100ns

$$$ md.in

Implicit solvent molecular dynamics
 &cntrl
   imin=0, irest=1, ntx=5,
   ntpr=1000, ntwx=1000, nstlim=500000,
   dt=0.002, ntt=3, tempi=300,
   temp0=300, gamma_ln=1.0, ig=-1,
   ntp=0, ntc=2, ntf=2, cut=1000,
   ntb=0, igb=8, ioutfm=1,
 /

sander -O -i md.in -p gfp.parm7 -c heat.rst7 -o md1.mdout \
       -x md1.nc -r md1.rst7