next up previous
Next: Solution to Norleucine Problem Up: Topology Tutorial Previous: Examining the Topology File

Subsections

Glutathione

Now we will try something a bit harder. We are going to build a topology file for an unusual peptide.

Many enzymes utilize a tripeptide called glutathione ($\gamma$-Glu-Cys-Gly), or GSH for short. The tripeptide is unusual in that the first amino acid, glutamic acid, is linked to cysteine via its side-chain, rather than the protein backbone:

 NH3(+)          O
 |              //
HC-C(H2)-C(H2)-C
 |              \
 COO(-)         (CYS)

Creating the Topology File

Once again, by using existing topologies and parameters available within CHARMM22, we will be able to create a topology entry for the $\gamma$-Glu residue and run a simulation incorporating the peptide. Since $\gamma$-Glu is almost identical to standard glutamic acid, we will use the glutamic acid topology entry as a starting point from which to create out new topology.

1
Copy the topology file you created for ornithine to the 2-glutathione directory and open the file with your text editor:

2
Find the topology entry for glutamic acid using the Search or Find feature in your text editor by typing ``resi glu" in the search field.

3
The entry for glutamic acid is shown on the next page.

[frame=single, framerule=1.2mm, framesep=3mm, label=Glutamic Acid Topology Entry, fontsize=\scriptsize]
RESI GLU         -1.00
GROUP   
ATOM N    NH1    -0.47  !     |          
ATOM HN   H       0.31  !  HN-N          
ATOM CA   CT1     0.07  !     |   HB1 HG1   OE1
ATOM HA   HB      0.09  !     |   |   |    //
GROUP                   !  HA-CA--CB--CG--CD
ATOM CB   CT2    -0.18  !     |   |   |    \
ATOM HB1  HA      0.09  !     |   HB2 HG2   OE2(-)
ATOM HB2  HA      0.09  !   O=C          
GROUP                   !     |          
ATOM CG   CT2    -0.28
ATOM HG1  HA      0.09
ATOM HG2  HA      0.09
ATOM CD   CC      0.62
ATOM OE1  OC     -0.76
ATOM OE2  OC     -0.76
GROUP   
ATOM C    C       0.51
ATOM O    O      -0.51
BOND CB CA  CG CB  CD CG  OE2 CD   
BOND N  HN  N  CA C   CA   
BOND C  +N  CA HA  CB HB1 CB  HB2 CG  HG1   
BOND CG HG2  
DOUBLE O  C   CD  OE1 
IMPR N   -C CA  HN  C CA +N O   
!IMPR OE1 CG OE2 CD
IMPR CD CG OE2 OE1
DONOR HN N   
ACCEPTOR OE1 CD   
ACCEPTOR OE2 CD   
ACCEPTOR O C   
IC -C   CA   *N   HN    1.3471 124.4500  180.0000 113.9900  0.9961
IC -C   N    CA   C     1.3471 124.4500  180.0000 107.2700  1.5216
IC N    CA   C    +N    1.4512 107.2700  180.0000 117.2500  1.3501
IC +N   CA   *C   O     1.3501 117.2500  180.0000 121.0700  1.2306
IC CA   C    +N   +CA   1.5216 117.2500  180.0000 124.3000  1.4530
IC N    C    *CA  CB    1.4512 107.2700  121.9000 111.7100  1.5516
IC N    C    *CA  HA    1.4512 107.2700 -118.0600 107.2600  1.0828
IC N    CA   CB   CG    1.4512 111.0400  180.0000 115.6900  1.5557
IC CG   CA   *CB  HB1   1.5557 115.6900  121.2200 108.1600  1.1145
IC CG   CA   *CB  HB2   1.5557 115.6900 -123.6500 109.8100  1.1131
IC CA   CB   CG   CD    1.5516 115.6900  180.0000 115.7300  1.5307
IC CD   CB   *CG  HG1   1.5307 115.7300  117.3800 109.5000  1.1053
IC CD   CB   *CG  HG2   1.5307 115.7300 -121.9600 111.0000  1.1081
IC CB   CG   CD   OE1   1.5557 115.7300  180.0000 114.9900  1.2590
IC OE1  CG   *CD  OE2   1.2590 114.9900 -179.1000 120.0800  1.2532

4
Copy the existing glutamic acid entry. Use the mouse to highlight every entry line (from RESI GLU -1.00 to IC OE1 CG *CD OE2 1.2590 114.9900 -179.1000 120.0800 1.2532). Then click Edit $\rightarrow$ Copy.

5
Paste the copied entry below the original glutamic acid entry by using the Edit $\rightarrow$ Paste option.

You will edit your copied glutamic acid entry to make it an $\gamma$-glutamic acid entry by making the following changes:

6
Since our $\gamma$-Glu residue will no longer have a side chain negative charge, and the ``backbone" will have both a positive N-terminus and negative C-terminus, the total charge on the residue is 0.

Change:
to:
RESI GLU -1.00
RESI GGL 0.00

7
Let's begin by examining the ``backbone". Since both $\gamma$-Glu ``termini" resemble normal N- and C-termini, we should use atom types and charges from the terminal patches in our residue.

Change the first GROUP:

GROUP !  
ATOM N NH1 -0.47 !  
ATOM HN H 0.31 !  
ATOM CA CT1 0.07 !  
ATOM HA HB 0.09 !  

to its NTER counterpart:

GROUP !  
ATOM N NH3 -0.30 !  
ATOM HT1 HC 0.33 !  
ATOM HT2 HC 0.33 !  
ATOM HT3 HC 0.33 !  
ATOM CA CT1 0.21 !  
ATOM HA HB 0.10 !  

[frame=single, framerule=1.2mm, framesep=3mm, label=N-terminus Patch Topology Entry, fontsize=\scriptsize]
PRES NTER         1.00 ! standard N-terminus
GROUP                  ! use in generate statement
ATOM N    NH3    -0.30 !
ATOM HT1  HC      0.33 !         HT1	
ATOM HT2  HC      0.33 !     (+)/
ATOM HT3  HC      0.33 ! --CA--N--HT2
ATOM CA   CT1     0.21 !   |    \
ATOM HA   HB      0.10 !   HA    HT3
DELETE ATOM HN   
BOND HT1 N HT2 N HT3 N   
DONOR HT1 N   
DONOR HT2 N   
DONOR HT3 N   
IC HT1  N    CA   C     0.0000  0.0000  180.0000  0.0000  0.0000
IC HT2  CA   *N   HT1   0.0000  0.0000  120.0000  0.0000  0.0000
IC HT3  CA   *N   HT2   0.0000  0.0000  120.0000  0.0000  0.0000

[frame=single, framerule=1.2mm, framesep=3mm, label=C-terminus Patch Topology Entry, fontsize=\scriptsize]
PRES CTER        -1.00 ! standard C-terminus
GROUP                  ! use in generate statement
ATOM C    CC      0.34 !   OT2(-)
ATOM OT1  OC     -0.67 !  /
ATOM OT2  OC     -0.67 ! -C
DELETE ATOM O          !  \\
BOND C OT2             !   OT1
DOUBLE  C OT1
!IMPR OT1 CA OT2 C
IMPR C CA OT2 OT1
ACCEPTOR OT1 C   
ACCEPTOR OT2 C   
IC N    CA   C    OT2   0.0000  0.0000  180.0000  0.0000  0.0000
IC OT2  CA   *C   OT1   0.0000  0.0000  180.0000  0.0000  0.0000

Notice we have left out the ASCII drawing of our new residue. Try to construct it yourself!

8
Add the following GROUP for the C-terminus taken from the CTER patch:

GROUP  
ATOM C CC 0.34  
ATOM OT1 OC -0.67  
ATOM OT2 OC -0.67  

9
Now, let's look at the ``side chain" through which the peptide bond is formed. In the original glutamic acid residue, the side chain is quite electronegative, and draws electron density from the CH$_2$ atoms next to it, so the entire set C$_2$H$_2$O$_2$ is a single GROUP. In $\gamma$-Glu, however, since the ``side chain" more resembles a peptide bond, we separate it into three GROUPs: CH$_2$, CH$_2$, and CO.

10
We also need to account for the bonds which are created and destroyed in changing glutamic acid to $\gamma$-Glu, remembering new atom names:

Change:
to:
Change:
to:
Change:
to:
Change:
to:
Change:
to:
BOND CB CA CG CB CD CG OE2 CD
BOND CB CA CG CB CD CG
BOND N HN N CA C CA
BOND HT1 N HT2 N HT3 N CA N
BOND C +N CA HA CB HB1 CB HB2 CG HG1
BOND CD +N CA HA CB HB1 CB HB2 CG HG1 CG HG2 CA C
BOND CG HG2
BOND C OT2
DOUBLE O C CD OE1
DOUBLE CD O C OT1

11
In considering the improper entries, we should examine both the NTER and CTER patches for our ``backbone" and the GLU entry for the modified ``side chain". We find a planar structure near both double-bonded oxygens.

Change:
to:
Change:
to:
IMPR N -C CA HN C CA +N O
IMPR CD CG +N O
IMPR CD CG OE2 OE1
IMPR C CA OT1 OT2

You may delete the IMPR command for glutamic acid which is commented out.

Note the use of - and + signs in the topology entry. They indicate the peptide bonds to the preceding and next amino acids in the sequence, respectively.

12
Change the DONOR and ACCEPTOR commands to maintain consistency in atom names.

Change:
to:


Change:
to:
Change:
to:
Change:
to:
DONOR HN N
DONOR HT1 N
DONOR HT2 N
DONOR HT3 N
ACCEPTOR OE1 CD
ACCEPTOR O CD
ACCEPTOR OE2 CD
ACCEPTOR OT1 C
ACCEPTOR O C
ACCEPTOR OT2 C

13
The IC commands are the most complicated (as they will always be). We can construct them through a combination of those in patches NTER and CTER and residue GLU.

After making the changes, your topology definition for $\gamma$-Glu should look as shown on the next page. We have added comments to keep track of where certain commands came from, and also drawn the correct topology for the molecule in ASCII.

[frame=single, framerule=1.2mm, framesep=3mm, label= $\gamma$-Glutamic Acid Topology Entry, fontsize=\scriptsize]
RESI GGL          0.00
GROUP                   !     HT2
ATOM N    NH3    -0.30  !     |(+)          
ATOM HT1  HC      0.33  !     |          
ATOM HT2  HC      0.33  ! HT1-N-HT3   <-- from NTER
ATOM HT3  HC      0.33  !     |
ATOM CA   CT1     0.21  !     |   HB1 HG1    O   <-- from peptide bond
ATOM HA   HB      0.10  !     |   |   |    //
GROUP                   !  HA-CA--CB--CG-CD     
ATOM C    CC      0.34  !     |   |   |    \    
ATOM OT1  OC     -0.67  !     |   HB2 HG2   \    <-- from peptide bond
ATOM OT2  OC     -0.67  ! OT1=C          
GROUP                   !     |       <-- from CTER
ATOM CB   CT2    -0.18  !     OT2(-)
ATOM HB1  HA      0.09
ATOM HB2  HA      0.09
GROUP
ATOM CG   CT2    -0.18
ATOM HG1  HA      0.09
ATOM HG2  HA      0.09
GROUP
ATOM CD   C       0.51
ATOM O    O      -0.51
BOND CB  CA  CG  CB  CD  CG
BOND HT1 N   HT2 N   HT3 N   CA  N
BOND CD +N   CA  HA  CB  HB1 CB  HB2 CG  HG1 CG HG2  CA  C
BOND C   OT2
DOUBLE   CD  O   C   OT1
IMPR     CD  CG +N   O   !from GLU
IMPR     C   CA  OT1 OT2 !from CTER
DONOR HT1 N              !from NTER
DONOR HT2 N              !from NTER
DONOR HT3 N              !from NTER
ACCEPTOR O   CD          !from GLU
ACCEPTOR OT1 C           !from CTER
ACCEPTOR OT2 C           !from CTER
IC N    C    *CA  CB  1.4512 107.2700  121.9000 111.7100  1.5516 !from GLU
IC N    C    *CA  HA  1.4512 107.2700 -118.0600 107.2600  1.0828 !from GLU
IC N    CA   CB   CG  1.4512 111.0400  180.0000 115.6900  1.5557 !from GLU
IC CG   CA   *CB  HB1 1.5557 115.6900  121.2200 108.1600  1.1145 !from GLU
IC CG   CA   *CB  HB2 1.5557 115.6900 -123.6500 109.8100  1.1131 !from GLU
IC CA   CB   CG   CD  1.5516 115.6900  180.0000 115.7300  1.5307 !from GLU
IC CD   CB   *CG  HG1 1.5307 115.7300  117.3800 109.5000  1.1053 !from GLU
IC CD   CB   *CG  HG2 1.5307 115.7300 -121.9600 111.0000  1.1081 !from GLU
IC CB   CG   CD   O   1.5557 115.7300  180.0000 114.9900  1.2590 !from GLU
IC HT1  N    CA   C   0.0000  0.0000  180.0000  0.0000  0.0000   !from NTER
IC HT2  CA   *N   HT1 0.0000  0.0000  120.0000  0.0000  0.0000   !from NTER
IC HT3  CA   *N   HT2 0.0000  0.0000  120.0000  0.0000  0.0000   !from NTER
IC N    CA   C    OT2 0.0000  0.0000  180.0000  0.0000  0.0000   !from CTER
IC OT2  CA   *C   OT1 0.0000  0.0000  180.0000  0.0000  0.0000   !from CTER

The topology file is finished, you think! Time to run, you say! Wrong.

Although we have created the appropriate atomic topology for $\gamma$-Glu, we haven't considered the peptide bond between it and the next residue (Cys in the case of GSH). If we were to attempt to create a psf file at this point, psfgen would try to link the residues via a normal peptide bond and generate errors. In order to properly account for the ``side chain" peptide bond, we use a patch.

\framebox[\textwidth]{
\begin{minipage}{.2\textwidth}
\includegraphics[width=2...
...ine residues so that there is a disulfide bond between them. }
\end{minipage} }

In creating a patch to accurately represent the bond between GGL and CYS, we need to consider how both topologies will be affected.

14
Find the CYS topology entry in top_all27_prot_lipid_orn.inp.

There are two important things to note:

IMPR N -C CA HN  
IC -C CA *N HN 1.3479 123.9300 180.0000 114.7700 0.9982  

This is inappropriate for your new GGL residue. Remember that C and CA are no longer right next to the peptide bond. We need the patch to correct this. For our patch, we need to DELETE any inappropriate parameters and build new ones.

Your patch should look something like this:

[frame=single, framerule=1.2mm, framesep=3mm, label= Glutathione Patch, fontsize=\scriptsize]
PRES GLNK         0.00 ! linkage for IMAGES or for joining segments
                       ! 1 refers to GGLU (N terminal)
                       ! 2 refers to next (C terminal)
                       ! use in a patch statement
DELETE IMPR 2N  1C  2CA 2HN !Improper specified by IMPR N -C CA HN
DELETE IC 1C   2CA  *2N  2HN!Specified by IC -C   CA   *N   HN

IMPR 2N 1CD 2CA 2HN    !New improper
IC 1CD 2CA  *2N  2HN    1.3479 123.9300  180.0000 114.7700  0.9982 !new IC

The last IC is needed for building the peptide hydrogen at the GGL-CYS link.

15
Use your text editor to add the above patch by typing each line into the topology file you just edited.

16
Click File $\rightarrow$ Save As..., enter top_all27_prot_lipid_gsh.inp as your new filename, and save the file. You have now created a topology file with $\gamma$-Glu (and ornithine and D-alanine) included.

Windows Users: Make sure you save the file in .txt format.

Using the Topology File

Now you are ready to use your new topology file! We will use it to simulate a protein that binds glutathione: the human Pi-class Glutathione S-transferase (GST). GSTs are detoxifying enzymes that conjugate xenobiotic compounds with electrophilic centres to glutathione via the sulfur atom:

Image reaction
In the above equation, R is the xenobiotic compound, X is the electrophilic centre, and GS-R is the glutathione-xenobiotic conjugate.

The addition of glutathione to a xenobiotic increases its solubility and acts as a marker to indicate that the compound is to be excreted by the cell. After excretion, the compound is degraded by the mercapturic acid pathway and excreted by the kidneys. While normal functioning GSTs protect us against toxins in our food and environment, GSTs also attack drugs, and are implicated in cellular resistance to chemotherapy. The human Pi-class GST is a homodimer with 209 residues in each monomer. The monomers have two domains: an N-terminal domain which adopts the thioredoxin fold, found in many GSH-binding proteins, and an all-helical C-terminal domain unique to GSTs. One GSH molecule binds in each active site of GST.

1
Launch VMD by:

2
In the VMD Main window, click Extensions $\rightarrow$ Tk Console. Load the pdb structure for glutathione-bound GST, by typing in the VMD TkCon window,

mol new 6GSS.pdb  

Note: Windows users must make sure they are in the 2-glutathione directory. To change to it, type:
cd <path to topology-tutorial-files directory>

Since the molecule is a homodimer, we must create a separate pdb for each monomer. The monomers exist as separate chains in the pdb.

3
Create pdb files for each GST monomer without ligands. Type the following commands into the TK Console window of VMD:

set gstA [atomselect top "chain A and not resname MES GTT"]  
$gstA writepdb gst-a.pdb  
set gstB [atomselect top "chain B and not resname MES GTT"]  
$gstB writepdb gst-b.pdb  

These commands will write pdb's of the protein without the residues named MES (a buffer molecule from the crystallization mixture) and GTT (glutathione).

4
Create pdb's for the two glutathione peptides and the crystal waters. Type the following commands into the TK Console window of VMD:

set gshA [atomselect top "chain A and resname GTT"]  
$gshA writepdb gsh-a.pdb  
set gshB [atomselect top "chain B and resname GTT"]  
$gshB writepdb gsh-b.pdb  
set w [atomselect top "resname HOH"]  
$w writepdb water.pdb  

5
Edit the files gsh-a.pdb and gsh-b.pdb with your text editor. Convert the atom names, residue names, and sequence numbers for the glutathione chains so that they are consistent with CHARMM22 and the new GGL residue that you have constructed. It might be helpful to load the glutathione molecule into VMD and label all atoms so that you know which one is which. Furthermore, these changes could be made in VMD via the Tk Console and identifying atoms by their index, but this would be tedious. We will use the text editor since the molecule is only 3 residues long.

The new pdb file, gsh-a.pdb should look as follows. File gsh-b.pdb is analogous.

[frame=single, framerule=1.2mm, framesep=3mm, label= Glutathione PDB, fontsize=\scriptsize]
CRYST1   79.220   90.690   69.170  90.00  98.22  90.00 P 1           1
ATOM      1  N   GGL A   1      14.887  10.883  23.275  1.00 73.31          
ATOM      2  CA  GGL A   1      14.820  10.124  24.553  1.00 73.72          
ATOM      3  C   GGL A   1      15.860   9.001  24.508  1.00 73.51          
ATOM      4  OT1 GGL A   1      16.269   8.500  25.582  1.00 72.23          
ATOM      5  OT2 GGL A   1      16.281   8.656  23.381  1.00 73.55          
ATOM      6  CB  GGL A   1      13.422   9.543  24.740  1.00 74.00          
ATOM      7  CG  GGL A   1      13.237   8.843  26.061  1.00 74.50          
ATOM      8  CD  GGL A   1      11.904   8.151  26.176  1.00 75.17          
ATOM      9  O   GGL A   1      11.170   8.008  25.193  1.00 75.35          
ATOM     10  N   CYS A   2      11.589   7.729  27.397  1.00 75.35          
ATOM     11  CA  CYS A   2      10.345   7.036  27.698  1.00 75.19          
ATOM     12  C   CYS A   2       9.835   7.523  29.054  1.00 74.65          
ATOM     13  O   CYS A   2      10.623   7.766  29.973  1.00 73.83          
ATOM     14  CB  CYS A   2      10.585   5.521  27.753  1.00 75.74          
ATOM     15  SG  CYS A   2      11.440   4.813  26.310  1.00 76.23          
ATOM     16  N   GLY A   3       8.520   7.663  29.172  1.00 74.50          
ATOM     17  CA  GLY A   3       7.932   8.116  30.417  1.00 74.66          
ATOM     18  C   GLY A   3       7.340   9.505  30.290  1.00 74.96          
ATOM     19  OT1 GLY A   3       6.438   9.842  31.087  1.00 75.16          
ATOM     20  OT2 GLY A   3       7.761  10.251  29.379  1.00 74.82          
END

Now, to create a psf of the entire protein with glutathione chains bound, you will need to use the new topology file you created which has GGL and GLNK topology entries. The psfgen input file gen-gst.pgn has been provided for this. You should look at the file before using it. Notice how the GSH segment is handled:

segment GSHA {  
first none  
pdb gsh-a.pdb  
}  
patch GLNK GSHA:1 GSHA:2  

The command first none means that we apply no N-terminal or C-terminal generating patches to the GGL residue. The patch command is the means by which we apply our patch. It is followed by the name of the patch from the topology file (GLNK) and a list of residues and their segment names [segment:residue] to which the patch should be applied (GSHA:1 GSHA:2).

\fbox{
\begin{minipage}{.2\textwidth}
\includegraphics[width=2.3 cm, height=2....
... based on local chemical considerations (hydrogen bonds etc).}
\end{minipage} }

6
Use psfgen to create a psf for the system. In the VMD TkCon window, type:

source gen-gst.pgn  

Check the output to make sure no obvious errors occurred. If successful, you will have created files gst.pdb and gst.psf.

\fbox{
\begin{minipage}{.2\textwidth}
\includegraphics[width=2.3 cm, height=2....
...gen, as they could indicate a problem with the topology file.}
\end{minipage} }

7
Check the files visually to make sure they were created properly. In VMD, first delete molecule 6GSS.pdb. Then, load the files gst.psf and gst.pdb into VMD by typing in the VMD TkCon window:

mol new gst.psf  
mol addfile gst.pdb  

Have the hydrogen atoms on GSHA and GSHB been built properly? Unusual geometry could indicate a problem with your topology file.

8
Copy the parameter file to be used in your simulation to the current directory. (Alternatively, you could alter the NAMD configuration file to point to the original location of the parameter file.)

At this point, you are ready to run a simulation of your system in vacuum. If you have limited computational power (i.e. those performing this tutorial on a laptop), you should perform a vacuum run at this point and skip Section 2.3. If you have access to greater computational power (i.e. a small cluster), you may skip the vacuum run and continue on to Section 2.3.

\fbox{
\begin{minipage}{.2\textwidth}
\includegraphics[width=2.3 cm, height=2....
...\backslash$namd2 gst\_vac\_mineq.conf > gst\_vac\_mineq.log}
}
\end{minipage} }


Solvating and Ionizing

1
Place the protein in a water box. In VMD, use the solvate package by typing in the TkCon window:

package require solvate  
solvate gst.psf gst.pdb -t 5 -o gst_solv  

2
Add ions to model physiological conditions by using VMD's autoionize package. In the TkCon window, type:

package require autoionize  
autoionize -psf gst_solv.psf -pdb gst_solv.pdb -is 0.2 -o gst_solv_ion  

3
Now lets do some minimization and equilibration using Langevin dynamics. The NAMD configuration file gst_wb_mineq.conf has been provided for this. The cell basis vectors should match those output by solvate. The PMEGridSize dimensions should also be appropriate. You should look at the configuration file before running to make sure you understand what it will do.

\fbox{
\begin{minipage}{.2\textwidth}
\includegraphics[width=2.3 cm, height=2....
...D$\backslash$namd2 gst\_wb\_mineq.conf > gst\_wb\_mineq.log}
}
\end{minipage} }

As frames are added to the output trajectory (dcd) files, you can load them into VMD as a preliminary check that all is well with the topology file you created. Does the GSH molecule behave, or does the geometry distort? Unusual behaviour could indicate a problem in the topology file. If all appears well, you have succeeded!


next up previous
Next: Solution to Norleucine Problem Up: Topology Tutorial Previous: Examining the Topology File
tutorial-l@ks.uiuc.edu