next up previous
Next: AARSs in Methanococcus jannaschii Up: Bioinformatics Tutorial Previous: Introduction


Biology of class II aminoacyl-tRNA Synthetases

Translation in biological cells is the process of protein synthesis directed by a nucleic acid message, mRNA. In the ribosome, each set of three successive nucleotides in the mRNA is matched to a specific amino acid according to the code shown in Table 2. The translation machinery dedicated to interpreting this nucleic acid code operates in a two part process. Amino acids are covalently linked, or ``charged'', to their cognate transfer RNAs (tRNAs) via an aminoacylation reaction catalyzed by a diverse group of multi-domain proteins, the aminoacyl-tRNA synthetases. Charged tRNAs are then shuttled to the ribosome where the tRNA anti-codon is matched to the mRNA codon, and the tRNA is deacylated with the amino acid being added as the next residue of a nascent protein chain [2]. The RNA world hypothesis states that the modern biological world - which relies on DNA and RNA to store genetic information and on proteins to perform catalytic tasks - was pre-dated by and evolved from a form of life that was mostly RNA based, with RNA molecules serving not only to store information, but also to perform required catalytic functions. It is likely that among the first proteins to take over catalytic duties from ribozymes were the aminoacyl-tRNA synthetases (AARSs). These ancient proteins are found in all extant organisms, and their inception clearly pre-dates the root of the universal phylogenetic tree [3,4]. In this tutorial you will use several alignment methods to study and compare various AARSs.


Table 1: Amino acids names and letter codes
Amino Acid Single Letter Three Letter
Alanine A Ala
Arginine R Arg
Asparagine N Asn
Aspartic acid D Asp
Asparagine or aspartic acid B Asx
Cysteine C Cys
Glutamine Q Gln
Glutamic acid E Glu
Glutamine or glutamic acid Z Glx
Glycine G Gly
Histidine H His
Isoleucine I Ile
Leucine L Leu
Lysine K Lys
Methionine M Met
Phenylalanine F Phe
Proline P Pro
Serine S Ser
Threonine T Thr
Tryptophan W Trp
Tyrosine Y Tyr
Valine V Val



Table 2: The genetic code. Some species have slightly different codes. $^*$This codon also specifies the initiator tRNA $^{\textrm {fMet}}$.
           
  UUU Phe UCU Ser UAU Tyr UGU Cys U
U UUC Phe UCC Ser UAC Tory GU Cys C
  UUA Leu UCA Ser UAA stop UGA stop A
  UUG Leu UCG Ser UAG stop UGG Trp G
  CUU Leu CCU Pro CAU His CGU Arg U
C CUC Leu CCC Pro CAC His CGC Arg C
  CUA Leu CCA Pro CAA Gln CGA Arg A
  CUG Leu CCG Pro CAG Gln CGG Arg G
  AUU Ile ACU Thr AAU Asn AGU Ser U
A AUC Ile ACC Thr AAC Asn AGC Ser C
  AUA Ile ACA Thr AAA Lys AGA Arg A
  AUG$^*$ Met ACG Thr AAG Lys AGG Arg G
  GUU Val GCU Ala GAU Asp GGU Gly U
G GUC Val GCC Ala GAC Asp GGC Gly C
  GUA Val GCA Ala GAA Glu GGA Gly A
  GUG$^*$ Val GCG Ala GAG Glu GGG Gly G



next up previous
Next: AARSs in Methanococcus jannaschii Up: Bioinformatics Tutorial Previous: Introduction
zan@uiuc.edu