Analysis of the glutathione S-transferase (GST) gene family
© Henry Stewart Publications 2004
Received: 4 August 2004
Accepted: 4 August 2004
Published: 1 November 2004
The glutathione S-transferase (GST) gene family encodes genes that are critical for certain life processes, as well as for detoxication and toxification mechanisms, via conjugation of reduced glutathione (GSH) with numerous substrates such as pharmaceuticals and environmental pollutants. The GST genes are upregulated in response to oxidative stress and are inexplicably overexpressed in many tumours, leading to problems during cancer chemotherapy. An analysis of the GST gene family in the Human Genome Organization-sponsored Human Gene Nomenclature Committee database showed 21 putatively functional genes. Upon closer examination, however, GST-kappa 1 (GSTK1), prostaglandin E synthase (PTGES) and three microsomal GSTs (MGST1, MGST2, MGST3) were determined as encoding membrane-bound enzymes having GST-like activity, but these genes are not evolutionarily related to the GST gene family. It is concluded that the complete GST gene family comprises 16 genes in six subfamilies -- alpha (GSTA), mu (GSTM), omega (GSTO), pi (GSTP), theta (GSTT) and zeta (GSTZ).
One goal of this 'Update on Genome Completion and Annotations' series [1, 2] has been to select a gene, or gene family, check for accuracy in the databases, and then help to suggest ways to correct any nomenclature problems that might exist. The glutathione S-transferases (GSTs) represent an important group of enzymes which detoxify both endogenous compounds and foreign chemicals such as pharmaceuticals and environmental pollutants. Although a large number of reviews about this important enzyme family have appeared,[3–11] there continues to be considerable confusion in the field with regard to the naming and classification of these genes and gene products.
Homologous genes, having a common ancestral origin 2 billion years ago or more, can be identified more readily, if they are designated with a stem (or root) symbol. A root symbol is very much encouraged by the Human Gene Nomenclature Committee (HGNC) as the basis for a hierarchical series of genes (eg for the ABC family, subfamily A, ABCA1, ABCA2, ABCA3, ABCA4) that are either the result of evolutionary divergence of an ancient ancestral gene, or have conserved functions -- via pathways, interactions or protein domains. Such a root symbol allows the easy identification of other related members in both database searches and the literature.
Homologous regions of 15-25 per cent of nucleotides or amino acids can be detected by the various alignment programs, denoting divergence from an ancestral gene; a small almost-invariant DNA motif or protein domain -- functioning as an enzyme active-site, cofactor docking site or ligand-binding site -- is further evidence of divergence from an ancestral gene. One of the earliest examples of this nomenclature approach for homologous genes was the cytochrome P450 (CYP) gene superfamily, in which it was agreed that approximately 40 per cent or more amino acid similarity allows two members to be placed in the same family and about 55 per cent or greater similarity allows two members to be assigned to the same subfamily . These cut-off values follow the original recommendations by Margaret Dayhoff. At present, more than 130 additional gene superfamilies and large gene families have since followed this same format .
Biochemistry of the GST enzymes
The fundamental basis for all GST catalytic activities is the capacity of these enzymes to lower the pKa of the sulfhydryl group of reduced glutathione (GSH) from 9.0 in aqueous solution to about 6.5 when GSH is bound in the active site . GSH exists as the thiolate (GS-) anion at neutral pH when complexed with the GST enzyme. Catalysis by GSToccurs through the combined capacity of the enzyme to promote GS- formation and to bind hydrophobic electrophilic compounds at a closely adjacent site . The GSH-binding and the hydrophobic substrate-binding sites have been called the G- and H-sites, respectively . In the case of certain substrates (eg benzyl and phenethyl isothiocyanates, alkyl dihalides), GST can catalyse both the forward and reverse reactions, leading to increased toxicity rather than detoxication . The active cytosolic enzyme exists as a dimer of two subunits [3, 4].
Evolution of the GSTgenes
GSTs are widely distributed in nature -- from bacteria and yeast to plants and animals. Plant GSTs include the phi, tau, theta, zeta and lambda classes; the theta and zeta have counterparts in animals [4, 5]. The sigma and theta classes are abundant in non-vertebrate animals . There is significant homology between a class theta GST and a dichloromethane dehalogenase enzyme from the prokaryote Methylobacterium, suggesting that the ancestral progenitor for mammalian GSTs probably arose from the theta class.
The analysis in this review will focus only on human GST genes. Numerous polymorphisms exist in the human GST genes,[10, 11] including the complete absence of the GSTM1 or the GST theta 1 gene -- at frequencies as high as 20 per cent to 50 per cent in some populations. Given the absence in certain GST activities, one can see how this might lead to decreased detoxication of environmental carcinogens or chemotherapeutic agents and thus to clinical problems in patients lacking these genes. Evidence is also emerging that GST genes from some pathogens might exert immunomodulatory functions towards the immune system, involving separate profiles of cytokine gene transcription and different patterns of cell growth . Antioxidants, as well as oxidative stress, induce transcription of many of the GST genes,[8, 9] leading to increased protection of the cell against insult by environmental chemicals and drugs.
Cytosolic versus membrane-bound GSTs
Many of the GST reviews include membrane-bound as well as cytosolic enzymes [4, 7]. Microsomal GST  and leukotriene C4 synthase  have been described as members of the GST family, although it has been noted  that neither shares sequence identity with the cytosolic GSTs. It would therefore appear likely that these membrane-bound GST enzymes represent examples of convergent, rather than divergent, evolution; at a particular point in time during evolution, Mother Nature required an enzyme to carry out such a membrane-bound catalytic reaction and assigned that task to an enzyme class different from that of the cytosolic GSTs.
The real GSTs have the two domains GST_N  and GST_C . One or the other of these domains appears in a number of other proteins. This might explain why some other proteins exhibit GST-like activity.
HGNC search for GSTgenes
List of human GST putatively functional genes.
Approved gene symbol
Approved gene name
Glutathione S-transferase (alpha) A1
Glutathione S-transferase A2
Glutathione S-transferase A3
Glutathione S-transferase A4
Glutathione S-transferase A5
Glutathione S-transferase A pseudogene 1
Glutathione S-transferase A pseudogene 2
Glutathione S-transferase kappa 1
Glutathione S-transferase (mu) M1
Glutathione S-transferase 'M1-like' pseudogene
Glutathione S-transferase M2 (muscle)
Glutathione S-transferase M3 (brain)
Glutathione S-transferase M3 pseudogene
Glutathione S-transferase M4
Glutathione S-transferase M5
Glutathione S-transferase omega 1
Glutathione S-transferase omega 2
Glutathione S-transferase omega 3 pseudogene
Glutathione S-transferase (pi) P1
Glutathione S-transferase pi pseudogene
Glutathione S-transferase theta 1
Glutathione S-transferase theta 2
Glutathione S-transferase (zeta) Z1
Microsomal glutathione S-transferase 1
Microsomal glutathione S-transferase 2
Microsomal glutathione S-transferase 3
Prostaglandin E synthase
Greek-to-Latin alphabetic conversions
Finally, of the six GST subfamilies, two of these are misnamed in the HGNC database, according to its own guidelines (Table 1). The two functional genes and one pseudogene of the omega class should correctly be named GSTW1, GSTW2 and GSTW3P1, respectively; the symbol 'W' stands for 'omega', whereas the symbol 'O' stands for 'omicron'. Similarly, the two functional genes of the theta class should correctly be named GSTQ1 and GSTQ2, because the symbol 'Q' stands for 'theta', whereas the symbol 'T' stands for 'tau'. Plants contain GST tau genes [4, 5].
The HGNC addressed this 'Greek letter' issue in relation to the GST genes. GSTT1 and GSTT2 were approved in 1994, in line with a request from Board's laboratory  and have been widely used ever since, with GSTT1 especially appearing in hundreds of references listed in PubMed. Likewise, Board's group published work about the GSTO1, GSTO2 and GSTOP3 genes, which were approved by the HGNC in 2003. HGNC therefore concluded:
'Although we do indeed have guidelines for Greek letter conversions, we also aim to serve the community by providing a useable and used nomenclature. It would seem to us to be somewhat pedantic to change the symbols for these five genes, all of which are being widely used in publications, simply because they did not conform to a guidance conversion table. In a similar manner, we usually use 'G' for gamma, but sometimes 'C' has been used instead, because this is taken as the third-letter-of-the-alphabet equivalent, eg laminin gamma-2 encoded by the LAMC2 gene . Hence, we do not see a need to change these glutathione S-transferase symbols. We realize that people can become upset by nomenclature changes, and we believe that a working nomenclature system is more desirable than a perfect one'.
Since mouse nomenclature follows that of human, the Mouse Genomic Nomenclature Committee (MGNC) will similarly stay with these same symbols for the orthologous genes. Both HGNC and MGNC continue to work closely with experts in the field, and the committees certainly make changes to the nomenclature, based on information from the experts when necessary. In most instances, changes will be made if they are necessary in order to promote accuracy and consistency.
The GST gene family comprises 16 genes in six subfamilies. Several problems were found in the HGNC listings and nomenclature for the GST gene family. First, GSTM1L is a pseudogene. Secondly, there are five additional genes included (GSTK1, MGST1, MGST2, MGST3 and PTGES) that encode membrane-bound enzymes having GST-like activity but which are not evolutionarily related to the 16 true GST genes. Thirdly, according to the Human Genome Organization HGNC's own rules, the GST-omega subfamily should include 'W' for omega -- instead of 'O', which is reserved for omicron -- and the GST-theta subfamily should include 'Q' for theta -- instead of 'T', which is reserved for tau. And plants have a GST-tau subfamily. The present authors' analysis of the GST gene family simply underscores some of the problems encountered in the various databases. Similar nomenclature problems were seen with the mouse Gst genes (not shown). The authors estimate that it will take many years before all of the bumps and wrinkles can be ironed out of the nomenclature systems for human and mouse genes and gene families.
The writing of this article was funded, in part, by NIH grants P30 ES06096 (D.W.N.) and R01 EY11490 (V.V.).
- Nebert DW, Wain HM: Update on human genome completion and annotations: Gene nomenclature. Hum Genomics. 2003, 1: 66-71.PubMed CentralView ArticlePubMedGoogle Scholar
- Nebert DW, Sophos NA, Vasiliou V, Nelson DR: Update on human genome completion and annotations: Cyclophilin nomenclature problems, or a visit from the sequence police. Hum Genomics. 2004, 1: 381-388.PubMed CentralView ArticlePubMedGoogle Scholar
- Hayes JD, Pulford DJ: The glutathione S-transferase gene family: Regulation of GST and the contribution of the isoenzymes to cancer chemoprotection and drug resistance. Crit Rev Biochem Mol Biol. 1995, 30: 445-600. 10.3109/10409239509083491.View ArticlePubMedGoogle Scholar
- Sheehan D, Meade G, Foley VM, Dowd CA: Structure, function and evolution of glutathione transferases: Implications for classification of non-mammalian members of an ancient enzyme superfamily. Biochem J. 2001, 360: 1-16. 10.1042/0264-6021:3600001.PubMed CentralView ArticlePubMedGoogle Scholar
- Dixon DP, Lapthorn A, Edwards R: Plant glutathione transferases. Genome Biol. 2002, 3: S3004-View ArticleGoogle Scholar
- Ouaissi A, Ouaissi M, Sereno D: Glutathione S-transferases and related proteins from pathogenic human parasites behave as immunomodulatory factors. Immunol Lett. 2002, 81: 159-164. 10.1016/S0165-2478(02)00035-4.View ArticlePubMedGoogle Scholar
- Murakami M, Nakatani Y, Tanioka T, Kudo I: Prostaglandin E synthase. Prostaglandins Other Lipid Mediat. 2002, 68-69: 383-399.View ArticlePubMedGoogle Scholar
- Owuor ED, Kong AN: Antioxidants- and oxidants-regulated signal transduction pathways. Biochem Pharmacol. 2002, 64: 765-770. 10.1016/S0006-2952(02)01137-1.View ArticlePubMedGoogle Scholar
- Rinaldi R, Eliasson E, Swedmark S, Morgenstern R: Reactive intermediates and the dynamics of glutathione transferases. Drug Metab Dispos. 2002, 30: 1053-1058. 10.1124/dmd.30.10.1053.View ArticlePubMedGoogle Scholar
- Townsend D, Tew K: Cancer drugs, genetic variation, and the glutathione-S-transferase gene family. Am J Pharmacogenomics. 2003, 3: 157-172. 10.2165/00129785-200303030-00002.View ArticlePubMedGoogle Scholar
- Coles BF, Kadlubar FF: Detoxification of electrophilic compounds by glutathione S-transferase catalysis: Determinants of individual response to chemical carcinogens and chemotherapeutic drugs?. Biofactors. 2003, 17: 115-130. 10.1002/biof.5520170112.View ArticlePubMedGoogle Scholar
- Armstrong RN: Glutathione S-transferases: Structure and mechanism of an archaetypical detoxication enzyme. Adv Enzymol Relat Areas Mol Biol. 1994, 69: 1-44.PubMedGoogle Scholar
- Mannervik B: The isoenzymes of glutathione transferase. Adv Enzymol Relat Areas Mol Biol. 1985, 57: 357-417.PubMedGoogle Scholar
- La Roche SD, Leisinger T: Sequence analysis and expression of the bacterial dichloromethane dehalogenase structural gene, a member of the glutathione S-transferase gene superfamily. J Bacteriol. 1990, 172: 164-171.PubMed CentralPubMedGoogle Scholar
- Morgenstern R, DePierre JW: Microsomal glutathione transferase. Purification in unactivated form and further characterization of the activation process, substrate specificity, and amino acid composition. Eur J Biochem. 1983, 134: 591-597. 10.1111/j.1432-1033.1983.tb07607.x.View ArticlePubMedGoogle Scholar
- Nicholson DW, Ali A, Vaillancourt JP, et al: Purification to homogeneity and the N-terminal sequence of human leukotriene C4 synthase: A homodimeric glutathione S-transferase composed of 18-kDa subunits. Proc Natl Acad Sci USA. 1993, 90: 2015-2019. 10.1073/pnas.90.5.2015.PubMed CentralView ArticlePubMedGoogle Scholar
- Polekhina G, Board PG, Blackburn AC, Parker MW: Crystal structure of maleylacetoacetate isomerase/glutathione transferase zeta reveals the molecular basis for its remarkable catalytic promiscuity. Biochemistry. 2001, 40: 1567-1576. 10.1021/bi002249z.View ArticlePubMedGoogle Scholar
- CLUSTAL W alignment for DNA and proteins. [http://www.ebi.ac.uk/clustalw/]
- Tan KL, Webb GC, Baker RT, Board PG: Molecular cloning of a cDNA and chromosomal localization of a human theta-class glutathione S-transferase gene (GSTT2) to chromosome 22. Genomics. 1995, 25: 381-387. 10.1016/0888-7543(95)80037-M.View ArticlePubMedGoogle Scholar
- Board PG, Coggan M, Chelvanayagam G, et al: Identification, characterization, and crystal structure of the omega class glutathione transferases. J Biol Chem. 2000, 275: 24798-24806. 10.1074/jbc.M001706200.View ArticlePubMedGoogle Scholar
- Pulkkinen L, Christiano AM, Airenne T, et al: Mutations in the gamma-2 chain gene (LAMC2) of kalinin/laminin-5 in the junctional forms of epidermolysis bullosa. Nat Genet. 1994, 6: 293-297. 10.1038/ng0394-293.View ArticlePubMedGoogle Scholar