- Genome update
- Open Access
Update on the olfactory receptor (OR) gene superfamily
Human Genomicsvolume 3, Article number: 87 (2008)
The olfactory receptor gene (OR) superfamily is the largest in the human genome. The superfamily contains 390 putatively functional genes and 465 pseudogenes arranged into 18 gene families and 300 subfamilies. Even members within the same subfamily are often located on different chromosomes. OR genes are located on all autosomes except chromosome 20, plus the X chromosome but not the Y chromosome. The gene:pseudogene ratio is lowest in human, higher in chimpanzee and highest in rat and mouse -- most likely reflecting the greater need of olfaction for survival in the rodent than in the human. The OR genes undergo allelic exclusion, each sensory neurone expressing usually only one odourant receptor allele; the mechanism by which this phenomenon is regulated is not yet understood. The nomenclature system (based on evolutionary divergence of genes into families and subfamilies of the OR gene superfamily) has been designed similarly to that originally used for the CYP gene superfamily.
Before 1980, the names of genes and classification of their encoded proteins were highly variable and non-systematic -- especially to anyone slightly outside a particular field or to a new graduate student entering the field. Professor Margaret Oakley Dayhoff was a pioneer in attempting to create order out of chaos in the naming of genes and gene families by means of computerised protein alignments . She was widely recognised as the founder in this new field of gene/protein classification, before her untimely death in 1983.
Cytochrome P450 (CYP) genes are conveniently arranged into families and subfamilies based on the percentage amino acid sequence identity [2–7]. Enzymes that share approximately ≥ 40 per cent identity are assigned to a particular family designated by an Arabic numeral, whereas those sharing approximately ≥ 55 per cent identity are grouped into a particular subfamily designated by a letter. For example, the sterol 27-hydroxylase enzyme and the 25-hydroxy-vitamin D3 1α-hydroxylase enzyme are both assigned to the CYP27 family because they share > 40 per cent sequence identity. Furthermore, the sterol 27-hydroxylase is assigned to the CYP27 'A' subfamily and the 25-hydroxy-vitamin D3 1 α-hydroxylase to the CYP27 'B' subfamily because their protein sequences are < 55 per cent identical. If an additional enzyme were to be discovered that shared > 55 per cent identity with the sterol 27-hydroxylase, then it would be named CYP27A2. If an additional enzyme were to be discovered that shared < 55 per cent but > 40 per cent identity with the sterol 27-hydroxylase as well as the 25-hydroxy-vitamin D3 1α-hydroxylase, then it would be named CYP27C1. The development and application of this delightfully logical system of nomenclature to the genes of many animals, plants and bacteria  has eliminated the confusion that often had plagued the naming of gene families and superfamilies. Subsequently, this 'divergent evolution' nomenclature system was adopted for several hundred other gene families and superfamilies -- including the olfactory receptor superfamily.
Background and history
Vertebrate olfactory receptor (OR) genes represent a category of G-protein-coupled receptors (GPCRs) that contain seven transmembrane α-helical domains and function in the reception of innumerable odour molecules in the environment . The OR gene superfamily is the largest in vertebrate genomes [10–13]. The genomic architecture of mammalian OR gene clusters shows an ancient evolutionary origin, preceding the marsupial-eutherian split; species-specific evolution has further shaped the different OR gene families, by means of both gains and losses of complete clusters, as well as expansion and contraction of existing clusters .
This dynamic flexibility is also reflected among individual humans; examining 51 candidate OR genes on DNA chips in 189 ethnically diverse subjects, a striking amount of population diversity was found . Segregating pseudogenes (SPGs) are genes that segregate in populations between intact genes and pseudogenes -- due to a disruptive single nucleotide polymorphism (SNP). A range of 16-24 functional OR genes was found, just in this study alone, indicating that the OR gene superfamily is among the most pronounced examples of functional population diversity in the human genome . Copy number variations (CNVs), another type of polymorphism, are also highly prevalent among human OR genes [15, 16]. All these genomic events are evidence of a relatively recent process, whereby the extreme diminution of a functional repertoire in humans has occurred -- a process which is presumably still ongoing.
For most mammalian species, the ability to detect millions of different odourants is critical to their survival. Based on recent OR gene mining data in the platypus, opossum, cow and dog genomes -- compared with that in the rat, mouse, macaque and human genomes  -- we are now certain that there has been a substantial expansion of the OR gene superfamily since the mammalian radiation ~ 100 million years ago.
The evolutionary change in the number of OR genes in insects is not nearly as extensive as that in mammals. Drosophila melanogaster has a relatively small receptor repertoire of 62 odourant receptors . A comparison of 12 Drosophila species, encompassing ~ 60 million years of divergence, shows that the number of functional OR genes has remained fairly stable . Caenorhabditis elegans has a highly developed chemosensory system, which enables it to detect a wide variety of volatile (olfactory) and water-soluble (gustatory) cues associated with food, danger or other animals; between 500 and 1,000 different GPCRs are expressed in chemosensory neurones, and these may be supplemented by alternative sensory pathways as well . The vertebrate OR gene repertoire has thus evolved from a subset of ancestral genes in the fly and worm.
There appear to be three important periods in the evolution of the vertebrate olfactory system, as evidenced by comparative genomics: (1) the adaptation to land in amphibian ancestors; (2) the decline of olfaction in primates; and (3) the delineation of putative pheromone receptors concurrent with rodent speciation . The gene: pseudogene ratio is lowest in human, higher in chimpanzee and highest in rat and mouse. This most likely reflects the necessity of olfaction for survival -- more so in the rodent than in the human.
Whereas the chicken, platypus and primate genomes carry <400 functional OR genes, the opossum and rodent genomes, not surprisingly, contain between 1,000 and 1,210 functional OR genes [11, 13]. Curiously, however, it is difficult to explain why the cow genome, with 970 functional OR genes, shows more than the dog genome, with ~811 functional OR genes, when dogs are considered to have such a keen sense of olfaction . Thus, the number of OR genes in a species does not appear to be directly related to the environmental 'requirement' or to lifestyle.
Current bioinformatics about the ORgene superfamily
The OR gene superfamily comprises 18 gene families and 300 subfamilies (Table 1). Presently, there are 390 putatively functional (protein-coding) OR genes and 465 OR pseudogenes located in multiple clusters of varying sizes scattered throughout all autosomes except chromosome (Chr) 20, and on the X but not the Y chromosome [21–23]. The members of each subfamily have been placed therein because of divergent evolution, as described above. These subfamilies differ from CYP subfamilies, in that individual members within one subfamily are often located on two or more different chromosomes. The OR2T (Table 2) subfamily contains 16 functional genes -- more than in any other subfamily. Evolutionary divergence of each of the 18 gene families is illustrated in Figure 1.
Note that, in many instances, some subfamilies contain only a single gene or only a single pseudo-gene (Tables 2-5). In fact, the OR7E subfamily has only one functional gene, and all the other 85 members are pseudogenes (Table 3). The OR7E subfamily is the largest subfamily in the human OR gene repertoire, and probably has expanded in the human genome through a series of segmental gene duplication events . The newly described human OR14 gene family (Table 4) was realised after analysis of the platypus and opossum OR gene repertoires. This analysis revealed that six human OR functional genes and one OR pseudogene (which previously had been classified within the OR5 family) are actually derived from a distinct platypus OR gene family [11, 25]. The evolutionary divergence of the OR14 gene family is shown in Figure 2.
The 'shotgun' splattering of OR genes throughout the human genome must have happened before speciation of Homo sapiens and the development of its 22 autosomes plus the X and Y chromosomes; this can be inferred from the high conservation of the OR genes' genomic organisation among marsupial and eutherian mammals, and the phylogenetic analysis of the platypus OR gene repertoire--by comparison with that in mammals [13, 25]. In contrast to this OR gene arrangement would be the establishment of the CYP gene subfamilies, which arose as syntenic clusters of members within a single chromosomal segment. This finding suggests that gene duplication events within CYP subfamilies occurred after mammalian speciation and development of the autosomes and sex chromosomes.
The two largest OR gene clusters are located on Chr 11, with 38 functional genes (51 per cent of total) on 11q (Cluster email@example.com) and 44 functional genes (45 per cent) on 11p (Cluster firstname.lastname@example.org). These genes are predominantly in OR families 51, 52, 55 and 56 (Table 5). Immersed within these two clusters are dozens of other non-OR-related genes. This intrusion of other non-OR-related genes can also be seen in all other OR gene clusters throughout the genome.
Future directions: Additional subsets of sensory reception genes and identification of ligands
A recently appreciated discovery in olfaction is the unique specialisation of sensory neurones, such that each individual sensory neurone is stochastically chosen to express usually only one odourant receptor allele. This mechanism of 'allelic exclusion', by which mutually exclusive expression of odourant receptor genes is regulated, remains unclear at present [20, 26, 27].
The vomeronasal-1 receptor genes (VN1R) also encode GPCRs and, while they encode odourant receptors, they are evolutionarily distinct  from the very large OR gene superfamily. There are five VN1R genes and nine VN1R pseudogenes. The VN1R1, VN1R2 and VN1R4 genes and VN1R6P pseudogene are located at Chr 19q13.42; the VN1R10P, VN1R11P, VN1R12P, VN1R13P and VN1R14P pseudogenes are located on Chr 6p21; VN1R7P and VN1R8P are on Chr 21p11.2;VN1R3 is alone on Chr 16p11.2; VN1R5 is alone on Chr 1q44; VN1R9P is alone on Chr 22 .
At the present time, information about the ligands for mammalian OR genes is very limited. The smell of lemons (limonene), the perception of a floral or woody smell (acetophenone) and the ability to smell isovaleric acid  have been mapped in the mouse to two specific genomic loci on Chr 4 (Ivat1) and Chr 6 (Iva2). In humans, isovaleric acid was found to be highly associated with the OR11H7P segregating pseudogene, which is not syntenic with either Ivat1 or Iva2 . Another recent study found that human OR7D4 is selectively activated in vitro by androstenone; interestingly, this study found that two non-synonymous SNPs account for a significant proportion of the variance in smell perception of androstenone .
Members of the gustatory receptor (Gr) gene family in Drosophila are expressed in chemosensory neurones and are known to mediate the perception of sugars, bitter substrates, carbon dioxide and pheromones. Intriguingly, some of these Gr genes have now been shown to be expressed in abdominal multi-dendritic neurones, hygroreceptive neurones of the arista, peripheral proprioceptive neurones in the legs, neurones in the larval and adult brain, and oenocytes . Along these same lines, we and others have observed several OR genes being significantly up- or downregulated in the liver or kidney of knockout mouse lines -- that is, in tissues not normally known to be involved in olfaction. It is therefore tempting to speculate that the receptors encoded by OR genes, as well as by Gr genes, might participate in the roles of detecting endogenous ligands.
Dayhoff MO, Barker WC, Hunt LT: Establishing homologies in protein sequences. Methods Enzymol. 1983, 91: 524-545.
Nebert DW, Adesnik M, Coon MJ, et al: The P450 gene superfamily. Recommended nomenclature. DNA. 1987, 6: 1-11. 10.1089/dna.1987.6.1.
Nebert DW, Gonzalez FJ: P450 genes: Structure, evolution, and regulation. Annu Rev Biochem. 1987, 56: 945-993. 10.1146/annurev.bi.56.070187.004501.
Nebert DW, Nelson DR, Adesnik M, et al: The P450 superfamily: Updated listing of all genes and recommended nomenclature for the chromosomal loci. DNA. 1989, 8: 1-13. 10.1089/dna.1.1989.8.1.
Nebert DW, Nelson DR, Coon MJ, et al: The P450 super-family: Update on new sequences, gene mapping, and recommended nomenclature. DNA Cell Biol. 1991, 10: 1-14. 10.1089/dna.1991.10.1.
Nelson DR, Kamataki T, Waxman DJ, et al: The P450 superfamily: Update on new sequences, gene mapping, accession numbers, early trivial names of enzymes, and nomenclature. DNA Cell Biol. 1993, 12: 1-51. 10.1089/dna.1993.12.1.
Nelson DR, Koymans L, Kamataki T, et al: P450 superfamily: Update on new sequences, gene mapping, accession numbers and nomenclature. Pharmacogenetics. 1996, 6: 1-42. 10.1097/00008571-199602000-00002.
Buck L, Axel R: A novel multigene family may encode odorant receptors: A molecular basis for odor recognition. Cell. 1991, 65: 175-187. 10.1016/0092-8674(91)90418-X.
Gaillard I, Rouquier S, Giorgi D: Olfactory receptors. Cell Mol Life Sci. 2004, 61: 456-469. 10.1007/s00018-003-3273-7.
Aloni R, Olender T, Lancet D: Ancient genomic architecture for mammalian olfactory receptor clusters. Genome Biol. 2006, 7: R88-10.1186/gb-2006-7-10-r88.
Henion TR, Schwarting GA: Patterning the developing and regenerating olfactory system. J Cell Physiol. 2007, 210: 290-297. 10.1002/jcp.20888.
Niimura Y, Nei M: Extensive gains and losses of olfactory receptor genes in mammalian evolution. PLoS ONE. 2007, 2: e708-10.1371/journal.pone.0000708.
Menashe I, Man O, Lancet D, Gilad Y: Different noses for different people. Nat Genet. 2003, 34: 143-144. 10.1038/ng1160.
Nguyen DQ, Webber C, Ponting CP: Bias of selection on human copy-number variants. PLoS Genet. 2006, 2: e20-10.1371/journal.pgen.0020020.
Hasin Y, Olender T, Khen M, et al: High resolution copy-number variation map reflects human olfactory receptor diversity and evolution. PLoS Genet in press. 2008
Smith DP: Odor and pheromone detection in Drosophila melano-gaster. Pflüger's Arch. 2007, 454: 749-758.
Nozawa M, Nei M: Evolutionary dynamics of olfactory receptor genes in Drosophila species. Proc Natl Acad Sci USA. 2007, 104: 7122-7127. 10.1073/pnas.0702133104.
Bargmann CI: Chemosensation in C. elegans. Wormbook, October 25. 2006, 1-29.
Kambere MB, Lane RP: Co-regulation of a large and rapidly evolving repertoire of odorant receptor genes. BMC Neurosci. 2007, 8 (Suppl 3): S2-10.1186/1471-2202-8-S3-S2.
Safran M, Chalifa-Caspi V, Shmueli O, et al: Human gene-centric databases at the Weizmann Institute of Science:GeneCards, UDB, CroW 21 and HORDE. Nucleic Acids Res. 2003, 31: 142-146. 10.1093/nar/gkg050.
Olender T, Feldmesser E, Atarot T, et al: The olfactory receptor universe -- From whole genome analysis to structure and evolution. Genet Mol Res. 2004, 3: 545-553.
Yue Y, Haaf T: 7E olfactory receptor gene clusters and evolutionary chromosome rearrangements. Cytogenet Genome Res. 2006, 112: 6-10. 10.1159/000087507.
Warren WC, Hillier LW, Marshall-Graves JA, et al: Genome analysis of the platypus reveals unique signatures of evolution. Nature. 2008, 453: 175-183. 10.1038/nature06936.
Serizawa S, Miyamichi K, Nakatani H, et al: Negative feedback regulation ensures the one receptor-one olfactory neuron rule in mouse. Science. 2003, 302: 2088-2094. 10.1126/science.1089122.
Rodriguez I: Odorant and pheromone receptor gene regulation in vertebrates. Curr Opin Genet Dev. 2007, 17: 465-470. 10.1016/j.gde.2007.07.005.
Malnic B, Godfrey PA, Buck LB: The human olfactory receptor gene family. Proc Natl Acad Sci USA. 2004, 101: 2584-2589. 10.1073/pnas.0307882100.
Griff IC, Reed RR: The genetic basis for specific anosmia to isovaleric acid in the mouse. Cell. 1995, 83: 407-414. 10.1016/0092-8674(95)90118-3.
Menashe I, Abaffy T, Hasin Y, et al: Genetic elucidation of human hyperosmia to isovaleric acid. PLoS Biol. 2007, 5: e284-10.1371/journal.pbio.0050284.
Keller A, Zhuang H, Chi Q, et al: Genetic variation in a human odorant receptor alters odour perception. Nature. 2007, 449: 468-472. 10.1038/nature06162.
Thorne N, Amrein H: Atypical expression of Drosophila gustatory receptor genes in sensory and central neurons. J Comp Neurol. 2008, 506: 548-568. 10.1002/cne.21547.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4:Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.
We thank our colleagues for valuable discussions and a critical reading of this manuscript. The writing of this article was funded, in part, by NIH grant P30 ES06096 (D.W.N.).
About this article
- classification of gene families and subfamilies
- OR gene superfamily
- CYP gene superfamily
- nasal olfactory neurone
- olfactory receptor gene superfamily
- allelic exclusion
- opossum genome
- platypus genome