Emerging strategies and applications of pharmacogenomics
Human Genomics volume 1, Article number: 444 (2004)
The rapid pace of genomic science advancements, including the completion of the human genome sequence, the extensive cataloguing of genetic variation and the acceleration of technologies to assess such variation, combined with clinical programmes with rich phenotypic data, serve as the foundation for the design and execution of pharmacogenomic studies which have an impact on the pharmaceutical pipeline from early discovery through to the marketplace. The authors discuss the required infrastructure to support pharmacogenomic studies and provide insight into the strategies and practical application to influence decision making in the pharmaceutical setting. Further, the influence of pharmacogenomics is currently affecting patient care in the oncology area and is highlighted as evident impact in the marketplace.
Pharmaceutical engagement of the pharmacogenomics discipline gained significant momentum in the mid-1990s as colleagues within the industry recognised the potential opportunity for the emerging field to have an impact on various phases of the drug discovery and development processes. While the optimism and excitement for this emerging discipline was high, the scientific and technological hurdles required to deliver pharmacogenomics seemed significant. At the time, the demonstration of pharmacogenomic impact as embodied by the drug metabolism field was evident, yet many hurdles remained to enable the delivery of comprehensive pharmacogenomic studies to explore disease state and drug response relationships. These hurdles included a complete draft sequence of the human genome, not expected until 2005. From this sequence would then emerge the catalogue of human genome variation, and this was not expected until several years after the completion of the human genome sequence. Speculation at the time suggested that more than 300,000 evenly spaced, single nucleotide polymorphisms (SNPs) would be needed to perform comprehensive pharmacogenomic studies -- a scale unimaginable at the time [1, 2]. How could it be possible to catalogue the SNPs needed to perform whole-genome association studies and conduct the genotyping -- the building blocks speculated to drive the scientific revolution needed to fully deliver pharmacogenomics?
Over the past ten years, significant technological progress to enable pharmacogenomics has been achieved. The International Human Genome project and Celera raced to publish the working draft genome sequence in 2001[3, 4] -- four years ahead of schedule, and the SNP Consortium, formed in 1999 through partnership between the academic and industrial community, developed the first comprehensive SNP map, published in November 2000, ahead of schedule, below cost and exceeding the initial goal of identifing 300,000 SNP markers of which 150,000 would be mapped by some 1.12 M, delivering 1.42M mapped SNPs for the scientific community at large to utilise for scientific research [5, 6]. This has now evolved into the International HapMap project, which will further refine the SNPs required for comprehensive association analysis by identifying critical patterns of SNPs that cosegregate from generation to generation in the form of haplotype tags and may provide the framework from which human whole-genome association studies are designed .
In parallel with the rapid advancements in the identification and mapping of SNPs across the human genome, technological discoveries and enhancements to high-throughput genotyping have also made significant strides. Over the course of the last five years, leading biotech firms have driven genotyping throughput to scales in the order of millions per day while significantly driving down costs. These technologies are now available to human genetics scientists to support large multi-gene association studies and the early application of human whole-genome association studies.
So, while in the mid-1990s many people heralded the revolution of 'personalised medicine' and 'the right drug for the right patient', the realists understood that, like any revolution, scientific research and technological advances would be required to shape the potential importance of this field to modern medicine. It is now understood that any two individuals differ from each other at the DNA sequence level by a mere 0.1 per cent. Yet it is this 0.1 per cent difference which makes each person unique and holds the key to differences in disease susceptibility and progression, as well as our response to therapeutic intervention . As of now, the foundation has been laid for the broadening application of pharmacogenomics; clear examples exist in medical practice, and scientific experiments can now commence at a pace and level which would have seemed unimaginable back in the mid-1990s. Furthermore, the face of the pharmaceutical industry has changed dramatically; with ever-increasing development costs and decreasing numbers of new drug approvals, the challenges facing the industry have never been more significant. This paper aims to highlight the infrastructure needed to integrate pharmacogenomics into the pharmaceutical business, as well as strategies for application, and discusses the potential applications and opportunities that this field offers to influence medical care.
Infrastructure for delivering pharmacogenomics into the pharmaceutical setting
The discipline of pharmacogenomics is currently being applied throughout the drug discovery and development process, as depicted in Figure 1. Highlights include: applying human genetics to ensure that the best therapeutic targets are prioritised for discovery investment; the comprehensive evaluation of the target gene sequence in multiple subjects to determine the genetic heterogeneity present in different ethnic populations; the use of genetics to select particular subpopulations and in early proof of concept studies; and, finally, the potential to predict efficacy and safety of new medicines more precisely. Prior to the effective integration and application of pharmacogenomics, however, significant investment and infrastructure was required to integrate pharmacogenomics into these processes. These areas are highlighted in Figure 2 and discussed in more detail below.
Appropriate informed consent
The ability to collect DNA, as well as other types of genomic samples, from study subjects is directly dependent on patient informed consent as to the risks and benefits associated with the sample collection. A separate informed consent, which states the purpose and the use of the samples, assurance of patient confidentiality through anonymisation or de-identification of unique patient identifiers and clinical data, as well as the optional participation in this donation, has been an integral part of patient acceptance in the clinical trial setting. Within current Pfizer clinical programmes it is not unusual for patient consent for genomic sample collection to exceed 80 per cent in clinical studies taking place in countries around the world.
Collection of phenotypic data
The ability to execute any genetic study requires accurate and comprehensive phenotypes, which at best are quantitative and reliable, to enable robust genetic analyses. Clinical trials serve as an excellent starting point for genetic research aimed at better understanding the underlying disease state and providing clinical results including outcome measures and drug response, both of which can drive pharmacogenomic investigations. Clinical trials are not genetic studies by design, however, and critical review of the completed study will often require careful epidemiological evaluation to ensure the appropriate study design. This often involves ensuring that those subjects donating DNA samples are reflective of the entire study population and are not biased in any way. If one is interrogating drug response phenotypes, care must be taken to examine the demographics of the responder/non-responder subgroups. Similarly, when safety events are investigated, subjects experiencing the adverse event must be closely matched to those subjects receiving the drug and exhibiting no such adverse event.
When considering the use of clinical trial samples for disease gene interrogations, the ascertainment of appropriately matched control populations to support such genetic studies is equally important. Often, clinical methodology studies to ascertain control populations matched on demographics as well as on certain clinical phenotypes present in the case population are required. In the authors' studies of osteoarthritis, critical care was taken to execute a study which identified control individuals who were radiographically free of joint space narrowing, to ensure accurate control phenotype status . These collections then allow the design and execution of case-control association studies in support of both target validation and genetic biomarker discovery to support clinical development programmes targeting the specific disease.
Technology for delivery of genetic analysis
The rapid pace of technology advancements enables the interrogation of the entire genome at the level of DNA, RNA or protein. Technology has moved rapidly from methods of restriction fragment polymorphism (RFLP) scoring of genetic variation to examining individual gene markers associated with various phenotypes, to SNP scoring to examine multiple markers in multiple candidate genes, to now having the ability to interrogate thousands of SNP markers using methods typified by Sequenom mass spec, Illumina bead-based methods, the Affymetrix high-density SNP chips  and Perlegen high-density wafers . It is now possible to examine hundreds of millions of SNPs within several weeks, as the authors are doing through their collaboration with Perlegen Sciences to study metabolic syndrome and depression phenotypes. It is anticipated that whole-genome sequencing in a timely and cost-effective manner may represent the next technological floodgate .
One of the single most important challenges facing the field of pharmacogenomics rests in the hands of the information technology experts. The vast amount of clinical data, combined with genomic data, requires a well designed information architecture to enable hypothesis generation at one end of the portfolio and validated systems to enable the use of the information in regulatory submissions at the other end. For example, one must be able to track clinical genomic samples from the patient from sample processing through to the sample repository, ensuring that there is integrated information regarding informed consent and potential restrictions on sample use, and that clinical data associated with the matched genomic sample is entered into a database. Moreover, often, a final step of anonymisation or de-identification must occur prior to initiation of a pharmacogenomic study. At this stage, the challenge lies in the ability to effectively query the clinical data in order to develop specific hypotheses that can be addressed by pharmacogenomic investigations, followed by data systems that will then house all of the newly generated genomic data, often derived from very different types of analysis platforms.
The ability to effectively integrate these data with clinical data and to mine such datasets remains elusive, and will require a comprehensive information technology solution to maximise the utility of research studies. This sophisticated knowledge base becomes a primary limitation in many cases, as the ability to integrate and effectively mine disparate data sources remains challenging and time intensive; however, this will have to be addressed in sophisticated systems to enable maximum return on investments in pharmacogenomics. In addition, when the importance of patient confidentiality in the future application of pharmacogenomics in medical care is considered, significant gaps exist. New business models have emerged that aim to focus on the integration and confidentiality of data sources. These include models developed by First Genetic Trust  and the Clinical Genomics business unit of IBM.
Carrying out pharmacogenomics studies to deliver value
Within the industry setting, the ability to drive pharmacogenomics into the fabric of the business is directly dependent on the potential to have an impact on business decisions. This therefore requires close partnership with colleagues in discovery, development and commercial marketing to work together to shape the scientific direction of research programmes. Given the challenges of the pharmaceutical industry, including the high attrition which occurs at every stage throughout the pipeline, combined with the long development time lines, excellent science alone is not sufficient. Pharmacogenomics has to drive impact and influence decisions aimed at increasing productivity within drug discovery and clinical development. A discussion focused on achieving such value follows.
Choosing the best target: Applying human genetics to add human relevance
There exist significant unmet medical needs of patients, and it is now recognised that common diseases effecting morbidity and mortality have a significant genetic component. Consider the diseases of diabetes, obesity and osteoarthritis, to name but a few. The degree of genetic contribution has been estimated for these diseases based on twin studies and sibling risk analysis [18–23]. Similarly, rare gene mutations have provided insight into complex biological processes involved in more common disease phenotypes. For example, the importance of cholesteryl ester transfer protein (CETP) in influencing an individual's high-density lipoprotein (HDL) levels was demonstrated when individuals identified as possessing extreme levels of HDL were shown to harbour a rare mutation in their CETP gene and to lack CETP activity [24, 25]. In another example, subjects harbouring inactivating mutations within the Janus kinase 3 gene (JAK3) express severe combined immunodeficiency syndrome (SCID), providing mechanistic evidence that JAK3 inhibition was likely to effect immune suppression and potentially serve as a therapy for transplant recipients . Most importantly, the insight gained from the human genetics findings described above has led to new investigational drugs for CETP inhibition and JAK3 inhibition, which, upon entry into the clinic, demonstrate efficacy consistent with the human genetics data [27, 28].
While not traditionally defined as pharmacogenomics, human genetic association studies may provide an important avenue for establishing relationships between human genes and disease state and thus influence the appropriate selection of therapeutic targets. Important new discoveries of genes associated with complex disease appear to occur on a regular basis now, as evidenced by disease associations of neuregulin and schizophrenia; PDE4D and stroke; and 5-lipoxgenase and atherosclerosis . These studies have been strengthened by additional scientific data supporting functional relationships between the genetic association and the disease.
Pharmaceutical companies, as well as many academic institutions, have been collecting DNA samples with appropriate informed consent to investigate the relationship between genetic variation and disease phenotypes to better classify disease. Through the collection of clinical phenotypes with linked DNA samples, the execution of clinical trials provides an opportunity to investigate the underlying genetic variation that exists in patients. This is typified in a study in which DNA from patients who participated in a lipid-lowering trial demonstrated a strong association between the phenotype for HDL levels and a novel lipase gene family member, endothelial lipase (LIPG) . While association studies have had mixed reviews in the scientific community, often due to conflicting study results, careful attention to study design can provide the first opportunity to add human data to known and novel genes. Study design issues include: careful selection of appropriate study subjects based on rigorous phenotypic criteria; utility of extreme phenotypes; comprehensive SNP selection which takes into account linkage disequilibrium (LD) in the gene under study to ensure a density of markers which capture the LD pattern of the gene; statistical methods allowing one to assess how likely any result may be due to chance alone; and, finally, planned replication to provide added validation of any gene association. These are critical to the successful characterisation of disease risk alleles.
While candidate gene studies have become commonplace in the literature, these studies may often involve a biological hypothesis for candidate gene selection. The ability to interrogate the genome based solely on phenotypic criteria with no a priori bias on gene selection has been dramatically enhanced by technologies which now interrogate SNPs broadly across the genome. These platforms have now replaced estimates of 300,000 SNPs evenly spaced across the genome, by utilising some 200,000-300,000 haplotype-defining SNPs across the genome. For the first time, genotyping technologies -- such as those developed by Perlegen Sciences -- have provided the ability to genotype hundreds of thousands of markers in an individual using high-density oligonucleotide arrays coupled with either highly multiplexed polymerase chain reactions or restriction enzyme-based genomic reduction. Even with these new technologies and the availability of millions of mapped SNPs, however, the exact number of haplotype-defining SNPs that will be required to detect genetic associations with common polymorphisms that are not typed is still uncertain. The appropriate LD measure to select these tagging SNPs for association studies is r2 . Recent reports evaluating common polymorphisms across candidate gene regions suggest that, in order to detect > 80 per cent of all haplotypes in a given region using haplotype-tagging SNPs, it is necessary to reach an r2 of > 0.8 . As the HapMap project progresses forward with defining the linkage LD patterns in the human genome, geneticists will have the tools to evaluate the extent of LD within a given region and within a given ethnicity to select the optimal SNPs to genotype when designing a study. These data will provide valuable information around SNP selection, irrespective of whether the study design is for a whole-genome association or candidate gene-based studies. Elucidating the genetic contributors to common complex diseases, such as cardiovascular disease and psychiatric disease, will be more efficient through a whole-genome approach, as it does not depend on the biased selection of candidate genes. This has been exemplified using family-based linkage studies in stroke, schizophrenia  and Crohn's disease, where these genes would not have been suspected a priori and thus not even investigated using a candidate gene approach. Whether the study design focuses on disease, drug response or safety, the primary driver for any successful study is the accurate clinical phenotyping of the study population. The present authors believe that this new information on SNP density and LD coverage of the human genome will allow the visualisation of a significant portion of the genomic regions associated with the phenotype of interest, as illustrated in Figure 3.
In an attempt to validate the haplotype tagging approach to identify genetic associations, as well as to evaluate the Perlegen Sciences array-based chip-based genotyping platform, 7,283 SNPs spanning 17.1 megabases (Mb) of DNA were genotyped to identify associations with HDL levels . SNPs associated with the 5' haploblock of the CETP gene were identified as the most significant association in the dataset . Readouts from several genome-wide studies are expected within the next year and will provide further insight into the potential value of these studies to identify genes involved in complex disease phenotypes, including metabolic syndrome and drug response. The public HapMap project  and companies like Perlegen, which recently announced its intention to provide its SNP markers into the public domain to further advance the foundation for such experiments, will further aid these whole-genome approaches for the scientific community.
Drug discovery and development: Understanding target variability
Early in the drug discovery phase, the assessment of comprehensive target variation achieved through gene resequencing in ethnically diverse DNA panels provides discovery scientists with information that is critical for screen design, functional assessment of variant alleles and animal model design. In partnership with companies like Genaissance Pharmaceuticals, the present authors will typically screen a panel of 92 DNA samples with representations from Caucasian, Asian, Hispanic and African-American DNA samples, which allow the detection of polymorphisms with a frequency greater than 1 per cent . For example, the CETP gene variation depicted in Figure 4 provides a comprehensive understanding of the variation within this gene locus. In addition, these comprehensive SNP data provide the basis for candidate gene-based studies to interrogate the relationship between putative therapeutic target genes and human disease phenotypes . This knowledge of target variation can then be utilised for hypothesis testing in early clinical programmes, with particular emphasis on putative functional variants which may have been defined during discovery studies exploring functional activities. This is exemplified in the following discussion.
Drug development: Traditional pharmacogenetics for clinical decision making
Most, if not all, pharmaceutical companies have actively engaged in the area of pharmacogenomics with differing levels of investment, strategies and application. It is the authors' belief, however, that application of this discipline is still in its infancy. Yet there are some very clear demonstrations of impact already in place, with particular emphasis on the oncology area, which are described in more detail in the next section.
One can also look to the area of drug metabolism, the more classically defined area of pharmacogenetics, and find clear demonstrations of the utility of genetic knowledge of the cytochrome P450 gene family, as well as other genes involved in the pharmacokinetics of drug disposition. These genes have been extensively studied, and examples such as cytochrome P450 CYP2D6, 2C19 and 2C9 provide a clear demonstration of the predictive use of genotype in clinical development. For example, the use of genotyping to predict poor metaboliser status for individuals deficient in CYP2D6 enzyme is used routinely when evaluating CYP2D6 substrates [39, 40]. Many excellent reviews have been published, providing detailed summaries of the area of drug metabolism and its importance for clinical development [41–45].
In the early stages of clinical development, CYP2D6 genotyping is routinely employed for prospective enrolment or retrospective analysis of subjects, to rapidly establish individual pharmacokinetic differences for those compounds suspected to be metabolised via the CYP2D6 pathway . Additionally, drug-drug interaction studies with experimental therapies and known CYP2D6 substrates, such as paroxetine, can benefit from the incorporation of CYP2D6 genotyping [46, 47]. CYP2D6 poor metabolisers may demonstrate altered paroxetine exposure, which would remain unexplained in the absence of CYP2D6 genotyping, and thus incorporation of the genotyping data removes the potential for uncertainty in those individuals who would be expected to demonstrate increased exposure due to the possession of two non-functional alleles of CYP2D6.
Drug development: Transitioning to proof of concept
The combined knowledge of genetic contributions to the underlying disease state and comprehensive data on target variation, provides new and valuable insight which can be applied in early clinical studies aimed at establishing proof of concept for novel therapeutic agents. Genetic biomarkers defining the risk of disease onset and/or progression can potentially be used to select optimal patient populations in early clinical development by targeting a genetically defined disease risk population relevant to the therapeutic intervention or the endpoint being studied. This is exemplified by recent data from deCODE Genetics, in which genetic variation within the 5-lipoxygenase activating protein (FLAP) gene locus was associated with an increased risk of myocardial infarction in the Icelandic population . Functional assessment of isolated neutrophils from individuals carrying the 'at-risk' four-SNP marker haplotype demonstrates increased production of leukotriene B4, a major leukotriene in this inflammatory pathway. This unique haplotype identifies a specific population in which to assess the potential therapeutic benefits of a FLAP inhibitor, and deCODE Genetics has initiated clinical studies to validate this approach, combining genetic enrichment with related biochemical measures of drug activity and inflammation to assess proof of concept. Should this study prove successful, the question will remain as to whether this mechanism will be beneficial in the genetically 'at risk' FLAP haplotype individuals alone, or whether other individuals will also benefit from therapeutic intervention directed at this specific mechanism. This is a bold new approach which has encouraged Merck to partner with deCODE Genetics, and will probably be watched carefully by other pharmaceutical companies.
The utility and importance of target variation data and knowledge about the functional relevance of this variation to execute pharmacogenomic studies in early development has also been applied in the development of novel therapies for HIV. Genetic studies first demonstrated that the CCR5 receptor was associated with susceptibility to HIV infection and the progression to AIDS . The CCR5 gene has been well characterised, with particular importance placed on the delta 32 allele, as individuals heterozygous for the allele exhibit significant reduction in receptor numbers on the cell surface while individuals homozygous for the allelle have no functional receptor present. Pharmaceutical companies have therefore rapidly moved CCR5 receptor antagonists from discovery into the clinic. In early clinical studies of a CCR5 antagonist, examination of the delta 32 genotype and receptor density as measured by fluorescence-activated cell sorting analysis, has shown that the CCR5 delta 32 was not significantly associated with any safety and toleration endpoints tested, nor with baseline viral load. Preliminary investigation suggests no apparent relationship between CCR5 delta 32, CCR5 expression and receptor saturation in healthy volunteers. Future studies will examine the CCR5 genotype and antiretroviral activity as the compound progresses through clinical development .
Utilisation in late-stage clinical development programmes and beyond
It is often difficult to ascertain individual pharmaceutical experience in late-stage clinical development, as much experience to date has involved retrospective analysis of clinical programmes and has not moved into true prospective application. In addition, there still exists a significant rate of late-stage attrition. A recent publication from the US Food and Drug Administration (FDA) provided some insight into the integration of pharmacogenomics into clinical development programmes, citing some 70 investigational new drug applications involving pharmacogenomics; of these 80 per cent involved cytochrome P450 genotyping and were related to drug metabolism . Recent issuance of the draft guidance for Voluntary Genomic Data Submissions by the FDA will also promote additional regulatory dialogue and information sharing related to the application of pharmacogenomics.
Roses recently provided an excellent example with Tranilast, a Phase III programme which was unsuccessful for restenosis, in which pharmacogenomics data were generated in real time during the Phase III development programme, demonstrating that a small percentage of patients developing hyperbilirubinaemia, during drug treatment contained a promoter variant in the UDG-glucuronosyltransferase I gene (UGT1); this was not seen in placebo-treated subjects [52, 53]. This initial Tranilast work has now been extended to model SNP scoring at a density expected for whole-genome scans and to serve as a validation of whole-genome analysis for identifying genomic regions associated with safety events. Some 76 haplotype-defining SNPs, spanning a 2.7 Mb region of DNA including the UGT1A1 gene, were examined in 1,054 patients who had been treated with Tranilast with no evident hyperbilirubinaemia, and 147 patients receiving therapy who did experience hyperbilirubinaemia. This analysis demonstrated an association in a 150 kilobase region that included the UGT1A1 gene locus, which upon further dense mapping confirmed the association with the UGT1A1 gene [54, 55].
Table 1 describes other examples of currently available drugs that have been examined for genetic associations with particular genetic pathways or gene markers. These references provide an early window onto the types of studies which are exploring the complex relationship between drug response and genetic heterogeneity in relatively small numbers of individuals receiving such therapies. These have relied on focused candidate gene-based investigations with limited ability to interrogate the entire genome; however, it is envisioned that several such genome-wide analyses will appear in the scientific community over the coming year or two.
Utilisation of pharmacogenomics in oncology to have an impact on patient care
Cancer is a genetic disease, both in terms of the germline (inherited genome) as well as the somatic genome, which is unique to the tumour itself. While mutations in genes such as BRCA1 and BRCA2 result in a significant increase in cancer risk, these types of mutations represent only a small percentage of all tumours . The vast majority of cancers result from somatic alterations within cells resulting in a selective growth advantage over the well-controlled mechanisms of normal cell growth [68–71]. There are currently technologies available for identifying and studying the molecular mechanisms defining various subtypes of common cancers and incorporating these data into the drug discovery and development process. The characterisation and understanding of genetic alterations driving tumourigenesis has already led to targeted oncology therapies with increased benefit to these molecularly defined patient populations. The approval of Herceptin in 1998 paralleled the explosion of new scientific data emerging from the Human Genome Project. Scientists at Genentech recognised that a certain percentage of women with breast cancer overexpressed the Her-2 neu protein and that their cancer was particularly aggressive . In cells overexpressing the Her-2 neu protein it was discovered that targeted reduction in the expression of the protein returned the cancerous cells to a normal state, leading to the development of a monoclonal antibody which attaches to the Her-2 neu receptor to prevent the cell growth response [73, 74]. The overexpression of the Her-2 neu gene is observed in approximately 30 per cent of breast cancers, and it is this 30 per cent of patients that appreciate the optimal benefit from this targeted therapy . Based on these data, current Herceptin therapy requires women with advanced breast cancer to be screened for Her-2 neu overexpression through fluorescence in situ hybridisation or quantitative protein measures.
Gleevec, approved in 2001 for chronic myeloid leukaemia, treats a subset of patients who possess a genetic rearrangement known as the Philadelphia chromosome, in which a translocation of chromosome 9q34 to 22q11 [t(9;22) (q34;q11)] occurs, resulting in a breakpoint cluster region that fuses two genes, BCR and the oncogene Abl, resulting in a constitutively active oncogenic product that plays an important role in cell growth. At least 90 per cent of patients with chronic myeloid leukaemia harbour this translocation, providing genetic evidence that it plays a role in disease risk . Once again, gaining a clear understanding of the molecular entity driving the disease, scientists at Novartis developed Gleevec, an inhibitor of BCR-ABL tyrosine kinase. A Phase II study reported in 2002 showed major cytogenetic responses in 60 per cent of subjects (n = 454) and complete haematological responses in 95 per cent of subjects who had previously failed on treatment with interferon-alpha .
Genetic studies have also been instrumental in elucidating mechanisms of resistance to cancer therapy. Several reports have shown that the cause of Gleevec resistance in advanced phase disease is due to identified somatic point mutations within the ABL kinase domain in a subset of patients, which could possibly interfere with Gleevec binding [78–80].
More recently, the elucidation of the response to the tyrosine kinase inhibitor Iressa has again highlighted the importance of characterising the somatic alterations contributing to tumourigenesis and the relationship between the mutations to increase the confidence in target mechanisms and drug response . In clinical development, some 10 per cent of patients with non-small cell lung cancer demonstrated a significant benefit with Iressa. When Lynch et al. examined the EGFR gene sequence -- the target for Iressa action -- in tumour DNA isolated from a subset of these responders, gain-of- function mutations were identified in eight out of nine responders, compared with zero out of seven non-responders . Paez et al. examined the EGFR gene in DNA obtained from non-small cell lung cancer tumour samples and found similar mutations in one out of 61 tumour DNAs from US-based patients, while in Japanese patients the incidence rose to 15 out of 58 tumour DNAs, providing data correlating with an observed increase in response rates to Iressa in Japan .
These data reinforce the importance of characterising underlying genetic heterogeneity within large tumour types. A priori knowledge of specific genetic markers indicative of drug response may influence the design of early proof-of-concept clinical trials. In the case mentioned above, screening for those 10 per cent of subjects with EGFR mutations for enrolment would have actually increased the power of the development studies substantially by improving the efficacy rates through reducing intersubject variability in tumour heterogeneity. Studies have also shown the power of utilising expression-profiling patterns of selected signature genes in defining prognostic criteria, as exemplified by a recent report defining prognostic criteria for metastatic disease across multiple tumour types . The incorporation of these molecular signatures into a drug discovery and clinical development programme may influence many different phases, including the selection of the optimal target based upon the role it plays in the molecular aetiology of the tumour subtype; the potential to allow enrichment of a patient population with molecularly defined tumours, as exemplified above and, finally, enrichment of the patient population most at risk for disease progression, to clearly test for drug effect.
As exemplified by the oncology examples highlighted in Table 2, pharmacogenomics will have an impact on optimal target selection and will increase efficacy in clinical development by targeting patient populations based on genetically defined disease phenotypes. It will also, ultimately, influence patient care when it provides a clear benefit to physicians' ability to deliver better medical care to patients. This will require a clear demonstration of benefit over standard assessments used in clinical practice, or when the molecular basis of the disease is so well understood that the clear path for development requires genomic knowledge to advance drug development. Advancements in genomic technologies, characterisation of human population genetic variation, enhancements in disease phenotyping methodologies, advances in informed consent processes and patient confidentiality technologies and the integration of molecular science with that of traditional clinical practice are paving the way for novel pharmacogenomic discoveries aimed at characterising disease risk, contributing to novel therapeutics and providing clues into the optimal treatment regimen based on a combination of clinically and genetically defined disease. While we are still in the early stages of this revolution, significant opportunities can be envisaged for scientific innovation to shape the future of medical care and patient health for many years to come.
Collins FS, Guyer MS, Chakravarti A: Variations on a theme: Cataloging human DNA sequence variation. Science. 1997, 278: 1580-1581. 10.1126/science.278.5343.1580.
Chakravarti A: It's raining SNPs, hallelujah?. Nat Genet. 1998, 19: 216-217. 10.1038/885.
International Human Genome Sequencing Consortium: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.
Venter JC, Adams MD, Myers EW, et al: The sequence of the human genome. Science. 2001, 291: 1304-1351. 10.1126/science.1058040.
The International SNP Working Group: A map of the human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature. 2001, 409: 928-933. 10.1038/35057149.
The SNP Consortium. 2002, (Accessed 21st October, 2004), [http://snp.cshl.org/index.html]
The International HapMap Consortium: The international HapMap project. Nature. 2003, 426: 789-796. 10.1038/nature02168.
Wang DG, Fan J, Siao C, et al: Large scale identification, mapping and genotyping of single-nucleotide polymorphisms in the human genome. Science. 1998, 280: 1077-1082. 10.1126/science.280.5366.1077.
Hellio Le Graverand-Gastineau MP, Pickering E, Chan G, et al: Prevalence of radiographic osteoarthritis in subjects who are clinically asymptomatic. Abstract OARSI World Congress. 2003, 12-15th October
Tamminen M, Kakko S, Kesaniemi A, Savolainen MJ: A polymorphic site in the 3' untranslated region of the cholesteryl ester transfer protein (CETP) gene is associated with low CETP activity. Atherosclerosis. 1996, 124: 237-247. 10.1016/0021-9150(96)05833-9.
McCarthy JJ, Parker A, Salem R, for the GeneQuest Investigators, et al: Large scale association analysis for identification of genes underlying premature coronary heart disease: Cumulative perspective from analysis of 111 candidate genes. J Med Genet. 2004, 41: 334-341. 10.1136/jmg.2003.016584.
Jurinke C, van den Boom D, Cantor CR, Koster H: The use of MassARRAY technology for high throughput genotyping. Methods Mol Biol. 2002, 187: 179-192.
Barker DL, Hansen M, Faruqi AF, et al: Two methods of whole-genome amplification enable accurate genotyping across a 2320-SNP linkage panel. Genome Res. 2004, 14: 901-907. 10.1101/gr.1949704.
Janne PA, Li C, Zhao X, et al: High-resolution single-nucleotide polymorphism array and clustering analysis of loss of heterozygosity in human lung cancer cell lines. Oncogene. 2004, 23: 2716-2726. 10.1038/sj.onc.1207329.
Patil N, Berno AJ, Hinds DA, et al: Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science. 2001, 294: 1719-1723. 10.1126/science.1065573.
MacNeil JS: (Still) looking for a few new sequencing technologies. Genome Technol. 2004, 46: 23-26.
Robertson JA: Consent and privacy in pharmacogenetic testing. Nat Genet. 2001, 28: 207-209. 10.1038/90032.
McIntyre EA, Walter M: Genetics of type 2 diabetes and insulin resistance: Knowledge from human studies. Clin Endocrinol. 2002, 57: 303-311. 10.1046/j.1365-2265.2002.01610.x.
Florez JC, Hirschhorn J, Altshuler D: The inherited basis of diabetes mellitus: Implications for the genetic analysis of complex traits. Annu Rev Genomics Hum Genet. 2003, 4: 257-291. 10.1146/annurev.genom.4.070802.110436.
Allison DB, Faith MS, Nathan JS: Risch's lambda values for human obesity. Int J Obes. 1996, 20: 990-999.
Maes HH, Neale MC, Eaves LJ: Genetic and environmental factors in relative body weight and human adiposity. Behav Genet. 1997, 27: 325-351. 10.1023/A:1025635913927.
Lanyon P, Muir K, Doherty S, Doherty M: Assessment of a genetic contribution to osteoarthritis of the hip: Sibling study. BMJ. 2000, 321: 1179-1183. 10.1136/bmj.321.7270.1179.
MacGregor AJ, Antoniades L, Matson M, et al: The genetic contribution to radiographic hip osteoarthritis in women. Arthritis Rheum. 2000, 43: 2410-2416. 10.1002/1529-0131(200011)43:11<2410::AID-ANR6>3.0.CO;2-E.
Inazu A, Brown ML, Hesler CB, et al: Increased high-density lipoprotein levels caused by a common cholesteryl-ester transfer protein gene mutation. N Engl J Med. 1990, 323: 1234-1238. 10.1056/NEJM199011013231803.
Yamashita S, Sprecher DL, Sakai N, et al: Accumulation of apolipoprotein E-rich high density lipoproteins in hyperalphalipoproteinemic human subjects with plasma cholesteryl ester transfer protein deficiency. J Clin Invest. 1990, 86: 688-695. 10.1172/JCI114764.
Macchi P, Villa A, Giliani S, et al: Mutations of Jak-3 gene in patients with autosomal severe combined immune deficiency (SCID). Nature. 1995, 377: 65-68. 10.1038/377065a0.
Clark RW, Sutfin TA, Ruggeri RB, et al: Raising high-density lipoprotein in humans through inhibition of cholesteryl ester transfer protein: An initial multidose study of torcetrapib. Arterioscler Thromb Vasc Biol. 2004, 24: 490-497. 10.1161/01.ATV.0000118278.21719.17.
Changelian PS, Flanagan ME, Ball DJ, et al: Prevention of organ allograft rejection by a specific Janus kinase 3 inhibitor. Science. 2003, 30: 875-881.
Steffansson H, Sarginson J, Kong A, et al: Neuregulin I and susceptibility to schizophrenia. Am J Hum Genet. 2002, 71: 877-892. 10.1086/342734.
Gretarsdottir S, Thorleifsson G, Reynisdottir ST, et al: The gene encoding phosphodiesterase 4D confers risk of ischemic stroke. Nat Genet. 2003, 35: 131-138. 10.1038/ng1245.
Dwyer JH, Allayee H, Dwyer KM, et al: Arachidonate 5-lipoxygenase promoter genotype, dietary arachidonic acid, and atherosclerosis. N Engl J Med. 2004, 350: 29-37. 10.1056/NEJMoa025079.
Mank-Seymour AR, Durham LK, Thompson JF, et al: Association between single-nucleotide polymorphisms in the endothelial lipase (LIPG) gene and high-density lipoprotein cholesterol levels. Biochim Biophys Acta. 2004, 1636: 40-46. 10.1016/j.bbalip.2003.12.001.
Devlin B, Risch N: A comparison of linkage disequilibrium measures for fine-scale mapping. Genomics. 1995, 29: 311-322. 10.1006/geno.1995.9003.
Carlson CS, Eberle MA, Rieder MJ, et al: Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet. 2004, 74: 106-120. 10.1086/381000.
Ogura Y, Bonen DK, Inohara N, et al: Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease. Nature. 2004, 411: 603-606.
Hinds DA, Seymour AB, Durham LK, et al: Application of pooled genotyping to scan candidate regions for association with HDL levels. Hum Genomics. 2004, 1: 421-434.
Stephens JC, Schneider JA, Tanguay DA, et al: Haplotype variation and linkage disequilibrium in 313 human genes. Science. 2001, 293: 489-493. 10.1126/science.1059431.
Thompson JF, Lira ME, Durham LK, et al: Polymorphisms in the CETP gene and association with CETP mass and HDL levels. Atherosclerosis. 2003, 167: 195-204. 10.1016/S0021-9150(03)00005-4.
Sachse C, Brockmoller J, Bauer S, Roots I: Cytochrome P450 2D6 variants in a Caucasian population: Allele frequencies and phenotypic consequences. Am J Hum Genet. 1997, 60: 284-295.
McElroy SM, Sachse C, Richmond J, et al: Utilization of CYP450 2D6 genotyping as an alternative to probe drug phenotyping for determination of metabolic status in a clinical setting. AAPS Pharmsci. 2000, 2 (33): (Accessed 21st October, 2004), [http://www.aapsharmsci.org/view.asp?art=ps020433]
Evans WE, Relling MV: Pharmacogenomics: Translating functional genomics into rational therapeutics. Science. 1999, 286: 487-491. 10.1126/science.286.5439.487.
Wolf CR, Smith G: Pharmacogenetics. Br Med Bull. 1999, 55: 366-386. 10.1258/0007142991902439.
Ingelman-Sundberg M, Oscarson M, McLellan RA: Polymorphic human cytochrome P450 enzymes: An opportunity for individualized drug treatment. Trends Pharmacol Sci. 1999, 20: 342-349. 10.1016/S0165-6147(99)01363-2.
Ingelman-Sundberg M: Pharmacogenetics: An opportunity for a safer and more efficient pharmacotherapy. J Intern Med. 2001, 250: 186-200. 10.1046/j.1365-2796.2001.00879.x.
Goldstein JA: Clinical relevance of genetic polymorphisms in the human CYP 2C subfamily. Br J Clin Pharmacol. 2001, 52: 349-355. 10.1046/j.0306-5251.2001.01499.x.
Ozdemir V, Tyndale RF, Reed K, et al: Paroxetine steady-state plasma concentration in relation to CYP2D6 genotype in extensive metabolizers. J Clin Psychopharmacol. 1999, 19: 472-475. 10.1097/00004714-199910000-00014.
Charlier C, Broly F, Lhermitte M, et al: Polymorphisms in the CYP 2D6 gene: Association with plasma concentrations of fluoxetine and paroxetine. Ther Drug Monit. 2003, 25: 738-742. 10.1097/00007691-200312000-00014.
Helgadottir A, Manolescu A, Thorleifsson G, et al: The gene encoding 5-lipoxygenase activating protein confers risk of myocardial infarction and stroke. Nature Genet. 2004, 36: 233-239. 10.1038/ng1311.
Dean M, Carrington M, Winkter C, et al: Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 structural gene. Science. 1996, 273: 1856-1861. 10.1126/science.273.5283.1856.
Penny M, Myrand S, Lin C, et al: Pharmacogenetic analysis of polymorphisms in the chemokine receptors CCR5 and CCR2 in the clinical development of a CCR5 antagonist (UK-427, 857) for the treatment of HIV/AIDS. 2004, FDA Science Forum, Washington, DC, USA
Lesko LJ, Salerno RA, Spear BB, et al: Pharmacogenetics and pharmacogenomics in drug development and regulatory decision-making: Report of the first FDA-PWG-PhRMA-DruSafe workshop. J Clin Pharmacol. 2003, 43: 342-358. 10.1177/0091270003252244.
Roses AD: Genome-based pharmacogenetics and the pharmaceutical industry. Nat Rev Genet. 2002, 1: 541-549. 10.1038/nrd840.
Danoff TM, Campbell DA, McCarthy LC, et al: A Gilbert's syndrome UGT1A1 variant confers susceptibility to Tranilast-induced hyperbilirubinemia. Pharmacogenomics J. 2004, 4: 49-53. 10.1038/sj.tpj.6500221.
Roses AD: Pharmacogenetics and drug development: The path to safer and more effective drugs. Nat Rev Genet. 2004, 5: 645-655.
Xu CF, Lewis KF, Yeo AJ, et al: Identification of a pharmacogenetic effect by linkage disequilibrium mapping. Pharmacogenomics J. 2004.
Chung W-H, Hung S-I, Hong H-S, et al: A marker for Stevens-Johnson syndrome. Nature. 2004, 428: 486-10.1038/428486a.
Johnson JA, Zineh I, Puckett BJ, et al: Beta-1 adrenergic receptor polymorphisms and antihypertensive response to metoprolol. Clin Pharmacol Ther. 2003, 74: 44-52. 10.1016/S0009-9236(03)00068-7.
Mallal S, Nolan D, Witt C, et al: Association between presence of HLA-B*5701, HLA-DR7, and HLA-DQ3 and hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir. Lancet. 2002, 359: 727-732. 10.1016/S0140-6736(02)07873-X.
Hetherington S, Hughes AR, Mosteller M, et al: Genetic variations in HLA-B region and hypersensitivity reactions to abacavir. Lancet. 2002, 359: 1121-1122. 10.1016/S0140-6736(02)08158-8.
Palumbo S, Giuliano F, Mossetti G, et al: Raloxifene administration in post-menopausal women with osteoporosis: Effect of different BsmI vitamin D receptor genotypes. Hum Reprod. 2003, 18: 192-198. 10.1093/humrep/deg031.
Acuna G, Foernzler D, Leong D, et al: Pharmacogenetic analysis of adverse drug effect reveals genetic variant for susceptibility to liver toxicity. Pharmacogenomics J. 2002, 2: 327-334. 10.1038/sj.tpj.6500123.
Krynetski EY, Evans WE: Genetic polymorphism of thiopurine S-methyltransferase: Molecular mechanisms and clinical importance. Pharmacology. 2000, 61: 136-146. 10.1159/000028394.
Drazen JM, Yandava CN, Dube L, et al: Pharmacogenetic association between ALOX5 promoter genotype and the response to anti-asthma treatment. Nat Genet. 1999, 22: 168-170. 10.1038/9680.
Smeraldi E, Zanardi R, Benedetti F, et al: Polymorphism within the promoter of the serotonin transporter gene and antidepressant efficacy of fluvoxamine. Mol Psychiatry. 1998, 3: 508-511. 10.1038/sj.mp.4000425.
Durham LK, Webb SM, Milos PM, et al: The serotonin transporter polymorphism, 5HTTLPR, is associated with a faster response time to sertraline in an elderly population with major depression. Psychoparmacol. 2004, 174: 525-529.
Ueda S, Meredith PA, Morton JJ, et al: ACE (I/D) genotype as a predictor or the magnitude and duration of the response to an ACE inhibitor drug (enalaprilat) in humans. Circulation. 1998, 98: 2148-2153. 10.1161/01.CIR.98.20.2148.
Iau PT, Macmillan RD, Blamey RW: Germline mutations associated with breast cancer susceptibility. Eur J Cancer. 2001, 37: 300-321. 10.1016/S0959-8049(00)00378-6.
Wang Z, Shen D, Parsons DW, et al: Mutational analysis of the tyrosine phosphatome in colorectal cancers. Science. 2004, 304: 1164-1166. 10.1126/science.1096096.
Paez JG, Psai A, Janne PA: EGFR mutations in lung cancer: Correlation with clinical response to gefitinib therapy. Science. 2004, 304: 1497-1500. 10.1126/science.1099314.
Bardelli A, Parsons DW, Silliman N, et al: Mutational analysis of the tyrosine kinome in colorectal cancers. Science. 2003, 300: 949-10.1126/science.1082596.
Pollock PM, Harper UL, Hansen KS, et al: High frequency of BRAF mutations in nevi. Nat Genet. 2002, 33: 19-20.
Slamon DJ, Clark GM, Wong SG, et al: Human breast cancer: Correlation of relapse and survival with amplification of the HER-2/neu oncogene. Science. 1987, 235: 177-182. 10.1126/science.3798106.
Shepard HM, Lewis GD, Sarup JC, et al: Monoclonal antibody therapy of human cancer: Taking the HER2 protooncogene to the clinic. J Clin Immunol. 1991, 11: 117-127. 10.1007/BF00918679.
Molina MA, Codony-Servat J, Albanell J, et al: Trastuzumab (herceptin), a humanized antiHer2 receptor monoclonal antibody, inhibits basal and activated Her2 ectodomain cleavage in breast cancer cells. Cancer Res. 2001, 61: 4744-4749.
Cobleigh MA, Vogel CL, Tripathy D, et al: Multinational study of the efficacy and safety of humanized anti-HER2 monoclonal antibody in women who have HER2-overexpressing metastatic breast cancer that has progressed after chemotherapy for metastatic disease. J Clin Oncol. 1999, 17: 2639-2648.
Rowley JD: A new consistent chromosomal abnormality in chronic myelogenous leukaemia identified by quinacrine fluorescence and Giesma staining. Nature. 1973, 243: 290-293. 10.1038/243290a0.
Kantarjian H, Sawyers C, Hochhaus A, et al: Hematologic and cytogenetic responses to imatinib mesylate in chronic myelogenous leukaemia. N Engl J Med. 2002, 346: 645-652. 10.1056/NEJMoa011573.
Gorre ME, Mohammed M, Ellwood K, et al: Clinical resistance to STI-571 cancer therapy caused by BCR-ABL gene mutation or amplification. Science. 2001, 293: 876-880. 10.1126/science.1062538.
Bubnoff NV, Schneller F, Peschel C, Duyster J: BCRABL gene mutations in relation to clinical resistance of Philadelphia-chromosome-positive leukaemia to STI571: A prospective study. Lancet. 2002, 359: 487-491. 10.1016/S0140-6736(02)07679-1.
Branford S, Rudzki Z, Walsh S, et al: High frequency of point mutations clustered within the adenosine triphosphate-binding region of BCR/ABL in patients with chronic myeloid leukemia or Ph-positive acute lymphoblastic leukemia who develop imatinib (STI571) resistance. Blood. 2002, 99: 3472-3475. 10.1182/blood.V99.9.3472.
Lynch TJ, Bell DW, Sordella R, et al: Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N Engl J Med. 2004, 350: 350-361.
Ramaswamy S, Ross KN, Lander ES, Golub TR: A molecular signature of metastasis in primary solid. Nat Genet. 2003, 33: 49-54. 10.1038/ng1060.
Special thanks go to Paul Feeney, John Thompson, Duncan McHale and Michelle Penny for contributions to this manuscript. We especially thank our colleagues in the Pharmacogenomics groups for their passion and dedication in pursuing scientific research directed at improving human health.
Rights and permissions
About this article
Cite this article
Milos, P.M., Seymour, A.B. Emerging strategies and applications of pharmacogenomics. Hum Genomics 1, 444 (2004). https://doi.org/10.1186/1479-7364-1-6-444
- single nucleotide polymorphisms (SNPs)
- genotype-phenotype association