Genome-wide characterisation of the binding repertoire of small molecule drugs

Most, if not all, drugs interact with multiple proteins. One or more of these interactions are responsible for carrying out the primary therapeutic effects of the drug. Others are involved in the transport or metabolic processing of the drug or in the mediation of side effects. Still others may be responsible for activities that correspond to alternate therapeutic applications. The potential clinical impact of a drug and its cost of development are affected by the sum of all these interactions. The drug development process includes the identification and characterisation of a drug's clinically relevant interactions. This characterisation is presently accomplished by a combination of experimental laboratory techniques and clinical trials, with increasing numbers of patient participants. Efficient methods for the identification of all the molecular targets of a drug prior to clinical trials could greatly expedite the drug development process. Combinatorial peptide and cDNA phage display have the potential for achieving a complete characterisation of the binding repertoire of a small molecule. This paper will discuss the current state of phage display technology, as applied to the identification of novel receptors for small molecules, using a successful application with the drug Taxol™ as an example of the technical and theoretical benefits and pitfalls of this method.


Introduction
The concept of a receptor as the component of an organism with which a chemical agent interacts was based originally on the independent work of Paul Ehrlich and J. N. Langley during the late nineteenth century. Pharmaceutical agents are generally believed to exert their therapeutic effect by binding to a particular protein or nucleic acid target. 1 Both natural product and synthetic chemical drugs, however, can and do have multiple macromolecule-binding partners (Table 1), 2 -48 such as metabolic breakdown enzymes (ie cytochrome P450 enzymes), side effect-mediating targets and, in some cases, secondary therapeutic targets (see aspirin and Taxole below as specific examples). Recognition of the binding promiscuity of drug molecules is reflected in the recent assembly of a publicly accessible web-based therapeutic target database. 49 For a drug to reach the market, an average input of 14 years and US$359 million is required. 50 The assessment of toxicity, side effects and metabolic breakdown pathways consumes a large percentage of this investment. 51 -53 The speed with which a drug passes through the various stages of the drug discovery process can be a critical factor for success. Novel strategies that provide information about the potential target proteins of a small molecule are badly needed to augment the current drug development process. A complete accounting of the repertoire of targets for a particular drug provides important information towards a complete understanding of the mode of action of a drug, as well as the prediction of potential complications from its use in the clinic. Detection of potential toxicity and side effect properties early on during drug development can translate into large financial savings and lowered patient risk, as efforts can be redirected towards drug leads that are more likely to be safe for patient use. Unfortunately, identifying macromolecules that bind to small molecule drugs can be a slow and laborious process, particularly if the target(s) are expressed at low levels in their respective tissue(s). Although computer methods for toxicity prediction offer a fast, attractive solution to this problem, 54 -56 even the best of this predictive software 57 generates hypotheses based upon crystal structure-based rigid protein docking studies. In many cases, small molecules (ie non-peptide molecules of less than about 1,000 Daltons) appear to interact with their receptors through regions that are disordered prior to the interaction. 58 This suggests that computer-based docking strategies may be limited in their success with a subset of potential drug/protein interactions. (continued ) In this paper, we outline a strategy for identifying the targets of small molecules based on the screening of protein or combinatorial peptide libraries displayed on phage particles. The ultimate goal is the complete delineation of all the targets for a small molecule. Since known receptors are mainly proteins, the discussion will be limited to the identification of protein targets. These strategies have been validated using libraries of cDNA expression products 59 -61 and libraries of combinatorial peptides. The latter have been used to identify a novel target for Taxole, 3 and carry out an in vitro approach to the identification of ATP-binding proteins within entire proteomes (Makowski et al., in preparation).

The number of targets of a drug
Most, if not all, drugs interact with multiple macromolecules. Some of these molecules conform to the concept of a 'receptor', in that they are responsible for mediating the primary therapeutic effects of the drug. Other target molecules would not classically be referred to as receptors because they are involved in transport, chemical processing or the side effects of the drug. Some of these interactions are non-specific, such as the interactions between hydrophobic drugs and serum proteins. These interactions are, however, critical elements in defining the clinical effect of the drug. Table 1 lists the known 'specific' macromolecular targets for a few representative drugs. Some of the best drugs may be more effective than related molecules because they bind two or more targets relevant to their clinical action. Aspirin, the most potent anti-inflammatory agent identified to date, has at least three targets that mediate antiinflammatory activity. The anti-cancer activity of Taxole is greater than that of other mitotic spindle poisons, presumably because it interacts with the anti-apoptotic protein Bcl-2, as well as with b-tubulin. 3 A small molecule that binds to no macromolecules (if such a thing exists) will be biologically inert. Conversely, a small molecule that binds to hundreds of proteins may possess numerous therapeutic applications but will have a very high chance of being toxic, as any one of these interactions might disrupt an essential molecular function. A good drug, therefore, should be expected to bind to somewhere between one and a dozen proteins. One, or possibly a few, of these interactions will mediate the primary activity of the drug. The necessity of specific action demands that small molecule drugs bind with high affinity and specificity to a relatively limited number of protein targets.
Between 3,000 to 10,000 new disease-related targets are expected to emerge out of the Human Genome Project and related sequencing projects such as the Microbial Genome Program of the US Department of Energy. 62 Parallel synthesis or combinatorial chemistry is similarly expected to generate a coordinate build-up of lead compounds. This concurrent accumulation of targets and drug candidates may not automatically translate into an increased introduction of Food and Drug Administration-approved pharmaceuticals into the medical marketplace. Although high-throughput combinatorial chemistry can create staggering molecular diversity, this diversity may be irrelevant or only semi relevant to the drug target population. Natural products and their derivatives have evolved to bind to a relatively limited number of proteins in order to be non-toxic to the organism from which they derive. The only exceptions to this rule are a few widely used metabolites, such as ATP, that bind to many different proteins. Synthetic molecules are not so limited. No evolutionary pressure has restricted the number of their biological targets on the basis of inactivity or toxicity. It is possible that the members of many classes of synthetic molecules will bind to many different macromolecules, thus increasing the percentage that will fail toxicity screening. This effect may be partially responsible for the difficulties that combinatorial chemistry has had in superseding natural product leads in the drug development pipeline. There has been a growing trend towards the synthesis of focused, natural product-like libraries 63 which 'partake of the "wisdom" of natural products'. 64 This focused library approach may limit the chemical diversity to a biologically relevant subpopulation.

Methods for identification of the molecular targets of a small molecule
Conceptually, it should be possible to develop methods that are capable of identifying all the targets of a small molecule drug or drug candidate. One could conceive of an ideal toola biological interaction chip containing an indexed set of all the molecular surfaces available to a drug molecule within the human body (and within human pathogens and animal models for human pathologies). A simple screen, allowing for very high sensitivity detection of a drug candidate bound to each of the proteins on this chip, could then be used to identify the dozen or so targets expected for a successful drug candidate and rule out promiscuous, potentially toxic drug candidates. An ideal, universal system of this kind is far beyond the relatively simple protein or peptide chips currently under development. 65 What is available now?
The most obvious path to a full accounting of the targets of a small molecule is to create an affinity column containing an immobilised form of the small molecule (preferably attached via tethers to at least two different locations so that every portion of the small molecule is free for binding in at least one of the tagged forms). A cell extract poured through the column will allow those molecules with high affinity to the drug to be bound to the column. Identification of those targets, either by gel electrophoresis and/or mass spectrometry, provides an accounting of the molecules with high affinity for the immobilised small molecule. This strategy has been successful for the identification of numerous molecular targets 66 and will continue to be used. It may be difficult to identify poorly expressed targets, targets that are membrane bound, or large complexes via this technique, as well as to distinguish true positives from false positives.

Bacteriophage display libraries
An efficient way of improving the signal-to-noise ratio in an affinity selection process is to physically link a phenotype to a biologically 'amplifiable' genotype. Several such 'surface display systems' are now in use (such as ribosome display and cell surface display) 67 -69 and some of these have been validated for novel drug-protein interaction identification (see McPherson et al., 70 where a tagged version of the drug FK506 was used as a probe molecule against libraries of mRNA-protein fusion molecules constructed from human liver, kidney and bone marrow transcripts). The simplest and most widespread cost-effective systems for carrying out this linkage, however, are bacteriophage display libraries. 71 In these libraries, the DNA encoding a protein or peptide is inserted into the gene of a structural protein of the phage in such a way that its expression product is displayed on the surface of the phage as a fusion to the structural protein, but without disrupting an essential function of that structural protein. A library of phages, each displaying a different expression product, is screened for members that exhibit affinity to a target (using an affinity column, as described above, or a related technique referred to as 'biopanning' 72 ). Even if this affinity selection results in only a handful of positive 'hits', because of the linkage of phenotype to genotype, it is possible to characterise these 'hits' because the affinity-selected phage can be grown and the sequence of the inserted DNA ascertained. This process circumvents the need to carry out highly sensitive chemical analysis of small amounts of material separated via binding to the drug target. The majority of bacteriophage display systems have been based upon the filamentous phage M13, 71 but the inherent limitations of a biological system based upon membrane extrusion for replication 73 has led to the development of phage display systems based upon such lytic phages as T7 and l. 74

cDNA expression product libraries
It is now possible to construct libraries of cDNA expression products displayed on the surface of phage particles and to screen them for members exhibiting affinity to small molecules. cDNA sequences are inserted into the gene for a structural protein in such a way that the expression product is displayed on the surface of a virion. Even if only a single phage particle is selected for affinity to a small molecule, it is possible to identify the gene product responsible because the virus can be grown in quantities sufficient for sequencing the viral DNA which contains an insert coding for the gene product displayed on the surface. A similar approach using genomic libraries is also possible, whereby whole genomes are chemically fragmented and cloned into the phage genome. 75 -79 A principal shortcoming of this method is that many gene products do not fold properly on a phage surface, particularly large proteins and membrane proteins. This drawback can be alleviated by the use of genomic DNA fragmented

Makowski and Rodi
Review REVIEW into epitope-sized coding pieces as mentioned above. A second problem with full-length protein display is that most phage display systems append the foreign protein onto the carboxy terminus of the phage structural protein, due to the structural requirements of the viral particle (such as a number of commercially available T7 cDNA display libraries). This results in the translation of many cDNA sequences in the incorrect reading frame, as well as the inclusion of 'nonsense' peptides translated from 3 0 non-coding regions. A number of systems are available which attach the foreign protein to the amino terminus of the phage structural protein, thus weeding out DNA insert sequences which do not fully translate in-frame to the beginning of the phage structural protein gene (for example, M13 gene IIIbased libraries). 74 Several successful uses of this strategy have been reported. Sche et al., used a cDNA phage display library to isolate clones of the FKBP12 protein from a human brain cDNA library using a biotinylated FK506 probe molecule. 59,60 During multiple rounds of affinity selection with the immobilised drug molecule, the FKBP12 gene emerged as the dominant library member with detection possible via polymerase chain reaction amplification after the second round of affinity selection. Six rounds of affinity selection with the modified drug were required for the FKBP gene to be the predominant clone within the selected population. A similar phage-based approach has been used to identify the protein hNopp140 as a binding partner for the drug doxorubicin. 61 DNA-binding proteins were successfully affinity-selected from a yeast genomic library displayed on a l phage display vector. 80 Carbohydrate-binding proteins have also been identified by similar techniques. 81 These representative results demonstrate that, in cases where the targets can be successfully displayed in their proper three-dimensional configuration on the surface of phage particles, the use of cDNA expression product display libraries can provide a powerful approach to the identification of the molecular targets of small molecule drugs. Many targets, such as membrane proteins, however, may require other strategies, such as the fragmented epitope approaches mentioned above or the use of combinatorial peptide libraries outlined below.

Combinatorial peptide libraries
An alternative to the use of cDNA-expression product display is combinatorial peptide display. In this method, random synthetic oligonucleotides are inserted into the gene for a structural protein at a site known to be presented on the surface of the phage particle; the resulting population is screened for members exhibiting affinity for the small molecule probe using multiple rounds of biopanning. The sequences of the affinity-selected peptides are then  . Structure of Hsp90 with bound geldamycin. Protein is rendered using colours to represent the similarity between the sequence of the protein and the sequences of peptides selected for affinity to Taxole. Red indicates a high level of similarity, blue a very low level. Three segments with relatively high similarity scores bracket the geldamycin-binding site, suggesting that Taxole may bind to the same site compared with the sequences of naturally occurring proteins and those proteins with segments having the greatest sequence similarity to the affinity-selected peptides are identified as potential targets for the small molecule. This process has been successfully used to identify novel targets for small molecules, as well as to map the binding sites of small molecules on proteins. 3,58 The principal limitation of this technique is that, for a successful outcome, the binding properties of peptides on the surface of the phage must mimic the binding properties of similar sequences in naturally occurring proteins. This will not always be the case. Our understanding of how a protein recognises a small molecule has changed little since 1894, when Emil Fischer first proposed that '. . .enzyme and glucoside have to fit to each other like a lock and key. . .'. This metaphor of protein structure conveys an image of rigidity that belies the complexity of the process of molecular recognition. It is becoming increasingly clear that many binding sites exhibit significant disorder prior to interaction with a ligand, particularly small molecule-binding sites, with the proteinligand complex moving through a series of progressively lower energy states towards the bottom of a 'binding funnel' (similar to the proposed 'folding funnel' model for protein folding). 82,83 The initial steps in the process of molecular recognition of a small molecule by a protein might well involve the interaction of a disordered segment of peptide with the small molecule, suggesting that the structural context of the interaction is far less important than implied by the 'lock and key' vision. 58 In this case, peptides on the surface of a phage might well reflect the binding properties of peptide segments of naturally occurring proteins, at least during the initial 'handshake' stage of the binding process.
Elucidation of three-dimensional protein structures via X-ray crystallography and nuclear magnetic resonance spectroscopy has provided extensive evidence that small molecule-binding sites are made of short stretches of peptide, usually arising from different discontinuous portions of the polypeptide chain. This would suggest that the sequence of any one portion of the protein would make an energetically trivial contribution to the total binding energy. But a survey of the structures of protein -small molecule interactions has shown that virtually all ligands larger than 300 Daltons interact with at least five contiguous amino acids in at least one peptide segment, 58 implying that there is an energetic 'hot spot' in the ligand-binding site similar to those identified in proteinprotein interactions via mutagenesis studies. 84 -86 A peptide segment, five amino acids in length, is adequate to define a binding motif identifiable on a genome-wide scan.
It may be possible to categorise small molecule-binding sites by their level of flexibility prior to ligand binding. Those binding sites that act most like the classical 'lock and key' are rigid prior to binding. Although these may be detected through the screening of cDNA display libraries, it is unlikely that they will be identified through affinity screening of peptide libraries. Detection of these receptors may be more amenable to cDNA display approaches as outlined above, or even to in silico docking approaches. Other sites may have rigid bases and flexible overlying peptides, as in examples presented in a survey by Freire. 87 Portions of these binding sites appear to be detectable using peptide phage display (see Taxole discussion below). Finally, there appear to be proteins that are significantly disordered prior to binding a ligand, such as the bacterial quorum-sensing pheromone from Agrobacterium tumefaciens. 88 These highly disordered binding sites may well be the most easily mimicked by the binding properties of peptides on the surface of phage particles and appear to be detectable through the use of peptide phage display (again, see Taxole discussion below). Initial successes with this approach suggest an important role for random peptide display as a drug target identification tool.

The case of Taxole
Taxole is a natural product with broad cytotoxic activity against a range of human tumours. It was first isolated from the bark of Pacific yew trees, and is now produced in a two-step semisynthesis from precursors extracted from the needles of farmed yew bushes. 89 The primary activity of Taxole is its cytotoxicity towards rapidly dividing cells, apparently brought on through p53-mediated apoptosis initiated by binding to b-tubulin, a process that induces microtubule bundling and halts mitosis.

Isolation of Taxole-selected peptides
Libraries of random 12-mer and disulfide-bond cyclised random 7-mer peptides displayed at the amino terminus of the mature pIII of M13 phage were selected for members that exhibited affinity to a biotinylated derivative of Taxole which had been immobilised on streptavidin-coated plates. 3,58 The sequences of Taxole-selected peptides were analysed in three ways: (i) The sequences of proteins known to bind to Taxole were compared with the Taxole-selected sequences as an internal control; (ii) the sequences of Taxole-selected peptides were analysed to identify motifs that occurred in multiple peptides and these motifs were used to search the human genome database for novel Taxole-binding proteins; and (iii) the full-length sequences of the Taxole-selected peptides were compared with the sequences of proteins on a genome-wide scale to identify those proteins with regions exhibiting sequences with high similarity to the Taxole-selected peptide population.
Only weak consensus sequences were found within either of the two selected peptide populations; however, the physicochemical properties of the selected phage populations were statistically evaluated and found to be substantially different from that of an unselected population. 3 This observation suggested that, in spite of the absence of a strong consensus sequence, the selection process had been successful. A comparison of the Taxole-binding peptide sequences to known Taxole-binding proteins was carried out to determine if any peptides were mimicking known Taxole-binding motifs.

Review REVIEW
Comparison of the Taxole-selected peptides to the sequence of b-tubulin Initial comparison of sequence motifs from the Taxoleselected populations with the sequence of human b-tubulin suggested no detectable similarity. 3 The development of algorithms which made it possible to compare the complete sequences of all the selected peptides with the sequences of naturally occurring proteins allowed for the detection of similarities between the sequences of the Taxole-binding peptides and the sequence of b-tubulin, however. 58 Figure 1 is a depiction of the three-dimensional structure of b-tubulin, in which the colour table has been chosen to reflect the similarity of the tubulin sequence to that of the random 12-mers selected for affinity to Taxole. Examination of this Figure indicates that the region of highest similarity is a surface loop in direct contact with the bound Taxole. This may be an example of the rigid base/flexible overlying loop scenario discussed above. Figure 1 demonstrates that the sequence of at least one segment of b-tubulin involved in Taxole binding is a weak motif within the sequences of the Taxole-selected peptides. Nevertheless, the representation of this putative Taxolebinding motif is relatively weak in the Taxole-selected peptide population. Although the surface loop in contact with the Taxole molecule exhibits the highest similarity to the sequences of the Taxole-selected peptides in b-tubulin, this similarity is not exceptionally high in comparison with that found when scanning the entire genome.
A similar comparison of the affinity-selected conformationally constrained 7-mer population with that of b-tubulin indicated that the protein segment of highest similarity to the peptides was remote from the Taxole-binding site, but that the second-highest similarity peak corresponded to a protein segment in direct contact with Taxole (results not shown). Again, the observed level of similarity was not exceptional in comparison with whole genome scanning.

Other Taxole-binding proteins
A comparison of the sequence of the human multi-drug transporter MDR1 (P-glycoprotein) to the sequences of Taxoleselected peptides resulted in the prediction of two putative Taxole-binding regions within the P-glycoprotein sequence. 58 Both of these regions fall within the putative drug-binding regions of the protein that have been identified previously using photo-affinity labelling, 4,5 demonstrating that the data are consistent with those peptides possessing Taxole-binding motifs.
Using biotin-labelled Taxole, Byrd et al. identified two Taxole targets from both mouse macrophages and brain as heat shock proteins of the 70-and 90-kDa families. 8 The structure of Hsp90 bound to geldamycin, a specific inhibitor of the Hsp90 family, has been elucidated. 90 The Hsp90 sequence has three segments exhibiting similarity to the sequences of the Taxole-selected peptides. Two of these are in alpha-helical structures that flank the geldamycin binding site; the third is at the end of a beta strand beneath the geldamycinbinding site, as shown in Figure 2. These results, when compared with the previous analyses, predict that Taxole will bind Hsp90 at the same site as geldamycin.
Experimental evidence from affinity chromatography suggests that Taxole interacts with murine CD18 in a manner mimicking that of lipopolysaccharide to produce an inflammatory response. 6 Although the three-dimensional structure of one subunit of CD18 has been elucidated and the Taxoleselected peptides have been used to predict regions with high probability of binding Taxole, there are no biochemical data available to evaluate the Taxole-binding site on CD18 predicted by this analysis.

Genome-wide survey of Taxole-binding motifs
An analysis of the Taxole-selected peptides identified several sequence motifs which appeared multiple times in the peptides. Sequences of all proteins in the human genome database were searched to identify proteins containing these sequences. This process led to the identification of the anti-apoptotic protein Bcl-2 as a candidate Taxole-binding protein containing the amino acid sequence motif HTPHP. Enzyme-linked immunosorbent assay binding studies and circular dichroism spectroscopy were used to show that the affinity of Taxole for Bcl-2 is in the 200 nM range. 3 Calculation of the similarity of the Taxole-selected peptides to the sequence of Bcl-2 indicated that the most likely site for the binding of Taxole to Bcl-2 was within a 50 amino acid unstructured loop near the amino terminus of the protein. A model of the three-dimensional structure of Bcl-2 is shown in Figure 3. This model is based upon the three-dimensional structure of Bcl-X L 91 and the more recent structures of Bcl-2, in which the unstructured loop was excised and replaced by the corresponding sequence from Bcl-X L . 92 The significance of the Taxole-Bcl-2 interaction Is the interaction of Taxole with Bcl-2 relevant to the clinical activity of Taxole? It has long been known that Bcl-2 is phosphorylated and inactivated in cells treated with Taxole. 93,94 This inactivation blocks the anti-apoptotic activity of Bcl-2, leading to apoptosis, a major mechanism of the cytotoxic activity of Taxole. It has been noted, however, that Taxole is a much more potent anti-cancer drug than other tubulin-binding mitotic spindle poisons. Experiments have pointed to the existence of a non-p53-dependent pathway towards apoptosis subsequent to the addition of Taxole to the cell culture. 95 -97 The direct interaction of Taxole with Bcl-2 may mediate a predicted alternative pathway to apoptosis, independent of mitotic block and the subsequent activity of p53. This hypothesis was reinforced when it was shown that application of Taxole to isolated mitochondria 98 induced the permeability transition that results in the release of cytochrome c, an obligate step in the apoptotic pathway. Bcl-2 is anchored in the outer membrane of mitochondria through a C-terminal tail, making it a potential mitochondrial target for mediating this activity. Although tubulin has also been shown to be a component of mitochondrial membranes, 99 making it possible that interaction with tubulin also induces this activity, the functional connection between tubulin and the molecular events involved in the mitochondrial permeability transition has been shown to involve Bcl-2. Removal of the flexible loop of Bcl-2 blocks the apoptotic action of Taxole, 100,101 as does phosphorylation. 102 It is a reasonable hypothesis that when this flexible regulatory loop is bound to Taxole, Bcl-2 is incapable of preventing the progression to apoptosis.

Whole-genome similarity calculations
In the genome-wide scan for potential Taxole-binding proteins, the authors identified a single protein, Bcl-2, that has since been shown to exhibit significant affinity for Taxole. 3 None of the previously known Taxole-targets were identified in this scan. There are several possible reasons for this, some technical and some fundamental. From a technical perspective, only 69 peptides selected for binding to Taxole were used for this novel ligand identification. In their more recent work, aimed at the identification of ATP-binding proteins (Makowski et al., 2003; in preparation), the authors routinely use the sequences of 500 affinity-selected peptides for genome-wide scans. While other Taxole-binding motifs were present in the Taxole-selected peptides, such as those matching the b-tubulin and P-glycoprotein sequences, these were less strongly represented than the motif found in Bcl-2. Sequencing of a larger number of peptide sequences makes it possible to observe multiple copies of the motifs that exhibit affinity for the target small molecule, greatly enhancing the likely success of a genome-wide search.
A more fundamental consideration is the fact that Taxole binding to Bcl-2 involves a 50 amino acid flexible loop, a structural context readily mimicked by the phage-displayed peptides. The Taxole-binding sites in the previously identified target structures (eg b-tubulin) appear to be more rigidly structured, involving several discontinuous segments of peptide. Any one peptide segment involved in a relatively rigid binding site may correspond to a motif that exhibits relatively low affinity on its own, leading to a relatively lower abundance in the affinity-selected peptides. These short motifs are observablethese loops were picked out in a scan of the individual proteins as the most likely site within the protein to be involved in binding of Taxole -but their observation in a genome-wide scan presents a formidable signal-to-noise problem. Careful analysis of the results of the authors' ongoing study of ATP-binding peptides, utilising the extensive body of knowledge available for ATP-binding proteins, is being used to provide a basis for further development of genome-wide methods.

Concluding remarks
The results from the Taxole affinity screening suggest that the utilisation of combinatorial peptide phage display to identify novel drug targets will be most successful for targets with binding sites that are highly flexible prior to ligand binding. Considerable technical improvements in the current methods are possible, both experimental and algorithmic, and the greatest potential is for identification of disordered binding sites that are difficult to find by other methods. cDNA and other display methods may be more appropriate for finding binding sites that involve relatively rigid constructs or multiple punctate interactions with short peptide segments. In silico docking methods for the prediction of binding sites are also more likely to be successful in the identification of rigid binding sites in target proteins. A complete accounting of the binding repertoire of a small molecule drug will require a combination of methods, each best suited to a subset of the potential targets.
The results outlined here indicate that utilisation of phage display of cDNA expression products and random peptides holds significant promise as a strategy for identification of all the protein targets of a small molecule, as well as identification of specific regions in those proteins that may be involved in the binding process. This model is based on the three-dimensional structure of Bcl-X L (pdb file 1LXL 103 ) and the more recent structures of Bcl-2 in which the unstructured loop was excised and replaced by the corresponding sequence from Bcl-X L (pdb file 1GJH 104 ). Protein is rendered using colours to represent the similarity between the sequence of the protein and the sequences of peptides selected for affinity to Taxole. Red indicates a high level of similarity, blue a very low level. The highest level of similarity is in the flexible 50 amino acid loop, which has no structure detectable either from X-ray crystallography or nuclear magnetic resonance spectroscopy, and is thus highly mobile in solution