Krüppel-like factors: Three fingers in control

Krüppel-like factors (KLFs), members of the zinc-finger family of transcription factors capable of binding GC-rich sequences, have emerged as critical regulators of important functions all over the body. They are characterised by a highly conserved C-terminal DNA-binding motif containing three C2H2 zinc-finger domains, with variable N-terminal regulatory domains. Currently, there are 17 KLFs annotated in the human genome. In spite of their structural similarity to one another, the genes encoding different KLFs are scattered all over the genome. By virtue of their ability to activate and/or repress the expression of a large number of genes, KLFs regulate a diverse array of developmental events and cellular processes, such as erythropoiesis, cardiac remodelling, adipogenesis, maintenance of stem cells, epithelial barrier formation, control of cell proliferation and neoplasia, flow-mediated endothelial gene expression, skeletal and smooth muscle development, gluconeogenesis, monocyte activation, intestinal and conjunctival goblet cell development, retinal neuronal regeneration and neonatal lung development. Characteristic features, nomenclature, evolution and functional diversities of the human KLFs are reviewed here.


Introduction
Krüppel-like factors (KLFs) are members of the zinc-finger family of transcription factors named after their similarity to the Drosophila gap gene Krüppel. 1 KLFs are characterised by a DNA-binding motif containing three well-conserved C2H2 zinc-finger domains located in the carboxy terminal of the protein capable of binding GC-rich sequences, such as CACCC elements present in the proximal promoters of many eukaryotic genes. 2 -7 The transcriptional regulatory domains located in the amino terminal of different KLFs are variable, resulting in their ability to interact with co-activators and/or co-repressors, culminating in the activation or repression of a given promoter activity. The presence of variable structural motifs outside of the DNA-binding domain of the KLF family members is reflected in their functional diversity. 3,8 Characteristic features, nomenclature, evolution and functions of the human KLFs are reviewed here.

Characteristic features of the zinc-finger domain in KLFs
The 81-amino acid DNA-binding zinc-finger domain is highly conserved among the members of the KLF family, with more than 65 per cent amino acid sequence identity among the family members. The specific amino acids critical for DNA binding are highly conserved, imparting an ability to different KLFs that interact with similar cis-elements, UPDATE ON GENE COMPLETIONS AND ANNOTATIONS such as GT boxes or GC-rich sequences like CACCC. The C 2 H 2 zinc finger present in the KLFs consists of two short beta strands followed by an alpha helix. In the classical C 2 H 2 zinc-finger domain, two conserved cysteines and histidines coordinate a zinc ion. The pattern of amino acid arrangement in a classical zinc finger is as follows: where C, H and X correspond to cysteine, histidine and any amino acid, respectively, and numbers indicate the number of residues separating the flanking amino acids. The amino acids that are important for the stable fold of the zinc finger are marked with the # symbol. The amino acid occupying the final position can be either histidine or cysteine. The linker sequence in between the zinc-finger domains (TGE(R/K)P(Y/F)X) is also highly conserved in KLF proteins. 9

Nomenclature of KLFs
The nomenclature of KLFs has evolved over the years. KLFs were initially named after the tissue in which they were detected or highly expressed, such as erythroid KLF (EKLF or KLF1), 10 Table 1. Several other related proteins, such as the members of the Sp family of proteins, GLI2, GLI3, and the pseudogene KLF7P, are not included in this list, for the sake of brevity.

Evolution of KLFs
KLFs are closely related to the Sp family of zinc-finger transcription factors, of which there are nine members in the human genome (Sp1-Sp9). Currently, there are 17 KLFs annotated in the human genome. The high level of conservation of structure and function of KLF proteins in different species is a reflection of their ancient evolutionary history. The 17 genes encoding different KLFs are scattered all over the human genome, and there are also 17 Klf genes in the mouse genome. This indicates that these genes are ancient and suggests the involvement of gene duplications and translocations in their evolution. The exon -intron organisation of human KLF genes is not well conserved. For example, while KLF12 has eight exons, KLF14 is encoded on a single exon (Table 1). Based on an extensive phylogenetic analysis with the amino acid sequences of KLF proteins from different species, it was proposed that the mammalian KLF genes have evolved in two phases -the first in the chordate Figure 1. Phylogenetic tree generated using the complete amino acid sequence of human KLF proteins by ClustalW2 web-based program (http://www.ebi.ac.uk/Tools/es/cgi-bin/clustalw2). Evolutionary distances are shown next to the corresponding names.
lineage, during the early emergence of vertebrates, and the second in the mammalian lineage. 23 This phylogenetic analysis also identified six different ascidian zinc-finger proteins as the ancestral genes for the distinct subgroups of vertebrate KLF genes. 23 In view of the intron-less nature of KLF14 and its homology with KLF16, it has been suggested that KLF14 is an ancient retrotransposed copy of KLF16. 24 Phylogenetic analysis of the 17 human KLF complete amino acid sequences by the neighbour-joining method using the ClustalW2 program (http://www.ebi.ac. uk/Tools/es/cgi-bin/clustalw2) indicated that KLFs 5, 17 and 8 are related more to each other than to the rest of the KLFs, which are further grouped into two major clades ( Figure 1). According to this analysis, KLFs 9 and 16 are the most recent KLFs to have diverged from each other, followed by KLFs 6 and 7 ( Figure 1). This is consistent with the similar expression pattern, common ability to interact with mSin3A (a core component of a large multiprotein co-repressor complex with associated histone deacetylase enzymatic activity) and shared cellular function of cell cycle regulation attributed to KLFs 9 and 16 ( Table 2).

Functions of KLFs
By virtue of their ability to activate and/or repress the expression of a large number of genes, KLFs regulate a diverse array of developmental events and cellular processes such as haematopoiesis, 75

Future directions
A large body of work over the past 25 years has established the KLFs as critical regulators of diverse functions in many parts of the body. In spite of this progress in our understanding of the properties of KLFs, much remains to be uncovered. In order fully to understand the properties of KLFs in diverse spatiotemporal contexts and physiological conditions, it is crucial to identify (a) the co-factors that they interact with; (b) their target genes; (c) the signal transduction pathways by which they are regulated; and (d) their unique tissue-specific roles using conditional knockouts. It is expected that these avenues of research will lead to exciting discoveries regarding the involvement of KLFs in human health and disease.