115:412/508                                                                                                                           spring 2002

Special problems of purifying recombinant proteins

I have already made some mention the production of recombinant pro­teins in E. coli and other hosts; here I want to address the special problems in­volved.  If your protein is present only in very low concentrations in its natural source, or the source is not readily available, such as humans, an alternative is to clone the gene and produce the protein in a bacterium, yeast, or mammalian or insect cells in tissue culture.  This is discussed in Rosenberg, pp. 335-350.  Pages 350-361 discuss use of eukaryotic cells, though much is on getting the foreign DNA into the host cell.

If a bacterial host is used, there are two big problems, whether necessary post-translational modifications occur, and whether the proteins accumulate as insoluble "inclusion bodies", which typically also include the 4 subunits of RNA polymerase, the outer membrane pro­teins OmpA, C and F, and ribosomal RNA.  It is not entirely clear why this happens, but accumulation of a large amount of a single protein at high concentration is probably a major factor.  The protein in inclusion bodies typically is misfolded, and will have either incorrect disul­fide bonds or intermolecular bonds rather than intramolecular.  The proteins are released from these inclusion bodies only by powerful denaturing agents, 6 m urea or 8 m guanidinium HCl; they must then be renatured by dilution and dialysis away of the denaturing agent, typi­cally in steps, followed by concentra­tion of the dilute protein.  During this process formation of incorrect disulfide bonds frequently occurs.  This problem can be minimized in principle by adding protein disulfide isomerase, which corrects such problems in the cell, or more practi­cally by passing through a column of thiol-Sephadex, which was shown almost 30 years ago to be very effective in reducing protein disulfides, and appears to act like a disulfide isomerase; or by carrying out the renatura­tion in the presence of an empirically established mixture of oxi­dized and reduced forms of a thiol compound - this was done beauti­fully by one of our under­graduates working at Schering.  Another problem is that if there are any proteases present, they just love chewing on unfolded pro­teins, and must be inhibited by inhibitors such as PMSF.  See Rosenberg pp. 339-340 for a sample procedure.

It is better to avoid forming the inclusion bodies in the first place.  This can be done by: producing the protein in eukaryotic cells - though even this doesn't guarantee soluble protein, it may be stuck in the Golgi apparatus, and in any case this is slower, more prone to contamin­ation, and yields lower trations of the protein.  Secreting the protein from cells, after genetically fusing a secretory signal peptide to the amino terminal of the protein, is better.  Bacillus subtilis  and streptomycetes are much better secretors than E. coli, if you clone the gene in an appro­priate vector.  Sometimes simply growing the cells at 30° rather than 37° helps a lot, for unknown reasons, not connected with incipient heat denaturation, as even heat-stable proteins such as RNAse A can form inclusion bodies at 37°.  It may be possible to improve solubility by specific mutations in the protein which do not affect activity but change the solubility; negative­ly charged and hydrophilic proteins are secreted better than positively charged, hydrophobic proteins.  It is hoped that co-cloning of 'fold­ases' will prevent IB formation in the cell, but they don't prevent it in lysates.

When a gene has been cloned, it can be fused with another gene, so that a fusion pro­tein is produced with properties of two pro­teins, or a bit of sequence can be added.  The first reason for this is that the fusion protein is induced with the inducer of the fused bacterial pro­tein - for instance isopropyl thiogal­actoside for b-galactosidase.  But it is now normal simply to clone the gene in a vector with the desired promoter, whether or not b-galactosidase is pro­duced.  The second is that the fusion protein can be recognized by either the activity or the antigenicity of the fused bacterial protein.  And fusion can be taken advantage of for purifica­tion of the cloned protein, especially if the other protein can be separated by a specific proteo­lytic cleavage.  For instance, the cloned gene can be coupled with that for a maltose-binding protein, and the fusion protein can then be isolated by adsorption on an amylose (starch) column.  The maltose-binding protein used for this has a C-terminal linker sequence which is cleaved specifically by the blood-clotting protease Factor Xa, to allow the cloned protein to be freed from the fusion.  Similarly, the cloned gene can be coupled with glutathione S-trans­ferase, the fusion protein isolated by adsorption on an immobilized glutathione column, eluted with free gluta­thione, and cleaved by thrombin, which like Factor Xa is fairly specific for cleav­age at exposed arginine residues, probably in specif­ic sequences.  However, there is always the possibility that the cleaving protease will also cleave the desired protein somewhere.

Another type of modification of a cloned protein is the attachment of a hexahistidine peptide.  This interacts tightly with immobilized Cu++, Ni++ or Co++ ions, chelated by iminodi­acetate or other chelator attached to a solid support), permitting separation from almost all other proteins, followed by elution with EDTA to compete for the Cu++ or imidazole to compete for Ni++.  Such a method can be used to purify the protein while still unfolded.  While I haven't worked with any of these methods myself, I think the hexahistidine method is probably best, because it usually doesn't interfere with the function of the protein and so the tag doesn't have to be removed and you don't have to take a chance with proteases.  It's my impression that this is winning out as a method for purifying cloned proteins.  See Rosenberg, pp. 345-6.  Qiagen sells expression vectors and Ni-NTA columns for the purpose.  I give you copies of their material, and of a competitor's.

However, one may not attach any tag, and purify the protein by conven­tional means, thus being sure that the tag does not affect the protein’s activity.

Yet another methiod mentioned in Rosenberg is the FLAG epitope, the sequence Asp-Tyr-Asp-Asp-Asp-Asp-Lys, which is bound by an antibody in presence of Ca++.  It is then eluted with EDTA, and the FLAG cleaved off with the protease enterokinase, which recognizes this sequence.

A final remark is that protein stability may be modified by site-specific mutagenesis, but "such mutations are rather easy to produce but time-consum­ing to characterize; even selective mutagenesis may deplete the graduate stu­dent supply long before all the possibilities are exhausted."