Article objectives

  • To define DNA technology, Human Genome Project, and PCR.
  • To describe the goals of the Human Genome Project.
  • To describe gene cloning and the processes involved.
  • To describe the processes involved in PCR.
  • Is it really possible to clone people? Another question is, should we clone people? Are scientific fantasies, such as depicted on TV shows such as Star Trek or in the movie GATTACA, actually a possibility? Who can really say? How, really, will science affect our future? The answers partially lie in the field of biotechnology.

    Biotechnology is technology based on biological applications. These applications are increasingly used in medicine, agriculture and food science. Biotechnology combines many features of biology, including genetics, molecular biology, biochemistry, embryology, and cell biology. Many aspects of biotechnology center around DNA and its applications, otherwise known as DNA technology. There are many current applications of biotechnology; however, we will focus on the applications towards medicine and the extension into the forensic sciences. First, though, we need to understand DNA technology.

    DNA Technology

    What is DNA technology? Is it using and manipulating DNA to help people? Is it using DNA to make better medicines and individualized treatments? Is it analyzing DNA to determine predispositions to genetic diseases? The answers to these questions, and many more, is yes. And the answers to many of these issues begin with the Human Genome Project.

    The Human Genome Project

    If we are all 99.9% genetically identical, what makes us different? How does that 0.1% make us tall or short, light or dark, develop cancer or not? To understand that 0.1%, we also need to understand the other 99.9%. Understanding the human genome is the goal of The Human Genome Project (HGP). This project, publicly funded by the United States Department of Energy (DOE) (Figure 1); and the National Human Genome Research Institute (NHGRI), part of the National Institutes of Health (NIH), may be one of the landmark scientific events of our lifetime.

    Figure 1: The Human Genome Project logo of the DOE.

    The goal of the HGP is to understand the genetic make-up of the human species by determining the DNA sequence of the human genome (Figure 2);and the genome of a few model organisms. However, it is not just determining the 3 billion bases; it is understanding what they mean. Today, all 3 billion base pairs have been sequenced, and the genes in that sequence are in the process of being identified and characterized. A preliminary estimate of the number of genes in the human genome is around 22,000 to 23,000.

    Figure 2: A depiction of DNA sequence analysis. Note the 4 colors utilized, each representing a separate base.

    The sequence of the human DNA is stored in databases available to anyone on the Internet. The U.S. National Center for Biotechnology Information (NCBI), part of the NIH, as well as comparable organizations in Europe and Japan, maintain the genomic sequences in a database known as Genbank. Protein sequences are also maintained in this database. The sequences in these databases are the combined sequences of anonymous donors, and as such do not yet address the individual differences that make us unique. However, the known sequence does lay the foundation to identify the unique differences among all of us. Most of the currently identified variations among individuals will be single nucleotide polymorphisms, or SNPs. A SNP (pronounced ”snip”) is a DNA sequence variation occurring at a single nucleotide in the genome. For example, two sequenced DNA fragments from different individuals, GGATCTA to GGATTTA, contain a difference in a single nucleotide. If this base change occurs in a gene, the base change then results in two alleles: the C allele and the T allele. Remember an allele is an alternative form of a gene. Almost all common SNPs have only two alleles. The effect of these SNPs on protein structure and function, and any effect on the resulting phenotype, is an extensive field of study.

    Gene Cloning

    You probably have heard of cloning. Whereas cloning of humans has many ethical issues associated with it, the cloning of genes has been ongoing for well over 30 years, with cloning of animals occurring more recently. Gene cloning, also known as molecular cloning, refers to the process of isolating a DNA sequence of interest for the purpose of making multiple copies of it. The identical copies are clones. In 1973, Stanley Cohen and Herbert Boyer developed techniques to make recombinant DNA, a form of artificial DNA.

    Recombinant DNA is engineered through the combination of two or more DNA strands, combining DNA sequences which would not normally occur together. In other words, selected DNA (or the DNA of ”interest”) is inserted into an existing organismal genome, such as a bacterial plasmid DNA or some other sort of vector. The recombinant DNA can then be inserted into another cell, such as a bacterial cell, for amplification and possibly production of the resulting protein. This process is called transformation, the genetic alteration of a cell resulting from the uptake, incorporation, and expression of foreign genetic material. Recombinant DNA technology was made possible by the discovery of restriction endonucleases.

    Restriction Enzyme Digestion and Ligation

    In the classical restriction enzyme digestion and ligation cloning protocols, cloning of any DNA fragment essentially involves four steps: 1. isolation of the DNA of interest (or target DNA)
    2. ligation
    3. transfection (or transformation)
    4. a screening/selection procedure.

    Isolation of DNA

    Initially, the DNA fragment to be cloned needs to be isolated. This DNA of interest may be a gene, part of a gene, a promoter, or another segment of DNA, and is frequently isolated by the Polymerase Chain Reaction (PCR) or restriction enzyme digestion. A restriction enzyme (or restriction endonuclease) is an enzyme that cuts double-stranded DNA at a specific sequence (Table 1). The enzyme makes two incisions, one through each strand of the double helix, without damaging the nitrogenous bases. This produces either overlapping ends (also known as sticky ends) or blunt ends.

    Table 1

    AG↓AATTCBCCC↓GGG
    CTTAA↑G GGG↑CCC

    A. EcoRI digestion produces overlapping ”sticky” ends: The enzyme cleaves between the G and A on both strands. B. SmaII restriction enzyme cleavage produces ”blunt” ends. The enzyme cleaves between the G and C on both strands.

    The 1978 Nobel Prize in Medicine was awarded to Daniel Nathans and Hamilton Smith for the discovery of restriction endonucleases. The first practical use of their work was the manipulation of E. coli bacteria to produce human insulin for diabetics.

    Ligation

    Once the DNA of interest is isolated, a ligation procedure is necessary to insert the amplified fragment into a vector to produce the recombinant DNA molecule. Restriction fragments (or a fragment and a plasmid/vector) can be spliced together, provided their ends are complementary. Blunt end ligation is also possible.

    The plasmid or vector (which is usually circular) is digested with restriction enzymes, opening up the vector to allow insertion of the target DNA. The two DNAs are then incubated with DNA ligase, an enzyme that can attach together strands of DNA with double strand breaks. This produces the recombinant DNA molecule. Figure 3 depicts a plasmid with two additional segments of DNA ligated into the plasmid, producing the recombinant DNA molecule. Figure 4 depicts DNA before and after ligation.

    Figure 3: This image shows a line drawing of a plasmid. The plasmid is drawn as two concentric circles that are very close together, with two large segments and one small segment depicted. The two large segments (1 and 2) indicate antibiotic resistances usually used in a screening procedure, and the small segment (3) indicates an origin of replication. The resulting DNA is a recombinant DNA molecule.

    Figure 4: Sticky ends produced by restriction enzyme digestion can be joined with the enzyme DNA ligase.

    Transfection and Selection

    Following ligation, the recombinant DNA is placed into a host cell, usually bacterial, in a process called transfection or transformation. Finally, the transfected cells are cultured. Many of these cultures may not contain a plasmid with the target DNA as the transfection process is not usually 100% successful, so the appropriate cultures with the DNA of interest must be selected. Many plasmids/vectors include selectable markers - usually some sort of antibiotic resistance (Figure 3). When cultures are grown in the presence of an antibiotic, only bacteria transfected with the vector containing resistance to that antibiotic should grow. However, these selection procedures do not guarantee that the DNA of insert is present in the cells. Further analysis of the resulting colonies is required to confirm that cloning was successful. This may be accomplished by means of a process known as PCR (see below) or restriction fragment analysis, both of which need to be followed by gel electrophoresis and/or DNA sequencing (DNA sequence analysis).

    DNA sequence analysis (the analysis of the order of the nitrogenous bases that make up the DNA), PCR, or restriction fragment analysis will all determine if the plasmid/vector contains the insert. Restriction fragment analysis is digestion of isolated plasmid/vector DNA with restriction enzymes. If the isolated DNA contains the target DNA, that fragment will be excised by the restriction enzyme digestion. Gel electrophoresis will separate DNA molecules based on size and charge. Examples are shown in Figure 5.

    Figure 5: Agarose gel following agarose gel electrophoresis on UV light box: In the gel with UV illumination (left), the ethidium bromide stained DNA glows pink; Right, photo of a gel. Far left: DNA ladder of fragments of known length. Lane 1: A PCR product of just over 500 bases. Lane 2: Restriction digest showing the 500 base fragment cut from a 4.5 kb plasmid vector.

    Gel Electrophoresis

    Gel electrophoresis is an analytical technique used to separate DNA fragments by size and charge. Notice in Figure 5 that the ”gels” are rectangular in shape. The gels are made of a gelatin-like material of either agarose or polyacrylamide. An electric field, with a positive charge applied at one end of the gel, and a negative charge at the other end, forces the fragments to migrate through the gel. DNA molecules migrate from negative to positive charges due to the net negative charge of the phosphate groups in the DNA backbone. Longer molecules migrate more slowly through the gel matrix. After the separation is completed, DNA fragments of different lengths can be visualized using a fluorescent dye specific for DNA, such as ethidium bromide. The resulting stained gel shows bands correspond to DNA molecules of different lengths, which also correspond to different molecular weights. Band size is usually determined by comparison to DNA ladders containing DNA fragments of known length. Gel electrophoresis can also be used to separate RNA molecules and proteins.

    The Polymerase Chain Reaction

    The Polymerase Chain Reaction (PCR) is used to amplify specific regions of a DNA strand millions of times. A region may be a number of loci, a single gene, a part of a gene, or a non-coding sequence. This technique produces a useful quantity of DNA for analysis, be it medical, forensic or some other form of analysis. Amplification of DNA from as little as a single cell is possible. Whole genome amplification is also possible.

    PCR utilizes a heat stable DNA polymerase, Taq polymerase, named after the thermophilic bacterium Thermus aquaticus, from which it was originally isolated. T. aquaticus is a bacterium that lives in hot springs and hydrothermal vents, and Taq polymerase is able to withstand the high temperatures required to denature DNA during PCR. Taq polymerase’s optimum temperature for activity is between 75°C and 80°C. Recently other DNA polymerases have also been used for PCR.

    A basic PCR involves a series of repeating cycles involving three main steps (see Figure 6):

    1. denaturation of the double stranded DNA
    2. annealing of specific oligonucleotide primers
    3. extension of the primers to amplify the region of DNA of interest

    These steps will be discussed in additional detail below.

    The oligonucleotide primers are single stranded pieces of DNA that correspond to the 5’ and 3’ ends of the DNA region to be amplified. These primers will anneal to the corresponding segment of denatured DNA. Taq Polymerase, in the presence of free deoxynucleotide triphosphates (dNTPs), will extend the primers to create double stranded DNA. After many cycles of denaturation, annealing and extension, the region between the two primers will be amplified.

    The PCR is commonly carried out in a thermal cycler, a machine that automatically allows heating and cooling of the reactions to control the temperature required at each reaction step (see below). The PCR usually consists of a series of about 30 to 35 cycles. Most commonly, PCR is carried out in three repeating steps, with some modifications for the first and last step.

    PCR is usually performed in small tubes or wells in a tray, each often beginning with the complete genome of the species being studied. As only a specific sequence from that genome is of interest, the sequence specific primers are targeted to that sequence. PCR is done with all the building blocks necessary to create DNA: template DNA, primers, dNTPs, and a polymerase.

    Figure 6: PCR: A repeating cycle of denaturation (1), annealing (2), and extension (3). Notice that initially there is a double strand of DNA, and after denaturation, the DNA is single stranded. In the annealing step (2), single stranded primers bind. These primers are extended by Taq Polymerase, represented by the green ball (3).

    The three basic steps of PCR (Figure 6) are:

    • Denaturation step: This step is the first regular cycling event and consists of heating the reaction to 94 - 98°C for 30 to 60 seconds. It disrupts the hydrogen bonds between complementary bases of the DNA strands, yielding single strands of DNA.

    • Annealing step: The reaction temperature is lowered to 50-65°C for 30 to 60 seconds, allowing annealing of the primers to the single-stranded DNA template. Stable hydrogen bonds form between the DNA strand (the template) and the primers when the primer sequence very closely matches the complementary template sequence. Primers are usually 17 - 22 nucleotides long and are carefully designed to bind to only one site in the genome. The polymerase binds to the primer-template hybrid and begins DNAsynthesis.

    • Extension step: A temperature of around 72°C is used for this step, which is close to the optimum temperature of Taq polymerase. At this step the Taq polymerase extends the primer by adding dNTPs, using one DNA strand as a template to create a the other (new) DNA strand. The extension time depends on the length of the DNA fragment to be amplified. As a standard, at its optimum temperature, the DNA polymerase will polymerize a thousand bases in one minute.

    Utilizing the PCR, DNA can be amplified millions of times to generate quantities of DNA that can be used for a number of purposes. These include the use of DNA for prenatal or genetic testing, such as testing for a specific mutation. PCR has revolutionized the fields of biotechnology, human genetics, and a number of other sciences. PCR was developed in 1983 by Kary Mullis. Due to the importance of this process and the significance it has had on scientific research, Dr. Mullis was awarded the Nobel Prize in Chemistry in 1993, just 10 years after his discovery.

    To say that PCR, molecular cloning and the Human Genome Project has revolutionized biology and medicine would be an understatement. These efforts have led to numerous accolades, including Nobel prizes, and more may follow.

    Images and table courtesy of:

    Table Source: Doug Wilkin, License: CC-BY-SA

    http://commons.wikimedia.org/wiki/Image:DNA_sequence.png. Public Domain.

    http://commons.wikimedia.org/wiki/Image:Example_plasmid.png. GNU-FDL.

    http://commons.wikimedia.org/wiki/Image:Ligation.svg. GNU-FDL.

    http://en.wikipedia.org/wiki/File:Agarosegelphoto.jpg. GNU-FDL.

    http://commons.wikimedia.org/wiki/Image:PCR.svg. GNU-FDL.