Difference between revisions of "Sequencing"

From Biolecture.org
imported>Jeongmin Oh
imported>Eunjin RYU
Line 1: Line 1:
<p><font size="4">[[Sequencing technology]]<br />
+
<p><span style="font-size:24px"><strong>DNA Sequencing</strong></span></p>
[[Sequencing companies]]<br />
+
 
[[Sequencing assembly program]]<br />
 
</font><br />
 
<br />
 
&nbsp;</p>
 
<hr />
 
<p><font size="6">What is sequencing?<br />
 
<font size="3">Sequencing means to determine the order of signals in a polymer.</font>&nbsp;<br />
 
<br />
 
</font><font size="3">In genetics and biochemistry, <strong>sequencing</strong> means to determine the primary structure (sequence) of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a <strong>sequence</strong> which succinctly summarizes much of the atomic-level structure of the sequenced molecule. </font></p>
 
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
 +
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
<h2><span class="mw-headline">[[DNA sequencing]]</span></h2>
+
 
<p><font size="3">DNA sequencing is the process of determining the nucleotide order of a given <font color="#810081">DNA</font> fragment. Thus far, most DNA sequencing has been performed using the chain termination method developed by Frederick Sanger. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as Pyrosequencing are gaining an increasing share of the sequencing market. More genome data is being produced by pyrosequencing than Sanger DNA sequencing these days. Pyrosequencing has enabled rapid genome sequencing. Bacterial genome can be sequenced in a single run with several X coverage with this technique. This technique was also used to sequence the genome of James Watson recently.</font></p>
+
<p><span style="font-size:20px"><strong>What is Sequencing?</strong></span></p>
<p><font size="3">The sequence of DNA encodes the necessary information for living things to survive and reproduce. Determining the sequence is therefore useful in 'pure' research into why and how organisms live, as well as in applied subjects. Because of the key nature of DNA to living things, knowledge of DNA sequence may come in useful in practically any biological research. For example, in medicine it can be used to identify, diagnose and potentially develop treatments for genetic diseases. Similarly, research into pathogens may lead to treatments for contagious diseases. Biotechnology is a burgeoning discipline, with the potential for many useful products and services.</font></p>
+
 
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
<h3><span class="mw-headline">Sanger sequencing</span></h3>
+
 
<div class="thumb tright">
+
<p><span style="font-size:14px">DNA sequencing is the process of determining the nucleotide order of a given DNA fragment. Thus far, most DNA sequencing has been performed using the chain termination method developed by Frederick Sanger. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as Pyrosequencing are gaining an increasing share of the sequencing market. More genome data is being produced by pyrosequencing than Sanger DNA sequencing these days. Pyrosequencing has enabled rapid genome sequencing. Bacterial genome can be sequenced in a single run with several X coverage with this technique. This technique was also used to sequence the genome of James Watson recently.</span></p>
<div style="width: 162px" class="thumbinner"><img class="thumbimage" border="0" alt="Part of a radioactively labelled sequencing gel" width="160" height="332" src="http://upload.wikimedia.org/wikipedia/commons/c/cb/Sequencing.jpg" />
+
 
<div class="thumbcaption">
 
<div style="float: right" class="magnify"><img alt="" width="15" height="11" src="http://en.wikipedia.org/skins-1.5/common/images/magnify-clip.png" /></div>
 
Part of a radioactively labelled sequencing gel</div>
 
</div>
 
</div>
 
<p><font size="3">In chain terminator sequencing (Sanger sequencing), extension is initiated at a specific site on the template DNA by using a short oligonucleotide 'primer' complementary to the template at that region. The oligonucleotide primer is extended using a DNA polymerase, an enzyme that replicates DNA. Included with the primer and DNA polymerase are the four deoxynucleotide bases (DNA building blocks), along with a low concentration of a chain terminating nucleotide (most commonly a <strong>di-</strong>deoxynucleotide). Limited incorporation of the chain terminating nucleotide by the DNA polymerase results in a series of related DNA fragments that are terminated only at positions where that particular nucleotide is used. The fragments are then size-separated by electrophoresis in a slab polyacrylamide gel, or more commonly now, in a narrow glass tube (capillary) filled with a viscous polymer.</font></p>
 
<div class="thumb tright">
 
<div style="width: 182px" class="thumbinner"><img class="thumbimage" border="0" alt="View of the start of an example dye-terminator read (click to expand)" width="180" height="42" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/44/Sanger_sequencing_read_display.gif/180px-Sanger_sequencing_read_display.gif" />
 
<div class="thumbcaption">
 
<div style="float: right" class="magnify"><img alt="" width="15" height="11" src="http://en.wikipedia.org/skins-1.5/common/images/magnify-clip.png" /></div>
 
View of the start of an example dye-terminator read (click to expand)</div>
 
</div>
 
</div>
 
<p><font size="3">An alternative to the labelling of the primer is to label the terminators instead, commonly called 'dye terminator sequencing'. The major advantage of this approach is the complete sequencing set can be performed in a single reaction, rather than the four needed with the labeled-primer approach. This is accomplished by labelling each of the dideoxynucleotide chain-terminators with a separate fluorescent dye, which fluoresces at a different wavelength. This method is easier and quicker than the dye primer approach, but may produce more uneven data peaks (different heights), due to a template dependent difference in the incorporation of the large dye chain-terminators. This problem has been significantly reduced with the introduction of new enzymes and dyes that minimize incorporation variability.</font></p>
 
<p><font size="3">This method is now used for the vast majority of sequencing reactions as it is both simpler and cheaper. The major reason for this is that the primers do not have to be separately labelled (which can be a significant expense for a single-use custom primer), although this is less of a concern with frequently used 'universal' primers.</font></p>
 
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
<h3><span class="mw-headline">Pyrosequencing</span></h3>
+
 
<p><font size="3">Pyrosequencing, which was originally developed by Mostafa Ronaghi, has been commercialized by Biotage (for low throughput sequencing) and 454 Life Sciences (for high-throughput sequencing). The latter platform sequences roughly 100 megabases in a 7-hour run with a single machine. In the array-based method (commercialized by 454 Life Sciences), single-stranded DNA is annealed to beads and amplified via emPCR. These DNA-bound beads are then placed into wells on a fiber-optic chip along with enzymes which produce light in the presence of ATP. When free nucleotides are washed over this chip, light is produced as ATP is generated when nucleotides join with their complementary base pairs. Addition of one (or more) nucleotide(s) results in a reaction that generates a light signal that is recorded by the CCD camera in the instrument. The signal strength is proportional to the number of nucleotides, for example, homopolymer stretches, incorporated in a single nucleotide flow. [1]</font></p>
+
<p><span style="font-size:20px"><strong>Why sequencing is important?</strong></span></p>
<p><span style="font-size: medium"><b>Synthesis based sequencing by Illumina</b></span></p>
+
 
<p><font size="3">Solexa, now part of Illumina developed a sequencing technology based on reversible dye-terminators. DNA molecules are first attached to primers on a slide and amplified so that local clonal colonies are formed (bridge amplification). Four types of ddNTPs are added, and non-incorporated nucleotides are washed away. Unlike pyrosequencing, the DNA can only be extended one nucleotide at a time. A camera takes images of the fluorescently labeled nucleotides then the dye along with the terminal 3' blocker is chemically removed from the DNA, allowing a next cycle. The final product of the SBS is many [[DNA reads]].</font>&nbsp;</p>
 
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
 +
 +
<p><span style="font-size:14px">Genome sequencing is figuring out the order of DNA nucleotides, or bases. When you read a sentence, the meaning is not just in the sequence of the letters. It is also in the words those letters make and in the grammar of the language. Similarly, the human genome is more than just its sequence. Sequencing the genome is an important step towards understanding it. Scientists also hope that being able to study the entire genome sequence will help them understand how the genome works&mdash;how genes work together to direct the growth, development and maintenance of an entire organism. Finally, genes account for less than 25 percent of the DNA in the genome, and so knowing the entire genome sequence will help scientists study the parts of the genome outside the genes. This includes the regulatory regions that control how genes are turned on an off, as well as long stretches of junk DNA.</span></p>
 +
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
<p><b style="font-size: medium;">ligation based sequencing&nbsp;</b></p>
+
 
 +
<p><span style="font-size:20px"><strong>How sequencing is operated?</strong></span></p>
 +
 
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
<p><span style="font-size: medium;">In this method, octomer-oligonucleotide probe is used, and instead of DNA polymerase, DNA ligase is used, as its name implies. Each of probes are dyed according to its first two nucleotides. First, primer is annealed to the template strand, probe follows, and DNA ligase is applied on the junction. unbound probe is washed away, and signal from bound probe is detected. End of probe is cleaved away from 3 nucleotide behind from first two nucleotides. Cycle is repeated with another primer, which has different starting base. The data are collected and compared to compensate empty nucleotides. This method is usually used for short strands.&nbsp;</span></p>
+
 
 +
<p><span style="font-size:14px">The whole genome can&#39;t be sequenced all at once because available methods of DNA sequencing can only handle short stretches of DNA at a time. So instead, scientists must break the genome into small pieces, sequence the pieces, and then reassemble them in the proper order. An automatic sequencing machine spits out what genome scientists call &quot;raw&quot; sequence. In raw sequence, the reads or short DNA sequences are all jumbled together. The process of transforming the fragmented rough draft into a long, continuous final product without breaks or errors is called finishing. Finishing often takes longer than the sequencing itself.</span></p>
 +
 
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
 +
 +
<p><span style="font-size:20px"><strong>Which kinds of sequencing techniques are available?</strong></span></p>
 +
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
<h2><span class="mw-headline">RNA sequencing</span></h2>
+
 
<p><font size="3"><font color="#810081">RNA</font> is less stable in the cell, and also more prone to nuclease attack experimentally. As RNA is generated by transcription from DNA, the information is already present in the cell's DNA. However, it is sometimes desirable to sequence RNA molecules. In particular, in Eukaryotes RNA molecules are not necessarily co-linear with their DNA template, as introns are excised. To sequence RNA, the usual method is first to convert the sample to complimertary DNA fragments, by reverse transcriptase. This can then be sequenced as described above.&nbsp;</font></p>
+
<p><span style="font-size:14px">To facilitate their full genome sequencing initiatives, &ldquo;Illumina&rdquo; licensed nanopore sequencing technology from Oxford Nanopore Technologies. Illumina sequencing system based on reversible terminators for sequence determination. It is an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Another possible way to accomplish cost-effective high-throughput sequencing is by utilizing fluorophore technology. Pacific Biosciences is currently using this approach in their SMRT (Single Molecule Real Time) DNA sequencing technology. SMRT sequencing is a harnesses the natural process of DNA replication and enables real-time observation of DNA synthesis by using phosphorlinked nucleotides.</span></p>
<p><font size="3"><br />
+
 
</font></p>
+
<h3>&nbsp;</h3>
 +
 
 +
<p><span style="font-size:20px"><strong>References</strong></span></p>
 +
 
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
<h2><span class="mw-headline">Protein sequencing</span></h2>
+
 
<p><font size="3">Methods for performing protein sequencing include:</font></p>
+
<ol>
<ul>
+
<li><span style="font-size:14px">N/A. (2008). Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry. <em>Nature.</em> <em>456</em>(7218): 53&ndash;59.</span></li>
    <li><font size="3">Edman degradation </font></li>
+
<li><span style="font-size:14px">Xiaoge, G., Kevin, L., Karen, O., Jenny Z., Sandeep S. D., &amp; Sue J.R. (2015). SMRT Sequencing for Parallel Analysis of Multiple Targets and Accurate SNP Phasing. <em>G3</em>. g3.115.023317</span></li>
    <li><font size="3">Peptide mass fingerprinting </font></li>
 
    <li><font size="3">Mass spectrometry </font></li>
 
    <li><font size="3">Protease digests </font></li>
 
</ul>
 
<p><font size="3">If the gene encoding the protein can be identified it is currently much easier to sequence the DNA and infer the protein sequence. Determining part of a protein's amino-acid sequence (often one end) by one of the above methods may be sufficient to enable the identification of a clone carrying the gene.</font></p>
 
<p>&nbsp;</p>
 
<h2><span class="mw-headline">Polysaccharide sequencing</span></h2>
 
<p><font size="3">Though polysaccharides are also biopolymers, it is not so common to talk of 'sequencing' a polysaccharide, for several reasons. Although many polysaccharides are linear, many have branches. Many different units (individual monosaccharides) can be used, and bonded in different ways. However, the main theoretical reason is that whereas the other polymers listed here are primarily generated in a 'template-dependent' manner by one processive enzyme, each individual join in a polysaccharide may be formed by a different enzyme. In many cases the assembly is not uniquely specified; depending on which enzyme acts, one of several different units may be incorporated. This can lead to a family of similar molecules being formed. This is particularly true for plant polysaccharides. Methods for the structure determination of oligosaccharides and polysaccharides include NMR spectroscopy and methylation analysis<sup id="_ref-0" class="reference">[1]</sup>.</font></p>
 
<p>&nbsp;</p>
 
<h2><span class="mw-headline">See also</span></h2>
 
<ul>
 
    <li><font size="3">Genetic code </font></li>
 
    <li><font size="3">Sequence motif </font></li>
 
    <li><font size="3">[http://sequenceome.org Sequenceome.org] </font></li>
 
    <li><font size="3">[http://glycome.net Glycome.net]</font></li>
 
</ul>
 
<p><a id="References" name="References"></a></p>
 
<h2><span class="mw-headline">References</span></h2>
 
<div class="references-small">
 
<ol class="references">
 
    <li id="_note-0"><strong><a title="" href="http://en.wikipedia.org/wiki/Sequencing#_ref-0">^</a></strong> <a class="external text" title="http://www.stenutz.eu/sop" rel="nofollow" href="http://www.stenutz.eu/sop">A practical guide to structural analysis of carbohydrates</a></li>
 
 
</ol>
 
</ol>
</div>
+
 
 +
<div class="references-small">&nbsp;</div>

Revision as of 22:13, 10 December 2015

DNA Sequencing

 

 

What is Sequencing?

 

DNA sequencing is the process of determining the nucleotide order of a given DNA fragment. Thus far, most DNA sequencing has been performed using the chain termination method developed by Frederick Sanger. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as Pyrosequencing are gaining an increasing share of the sequencing market. More genome data is being produced by pyrosequencing than Sanger DNA sequencing these days. Pyrosequencing has enabled rapid genome sequencing. Bacterial genome can be sequenced in a single run with several X coverage with this technique. This technique was also used to sequence the genome of James Watson recently.

 

Why sequencing is important?

 

Genome sequencing is figuring out the order of DNA nucleotides, or bases. When you read a sentence, the meaning is not just in the sequence of the letters. It is also in the words those letters make and in the grammar of the language. Similarly, the human genome is more than just its sequence. Sequencing the genome is an important step towards understanding it. Scientists also hope that being able to study the entire genome sequence will help them understand how the genome works—how genes work together to direct the growth, development and maintenance of an entire organism. Finally, genes account for less than 25 percent of the DNA in the genome, and so knowing the entire genome sequence will help scientists study the parts of the genome outside the genes. This includes the regulatory regions that control how genes are turned on an off, as well as long stretches of junk DNA.

 

How sequencing is operated?

 

The whole genome can't be sequenced all at once because available methods of DNA sequencing can only handle short stretches of DNA at a time. So instead, scientists must break the genome into small pieces, sequence the pieces, and then reassemble them in the proper order. An automatic sequencing machine spits out what genome scientists call "raw" sequence. In raw sequence, the reads or short DNA sequences are all jumbled together. The process of transforming the fragmented rough draft into a long, continuous final product without breaks or errors is called finishing. Finishing often takes longer than the sequencing itself.

 

Which kinds of sequencing techniques are available?

 

To facilitate their full genome sequencing initiatives, “Illumina” licensed nanopore sequencing technology from Oxford Nanopore Technologies. Illumina sequencing system based on reversible terminators for sequence determination. It is an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Another possible way to accomplish cost-effective high-throughput sequencing is by utilizing fluorophore technology. Pacific Biosciences is currently using this approach in their SMRT (Single Molecule Real Time) DNA sequencing technology. SMRT sequencing is a harnesses the natural process of DNA replication and enables real-time observation of DNA synthesis by using phosphorlinked nucleotides.

 

References

 

  1. N/A. (2008). Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry. Nature. 456(7218): 53–59.
  2. Xiaoge, G., Kevin, L., Karen, O., Jenny Z., Sandeep S. D., & Sue J.R. (2015). SMRT Sequencing for Parallel Analysis of Multiple Targets and Accurate SNP Phasing. G3. g3.115.023317