Difference between revisions of "Essay !4 - About Sequencing Code : KSI0004"

From Biolecture.org
imported>김상인
imported>김상인
 
Line 1: Line 1:
<p style="text-align: center;"><span style="font-size:36px"><strong>About Sequencinhg</strong></span></p>
+
<p>S1.4 Genomics Essay #4</p>
  
<p style="text-align: right;">Sangin Kim</p>
+
<p><a href="http://biolecture.org/index.php/Essay_!4_-_About_Sequencing_Code_:_KSI0004"><u>http://biolecture.org/index.php/Essay_!4_-_About_Sequencing_Code_:_KSI0004</u></a></p>
  
<p>In&nbsp;<a href="https://en.wikipedia.org/wiki/Genetics" title="Genetics">genetics</a>&nbsp;and&nbsp;<a href="https://en.wikipedia.org/wiki/Biochemistry" title="Biochemistry">biochemistry</a>,&nbsp;<strong>sequencing</strong>&nbsp;means to determine the&nbsp;<a href="https://en.wikipedia.org/wiki/Primary_structure" title="Primary structure">primary structure</a>&nbsp;(sometimes falsely called primary sequence) of an unbranched&nbsp;<a href="https://en.wikipedia.org/wiki/Biopolymer" title="Biopolymer">biopolymer</a>. Sequencing results in a symbolic linear depiction known as a&nbsp;<strong>sequence</strong>&nbsp;which succinctly summarizes much of the atomic-level structure of the sequenced molecule.</p>
+
<p>&nbsp;</p>
 +
 
 +
<p>Essay 4 &ndash; About Sequencing</p>
 +
 
 +
<p>Sangin Kim</p>
 +
 
 +
<p>&nbsp;</p>
 +
 
 +
<p>In genetics and biochemistry, sequencing means to determine the primary structure of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succinctly summarizes much of the atomic-level structure of the sequenced molecule.</p>
  
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
  
<p><span style="color:#800080"><strong>1. DNA sequencing&nbsp;</strong></span></p>
+
<p>DNA sequencing is the process of determining the nucleotide order of a given DNA fragment. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as pyrosequencing are gaining an increasing share of the sequencing market. More genome data are now being produced by pyrosequencing than Sanger DNA sequencing. Pyrosequencing has enabled rapid genome sequencing. Bacterial genomes can be sequenced in a single run with several times coverage with this technique.</p>
  
<p>DNA sequencing is the process of determining the nucleotide&nbsp;order of a given DNA fragment. So far, most DNA sequencing has been performed using the chain termination method developed by Fredrick Sanger. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as pyrosequencing are gaining an increasing share of the sequencing market. More genome data are now being produced by pyrosequencing than Sanger DNA sequencing. Pyrosequencing has enabled rapid genome sequencing. Bacterial genomes can be sequenced in a single run with several times coverage with this technique. This technique was also used to sequence the genome of James Watson recently</p>
+
<p>The sequence of DNA encodes the necessary information for living things to survive and reproduce. Determining the sequence is therefore useful in fundamental research into why and how organisms live, as well as in applied subjects. Because of the key importance DNA has to living things, knowledge of DNA sequences are useful in practically any area of biological research. For example, in medicine it can be used to identify, diagnose, and potentially develop treatments for genetic diseases. Similarly, research into pathogens may lead to treatments for contagious diseases. Biotechnology is a burgeoning discipline, with the potential for many useful products and services.</p>
  
<p>The sequence of DNA encodes the necessary information for living things to survive and reproduce. Determining the sequence is therefore useful in fundamental research into why and how organisms live, as well as in applied subjects. Because of the key importance DNA has to living things, knowledge of DNA sequences are useful in practically any area of biological research. For example, in medicine it can be used to identify, diagnose, and potentially develop treatments for genetic diseases. Similarly, research into pathogens may lead to treatments for contagious diseases. Biotechnology&nbsp;is a burgeoning discipline, with the potential for many useful products and services.</p>
+
<p>&nbsp;</p>
  
<p><strong>&nbsp;1.1 sanger sequencing</strong></p>
+
<p>1.1 sanger sequencing</p>
  
<p>In chain terminator sequencing (Sanger sequencing), extension is initiated at a specific site on the template DNA by using a short oligonucleotide &#39;primer&#39; complementary to the template at that region. The oligonucleotide primer is extended using a DNA polymeraase&nbsp;an enzyme that replicates DNA. Included with the primer and DNA polymerase are the four deoxynucleotide bases (DNA building blocks), along with a low concentration of a chain terminating nucleotide (most commonly a&nbsp;<strong>di-</strong>deoxynucleotide). Limited incorporation of the chain terminating nucleotide by the DNA polymerase results in a series of related DNA fragments that are terminated only at positions where that particular nucleotide is used. The fragments are then size-separated by electrophoresis in a slab polyacrylamide gel, or more commonly now, in a narrow glass tube (capillary) filled with a viscous polymer.</p>
+
<p>In chain terminator sequencing (Sanger sequencing), extension is initiated at a specific site on the template DNA by using a short oligonucleotide &#39;primer&#39; complementary to the template at that region. The oligonucleotide primer is extended using a DNA polymeraase an enzyme that replicates DNA. Included with the primer and DNA polymerase are the four deoxynucleotide bases (DNA building blocks), along with a low concentration of a chain terminating nucleotide (most commonly a di-deoxynucleotide). Limited incorporation of the chain terminating nucleotide by the DNA polymerase results in a series of related DNA fragments that are terminated only at positions where that particular nucleotide is used. The fragments are then size-separated by electrophoresis in a slab polyacrylamide gel, or more commonly now, in a narrow glass tube (capillary) filled with a viscous polymer.</p>
  
<p><img alt="" src="/ckfinder/userfiles/images/ssssss.jpg" style="height:332px; width:160px" /></p>
+
<p>&nbsp;</p>
  
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
  
<p><strong>&nbsp;1.2 Pyrosequencing</strong></p>
+
<p>1.2 Pyrosequencing</p>
  
<p>Pyrosequencing&nbsp;which was developed by P&aring;l Nyr&eacute;n and Mostafa Ronaghi, has been commercialized by Biotage (for low-throughput sequencing) and 454 Life Sciences (for high-throughput sequencing). The latter platform sequences roughly 100 megabases&nbsp;[now up to 400 megabases] in a seven-hour run with a single machine. In the array-based method (commercialized by 454 Life Sciences), single-stranded DNA is annealed to beads and amplified via EmPCR. These DNA-bound beads are then placed into wells on a fiber-optic chip along with enzymes which produce light in the presence of ATP. When free nucleotides are washed over this chip, light is produced as ATP is generated when nucleotides join with their complementary base pairs. Addition of one (or more) nucleotide(s) results in a reaction that generates a light signal that is recorded by the CCD camera in the instrument. The signal strength is proportional to the number of nucleotides, for example, homopolymer stretches, incorporated in a single nucleotide flow.&nbsp;</p>
+
<p>Pyrosequencing which was developed by P&aring;l Nyr&eacute;n and Mostafa Ronaghi, has been commercialized by Biotage (for low-throughput sequencing) and 454 Life Sciences (for high-throughput sequencing). The latter platform sequences roughly 100 megabases [now up to 400 megabases] in a seven-hour run with a single machine. In the array-based method (commercialized by 454 Life Sciences), single-stranded DNA is annealed to beads and amplified via EmPCR. These DNA-bound beads are then placed into wells on a fiber-optic chip along with enzymes which produce light in the presence of ATP. When free nucleotides are washed over this chip, light is produced as ATP is generated when nucleotides join with their complementary base pairs. Addition of one (or more) nucleotide(s) results in a reaction that generates a light signal that is recorded by the CCD camera in the instrument. The signal strength is proportional to the number of nucleotides, for example, homopolymer stretches, incorporated in a single nucleotide flow.</p>
  
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
  
<p><strong>&nbsp;1.3&nbsp;Large scale sequencing</strong></p>
+
<p>1.3 Large scale sequencing</p>
 +
 
 +
<p>Whereas the methods above describe various sequencing methods, separate related terms are used when a large portion of a genome is sequenced. Several platforms were developed to perform exome sequencing (a subset of all DNA across all chromosomes that encode genes) or whole genome sequencing (sequencing of the all nuclear DNA of a human).</p>
 +
 
 +
<p>&nbsp;</p>
  
<p>Whereas the methods above describe various sequencing methods, separate related terms are used when a large portion of a genome is sequenced. Several platforms were developed to perform exome sequencing (a subset of all DNA across all chromosomes that encode genes) or whole genome sequencing&nbsp;(sequencing of the all nuclear DNA of a human).</p>
+
<p>&nbsp;</p>
  
<p><strong>&nbsp;</strong></p>
+
<p>Taxonomy</p>
  
<p>&bull;<u>Sequencing</u>: determining the precise order of nucleotides in a DNA or RNA molecule</p>
+
<p>&bull;Sequencing: determining the precise order of nucleotides in a DNA or RNA molecule</p>
  
<p>&bull;<u>Sanger dideoxy method </u></p>
+
<p>&bull;Sanger dideoxy method</p>
  
 
<p>&bull;Invented by Nobel Prize winner Fred Sanger</p>
 
<p>&bull;Invented by Nobel Prize winner Fred Sanger</p>
Line 45: Line 57:
 
<p>&bull;Bases are labeled with radioactivity</p>
 
<p>&bull;Bases are labeled with radioactivity</p>
  
<p>&bull;Gel electrophoresis is then performed on products&nbsp;</p>
+
<p>&bull;Gel electrophoresis is then performed on products</p>
  
 
<p>&bull;Large-scale sequencing projects have led to automated DNA sequencing systems</p>
 
<p>&bull;Large-scale sequencing projects have led to automated DNA sequencing systems</p>
Line 51: Line 63:
 
<p>&bull;Based on Sanger method</p>
 
<p>&bull;Based on Sanger method</p>
  
<p>&bull;Radioactivity replaced by fluorescent dye&nbsp;</p>
+
<p>&bull;Radioactivity replaced by fluorescent dye</p>
  
 
<p>&bull;Virtually all genomic sequencing projects use shotgun sequencing</p>
 
<p>&bull;Virtually all genomic sequencing projects use shotgun sequencing</p>
Line 65: Line 77:
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
  
<p><span style="color:#0000FF"><span style="font-size:18px"><strong>NGS(Next generation sequencing)</strong></span></span></p>
+
<p>NGS(Next generation sequencing)</p>
  
<p>&bull;<u>Second-generation DNA sequencing </u></p>
+
<p>&bull;Second-generation DNA sequencing</p>
  
 
<p>&bull;Generates data 100x faster than Sanger method</p>
 
<p>&bull;Generates data 100x faster than Sanger method</p>
  
<p>&bull;<u>Massively parallel methods</u></p>
+
<p>&bull;Massively parallel methods</p>
  
 
<p>&bull;Large number of samples sequenced side by side</p>
 
<p>&bull;Large number of samples sequenced side by side</p>
Line 83: Line 95:
 
<p>&bull;SOLID/Applied Biosystems method</p>
 
<p>&bull;SOLID/Applied Biosystems method</p>
  
<p>&bull;<u>454 sequencing system</u>&nbsp;</p>
+
<p>&bull;454 sequencing system</p>
  
 
<p>&bull;DNA is broken into small segments</p>
 
<p>&bull;DNA is broken into small segments</p>
Line 99: Line 111:
 
<p>&bull;Sequencing of single molecules of DNA</p>
 
<p>&bull;Sequencing of single molecules of DNA</p>
  
<p>&bull;<u><a href="https://www.youtube.com/watch?v=TboL7wODBj4">HeliScope Single Molecule Sequencer</a></u><a href="https://www.youtube.com/watch?v=TboL7wODBj4"> </a></p>
+
<p>&bull;HeliScope Single Molecule Sequencer</p>
  
 
<p>&bull;Single-stranded DNA fragments attached in array on glass slide</p>
 
<p>&bull;Single-stranded DNA fragments attached in array on glass slide</p>
Line 111: Line 123:
 
<p>&bull;Third-generation DNA sequencing</p>
 
<p>&bull;Third-generation DNA sequencing</p>
  
<p>&bull;<u><a href="https://www.youtube.com/watch?v=v8p4ph2MAvI">Pacific Biosciences SMRT</a></u><u> </u></p>
+
<p>&bull;Pacific Biosciences SMRT</p>
  
<p>&bull;<u>S</u>ingle <u>M</u>olecule <u>R</u>eal <u>T</u>ime sequencing</p>
+
<p>&bull;Single Molecule Real Time sequencing</p>
  
<p>&bull;Reactions carried out in nanocontainers (<u>zero mode wave guides</u>)</p>
+
<p>&bull;Reactions carried out in nanocontainers (zero mode wave guides)</p>
  
 
<p>&bull;Single-stranded DNA fragments attached</p>
 
<p>&bull;Single-stranded DNA fragments attached</p>
Line 131: Line 143:
 
<p>&bull;Optical detection no longer used</p>
 
<p>&bull;Optical detection no longer used</p>
  
<p>&bull;<a href="https://www.youtube.com/watch?v=WYBzbxIfuKs">Ion torrent semiconductor sequencing&nbsp;</a></p>
+
<p>&bull;Ion torrent semiconductor sequencing</p>
  
 
<p>&bull;Measures release of protons whenever a deoxyribonucleotide is added</p>
 
<p>&bull;Measures release of protons whenever a deoxyribonucleotide is added</p>
Line 145: Line 157:
 
<p>&bull;Fourth-generation DNA sequencing</p>
 
<p>&bull;Fourth-generation DNA sequencing</p>
  
<p>&bull;<a href="https://www.youtube.com/watch?v=3UHw22hBpAk">Oxford Nanopore Technologies</a> system&nbsp;</p>
+
<p>&bull;Oxford Nanopore Technologies system</p>
  
 
<p>&bull;Passes DNA through nanoscale biological pores</p>
 
<p>&bull;Passes DNA through nanoscale biological pores</p>
Line 159: Line 171:
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
  
<p><span style="color:#800080"><strong>2. RNA sequencing&nbsp;</strong></span></p>
+
<p>2. RNA sequencing</p>
  
<p><a href="https://en.wikipedia.org/wiki/RNA" title="RNA">RNA</a>&nbsp;is less stable in the cell, and also more prone to nuclease attack experimentally. As RNA is generated by&nbsp;<a href="https://en.wikipedia.org/wiki/Transcription_(genetics)" title="Transcription (genetics)">transcription</a>&nbsp;from DNA, the information is already present in the cell&#39;s DNA. However, it is sometimes desirable to&nbsp;<a href="https://en.wikipedia.org/wiki/RNA-Seq" title="RNA-Seq">sequence RNA</a>&nbsp;molecules. While sequencing DNA gives a genetic profile of an organism, sequencing RNA reflects only the sequences that are actively&nbsp;<a href="https://en.wikipedia.org/wiki/Gene_expression" title="Gene expression">expressed</a>&nbsp;in the cells. To sequence RNA, the usual method is first to&nbsp;<a href="https://en.wikipedia.org/wiki/Reverse_transcriptase" title="Reverse transcriptase">reverse transcribe</a>&nbsp;the RNA extracted from the sample to generate cDNA fragments. This can then be sequenced as described above. The bulk of RNA expressed in cells are&nbsp;<a href="https://en.wikipedia.org/wiki/Ribosomal_RNA" title="Ribosomal RNA">ribosomal RNAs</a>&nbsp;or&nbsp;<a href="https://en.wikipedia.org/wiki/Small_RNA" title="Small RNA">small RNAs</a>, detrimental for cellular translation, but often not the focus of a study. This fraction can fortunately be removed&nbsp;<em>in vitro</em>, however, to enrich for the messenger RNA, also included, that usually&nbsp;<u>is</u>&nbsp;of interest. Derived from the&nbsp;<a href="https://en.wikipedia.org/wiki/Exon" title="Exon">exons</a>&nbsp;these mRNAs are to be later&nbsp;<a href="https://en.wikipedia.org/wiki/Translation" title="Translation">translated</a>&nbsp;to&nbsp;<a href="https://en.wikipedia.org/wiki/Protein" title="Protein">proteins</a>&nbsp;that support particular cellular functions. The&nbsp;<a href="https://en.wikipedia.org/wiki/Expression_profile" title="Expression profile">expression profile</a>&nbsp;therefore indicates cellular activity, particularly desired in the studies of diseases, cellular behaviour, responses to reagents or stimuli.&nbsp;<a href="https://en.wikipedia.org/wiki/Eukaryote" title="Eukaryote">Eukaryotic</a>&nbsp;RNA molecules are not necessarily&nbsp;<a href="https://en.wikipedia.org/wiki/Co-linear" title="Co-linear">co-linear</a>&nbsp;with their DNA template, as&nbsp;<a href="https://en.wikipedia.org/wiki/Intron" title="Intron">introns</a>&nbsp;are excised. This gives a certain complexity to map the read sequences back to the genome and thereby identify their origin. For more information on the capabilities of next-generation sequencing applied to whole&nbsp;<a href="https://en.wikipedia.org/wiki/Transcriptome" title="Transcriptome">transcriptomes</a>&nbsp;see:&nbsp;<a href="https://en.wikipedia.org/wiki/RNA-Seq" title="RNA-Seq">RNA-Seq</a>&nbsp;and&nbsp;<a href="https://en.wikipedia.org/wiki/MicroRNA_Sequencing" title="MicroRNA Sequencing">MicroRNA Sequencing</a>.</p>
+
<p>RNA is less stable in the cell, and also more prone to nuclease attack experimentally. As RNA is generated by transcription from DNA, the information is already present in the cell&#39;s DNA. However, it is sometimes desirable to sequence RNA molecules. While sequencing DNA gives a genetic profile of an organism, sequencing RNA reflects only the sequences that are actively expressed in the cells. To sequence RNA, the usual method is first to reverse transcribe the RNA extracted from the sample to generate cDNA fragments. This can then be sequenced as described above. The bulk of RNA expressed in cells are ribosomal RNAs or small RNAs, detrimental for cellular translation, but often not the focus of a study. This fraction can fortunately be removed in vitro, however, to enrich for the messenger RNA, also included, that usually is of interest. Derived from the exons these mRNAs are to be later translated to proteins that support particular cellular functions. The expression profile therefore indicates cellular activity, particularly desired in the studies of diseases, cellular behaviour, responses to reagents or stimuli. Eukaryotic RNA molecules are not necessarily co-linear with their DNA template, as introns are excised. This gives a certain complexity to map the read sequences back to the genome and thereby identify their origin. For more information on the capabilities of next-generation sequencing applied to whole transcriptomes see: RNA-Seq and MicroRNA Sequencing.</p>
  
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
  
<p><span style="color:#800080"><strong>3. Protein sequcing&nbsp;</strong></span></p>
+
<p>3. Protein sequcing</p>
  
<p>If the gene encoding the protein is known, it is currently much easier to sequence the DNA and infer the protein sequence. Determining part of a protein&#39;s&nbsp;<a href="https://en.wikipedia.org/wiki/Amino_acid" title="Amino acid">amino-acid</a>sequence (often one end) by one of the above methods may be sufficient to identify a&nbsp;<a href="https://en.wikipedia.org/wiki/Clone_(genetics)" title="Clone (genetics)">clone</a>&nbsp;carrying this gene.</p>
+
<p>If the gene encoding the protein is known, it is currently much easier to sequence the DNA and infer the protein sequence. Determining part of a protein&#39;s amino-acidsequence (often one end) by one of the above methods may be sufficient to identify a clone carrying this gene.</p>
  
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
  
<p><span style="color:#800080"><strong>4. Polysacharride sequencing&nbsp;</strong></span></p>
+
<p>4. Polysacharride sequencing</p>
  
<p>Though&nbsp;<a href="https://en.wikipedia.org/wiki/Polysaccharide" title="Polysaccharide">polysaccharides</a>&nbsp;are also biopolymers, it is not so common to talk of &#39;sequencing&#39; a polysaccharide, for several reasons. Although many polysaccharides are linear, many have branches. Many different units (individual&nbsp;<a href="https://en.wikipedia.org/wiki/Monosaccharide" title="Monosaccharide">monosaccharides</a>) can be used, and&nbsp;<a href="https://en.wikipedia.org/wiki/Chemical_bond" title="Chemical bond">bonded</a>&nbsp;in different ways. However, the main theoretical reason is that whereas the other polymers listed here are primarily generated in a &#39;template-dependent&#39; manner by one processive enzyme, each individual join in a polysaccharide may be formed by a different&nbsp;<a href="https://en.wikipedia.org/wiki/Enzyme" title="Enzyme">enzyme</a>. In many cases the assembly is not uniquely specified; depending on which enzyme acts, one of several different units may be incorporated. This can lead to a family of similar molecules being formed. This is particularly true for plant polysaccharides. Methods for the&nbsp;<a href="https://en.wikipedia.org/wiki/Structure_determination" title="Structure determination">structure determination</a>&nbsp;of&nbsp;<a href="https://en.wikipedia.org/wiki/Oligosaccharide" title="Oligosaccharide">oligosaccharides</a>&nbsp;and&nbsp;<a href="https://en.wikipedia.org/wiki/Polysaccharide" title="Polysaccharide">polysaccharides</a>include&nbsp;<a href="https://en.wikipedia.org/wiki/NMR" title="NMR">NMR</a>&nbsp;spectroscopy and&nbsp;<a href="https://en.wikipedia.org/w/index.php?title=Methylation_analysis&amp;action=edit&amp;redlink=1" title="Methylation analysis (page does not exist)">methylation analysis</a>.</p>
+
<p>Though polysaccharides are also biopolymers, it is not so common to talk of &#39;sequencing&#39; a polysaccharide, for several reasons. Although many polysaccharides are linear, many have branches. Many different units (individual monosaccharides) can be used, and bonded in different ways. However, the main theoretical reason is that whereas the other polymers listed here are primarily generated in a &#39;template-dependent&#39; manner by one processive enzyme, each individual join in a polysaccharide may be formed by a different enzyme. In many cases the assembly is not uniquely specified; depending on which enzyme acts, one of several different units may be incorporated. This can lead to a family of similar molecules being formed. This is particularly true for plant polysaccharides. Methods for the structure determination of oligosaccharides and polysaccharidesinclude NMR spectroscopy and methylation analysis.</p>
  
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>
Line 183: Line 195:
 
<p>1. https://www.youtube.com/watch?v=jFCD8Q6qSTM</p>
 
<p>1. https://www.youtube.com/watch?v=jFCD8Q6qSTM</p>
  
<p>2. Brook biology of microorganisms 14th edition.&nbsp;</p>
+
<p>2. Brook biology of microorganisms 14th edition.</p>
 +
 
 +
<p>&nbsp;</p>
 +
 
 +
<p>&nbsp;</p>
  
 
<p>&nbsp;</p>
 
<p>&nbsp;</p>

Latest revision as of 03:26, 3 December 2016

S1.4 Genomics Essay #4

http://biolecture.org/index.php/Essay_!4_-_About_Sequencing_Code_:_KSI0004

 

Essay 4 – About Sequencing

Sangin Kim

 

In genetics and biochemistry, sequencing means to determine the primary structure of an unbranched biopolymer. Sequencing results in a symbolic linear depiction known as a sequence which succinctly summarizes much of the atomic-level structure of the sequenced molecule.

 

DNA sequencing is the process of determining the nucleotide order of a given DNA fragment. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as pyrosequencing are gaining an increasing share of the sequencing market. More genome data are now being produced by pyrosequencing than Sanger DNA sequencing. Pyrosequencing has enabled rapid genome sequencing. Bacterial genomes can be sequenced in a single run with several times coverage with this technique.

The sequence of DNA encodes the necessary information for living things to survive and reproduce. Determining the sequence is therefore useful in fundamental research into why and how organisms live, as well as in applied subjects. Because of the key importance DNA has to living things, knowledge of DNA sequences are useful in practically any area of biological research. For example, in medicine it can be used to identify, diagnose, and potentially develop treatments for genetic diseases. Similarly, research into pathogens may lead to treatments for contagious diseases. Biotechnology is a burgeoning discipline, with the potential for many useful products and services.

 

1.1 sanger sequencing

In chain terminator sequencing (Sanger sequencing), extension is initiated at a specific site on the template DNA by using a short oligonucleotide 'primer' complementary to the template at that region. The oligonucleotide primer is extended using a DNA polymeraase an enzyme that replicates DNA. Included with the primer and DNA polymerase are the four deoxynucleotide bases (DNA building blocks), along with a low concentration of a chain terminating nucleotide (most commonly a di-deoxynucleotide). Limited incorporation of the chain terminating nucleotide by the DNA polymerase results in a series of related DNA fragments that are terminated only at positions where that particular nucleotide is used. The fragments are then size-separated by electrophoresis in a slab polyacrylamide gel, or more commonly now, in a narrow glass tube (capillary) filled with a viscous polymer.

 

 

1.2 Pyrosequencing

Pyrosequencing which was developed by Pål Nyrén and Mostafa Ronaghi, has been commercialized by Biotage (for low-throughput sequencing) and 454 Life Sciences (for high-throughput sequencing). The latter platform sequences roughly 100 megabases [now up to 400 megabases] in a seven-hour run with a single machine. In the array-based method (commercialized by 454 Life Sciences), single-stranded DNA is annealed to beads and amplified via EmPCR. These DNA-bound beads are then placed into wells on a fiber-optic chip along with enzymes which produce light in the presence of ATP. When free nucleotides are washed over this chip, light is produced as ATP is generated when nucleotides join with their complementary base pairs. Addition of one (or more) nucleotide(s) results in a reaction that generates a light signal that is recorded by the CCD camera in the instrument. The signal strength is proportional to the number of nucleotides, for example, homopolymer stretches, incorporated in a single nucleotide flow.

 

1.3 Large scale sequencing

Whereas the methods above describe various sequencing methods, separate related terms are used when a large portion of a genome is sequenced. Several platforms were developed to perform exome sequencing (a subset of all DNA across all chromosomes that encode genes) or whole genome sequencing (sequencing of the all nuclear DNA of a human).

 

 

Taxonomy

•Sequencing: determining the precise order of nucleotides in a DNA or RNA molecule

•Sanger dideoxy method

•Invented by Nobel Prize winner Fred Sanger

•Dideoxy analogs of dNTPs used in conjunction with dNTPs

•Analog prevents further extension of DNA chain

•Bases are labeled with radioactivity

•Gel electrophoresis is then performed on products

•Large-scale sequencing projects have led to automated DNA sequencing systems

•Based on Sanger method

•Radioactivity replaced by fluorescent dye

•Virtually all genomic sequencing projects use shotgun sequencing

•Entire genome is cloned, and resultant clones are sequenced

•Much of the sequencing is redundant

•Generally 7- to 10-fold coverage

•Computer algorithms are used to look for replicate sequences and assemble them

 

NGS(Next generation sequencing)

•Second-generation DNA sequencing

•Generates data 100x faster than Sanger method

•Massively parallel methods

•Large number of samples sequenced side by side

•Uses increased computer power and miniaturization

•454 Life Sciences pyrosequencing

•Illumina/Solexa sequencing

•SOLID/Applied Biosystems method

•454 sequencing system

•DNA is broken into small segments

•DNA is amplified using polymerase chain reaction (PCR)

•Light is released each time a base is added to DNA strand

•Instrument actually measures release of light

•Can handle only short stretches of DNA strand

•Third-generation DNA sequencing

•Sequencing of single molecules of DNA

•HeliScope Single Molecule Sequencer

•Single-stranded DNA fragments attached in array on glass slide

•Complementary strand synthesized

•Fluorescent tags monitored on microscope

•Computer assembles fragments into sequence

•Third-generation DNA sequencing

•Pacific Biosciences SMRT

•Single Molecule Real Time sequencing

•Reactions carried out in nanocontainers (zero mode wave guides)

•Single-stranded DNA fragments attached

•Complementary strand synthesized

•Fluorescent tags monitored

•Computer assembles fragments into sequence

 

•Fourth-generation DNA sequencing

•Optical detection no longer used

•Ion torrent semiconductor sequencing

•Measures release of protons whenever a deoxyribonucleotide is added

•Silicon chip measures "pH"

•Extremely fast

•Less expensive

 

•Fourth-generation DNA sequencing

•Oxford Nanopore Technologies system

•Passes DNA through nanoscale biological pores

•Detector measures change in electric current

•Extremely fast

•Measures long chains of DNA

 

 

2. RNA sequencing

RNA is less stable in the cell, and also more prone to nuclease attack experimentally. As RNA is generated by transcription from DNA, the information is already present in the cell's DNA. However, it is sometimes desirable to sequence RNA molecules. While sequencing DNA gives a genetic profile of an organism, sequencing RNA reflects only the sequences that are actively expressed in the cells. To sequence RNA, the usual method is first to reverse transcribe the RNA extracted from the sample to generate cDNA fragments. This can then be sequenced as described above. The bulk of RNA expressed in cells are ribosomal RNAs or small RNAs, detrimental for cellular translation, but often not the focus of a study. This fraction can fortunately be removed in vitro, however, to enrich for the messenger RNA, also included, that usually is of interest. Derived from the exons these mRNAs are to be later translated to proteins that support particular cellular functions. The expression profile therefore indicates cellular activity, particularly desired in the studies of diseases, cellular behaviour, responses to reagents or stimuli. Eukaryotic RNA molecules are not necessarily co-linear with their DNA template, as introns are excised. This gives a certain complexity to map the read sequences back to the genome and thereby identify their origin. For more information on the capabilities of next-generation sequencing applied to whole transcriptomes see: RNA-Seq and MicroRNA Sequencing.

 

3. Protein sequcing

If the gene encoding the protein is known, it is currently much easier to sequence the DNA and infer the protein sequence. Determining part of a protein's amino-acidsequence (often one end) by one of the above methods may be sufficient to identify a clone carrying this gene.

 

4. Polysacharride sequencing

Though polysaccharides are also biopolymers, it is not so common to talk of 'sequencing' a polysaccharide, for several reasons. Although many polysaccharides are linear, many have branches. Many different units (individual monosaccharides) can be used, and bonded in different ways. However, the main theoretical reason is that whereas the other polymers listed here are primarily generated in a 'template-dependent' manner by one processive enzyme, each individual join in a polysaccharide may be formed by a different enzyme. In many cases the assembly is not uniquely specified; depending on which enzyme acts, one of several different units may be incorporated. This can lead to a family of similar molecules being formed. This is particularly true for plant polysaccharides. Methods for the structure determination of oligosaccharides and polysaccharidesinclude NMR spectroscopy and methylation analysis.

 

 

Reference

1. https://www.youtube.com/watch?v=jFCD8Q6qSTM

2. Brook biology of microorganisms 14th edition.