Difference between revisions of "Personal genomics, bioinformatics, and variomics"
imported>Sskimb |
imported>Sskimb |
||
Line 18: | Line 18: | ||
<br /> | <br /> | ||
</span><strong><span style="FONT-SIZE: 9pt">Personal Genomics</span></strong><span style="FONT-SIZE: 9pt"><br /> | </span><strong><span style="FONT-SIZE: 9pt">Personal Genomics</span></strong><span style="FONT-SIZE: 9pt"><br /> | ||
− | In 2009, genome sequencing technologies will achieve one person's whole genome per day in terms of DNA fragments sequenced. Personal genomics is a new term that utilizes such fast sequencers. In 2008, the cost for one personal genome is less than $300,000 USD. If the cost goes down below $1,000 USD, the impact of personal genomics is predicted to be the largest ever in biology on common people's life. PGP (Personal Genome Project) is a project to sequence as many people as possible with low costs </span><span style="FONT-SIZE: 9pt">(Church 2005)</span><span style="FONT-SIZE: 9pt">. Google Inc. and Church group are working together to sequence 100,000 people's genetic regions of DNA. In Saudi Arabia, the government is planning to sequence 100 Arabic people. In Europe, there are various groups of people and nations who have been genotyping the populations. Especially, Iceland has been successful in that effort by utilizing their well-kept genealogical data encompassing 100,000s people. In Asia, Jeongsun Seo of Seoul National University has been working on East Asia Genome Project in the past years. His group collected thousands of samples from Mongolian tribes with a gigantic genealogical tree among them </span><span style="FONT-SIZE: 9pt">( | + | In 2009, genome sequencing technologies will achieve one person's whole genome per day in terms of DNA fragments sequenced. Personal genomics is a new term that utilizes such fast sequencers. In 2008, the cost for one personal genome is less than $300,000 USD. If the cost goes down below $1,000 USD, the impact of personal genomics is predicted to be the largest ever in biology on common people's life. PGP (Personal Genome Project) is a project to sequence as many people as possible with low costs </span><span style="FONT-SIZE: 9pt">(Church 2005)</span><span style="FONT-SIZE: 9pt">. Google Inc. and Church group are working together to sequence 100,000 people's genetic regions of DNA. In Saudi Arabia, the government is planning to sequence 100 Arabic people. In Europe, there are various groups of people and nations who have been genotyping the populations. Especially, Iceland has been successful in that effort by utilizing their well-kept genealogical data encompassing 100,000s people. In Asia, Jeongsun Seo of Seoul National University has been working on East Asia Genome Project in the past years. His group collected thousands of samples from Mongolian tribes with a gigantic genealogical tree among them </span><span style="FONT-SIZE: 9pt">(Park et al. 2008; Sung et al. 2008)</span><span style="FONT-SIZE: 9pt">. Seo is planning on sequencing at least 100 Korean genomes in collaboration with Church and Green Cross Inc. of Korea. The aim of Seo's genome project is produce a resource for the East Asians as well as Koreans. He is presently sequencing at least two Korean people. In China, Beijing Genome Institute has been successful in terms of sequencing. Their first achievement came from a plant genome, rice. After rice, they launched a 99 Han Chinese genome sequencing project. In Nov. 2008, they published their first Chinese genome in a magazine, Nature. In Dec. 2008, another Korean group Lee Gilyeo Cancer and Diabetes Institute and Korean Bioinformation Center (KOBIC) made a Korean genome sequence public. The genome was sequenced by Solexa paired-end sequencer and comparative genomics analyses and SNP data were uploaded as a public resource for everyone. <br /> |
<br /> | <br /> | ||
</span><strong><span style="FONT-SIZE: 9pt">Genome revolution </span></strong><strong><span style="FONT-SIZE: 9pt"><br /> | </span><strong><span style="FONT-SIZE: 9pt">Genome revolution </span></strong><strong><span style="FONT-SIZE: 9pt"><br /> | ||
− | </span></strong><span style="FONT-SIZE: 9pt">These public genome data alongside | + | </span></strong><span style="FONT-SIZE: 9pt">These public genome data alongside previously known Craig Venter's and James Watson's mark that full genome sequences are not in academic domain anymore. Anyone who has money and will can sequence human genomes. This 'genomic revolution' will eventually lead to the 'BioRevolution' in terms of making the most essential human information completely mapped and publically available. These are revolutionary because humans can now engineer themselves with a map or a blue print not directly relying on trial and error style conventional evolutionary methods. This indicates that evolution went into a conscious level of driving evolution. It is almost designing the evolution using computers. <br /> |
<br /> | <br /> | ||
</span><strong><span style="FONT-SIZE: 9pt">Genomes and personalized medicine</span></strong><span style="FONT-SIZE: 9pt"><br /> | </span><strong><span style="FONT-SIZE: 9pt">Genomes and personalized medicine</span></strong><span style="FONT-SIZE: 9pt"><br /> | ||
− | The consequences of | + | The consequences of 'BioRevolution' where genomic information is utilized by scientists to engineers all kinds of biological processes including evolution itself will bring us the personalized medicine. The essence of personalized medicine is that enzymes in our tissues such as cytochrome p450 have distinct differences among individuals and populations. Certain drugs produce different responses in individuals. <br /> |
<br /> | <br /> | ||
</span><strong><span style="FONT-SIZE: 9pt">Cytochrome p450 family example</span></strong><span style="FONT-SIZE: 9pt"><br /> | </span><strong><span style="FONT-SIZE: 9pt">Cytochrome p450 family example</span></strong><span style="FONT-SIZE: 9pt"><br /> |
Revision as of 17:53, 16 December 2008
Jong Bhak, Ho Ghang, Rohit Reja, and Sangsoo Kim
Abstract
There are at least five complete genome sequences available in 2008. It is known that there are over 15,000,000 genetic variants called SNPs in the dbSNP database. The cost of a full genome sequencing in 2009 will be claimed to be less than $5000 USD. The genomics era has arrived in 2008. This review introduces technologies, bioinformatics, genomics visions, and variomics projects. Variomics is the study of the total genetic variation in an individual and populations. Research on genetic variation is the most valuable among many genomics research branches. Genomics and variomics projects will change biology and the society so dramatically that biology will become an everyday technology as personal computers and the internet. 'BioRevolution' is the term that can adequately describe this change.
Introduction
Since the launch of the Human Genome Project (HGP) in 1990 by NIH of USA, researchers have been developing faster DNA sequencers (Shendure, Mitra et al. 2004; Chan 2005; Metzker 2005; Gupta 2008; Mardis 2008). HGP was said to be led by James Watson who modeled DNA in Cambridge, UK in 1953. In 2003, the International Human Genome Sequencing Consortium held a press conference to announce the completion of the human genome (IHGSC 2004). In 2008, after 55 years, his complete genome sequence was publicized by using 454 DNA sequencers developed by a company (Wheeler, Srinivasan et al. 2008). In 2007, Craig Venter of former Celera founder published his own personal genome in PLoS Biology (Levy, Sutton et al. 2007). We are entering the personalized biology era with the advent of next generation sequencing technologies.
DNA sequencing
The first breakthrough in genome sequencing came from Watson's colleague in Cambridge, Fred Sanger. In 1977, Sanger and his team produced the first useful DNA sequencing method and publicized the first complete genome (Sanger, Air et al. 1977). It was a tiny virus genome known as phi X 174. Soon after phi X 174, he published the first complete organelle genome which was mitochondrion (Anderson, Bankier et al. 1981). By 1998, researchers in the US evaluated multiplex genome sequencing technologies and were aware that one person's whole genome could be sequenced in one day using contemporary technologies. George Church was the Ph.D. student of Walter Gilbert who received a Nobel Prize with Sanger for developing a sequencing method. Gilbert's method was not used much. However, his colleague Church kept developing sequencing methods. One of them is based on Polony idea (Porreca, Shendure et al. 2006). This technology is used by KNOME Inc. that is a full genome sequencing company. Genome sequencing technology is moving forward to the level as computer CPUs are universally used. DNA sequencing is one of the most important industrial technologies in biology due to its perpetual use and new applications in the future.
Personal Genomics
In 2009, genome sequencing technologies will achieve one person's whole genome per day in terms of DNA fragments sequenced. Personal genomics is a new term that utilizes such fast sequencers. In 2008, the cost for one personal genome is less than $300,000 USD. If the cost goes down below $1,000 USD, the impact of personal genomics is predicted to be the largest ever in biology on common people's life. PGP (Personal Genome Project) is a project to sequence as many people as possible with low costs (Church 2005). Google Inc. and Church group are working together to sequence 100,000 people's genetic regions of DNA. In Saudi Arabia, the government is planning to sequence 100 Arabic people. In Europe, there are various groups of people and nations who have been genotyping the populations. Especially, Iceland has been successful in that effort by utilizing their well-kept genealogical data encompassing 100,000s people. In Asia, Jeongsun Seo of Seoul National University has been working on East Asia Genome Project in the past years. His group collected thousands of samples from Mongolian tribes with a gigantic genealogical tree among them (Park et al. 2008; Sung et al. 2008). Seo is planning on sequencing at least 100 Korean genomes in collaboration with Church and Green Cross Inc. of Korea. The aim of Seo's genome project is produce a resource for the East Asians as well as Koreans. He is presently sequencing at least two Korean people. In China, Beijing Genome Institute has been successful in terms of sequencing. Their first achievement came from a plant genome, rice. After rice, they launched a 99 Han Chinese genome sequencing project. In Nov. 2008, they published their first Chinese genome in a magazine, Nature. In Dec. 2008, another Korean group Lee Gilyeo Cancer and Diabetes Institute and Korean Bioinformation Center (KOBIC) made a Korean genome sequence public. The genome was sequenced by Solexa paired-end sequencer and comparative genomics analyses and SNP data were uploaded as a public resource for everyone.
Genome revolution
These public genome data alongside previously known Craig Venter's and James Watson's mark that full genome sequences are not in academic domain anymore. Anyone who has money and will can sequence human genomes. This 'genomic revolution' will eventually lead to the 'BioRevolution' in terms of making the most essential human information completely mapped and publically available. These are revolutionary because humans can now engineer themselves with a map or a blue print not directly relying on trial and error style conventional evolutionary methods. This indicates that evolution went into a conscious level of driving evolution. It is almost designing the evolution using computers.
Genomes and personalized medicine
The consequences of 'BioRevolution' where genomic information is utilized by scientists to engineers all kinds of biological processes including evolution itself will bring us the personalized medicine. The essence of personalized medicine is that enzymes in our tissues such as cytochrome p450 have distinct differences among individuals and populations. Certain drugs produce different responses in individuals.
Cytochrome p450 family example
- Extensive metabolizers: The individuals that can be administered with normal drug dosage
- Intermediate metabolizers : The individuals that metabolizes drug with a rate slower than the normal rate.
- Poor metabolizers: The individuals with poor metabolizing rate. Drugs make accumulate and cause serious adverse effects.
- Ultra metabolizers: Individuals with metabolizing rate faster than extensive metabolizers. They may experience no effect of drug activity.
Variomics
Asian Variome Project (AVP)
Alongside and with the associations of eIMBL, A-IMBN, and HVP, a variome project that tries to map Asian population variome was launched in 2008. This was a group effort of Korean researchers who have been interested in genome sequences, SNPs, and CNVs. They have formed a Korean Variome Consortium (KOVAC: http://variome.kr) and supported AVP as one of the first projects. eIMBL that is the virtual lab network of Asia linking key biologists groups modeled after EMBL has acquired $80,000 USD in 2008 to support AVP. eIMBL aims to establish a virtual bioinformatics center in Asia pacific region that links many bioinformation processing scientists in Asia.
Bioinformatics for personal genomes and variomes
Bioinformatics is the key in personal genome projects and variome projects. Bioinformatics is not a set of tools but it is a proper scientific discipline. It regards life as a gigantic information processing phenomenon and tries to map its components and the emerging networks of the components. Bioinformatics in 2008 is driving biology into an information science. Most biology researches are now with massive amount of data that can not be processed by hand. Nearly all the biological research outcomes in the next five years will have some form of high throughput data such as genome sequences, microarray data, proteome analyses, SNPs, epigenome chips, and large scale phenotype mapping. Bioinformatics tools in genomics and variomics can be found from various internet resources. There are various bioinformatics hubs such as NCBI (National Center for Biotechnology Information), EBI (European Bioinformatics Institute), DDBJ (Databank of Japan), and KOBIC. Some others are: Bioinformatics Organization (http://Bioinformatics.Org), EMBnet (http://www.embnet.org/), and The International Society for Computational Biology (http://iscb.org). The following are major bioinformatics journals
Algorithms in Molecular Biology (http://www.almob.org/)
Bioinformatics (http://bioinformatics.oxfordjournals.org/)
BMC Bioinformatics (http://www.biomedcentral.com/bmcbioinformatics)
Briefings in Bioinformatics (http://bib.oxfordjournals.org/)
Genome Research (http://genome.cshlp.org/)
Genomics and Informatics (http://kogo.or.kr)
The International Journal of Biostatistics (http://www.bepress.com/ijb/)
Journal of Computational Biology (http://www.liebertpub.com/Products/Product.aspx?pid=31&AspxAutoDetectCookieSupport=1)
Cancer Informatics (http://www.la-press.com/journal.php?pa=description&journal_id=10)
Molecular Systems Biology (http://www.nature.com/msb/index.html
PLoS Computational Biology (http://www.ploscompbiol.org/home.action)
International Journal of Bioinformatics Research and Applications (http://www.inderscience.com/browse/index.php?journalcode=ijbra)
Sequencing DNA, Metagenomics, and Ecogenomics
Next generation sequencing methods are not only mapping genomes. They can be used to map the environment. It is called ecogenomics. Environment to humans can be various microbial, plant, and animal interactions around us. Especially, microbial interaction is critical to our health. Gut bacteria are natural environment to us. Metagenomics is a methodology that sequences the whole set of microbes in our food tract. Researchers are realizing that human genome is complemented by such environmental genomes. A new term, 'ecogenomics' is now used to describe these concepts. Metagenomics and ecogenomics are for mapping the variation of environmental genetic factors.
Mapping expression using DNA sequencing
In 2009 and onwards, personal genome projects will produce unprecedented amount of biological data. New bioinformatics technologies will be required to handle them. New sequencing technologies will drive the next decades of biology and transform the medical practices in hospitals within the next decades. Fast sequencing unexpectedly brought us interesting applications such as metagenomics and ecogenomics. We examined the current trends in genomics and variomics.