Internship report

From Biolecture.org

    UNIST UNDERGRADUATE SCHOOL

     School of Life Science

 

 

RESEARCH INTERNSHIP REPORT

 

 

 

Student:      ___20132023____    ___Madina           _  Seidualy____

                  Student ID                 First name                  Last name

 

Track:      ____BIO ___ _     _____BME_ ___          Grade:  __Senior ___

                1 Track               2 Track

 

Advisor:   ____SLS_ ___ __    ___Jong Hwa_____    ____Bhak ____

                School                    First name                   Last name

 

 

Starting Date:     12/26/2016 (mm/dd/yyyy)

Ending Date:   02/20/2017 (mm/dd/yyyy)

 

 

 

 

 

  • The Genomic Lab

  The Genomics lab was established in July 2014 by a group of researchers in Korea.

The genomics lab is a team of dedicated people who want to understand the structure and function of the genomes on Earth. The main application area is anti-cancer and anti-aging. The topics are unlimited. TGL belongs to BME (Biomedical Engineeing department of UNIST) in SLS (School of Life Science).

Professor Bhak Jong Hwa is a director of the The genomic Institute in the UNIST. I had a chance to get an experience in this lab for two months. The Genomic Institute run diverse projects like whole genome sequencing, transcriptome analysis and etc..

 Currently, TGI doing genome sequencing analysis of whale shark, coral genome and jellyfish. Moreover, TGI is not confined on sequencing of the living species, either do relevance projects concerning problems in society, and trying to identify some genetic markers for those characteristic issues.

 

 

 

  • Performance in the Lab

Time

Tasks

Accomplishment

1st Week

Introduction to Lab, learning general tools, introduction to genomic databases

I’ve learned basic multiple alignment programs, and compute protein alignments. I’ve constructed phylogenetic tree and studied about genome browsers.

2nd Week

Literature search about candidate genes of the Whale shark

By using information from genome browsers (DAVID, Ensembl, NCBI, Unipot) have collected data about positively selected genes of Whale Shark. I’ve found out fascinating facts about whale shark through protein analysis.

3rd Week

Literature search about candidate genes of the Whale shark

Close search about HP1BP3, and did multiple sequence alignments via MEGA7. I’ve recognized W2 domain gain in HP1BP3 through InterPro database.

4th Week

Introduction to Bioinformatics, learning Perl

I’ve introduced to Linux Terminal, and started study on Perl.

5th Week

Perl coding, constructing HeatMap using R

Doing simple calculations in DNA, protein data in terminal through by writing my own coding scripts. Learned and build HeatMap in R studio.

6th Week

Calculation of the length of each scaffolds with certain gene in it from whale shark genome data.

Through tasks improved coding skills, and got used to genomic data. I’ve started using BioPerl.

7th Week

Mapping paired end reads to the reference genome of the Whale Shark

Study on DNA sequencing processes. In order to sequence alignment against a large reference genome learned about BWA and corresponding steps.

8th Week

Learning about various integrative analyses for multi omics data

Read paper about “Integrated Genomic Characterization of Papillary Thyroid Carcinoma”. I’ve learned about Circos program and other tool for integrating genomic data.

 

 

 

 

 

 

 

 

 

  • Self-Evaluation

 During the two month of internship in The Genomic Institute I’ve learned important values of the genomics. Before doing internship, I had known diverse things and learned plenty of stuffs in research area but I had had difficulties in correlating them. All seemed to me complicated. However, after having internship experience, I organized every research step in sequential order, and I could make whole picture of it. Now I know well how genome analysis have done. As an example, in below table I provided brief summary of the main genomic analysis I’ve learned so far:

STEPS

EXPLANATION

FORMATS&TOOLS

   Raw Data

   From Illumina sequencing

    .bcl format  (base cells per cycle)

  Demultiplexing

 Changing format from .bcl -> fastq

 .fastq (+quality scores)

Downstream alignment

 Sequence alignment to reference genome or creating de novo assembly.

 Fastq->SAM

 Sam ->bam (univercal file format for mapped seq. reads)

BWA, SOAPdenovo

  Variant Calling

  find SNP,SNV,INDELS

 SAMtools, GATK

Visualization of DATA

Integrative genomic viewer,

UCSC Genome browser

  Circos,Heatmap,etc

  

In above table, in each step there are plenty of tools used for, which I have to learn in detail in near future.

 In addition, I’ve acknowledged that Bioinformatics is the main tool in genomic analysis. I cannot imagine how would researches in big mass of information of the DNA and protein would computed without programming codes. I have strong desire to fortify my bioinformatics skill further in order to do rewarding researches, and correctly interpret biological data.

 Since DNA carries all genetic instructions used in growth, development, functioning and reproduction of all living organisms, fascinating thing is that, while conducting researches in genomic lab you learn, work and know about everything every day. Compare to laboratories, where researches are confined in specific topic Genomic department researchers observe everything and create big pictures of the life. For a student like me who has big passion in learning about living organisms it was best experiences.