Difference between revisions of "DATA extraction from FASTA"

Latest revision as of 21:38, 14 June 2016

FASTA format is an arranged sequence data file.

FASTA file starts from > in the first row.

This row contains basic information such as sequence ID.

1. Go to the NCBI (http://www.ncbi.nlm.nih.gov/) , National Center for Biotechnology Information

2. Choose the database that you want and search the information that you want to find

Ex) Database - Gene / Search - Homo sapiens chromosome 2 / Choose what you want to look at

3. Choose FASTA from the 'Go to nucleotide' section

4. Using <STDIN>, we can open the file in a perl program

Ex) print "Please type the filename to extract: ";

$DNAfilename = <STDIN>;

chomp $DNAfilename;

open(DNAFILE, $DNAfilename);

@FileData = <DNAFILE>;

close DNAFILE;

print @FileData;

exit;

@@ Line 22: / Line 22: @@
 <p>&nbsp;</p>
+<p>4. Using &lt;STDIN&gt;, we can open the file in a perl program</p>
+<p>Ex) print &quot;Please type the filename to extract: &quot;;</p>
+<p>$DNAfilename = &lt;STDIN&gt;;</p>
+<p>chomp $DNAfilename;</p>
+<p>open(DNAFILE, $DNAfilename);</p>
+<p>@FileData = &lt;DNAFILE&gt;;</p>
+<p>close DNAFILE;</p>
+<p>print @FileData;</p>
+<p>exit;</p>
 <h3><span style="background-color:rgb(245,245,255); color:rgb(0,0,0)">Reference</span></h3>
@@ Line 29: / Line 47: @@
 <p>&nbsp;</p>
-<p>[[<a href="http://biolecture.org/index.php/KSH_0608_Bioinformatics_with_Bioperl" title="KSH 0608 Bioinformatics with Bioperl">KSH_0608 Bioinformatics with Bioperl</a>]]</p>
+<p><a href="http://biolecture.org/index.php/KSH_0608_Bioinformatics_with_Bioperl" title="KSH 0608 Bioinformatics with Bioperl">KSH_0608 Bioinformatics with Bioperl</a></p>