Kernel: Python 2
File handling – The Hidden Message
The file "genome.fa" is a 1 million bp. piece from a bacterial genome
Find all open reading frames >= 450 nucleotides / 150 AA
Remember an ORF can also be on the complementary strand!
An ORF starts with "ATG"
An ORF stops with "TAA", "TAG" or "TGA"
Translate the ORF into an single letter amino acid sequence
ATG --> M
Sort the ORFs on length (large to small)
From the ORFs take in order the 25th AA
What is the hidden message?
In [1]:
In [2]:
In [3]:
In [ ]:
In [ ]: