GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:44:31 Sequence gi568815588r:79457221_79659483 : 202263 bp : 43.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3192 3258 67 2 1 50 74 91 0.599 5.15 1.02 Term + 10971 11128 158 1 2 109 44 78 0.604 3.60 1.03 PlyA + 11858 11863 6 1.05 2.04 PlyA - 12798 12793 6 1.05 2.03 Term - 13031 12976 56 0 2 109 54 37 0.009 0.12 2.02 Intr - 30978 30889 90 1 0 107 78 -2 0.175 0.67 2.01 Init - 34257 34197 61 0 1 73 89 26 0.252 0.66 2.00 Prom - 35466 35427 40 -1.56 3.00 Prom + 43431 43470 40 -6.56 3.01 Sngl + 49228 49611 384 0 0 63 48 624 0.680 52.09 3.02 PlyA + 49614 49619 6 -10.57 4.06 PlyA - 49626 49621 6 -10.46 4.05 Term - 50130 49648 483 0 0 74 37 866 0.925 74.85 4.04 Intr - 50340 50194 147 0 0 76 -108 231 0.585 2.73 4.03 Intr - 50565 50368 198 0 0 111 -108 338 0.712 16.05 4.02 Intr - 51002 50910 93 2 0 100 56 21 0.534 0.26 4.01 Init - 51190 51185 6 1 0 64 77 8 0.145 -2.25 4.00 Prom - 52380 52341 40 -6.16 5.00 Prom + 53691 53730 40 -8.76 5.01 Sngl + 55430 55894 465 1 0 53 36 623 0.955 49.75 5.02 PlyA + 56509 56514 6 1.05 6.04 PlyA - 67183 67178 6 1.05 6.03 Term - 67829 67734 96 2 0 43 43 80 0.105 -3.03 6.02 Intr - 69412 69293 120 1 0 130 113 85 0.977 15.89 6.01 Init - 74124 74089 36 0 0 56 101 40 0.729 0.11 6.00 Prom - 74700 74661 40 -9.26 7.00 Prom + 74809 74848 40 -7.66 7.01 Sngl + 78415 79200 786 0 0 86 47 221 0.962 13.76 7.02 PlyA + 80014 80019 6 1.05 8.05 PlyA - 81367 81362 6 1.05 8.04 Term - 100365 99989 377 1 2 85 54 465 0.902 37.70 8.03 Intr - 100909 100832 78 2 0 101 94 76 0.984 9.02 8.02 Intr - 101785 101666 120 2 0 114 99 50 0.755 9.17 8.01 Init - 102263 102092 172 2 1 71 84 180 0.908 13.55 8.00 Prom - 121541 121502 40 -5.46 9.05 PlyA - 122595 122590 6 1.05 9.04 Term - 125439 125159 281 1 2 8 48 272 0.071 10.81 9.03 Intr - 127156 127040 117 2 0 86 110 11 0.121 3.54 9.02 Intr - 138437 138156 282 0 0 82 43 300 0.086 22.29 9.01 Init - 138515 138500 16 2 1 47 94 15 0.301 -1.56 9.00 Prom - 143247 143208 40 -7.16 10.00 Prom + 143660 143699 40 -3.56 10.01 Init + 154606 154777 172 0 1 71 84 122 0.363 8.91 10.02 Intr + 155092 155211 120 2 0 114 99 95 0.860 13.67 10.03 Intr + 155969 156046 78 0 0 101 94 75 0.989 8.92 10.04 Term + 156517 156893 377 2 2 87 54 467 0.947 38.10 10.05 PlyA + 158181 158186 6 1.05 11.03 PlyA - 158736 158731 6 1.05 11.02 Term - 163848 163755 94 2 1 107 47 33 0.602 -1.60 11.01 Init - 164614 164196 419 1 2 73 53 141 0.722 5.00 11.00 Prom - 164812 164773 40 -1.06 12.03 PlyA - 166532 166527 6 1.05 12.02 Term - 175969 175904 66 1 0 74 40 108 0.554 2.54 12.01 Init - 180690 180556 135 0 0 74 87 46 0.491 3.24 12.00 Prom - 181010 180971 40 -2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 138437 138112 326 0 2 82 48 332 0.807 23.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_1|74_aa MSFLSLHQKQMLALCFTYSLQNSLSLHCTLAAAPDLHTLEKEGLGFFHFQPSESEGRTLP NPACPEVKEAPEEG >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_1|225_bp atgagcttcctgagtctccaccagaagcagatgctggccctatgcttcacgtacagcctg cagaactctctctccctgcactgcacactggcagcagctccagacctgcacaccctggag aaggaaggattgggattcttccacttccaacccagtgaatctgagggaagaactctgcct aatccagcttgtcctgaggtcaaagaagccccagaggaagggtga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_2|68_aa MGRLYLGQGVNHSLPACSWADIWPYQMGDLLSPFPPIQLDGWHLLLGCYLCTAHPAATVA RDVAMLTD >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_2|207_bp atgggccgcttgtatctggggcagggggtgaatcactcactccctgcatgctcctgggca gatatttggccgtatcagatgggggatctcctctcccccttcccccccatccagttagat ggctggcatttgctccttggttgttatctttgtactgcccacccagcagccactgtggca agagatgtggctatgctgactgactga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_3|127_aa MDEVPEDRNMQKMDKVREDKELQEVDEVPEDQQLQEVDEIPEDLLVQEMDEVREDHQLQE VDEVPEDLQLQKVDKAPEDQQLQEVDEVPEDLQLQEVDEVPENLPLQEVDEVPEDQQMQE VDEVPED >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_3|384_bp atggatgaggtcccggaggacagaaacatgcagaagatggataaggtccgggaggacaag gagctgcaggaggtggatgaggtcccagaggaccaacagctgcaggaggtggatgagatc ccagaggacctactggtgcaggagatggatgaggtccgagaggaccaccagctgcaggag gtggatgaggtcccagaagacctacagttgcagaaggtggataaggccccagaggaccaa cagcttcaggaggtggatgaggtcccagaagacctacagttgcaggaggtggatgaggtc ccagagaacctaccactgcaggaagtggatgaggtcccagaggaccaacagatgcaggag gtggatgaggtcccagaggactga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_4|308_aa MMDSVEGRLYGIHAAILKPGGGKGARDYHPRHKGLIQLLQLSVLSDLIHLQLSVLSDLIN LQLSVLWDLIHLLQLAVLPDLIYLQLSVLRDLIHLLQLSLSVLWDLIHLLQLAVLPDLIY LPLSVLQDLIHLLQLSVLQDLIHLLQLSLSVLPDLIYLQLSVLRDLIHLLQLAVLPDLIY LQLSVLRDLIHLLQLSVLRDLIHLQLSVLWDLIHLLQLSVLRDLIHLQLSVLWDLIHLLQ LAVLPDLIYLQLSVLQDLIHLLQLSVLRDLIDLQLSVLWDLIHLLQLSVLPDIIHLQLSV LQDLIHLL >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_4|927_bp atgatggactcagttgaagggagactttatggcattcatgctgccattttgaaacctgga ggagggaaaggtgcaagggactatcacccgaggcataagggcctcatccagctcctgcag ctgtcagtcctctcagacctcatccacctgcagctgtcagtcctctcagacctcatcaac ctgcagctgtcagtcctctgggacctcatccacctcctgcaactggcagtcctcccggat ctcatctacctgcagctgtcagtcctccgggacctcatccacctcctgcagctgtcactg tcagtcctctgggacctcatccacctcctgcagctggcagttctcccggacctcatctac ctgccgctgtcagtcctccaggacctcatccacctcctgcagctgtcagtcctccaggac ctcatccacctcctgcagctgtcactatcagtcctcccggacctcatctacctgcagctg tcagtcctccgggacctcatccacctcctgcaactggcagtcctcccggacctcatctac ctgcagctgtcagtcctccgggacctcatccacctcctacagctgtcagtcctccgggac ctcatccacctgcagctgtcagtcctctgggacctcatccacctcctgcagctgtcagtc ctccgggacctcatccacctgcagctgtcagtcctctgggacctcatccacctcctgcag ctggcagtcctcccggacctcatctacctgcagctgtcagtcctccaggacctcatccac ctcctgcagctgtcagtcctccgggacctcatcgacctgcagctgtcagtcctctgggac ctcatccacctcctgcagctgtcagtcctcccggacatcatccacctgcagctgtcagtt ctccaggacctcatccacctcctgtag >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_5|154_aa MADDLDFETGDAGASATFPMQCSALRKNGFVVLKGWPCKIVEMSASKTGKHGHAKVHLVG IDIFTGKKYEDICPSTHNMDVPNIKRNDFQLIGIQDGYLSLLQDSGEVPEDLRLPEGDLG KEIEQKYDCGEEILITVLSAMTEEAAVAIKAMAK >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_5|465_bp atggcagatgatttggacttcgagacaggagatgcaggggcctcagccaccttcccaatg cagtgctcagcattacgtaagaatggctttgtggtgctcaaaggctggccatgtaagatc gtggagatgtctgcttcgaagactggcaagcacggccacgccaaggtccatctggttggt attgacatctttactgggaagaaatatgaagatatctgcccgtcaactcataatatggat gtccccaacatcaaaaggaatgacttccagctgattggcatccaggatgggtacctatca ctgctccaggacagcggggaggtaccagaggaccttcgtctccctgagggagaccttggc aaggagattgagcagaagtacgactgtggagaagagatcctgatcacggtgctgtctgcc atgacagaggaggcagctgttgcaatcaaggccatggcaaaataa >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_6|83_aa MPWPPKVLGLQAVVKVFIVLTPQFLSRDKDQLTKELQKHVKSVTVSCKSPRKMEFVPELR KTVTGKIKQGELEKRSLVRCNQQ >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_6|252_bp atgccttggcctcccaaagtgctgggattacaggcggtcgtgaaggtgtttattgtcctg accccacagtttctgtcccgtgacaaggaccagctgaccaaggagctgcagaagcatgta aagtcagtgacagtctcatgcaagtccccaaggaagatggagtttgtcccggagctgcgg aaaactgtcactggaaagattaaacaaggtgaacttgagaaaaggagcttggtcagatgt aatcagcagtga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_7|261_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGIMLPD FKLYYKATVTKTAWYWYRNRDIDQWNRTEPSEIMPRIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKIISRWIKDLNVRPKTIKTLEENLGITIQDIDMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCAAKQTTIRVNRQPTKWEKIFATYSSDKGLISRIY NELKQIYKKKQTTPSKSGRRT >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_7|786_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcgtggtactggtaccgaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccacgtatctacaactat ttgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattatttcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacttaggcattaccattcaggacatagacatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcgcagcaaaacaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaacaaacaaccccatcaaaaagtgggcgaagg acatga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_8|248_aa MWLCPLALTLILMAASGAACEVKDVCVGSPGIPGTPGSHGLPGRDGRDGVKGDPGPPGPM GPPGETPCPPGNNGLPGAPGVPGERGEKGEAGERGPPGLPAHLDEELQATLHDFRHQILQ TRGALSLQGSIMTVGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIASFVKK YNTYAYVGLTEGPSPGDFRYSDGTPVNYTNWYRGEPAGRGKEQCVEMYTDGQWNDRNCLY SRLTICEF >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_8|747_bp atgtggctgtgccctctggccctcaccctcatcttgatggcagcctctggtgctgcgtgc gaagtgaaggacgtttgtgttggaagccctggtatccccggcactcctggatcccacggc ctgccaggcagggacgggagagatggtgtcaaaggagaccctggccctccaggccccatg ggtccgcctggagaaacaccatgtcctcctgggaataatgggctgcctggagcccctggt gtccctggagagcgtggagagaagggggaggctggcgagagaggccctccagggcttcca gctcatctagatgaggagctccaagccacactccacgacttcagacatcaaatcctgcag acaaggggagccctcagtctgcagggctccataatgacagtaggagagaaggtcttctcc agcaatgggcagtccatcacttttgatgccattcaggaggcatgtgccagagcaggcggc cgcattgctgtcccaaggaatccagaggaaaatgaggccattgcaagcttcgtgaagaag tacaacacatatgcctatgtaggcctgactgagggtcccagccctggagacttccgctac tcagatgggacccctgtaaactacaccaactggtaccgaggggagcctgcaggtcgggga aaagagcagtgtgtggagatgtacacagatgggcagtggaatgacaggaactgcctgtac tcccgactgaccatctgtgagttctga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_9|231_aa MADSPALSLQGSIPAVGEKVFFTNRQLVYLDAISEACARAGSCIAVPRSPEENEAIARFL KKYNMYAYMALAKGPSPGDFCYLDEAPVDYTNRCPGELIDQGLRGLQGPPGKLGPPGNPG APGIPGPRSQKGDHGDNSAFSLGKMSGKKLFMTNGERMPFSKVKALCAGLQATVAAPKNA KENKAIQDVAKDTAFLGITDEATEGQFIYLTGGRLTYSNWKKDEPNDHGSG >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_9|696_bp atggcagacagcccagccctcagtcttcagggctccataccggcagtgggagagaaggtc ttcttcacaaacaggcagttggtctatttagatgccattagtgaggcatgtgccagagca ggcagctgcattgctgtccctaggagtccagaggaaaacgaggccattgcaagatttttg aagaagtacaacatgtatgcctacatggccctggccaagggtcccagccctggagacttc tgctacctagatgaggcccctgtggactacacgaaccggtgcccaggggagctcatagat caagggctcagaggtttgcagggccctcctgggaagttggggcccccaggaaacccaggg gctcctggaattccaggaccaaggagccaaaaaggagatcatggggacaattcagccttc tccttggggaaaatgtctgggaagaagcttttcatgaccaacggtgagcggatgcctttc tccaaagtgaaggctctgtgtgctgggctccaggccacagtggctgcccccaagaatgcc aaggagaataaggccatccaggatgtggccaaagacactgccttcctgggcatcacagat gaggcaactgaaggccagttcatatacttgacgggtgggaggctgacctacagcaactgg aagaaggatgagccaaatgaccacggctcagggtag >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_10|248_aa MWLCPLALNLILMAASGAVCEVKDVCVGSPGIPGTPGSHGLPGRDGRDGLKGDPGPPGPM GPPGEMPCPPGNDGLPGAPGIPGECGEKGEPGERGPPGLPAHLDEELQATLHDFRHQILQ TRGALSLQGSIMTVGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIASFVKK YNTYAYVGLTEGPSPGDFRYSDGTPVNYTNWYRGEPAGRGKEQCVEMYTDGQWNDRNCLY SRLTICEF >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_10|747_bp atgtggctgtgccctctggccctcaacctcatcttgatggcagcctctggtgctgtgtgc gaagtgaaggacgtttgtgttggaagccctggtatccccggcactcctggatcccacggc ctgccaggcagggacgggagagatggtctcaaaggagaccctggccctccaggccccatg ggtccacctggagaaatgccatgtcctcctggaaatgatgggctgcctggagcccctggt atccctggagagtgtggagagaagggggagcctggcgagaggggccctccagggcttcca gctcatctagatgaggagctccaagccacactccacgactttagacatcaaatcctgcag acaaggggagccctcagtctgcagggctccataatgacagtaggagagaaggtcttctcc agcaatgggcagtccatcacttttgatgccattcaggaggcatgtgccagagcaggcggc cgcattgctgtcccaaggaatccagaggaaaatgaggccattgcaagcttcgtgaagaag tacaacacatatgcctatgtaggcctgactgagggtcccagccctggagacttccgctac tcagacgggacccctgtaaactacaccaactggtaccgaggggagcccgcaggtcgggga aaagagcagtgtgtggagatgtacacagatgggcagtggaatgacaggaactgcctgtac tcccgactgaccatctgtgagttctga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_11|170_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFATYSSDKGL ISRIYKELKQIYKKKPNNPIKKWAKDMNRHFSKEDIYAGKKHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRDMDETGIIMLSKLSQGQKTKHHMFSLIGGN >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_11|513_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactcaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaacctactcatctgataaagggcta atatccagaatctacaaagaacttaaacaaatttacaagaaaaaaccaaacaaccccatc aaaaagtgggcgaaggatatgaacagacacttctcaaaagaagacatttatgcaggcaaa aaacacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagg gacatggatgaaactggaatcatcatgctcagcaaattatcgcaaggacaaaaaaccaaa caccacatgttctcactcataggtgggaattga >gi568815588r:79457221_79659483|GENSCAN_predicted_peptide_12|66_aa MLWMTRREKKKRAVALWEPRPGSSRGQSCDSRYGTLWFLESASFQPTQHEDDEDEDLYDD PFPLSE >gi568815588r:79457221_79659483|GENSCAN_predicted_CDS_12|201_bp atgttgtggatgacaagaagagagaagaaaaaaagagctgtggctctatgggaacccaga cctgggagctcccgaggccagagctgtgactcccgctatgggaccctatggttcctggag tctgcaagcttccagcctactcaacatgaagatgatgaggatgaagacctttatgatgat ccatttccgctcagtgaatag