GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:36:05 Sequence gi568815588f:79412650_79613111 : 200462 bp : 46.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 3475 3470 6 1.05 1.03 Term - 12590 12510 81 2 0 84 52 77 0.666 1.39 1.02 Intr - 20109 19909 201 0 0 108 110 389 0.999 42.58 1.01 Init - 32791 32546 246 1 0 64 90 511 0.780 46.50 1.00 Prom - 35092 35053 40 -5.96 2.00 Prom + 38578 38617 40 -5.36 2.01 Init + 47763 47829 67 2 1 50 74 91 0.581 5.15 2.02 Term + 55542 55699 158 1 2 109 44 78 0.588 3.60 2.03 PlyA + 56429 56434 6 1.05 3.04 PlyA - 57369 57364 6 1.05 3.03 Term - 57602 57547 56 0 2 109 54 37 0.009 0.12 3.02 Intr - 75549 75460 90 1 0 107 78 -2 0.175 0.67 3.01 Init - 78828 78768 61 0 1 73 89 26 0.252 0.66 3.00 Prom - 80037 79998 40 -1.56 4.00 Prom + 88002 88041 40 -6.56 4.01 Sngl + 93799 94182 384 0 0 63 48 624 0.680 52.09 4.02 PlyA + 94185 94190 6 -10.57 5.06 PlyA - 94197 94192 6 -10.46 5.05 Term - 94701 94219 483 0 0 74 37 866 0.925 74.85 5.04 Intr - 94911 94765 147 0 0 76 -108 231 0.585 2.73 5.03 Intr - 95136 94939 198 0 0 111 -108 338 0.712 16.05 5.02 Intr - 95573 95481 93 2 0 100 56 21 0.534 0.26 5.01 Init - 95761 95756 6 1 0 64 77 8 0.145 -2.25 5.00 Prom - 96951 96912 40 -6.16 6.00 Prom + 98262 98301 40 -8.76 6.01 Sngl + 100001 100465 465 1 0 53 36 623 0.955 49.75 6.02 PlyA + 101080 101085 6 1.05 7.04 PlyA - 111754 111749 6 1.05 7.03 Term - 112400 112305 96 2 0 43 43 80 0.105 -3.03 7.02 Intr - 113983 113864 120 1 0 130 113 85 0.977 15.89 7.01 Init - 118695 118660 36 0 0 56 101 40 0.729 0.11 7.00 Prom - 119271 119232 40 -9.26 8.00 Prom + 119380 119419 40 -7.66 8.01 Sngl + 122986 123771 786 0 0 86 47 221 0.962 13.76 8.02 PlyA + 124585 124590 6 1.05 9.05 PlyA - 125938 125933 6 1.05 9.04 Term - 144936 144560 377 1 2 85 54 465 0.902 37.70 9.03 Intr - 145480 145403 78 2 0 101 94 76 0.984 9.02 9.02 Intr - 146356 146237 120 2 0 114 99 50 0.755 9.17 9.01 Init - 146834 146663 172 2 1 71 84 180 0.908 13.55 9.00 Prom - 166112 166073 40 -5.46 10.05 PlyA - 167166 167161 6 1.05 10.04 Term - 170010 169730 281 1 2 8 48 272 0.071 10.81 10.03 Intr - 171727 171611 117 2 0 86 110 11 0.121 3.54 10.02 Intr - 183008 182727 282 0 0 82 43 300 0.086 22.29 10.01 Init - 183086 183071 16 2 1 47 94 15 0.301 -1.56 10.00 Prom - 187818 187779 40 -7.16 11.00 Prom + 188231 188270 40 -3.56 11.01 Init + 199177 199348 172 0 1 71 84 122 0.362 8.91 11.02 Intr + 199663 199782 120 2 0 114 99 95 0.856 13.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 183008 182683 326 0 2 82 48 332 0.807 23.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_1|175_aa MSLLSAIDTSAASVYQPAQLLNWVYLSLQDTHQASAFDAFRPEPTAGAAPPELAFGKGRP EQLGSPLHSSYLNSFFQLQRGEALSNSVYKGASPYGSLNNIADGLSSLTEHFSDLTLTSE ARKPSKRPPPNYLCHLCFNKGHYIKDCPQGRLVSRFGLCTLMGQDSEGLSSNVLL >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_1|528_bp atgagcctgctgtcggccatcgacacgagcgccgcctcggtgtaccagcccgcccagctg ctcaactgggtctacctgtcgctgcaggacacgcaccaggctagcgccttcgatgccttc cggcccgagccgaccgccggcgccgcacccccggagctggccttcggcaagggccgcccc gagcagctgggctcgcccctgcactccagctatctcaacagcttcttccagctgcagcgc ggagaggcgctgagcaacagtgtgtacaagggcgcctcaccctatggctccctcaacaac atcgccgatggcctcagctccctcaccgagcacttctcagacctgaccctcacctccgag gctcgcaagcccagcaagcggcccccacccaactacctgtgccacctgtgcttcaacaaa ggacactacatcaaggactgcccccagggtcgcctggtatctcgttttggcctctgcacg ctgatgggtcaggactctgaaggcctgagcagcaatgtgttgctgtga >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_2|74_aa MSFLSLHQKQMLALCFTYSLQNSLSLHCTLAAAPDLHTLEKEGLGFFHFQPSESEGRTLP NPACPEVKEAPEEG >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_2|225_bp atgagcttcctgagtctccaccagaagcagatgctggccctatgcttcacgtacagcctg cagaactctctctccctgcactgcacactggcagcagctccagacctgcacaccctggag aaggaaggattgggattcttccacttccaacccagtgaatctgagggaagaactctgcct aatccagcttgtcctgaggtcaaagaagccccagaggaagggtga >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_3|68_aa MGRLYLGQGVNHSLPACSWADIWPYQMGDLLSPFPPIQLDGWHLLLGCYLCTAHPAATVA RDVAMLTD >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_3|207_bp atgggccgcttgtatctggggcagggggtgaatcactcactccctgcatgctcctgggca gatatttggccgtatcagatgggggatctcctctcccccttcccccccatccagttagat ggctggcatttgctccttggttgttatctttgtactgcccacccagcagccactgtggca agagatgtggctatgctgactgactga >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_4|127_aa MDEVPEDRNMQKMDKVREDKELQEVDEVPEDQQLQEVDEIPEDLLVQEMDEVREDHQLQE VDEVPEDLQLQKVDKAPEDQQLQEVDEVPEDLQLQEVDEVPENLPLQEVDEVPEDQQMQE VDEVPED >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_4|384_bp atggatgaggtcccggaggacagaaacatgcagaagatggataaggtccgggaggacaag gagctgcaggaggtggatgaggtcccagaggaccaacagctgcaggaggtggatgagatc ccagaggacctactggtgcaggagatggatgaggtccgagaggaccaccagctgcaggag gtggatgaggtcccagaagacctacagttgcagaaggtggataaggccccagaggaccaa cagcttcaggaggtggatgaggtcccagaagacctacagttgcaggaggtggatgaggtc ccagagaacctaccactgcaggaagtggatgaggtcccagaggaccaacagatgcaggag gtggatgaggtcccagaggactga >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_5|308_aa MMDSVEGRLYGIHAAILKPGGGKGARDYHPRHKGLIQLLQLSVLSDLIHLQLSVLSDLIN LQLSVLWDLIHLLQLAVLPDLIYLQLSVLRDLIHLLQLSLSVLWDLIHLLQLAVLPDLIY LPLSVLQDLIHLLQLSVLQDLIHLLQLSLSVLPDLIYLQLSVLRDLIHLLQLAVLPDLIY LQLSVLRDLIHLLQLSVLRDLIHLQLSVLWDLIHLLQLSVLRDLIHLQLSVLWDLIHLLQ LAVLPDLIYLQLSVLQDLIHLLQLSVLRDLIDLQLSVLWDLIHLLQLSVLPDIIHLQLSV LQDLIHLL >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_5|927_bp atgatggactcagttgaagggagactttatggcattcatgctgccattttgaaacctgga ggagggaaaggtgcaagggactatcacccgaggcataagggcctcatccagctcctgcag ctgtcagtcctctcagacctcatccacctgcagctgtcagtcctctcagacctcatcaac ctgcagctgtcagtcctctgggacctcatccacctcctgcaactggcagtcctcccggat ctcatctacctgcagctgtcagtcctccgggacctcatccacctcctgcagctgtcactg tcagtcctctgggacctcatccacctcctgcagctggcagttctcccggacctcatctac ctgccgctgtcagtcctccaggacctcatccacctcctgcagctgtcagtcctccaggac ctcatccacctcctgcagctgtcactatcagtcctcccggacctcatctacctgcagctg tcagtcctccgggacctcatccacctcctgcaactggcagtcctcccggacctcatctac ctgcagctgtcagtcctccgggacctcatccacctcctacagctgtcagtcctccgggac ctcatccacctgcagctgtcagtcctctgggacctcatccacctcctgcagctgtcagtc ctccgggacctcatccacctgcagctgtcagtcctctgggacctcatccacctcctgcag ctggcagtcctcccggacctcatctacctgcagctgtcagtcctccaggacctcatccac ctcctgcagctgtcagtcctccgggacctcatcgacctgcagctgtcagtcctctgggac ctcatccacctcctgcagctgtcagtcctcccggacatcatccacctgcagctgtcagtt ctccaggacctcatccacctcctgtag >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_6|154_aa MADDLDFETGDAGASATFPMQCSALRKNGFVVLKGWPCKIVEMSASKTGKHGHAKVHLVG IDIFTGKKYEDICPSTHNMDVPNIKRNDFQLIGIQDGYLSLLQDSGEVPEDLRLPEGDLG KEIEQKYDCGEEILITVLSAMTEEAAVAIKAMAK >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_6|465_bp atggcagatgatttggacttcgagacaggagatgcaggggcctcagccaccttcccaatg cagtgctcagcattacgtaagaatggctttgtggtgctcaaaggctggccatgtaagatc gtggagatgtctgcttcgaagactggcaagcacggccacgccaaggtccatctggttggt attgacatctttactgggaagaaatatgaagatatctgcccgtcaactcataatatggat gtccccaacatcaaaaggaatgacttccagctgattggcatccaggatgggtacctatca ctgctccaggacagcggggaggtaccagaggaccttcgtctccctgagggagaccttggc aaggagattgagcagaagtacgactgtggagaagagatcctgatcacggtgctgtctgcc atgacagaggaggcagctgttgcaatcaaggccatggcaaaataa >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_7|83_aa MPWPPKVLGLQAVVKVFIVLTPQFLSRDKDQLTKELQKHVKSVTVSCKSPRKMEFVPELR KTVTGKIKQGELEKRSLVRCNQQ >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_7|252_bp atgccttggcctcccaaagtgctgggattacaggcggtcgtgaaggtgtttattgtcctg accccacagtttctgtcccgtgacaaggaccagctgaccaaggagctgcagaagcatgta aagtcagtgacagtctcatgcaagtccccaaggaagatggagtttgtcccggagctgcgg aaaactgtcactggaaagattaaacaaggtgaacttgagaaaaggagcttggtcagatgt aatcagcagtga >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_8|261_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGIMLPD FKLYYKATVTKTAWYWYRNRDIDQWNRTEPSEIMPRIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKIISRWIKDLNVRPKTIKTLEENLGITIQDIDMGKDF MSKTPKAMATKAKIDKWDLIKLKSFCAAKQTTIRVNRQPTKWEKIFATYSSDKGLISRIY NELKQIYKKKQTTPSKSGRRT >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_8|786_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcgtggtactggtaccgaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccacgtatctacaactat ttgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattatttcaagatggattaaagacttaaacgttagacctaaaaccata aaaaccctagaagaaaacttaggcattaccattcaggacatagacatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcgcagcaaaacaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaacaaacaaccccatcaaaaagtgggcgaagg acatga >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_9|248_aa MWLCPLALTLILMAASGAACEVKDVCVGSPGIPGTPGSHGLPGRDGRDGVKGDPGPPGPM GPPGETPCPPGNNGLPGAPGVPGERGEKGEAGERGPPGLPAHLDEELQATLHDFRHQILQ TRGALSLQGSIMTVGEKVFSSNGQSITFDAIQEACARAGGRIAVPRNPEENEAIASFVKK YNTYAYVGLTEGPSPGDFRYSDGTPVNYTNWYRGEPAGRGKEQCVEMYTDGQWNDRNCLY SRLTICEF >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_9|747_bp atgtggctgtgccctctggccctcaccctcatcttgatggcagcctctggtgctgcgtgc gaagtgaaggacgtttgtgttggaagccctggtatccccggcactcctggatcccacggc ctgccaggcagggacgggagagatggtgtcaaaggagaccctggccctccaggccccatg ggtccgcctggagaaacaccatgtcctcctgggaataatgggctgcctggagcccctggt gtccctggagagcgtggagagaagggggaggctggcgagagaggccctccagggcttcca gctcatctagatgaggagctccaagccacactccacgacttcagacatcaaatcctgcag acaaggggagccctcagtctgcagggctccataatgacagtaggagagaaggtcttctcc agcaatgggcagtccatcacttttgatgccattcaggaggcatgtgccagagcaggcggc cgcattgctgtcccaaggaatccagaggaaaatgaggccattgcaagcttcgtgaagaag tacaacacatatgcctatgtaggcctgactgagggtcccagccctggagacttccgctac tcagatgggacccctgtaaactacaccaactggtaccgaggggagcctgcaggtcgggga aaagagcagtgtgtggagatgtacacagatgggcagtggaatgacaggaactgcctgtac tcccgactgaccatctgtgagttctga >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_10|231_aa MADSPALSLQGSIPAVGEKVFFTNRQLVYLDAISEACARAGSCIAVPRSPEENEAIARFL KKYNMYAYMALAKGPSPGDFCYLDEAPVDYTNRCPGELIDQGLRGLQGPPGKLGPPGNPG APGIPGPRSQKGDHGDNSAFSLGKMSGKKLFMTNGERMPFSKVKALCAGLQATVAAPKNA KENKAIQDVAKDTAFLGITDEATEGQFIYLTGGRLTYSNWKKDEPNDHGSG >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_10|696_bp atggcagacagcccagccctcagtcttcagggctccataccggcagtgggagagaaggtc ttcttcacaaacaggcagttggtctatttagatgccattagtgaggcatgtgccagagca ggcagctgcattgctgtccctaggagtccagaggaaaacgaggccattgcaagatttttg aagaagtacaacatgtatgcctacatggccctggccaagggtcccagccctggagacttc tgctacctagatgaggcccctgtggactacacgaaccggtgcccaggggagctcatagat caagggctcagaggtttgcagggccctcctgggaagttggggcccccaggaaacccaggg gctcctggaattccaggaccaaggagccaaaaaggagatcatggggacaattcagccttc tccttggggaaaatgtctgggaagaagcttttcatgaccaacggtgagcggatgcctttc tccaaagtgaaggctctgtgtgctgggctccaggccacagtggctgcccccaagaatgcc aaggagaataaggccatccaggatgtggccaaagacactgccttcctgggcatcacagat gaggcaactgaaggccagttcatatacttgacgggtgggaggctgacctacagcaactgg aagaaggatgagccaaatgaccacggctcagggtag >gi568815588f:79412650_79613111|GENSCAN_predicted_peptide_11|98_aa MWLCPLALNLILMAASGAVCEVKDVCVGSPGIPGTPGSHGLPGRDGRDGLKGDPGPPGPM GPPGEMPCPPGNDGLPGAPGIPGECGEKGEPGERGPPX >gi568815588f:79412650_79613111|GENSCAN_predicted_CDS_11|294_bp atgtggctgtgccctctggccctcaacctcatcttgatggcagcctctggtgctgtgtgc gaagtgaaggacgtttgtgttggaagccctggtatccccggcactcctggatcccacggc ctgccaggcagggacgggagagatggtctcaaaggagaccctggccctccaggccccatg ggtccacctggagaaatgccatgtcctcctggaaatgatgggctgcctggagcccctggt atccctggagagtgtggagagaagggggagcctggcgagaggggccctccagnn