GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:11:02 Sequence gi568815583f:75803964_76033099 : 229136 bp : 39.29% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 970 965 6 1.05 1.06 Term - 2420 2258 163 1 1 65 47 105 0.726 0.53 1.05 Intr - 4567 4499 69 0 0 46 91 79 0.335 1.38 1.04 Intr - 6578 6330 249 1 0 76 35 140 0.733 3.13 1.03 Intr - 11360 11211 150 1 0 69 76 169 0.783 12.16 1.02 Intr - 12174 11978 197 0 2 118 20 137 0.631 7.19 1.01 Init - 23707 23549 159 1 0 79 116 111 0.864 12.87 1.00 Prom - 29943 29904 40 -5.25 2.00 Prom + 38455 38494 40 -6.25 2.01 Init + 38806 39013 208 0 1 50 80 96 0.501 4.03 2.02 Intr + 39051 39257 207 1 0 28 53 141 0.600 2.73 2.03 Intr + 39693 39883 191 1 2 15 89 242 0.695 15.38 2.04 Intr + 40352 40626 275 1 2 68 60 148 0.889 5.41 2.05 Intr + 40689 40824 136 0 1 10 36 172 0.780 3.85 2.06 Intr + 50423 50524 102 1 0 98 91 109 0.948 11.65 2.07 Intr + 55915 56019 105 0 0 63 103 60 0.907 4.49 2.08 Intr + 64988 65047 60 1 0 85 50 126 0.491 6.51 2.09 Intr + 67079 67178 100 1 1 65 98 76 0.871 5.06 2.10 Intr + 67274 67461 188 0 2 60 49 65 0.803 -1.71 2.11 Intr + 69465 69605 141 2 0 102 101 146 0.997 16.93 2.12 Intr + 79403 79461 59 1 2 119 110 45 0.917 6.56 2.13 Intr + 99764 99915 152 2 2 59 6 138 0.019 1.59 2.14 Intr + 100047 100140 94 1 1 53 81 93 0.131 3.20 2.15 Intr + 100528 100703 176 1 2 74 77 205 0.189 16.66 2.16 Intr + 101615 101643 29 0 2 86 72 27 0.024 -2.18 2.17 Intr + 109240 109327 88 0 1 108 84 72 0.268 7.42 2.18 Intr + 113267 113431 165 0 0 67 97 44 0.083 2.21 2.19 Intr + 125921 126086 166 0 1 132 53 98 0.961 8.80 2.20 Intr + 128722 129067 346 1 1 77 83 323 0.425 25.17 2.21 Intr + 130497 130516 20 2 2 71 115 5 0.158 -4.01 2.22 Term + 136512 136659 148 0 1 76 37 128 0.167 2.89 2.23 PlyA + 136713 136718 6 1.05 3.05 PlyA - 137811 137806 6 1.05 3.04 Term - 141085 141066 20 2 2 110 49 14 0.248 -2.80 3.03 Intr - 152048 151969 80 1 2 78 79 93 0.738 5.68 3.02 Intr - 158011 157865 147 0 0 79 111 93 0.968 9.13 3.01 Init - 168483 168239 245 0 2 85 11 165 0.457 5.75 3.00 Prom - 168537 168498 40 -4.95 4.02 PlyA - 169157 169152 6 1.05 4.01 Sngl - 172500 172108 393 0 0 78 42 382 0.840 28.59 4.00 Prom - 174870 174831 40 -4.95 5.07 PlyA - 175463 175458 6 1.05 5.06 Term - 183221 183141 81 2 0 65 39 68 0.637 -3.69 5.05 Intr - 183748 183587 162 1 0 -7 100 172 0.888 8.15 5.04 Intr - 186519 186427 93 0 0 36 85 105 0.278 4.24 5.03 Intr - 193816 193732 85 0 1 75 39 36 0.047 -3.70 5.02 Intr - 201846 201781 66 2 0 120 55 87 0.220 5.70 5.01 Init - 210457 210213 245 1 2 64 87 150 0.878 9.75 5.00 Prom - 210511 210472 40 -5.65 6.04 PlyA - 210628 210623 6 1.05 6.03 Term - 214424 214024 401 0 2 55 48 297 0.916 16.69 6.02 Intr - 219435 219341 95 2 2 53 50 20 0.237 -6.61 6.01 Init - 220234 220065 170 1 2 60 90 127 0.512 9.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:75803964_76033099|GENSCAN_predicted_peptide_1|328_aa MQFGVLKVFFKTAQLYLEELQKAFAPRNPVAPLDNNTTYSNDASKDSLLTFRKLLLLLVV RASMEEKASSLSSGKLLWQLPKQSPLQVHHSTASFGMAVTHCVLVTELPSMVALLTWSSA YIITSGPCSGAWERDIPNSAKKLATEGSIPREHTAAGGETTARGLAIGGLKEILPVRRTL SPLRRVTNYGSDLLLTSQSWDLNASLCNYKAHTLLSARSPTPAPIPGPMSYIEAYSQNEG SKIPQQGKKVAGRLAEALFGVQTYDQDAYGFLTPRKPLENVLQHSERVNQEKRRSGAKEQ EMQQKRQEEIHTMVQRHARMPDVPWAES >gi568815583f:75803964_76033099|GENSCAN_predicted_CDS_1|987_bp atgcagttcggggtcttaaaggtcttcttcaagacagcacaattatatttagaagaactg caaaaagcatttgcccccagaaatcctgtggccccattggataacaatactacctattcg aatgatgccagcaaggactctcttctcacattcaggaagctgctactgctgctggtggtg agggcttccatggaggagaaagcatcctccttatcttctgggaagctcctctggcagctc cccaaacaaagccctttgcaagtgcaccacagcactgccagctttgggatggctgtaact cactgtgtgctggtcaccgagctcccctccatggttgccttgttgacatggagcagtgcc tatatcataaccagtggcccttgctcaggtgcctgggaaagggacataccaaacagtgca aagaaacttgcaactgaaggcagcatccccagggaacacaccgctgctggtggcgaaacg acagcaagaggcctcgcaatcggaggtctcaaagagatcttgccggttaggcggacactt tctccacttcgcagggtaaccaactatggatcagatcttcttctgacatcacagagctgg gacttgaatgcaagtctgtgtaactacaaagcccacactctgctctctgccaggagcccc actcctgccccgatccctgggccgatgagttatatcgaggcctatagccaaaatgaaggc tcaaagattccccagcaggggaagaaagtggctggcaggctggcagaagccctttttggg gtccagacttatgaccaggatgcctatggcttcctaacccccaggaagccactggaaaat gtgctgcaacacagtgagagagtaaaccaggaaaaaagaagatctggggccaaggaacag gagatgcaacagaagaggcaggaggaaattcacacgatggtgcagagacatgccaggatg ccagatgtgccatgggctgagagctga >gi568815583f:75803964_76033099|GENSCAN_predicted_peptide_2|1051_aa MKSWDTSRQQPPGAKPSSWVNGWFESKGGSSESGEKVLTGTQAGEKVLTGTQASEAVART RVKVAEVTLVRLAGAAPSAKLRSRPQGLALVCSGADVAGAIRHARVRLQAGGGGARGSPA PPRLLGPGRQLAPSLQSADEGKMSVSGLKAELKFLASIFDKNHERFRIVSWKLDELHCQF LVPQQGSPHSLPPPLTLHCNITPIVFKFAFKEKLTMRMDSLTEEKLECRLWCCLSDPSPP GLAARCCVLERSIVPSLRQVVLSVYSCGPSLVCKPRKWIGERRYAPGLLPGKITNPRLGK GIEVLSFSIKVYREAIRREIDVGYWSKLQNILLVIIGVKESYPSSSPIWFVDSEDPNLTS VLERLEDTKNNNLLRQQLKWLICELCSLYNLPKHLDVEMLDQPLPTGQNGTTEEVTSEEE EEEEEMAEVFLREGDVSGSQDNSGERVSRQTREQRSLHHRQALRRFFPISVDGTYNRVLY RDIPLPRDGQEADAFLLSQLQEACLPLILIFLSTDPLRVSGWGTDIEDLDHYEMKEEEPI SGKKSEDEGIEKENLAILEKIRKTQRQDHLNDNFPFDPPFVRVVLPVLSGGIRPNLKNEE NSSERITERSTPRSTGSRRGNAPALPRPPSPSSRKWRGRLLNPRSTFVLSNLAEVVERVL TFLPAKALLRVACVCRLWRECVRRVLRTHRSVTWISAGLAEAGHLEGHCLVRVVAEELEA SSKGCHAQSFADSVQILPVVPNVRILPHTVLYMADSETFISLEECRGHKRVTPMGSGSNR PQEIEIGESGFALLFPQIEGIKIQPFHFIKDPKNLTLERHQLTEVGLLDNPELRVVLVFG YNCCKVGASNYLQQVVSTFSDMNIILAGGQVDNLSSLTSEKNPLDIDASGVVGLSFSGHR IQSATVLLNEDVSDEKTAEAAMQRLKAANIPEHNTIGFMFACVGRGFQYYRAKGNVEADA FRKFFPSVPLFGFFGNGEIGCDRIVTGNFILRKCNEDSRTLARNRKTHPEFHVESEGIPN SQVSLEKEEQIGGLKPAEFKTYYNATVIKTM >gi568815583f:75803964_76033099|GENSCAN_predicted_CDS_2|3156_bp atgaaatcatgggacacctccaggcaacagcctccaggggctaaaccatcgtcttgggta aatggatggtttgagagcaaaggcgggagcagcgagtccggtgagaaggtcctcacagga acccaggctggtgagaaggtcctcacaggaacccaggctagcgaagcggtggctcggact agggtgaaagtagcggaggtaacgctggtgaggttagcgggtgccgcgccgagcgccaag ctgaggagccgcccccaggggctggcgctagtctgcagcggcgccgacgtggccggcgcg atccgccacgcacgtgtccggctccaggccgggggcgggggcgcgcgggggtctccggct cctccccgacttctggggcccgggcgccagctggccccgagcctccagtccgcggatgag gggaagatgtccgtgtcagggctcaaggccgagctgaagttcctggcgtccatcttcgac aagaaccacgagcgattccgcatcgtcagttggaagctggacgagctgcactgccagttc ctggtgccgcagcagggcagcccgcactcgctgccgccgccactcacgctccactgcaac atcacgccaattgtttttaagtttgcgtttaaggagaaactgacaatgagaatggactcg ttgacggaggaaaagttggaatgcagactctggtgctgtttgagcgacccctctcccccg ggcctggctgcgcgctgctgtgttctggaaaggagcatagtaccgtcgttgcggcaggtg gtgttgagtgtgtattcgtgtggcccctcccttgtgtgtaagcctcgaaaatggataggt gaaaggagatacgcgccagggctgctccccggaaagatcacaaacccaaggcttggtaaa ggaattgaagtattaagtttctccataaaagtgtatcgtgaagctattcgtcgtgaaatt gatgtaggatattggtccaaactgcagaatattctcttggttataataggcgtcaaggaa tcctatccatcttcttcaccgatatggtttgtggattctgaagacccaaatctgacatca gttctggaacgtctagaagatactaagaacaacaatttgcttcgtcagcaattgaagtgg ttgatatgtgaactctgcagtttatataaccttcctaagcacctggatgttgagatgcta gatcaaccactacccacgggtcagaatgggacaacagaagaagtgacttcagaagaagag gaagaagaagaagagatggctgaagtgtttctccgagagggggatgtgtcagggtcacaa gacaatagtggggagagggtcagcagacaaacacgtgaacaaaggtctttgcatcataga caagccctaaggcggtttttccctatctcagtagatggaacgtacaatcgggttttatac cgagacattccattgcccagggacgggcaggaggcagatgccttcctcttgtctcaactg caagaggcatgccttcctcttatactaatcttcctcagcacagaccctttacgggtgtca ggctgggggacggatatagaagacttagatcactatgagatgaaggaagaagagcctatt agtgggaaaaagtcagaggatgaaggaattgaaaaagaaaatttggcaatattagagaaa attaggaagactcaaaggcaagaccatttaaatgataactttccatttgatcctccattt gttcgagtggtgttacctgttctctcaggaggcattagacccaatctcaaaaatgaagaa aacagttccgagcgtattacggaacggagtacacctcggagtacgggctccagacgaggg aacgcaccggcgttgccacgcccaccttccccgtccagccggaagtggcgcggacgcctg ctcaacccgcggagcaccttcgtgttgagtaacctggcggaggtggtggagcgtgtgctc accttcctgcccgccaaggcgttgctgcgggtggcctgcgtgtgccgcttatggagggag tgtgtgcgcagagtattgcggacccatcggagcgtaacctggatctccgcaggcctggcg gaggccggccacctggaggggcattgcttggttcgcgtggtagcagaggagcttgaggca agtagcaaggggtgtcatgcacagtcatttgcagactcggttcagattttgccagttgtc ccaaatgttcgcatcttaccacatacagttctttacatggctgattcagaaactttcatt agtctggaagagtgtcgtggccataagagagtgactccaatgggatcaggtagcaatcga cctcaggaaatagaaattggagaatctggttttgctttattattccctcaaattgaagga ataaaaatacaaccctttcattttattaaggatccaaagaatttaacattagaaagacat caactcactgaagtaggtcttttagataaccctgaacttcgtgtggtccttgtctttggt tataattgctgtaaggtgggagccagtaattatctgcagcaagtagtcagcactttcagt gatatgaatatcatcttggctggaggccaggtggacaacctgtcatcactgacttctgaa aagaaccctctggatattgatgcctcgggtgtggttggactgtcatttagtggacaccga atccagagtgccactgtgctcctcaacgaggacgtcagtgatgagaagactgctgaggct gcgatgcagcgcctcaaagcggccaacattccagagcataacaccattggcttcatgttt gcatgcgttggcaggggctttcagtattacagagccaaggggaatgttgaggctgatgca tttagaaagttttttcctagtgttcccttattcggcttctttggaaatggagaaattgga tgtgatcggatagtcactgggaactttatattgaggaaatgtaatgaggacagtagaaca cttgcaagaaatagaaaaacccatcctgaatttcatgtggaatctgaagggatcccaaat agccaagtcagtcttgagaaagaagaacaaattggaggacttaaacctgcagagtttaaa acttactataatgctacagtaatcaaaacaatgtga >gi568815583f:75803964_76033099|GENSCAN_predicted_peptide_3|163_aa MGKDFVTKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRVNRQPTECEKIFAIYPSDKGL ISRIHKELKQIYKKKTNNPIKKCVENYTGARCEEVFLPGSSIQTKSNLFEAFVALAVLVT LIIGAFYFLCRKGHFQRASSVQYDINLVETSSTSAHHKCSVTW >gi568815583f:75803964_76033099|GENSCAN_predicted_CDS_3|492_bp atgggcaaagacttcgtgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aacaggcaacctacagaatgtgagaaaatttttgcaatctatccatctgacaaagggcta atatccagaatccataaggaacttaaacaaatttacaagaaaaaaaccaacaaccccatc aaaaagtgcgttgaaaactatacaggagctcgttgtgaagaggtttttctcccaggctcc agcatccaaactaaaagtaacctgtttgaagcttttgtggcattggcggtcctagtaaca cttatcattggagccttctacttcctttgcaggaaaggccactttcagagagccagttca gtccagtatgatatcaacctggtagagacgagcagtaccagtgcccaccacaaatgttca gtgacatggtaa >gi568815583f:75803964_76033099|GENSCAN_predicted_peptide_4|130_aa MRKNQCKKAENSKSQNASSPPKDHNSLPAREQNWMENEFDKLTEVGFRRWVITNSFKLNE HVLTQYKEAKNLEKRLEELLTGRTSVEKNISDLMELKNIAREFRKAYTGINRQIDQVEER ISDVEDQLMK >gi568815583f:75803964_76033099|GENSCAN_predicted_CDS_4|393_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaagccagaacgcctcttctcct ccaaaggatcacaactccctgccagcaagggaacaaaactggatggagaatgagtttgac aaattgacagaagtaggcttcagaaggtgggtaataacaaactccttcaagctaaacgag catgttctaacccaatacaaggaagctaagaaccttgaaaaaaggttagaggaactgcta actggaagaaccagtgtagagaagaacataagtgacctgatggagctgaaaaacatagca cgagaatttcgtaaagcatacacgggtatcaatagacaaatcgatcaagtggaagaaagg atatcagatgttgaagatcaattgatgaaataa >gi568815583f:75803964_76033099|GENSCAN_predicted_peptide_5|243_aa MGKDFVTKTPKAMATKAKTDKWDLIKPKSFCTAKETLVRVNRQPTEWEKIFAIYSSEKGL ISRIYKELKQIYKKKTNNPIKNLHENENDNNEDLYDDLLPLNEYDLGGHVLQVSGPRDGE GLPDWHWALDSSTVDNRQYGTVIPERRRTSLEIPLIFRAPFLETRWLKHQKLLRFLEDGK SKVKVLADSVGGEGPLSGLQMDIFVLHTYMAERGRAEASSELVIEIYFFPGDRKAFIAHF LLL >gi568815583f:75803964_76033099|GENSCAN_predicted_CDS_5|732_bp atgggcaaagacttcgtgactaaaacaccaaaagcaatggcaacaaaagccaaaactgac aaatgggatctaattaaaccaaagagcttctgcacagcaaaagaaactctcgtcagagta aacaggcaacctacagaatgggagaaaatttttgcaatctattcatctgagaaagggcta atatccagaatctacaaggaacttaaacaaatttacaagaaaaaaaccaacaaccccatc aaaaatctacacgaaaatgaaaatgacaacaatgaagacctttatgatgatctacttcca cttaatgaatatgaccttggaggccatgttctccaggtgtcagggccaagagatggagaa gggctgcctgactggcattgggcattagacagtagtacagttgacaacaggcagtacggg acagtgatccctgagagaaggagaacaagcttagagatccctctaatcttccgggctccc ttcctggagacaagatggcttaaacatcagaaacttctccgatttctggaggatgggaag tccaaggtcaaggtgctagctgattcagttggtggtgagggccctctttctggtttgcag atggacatcttcgtgctgcatacttacatggcagagagagggagagcggaagcctcatct gagttagtgattgaaatatacttctttcctggggaccgcaaagctttcatagctcacttc ctgctgttgtga >gi568815583f:75803964_76033099|GENSCAN_predicted_peptide_6|221_aa MLARGSLQGCFLGPRLTCWQLGGLGAGELGVAHESVSQALIAGMGPLGKPEVSQQERESL SCLQINLIQAWQTVPQRLDDSPLLPITVGSHSSPAREQNWMENEFDKSTELGFRRWVITN SSKLKEDILTQCKEAKNHEKRLEELLTGISSLEKNISDLIELKNIARELQQAYTSINSQI DQVEERISEIEDQLNEIKCEDKIREKRLKRNKRASKKYGTM >gi568815583f:75803964_76033099|GENSCAN_predicted_CDS_6|666_bp atgctagccaggggcagcctgcagggctgtttcttaggcccaagacttacatgttggcag cttggaggcctaggggcaggtgagctaggagtagcccatgaatctgtttctcaggctctg attgcaggcatggggcctttgggcaagccagaggtgtctcagcaggagagggaaagtctc tcctgtctccaaatcaatctgatccaggcatggcagacagtgccacagaggctagatgac tctccgttgcttccaatcactgtaggatcacactcctcgccagcaagggaacaaaactgg atggagaatgagtttgacaaatcgacagaattaggcttcagaaggtgggtaataacaaac tcctccaagctaaaggaggatattctaactcaatgcaaggaagctaagaaccatgaaaaa aggttagaggaattactaactggaataagcagtttagagaagaacataagtgacctgatt gagctgaaaaacatagcacgagaacttcaacaagcatacacaagtatcaatagccaaatc gatcaagtggaagaaaggatatcagagattgaagatcaacttaatgaaataaagtgtgaa gacaagattagagaaaaaagattgaaaagaaacaaacgagcctccaagaaatatgggact atgtga