GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:45:48 Sequence gi568815583r:75855930_76109295 : 253366 bp : 39.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1229 1234 6 1 0 87 116 10 0.885 4.06 1.02 Intr + 3949 4053 105 0 0 63 103 60 0.897 4.49 1.03 Intr + 13022 13081 60 1 0 85 50 126 0.491 6.51 1.04 Intr + 15113 15212 100 1 1 65 98 76 0.871 5.06 1.05 Intr + 15308 15495 188 0 2 60 49 65 0.803 -1.71 1.06 Intr + 17499 17639 141 2 0 102 101 146 0.997 16.93 1.07 Intr + 27437 27495 59 1 2 119 110 45 0.917 6.56 1.08 Intr + 47798 47949 152 2 2 59 6 138 0.019 1.59 1.09 Intr + 48081 48174 94 1 1 53 81 93 0.131 3.20 1.10 Intr + 48562 48737 176 1 2 74 77 205 0.189 16.66 1.11 Intr + 49649 49677 29 0 2 86 72 27 0.024 -2.18 1.12 Intr + 57274 57361 88 0 1 108 84 72 0.268 7.42 1.13 Intr + 61301 61465 165 0 0 67 97 44 0.083 2.21 1.14 Intr + 73955 74120 166 0 1 132 53 98 0.961 8.80 1.15 Intr + 76756 77101 346 1 1 77 83 323 0.425 25.17 1.16 Intr + 78531 78550 20 2 2 71 115 5 0.158 -4.01 1.17 Term + 84546 84693 148 0 1 76 37 128 0.167 2.89 1.18 PlyA + 84747 84752 6 1.05 2.05 PlyA - 85845 85840 6 1.05 2.04 Term - 89119 89100 20 2 2 110 49 14 0.248 -2.80 2.03 Intr - 100082 100003 80 1 2 78 79 93 0.738 5.68 2.02 Intr - 106045 105899 147 0 0 79 111 93 0.968 9.13 2.01 Init - 116517 116273 245 0 2 85 11 165 0.457 5.75 2.00 Prom - 116571 116532 40 -4.95 3.02 PlyA - 117191 117186 6 1.05 3.01 Sngl - 120534 120142 393 0 0 78 42 382 0.840 28.59 3.00 Prom - 122904 122865 40 -4.95 4.07 PlyA - 123497 123492 6 1.05 4.06 Term - 131255 131175 81 2 0 65 39 68 0.637 -3.69 4.05 Intr - 131782 131621 162 1 0 -7 100 172 0.888 8.15 4.04 Intr - 134553 134461 93 0 0 36 85 105 0.278 4.24 4.03 Intr - 141850 141766 85 0 1 75 39 36 0.047 -3.70 4.02 Intr - 149880 149815 66 2 0 120 55 87 0.220 5.70 4.01 Init - 158491 158247 245 1 2 64 87 150 0.878 9.75 4.00 Prom - 158545 158506 40 -5.65 5.05 PlyA - 158662 158657 6 1.05 5.04 Term - 162458 162058 401 0 2 55 48 297 0.878 16.69 5.03 Intr - 196248 196138 111 1 0 90 111 8 0.040 2.73 5.02 Intr - 204235 204135 101 0 2 69 45 127 0.064 5.33 5.01 Init - 218043 217955 89 0 2 48 109 83 0.377 6.56 5.00 Prom - 237669 237630 40 -3.65 6.02 PlyA - 238936 238931 6 1.05 6.01 Sngl - 240012 239578 435 0 0 55 41 231 0.822 11.12 6.00 Prom - 240885 240846 40 -6.15 7.03 PlyA - 241061 241056 6 1.05 7.02 Term - 242053 241745 309 1 0 7 39 232 0.363 4.28 7.01 Init - 243091 243002 90 1 0 112 36 56 0.694 3.34 7.00 Prom - 244356 244317 40 -5.05 8.00 Prom + 245732 245771 40 -9.05 8.01 Init + 247058 247302 245 1 2 81 68 129 0.342 7.46 8.02 Intr + 250114 250321 208 1 1 77 48 119 0.063 4.96 8.03 Intr + 252258 252362 105 2 0 83 1 98 0.034 0.09 8.04 Intr + 253044 253254 211 2 1 99 100 88 0.044 8.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 225217 225336 120 0 0 93 33 153 0.834 7.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_1|680_aa MKLRQQLKWLICELCSLYNLPKHLDVEMLDQPLPTGQNGTTEEVTSEEEEEEEEMAEVFL REGDVSGSQDNSGERVSRQTREQRSLHHRQALRRFFPISVDGTYNRVLYRDIPLPRDGQE ADAFLLSQLQEACLPLILIFLSTDPLRVSGWGTDIEDLDHYEMKEEEPISGKKSEDEGIE KENLAILEKIRKTQRQDHLNDNFPFDPPFVRVVLPVLSGGIRPNLKNEENSSERITERST PRSTGSRRGNAPALPRPPSPSSRKWRGRLLNPRSTFVLSNLAEVVERVLTFLPAKALLRV ACVCRLWRECVRRVLRTHRSVTWISAGLAEAGHLEGHCLVRVVAEELEASSKGCHAQSFA DSVQILPVVPNVRILPHTVLYMADSETFISLEECRGHKRVTPMGSGSNRPQEIEIGESGF ALLFPQIEGIKIQPFHFIKDPKNLTLERHQLTEVGLLDNPELRVVLVFGYNCCKVGASNY LQQVVSTFSDMNIILAGGQVDNLSSLTSEKNPLDIDASGVVGLSFSGHRIQSATVLLNED VSDEKTAEAAMQRLKAANIPEHNTIGFMFACVGRGFQYYRAKGNVEADAFRKFFPSVPLF GFFGNGEIGCDRIVTGNFILRKCNEDSRTLARNRKTHPEFHVESEGIPNSQVSLEKEEQI GGLKPAEFKTYYNATVIKTM >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_1|2043_bp atgaagcttcgtcagcaattgaagtggttgatatgtgaactctgcagtttatataacctt cctaagcacctggatgttgagatgctagatcaaccactacccacgggtcagaatgggaca acagaagaagtgacttcagaagaagaggaagaagaagaagagatggctgaagtgtttctc cgagagggggatgtgtcagggtcacaagacaatagtggggagagggtcagcagacaaaca cgtgaacaaaggtctttgcatcatagacaagccctaaggcggtttttccctatctcagta gatggaacgtacaatcgggttttataccgagacattccattgcccagggacgggcaggag gcagatgccttcctcttgtctcaactgcaagaggcatgccttcctcttatactaatcttc ctcagcacagaccctttacgggtgtcaggctgggggacggatatagaagacttagatcac tatgagatgaaggaagaagagcctattagtgggaaaaagtcagaggatgaaggaattgaa aaagaaaatttggcaatattagagaaaattaggaagactcaaaggcaagaccatttaaat gataactttccatttgatcctccatttgttcgagtggtgttacctgttctctcaggaggc attagacccaatctcaaaaatgaagaaaacagttccgagcgtattacggaacggagtaca cctcggagtacgggctccagacgagggaacgcaccggcgttgccacgcccaccttccccg tccagccggaagtggcgcggacgcctgctcaacccgcggagcaccttcgtgttgagtaac ctggcggaggtggtggagcgtgtgctcaccttcctgcccgccaaggcgttgctgcgggtg gcctgcgtgtgccgcttatggagggagtgtgtgcgcagagtattgcggacccatcggagc gtaacctggatctccgcaggcctggcggaggccggccacctggaggggcattgcttggtt cgcgtggtagcagaggagcttgaggcaagtagcaaggggtgtcatgcacagtcatttgca gactcggttcagattttgccagttgtcccaaatgttcgcatcttaccacatacagttctt tacatggctgattcagaaactttcattagtctggaagagtgtcgtggccataagagagtg actccaatgggatcaggtagcaatcgacctcaggaaatagaaattggagaatctggtttt gctttattattccctcaaattgaaggaataaaaatacaaccctttcattttattaaggat ccaaagaatttaacattagaaagacatcaactcactgaagtaggtcttttagataaccct gaacttcgtgtggtccttgtctttggttataattgctgtaaggtgggagccagtaattat ctgcagcaagtagtcagcactttcagtgatatgaatatcatcttggctggaggccaggtg gacaacctgtcatcactgacttctgaaaagaaccctctggatattgatgcctcgggtgtg gttggactgtcatttagtggacaccgaatccagagtgccactgtgctcctcaacgaggac gtcagtgatgagaagactgctgaggctgcgatgcagcgcctcaaagcggccaacattcca gagcataacaccattggcttcatgtttgcatgcgttggcaggggctttcagtattacaga gccaaggggaatgttgaggctgatgcatttagaaagttttttcctagtgttcccttattc ggcttctttggaaatggagaaattggatgtgatcggatagtcactgggaactttatattg aggaaatgtaatgaggacagtagaacacttgcaagaaatagaaaaacccatcctgaattt catgtggaatctgaagggatcccaaatagccaagtcagtcttgagaaagaagaacaaatt ggaggacttaaacctgcagagtttaaaacttactataatgctacagtaatcaaaacaatg tga >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_2|163_aa MGKDFVTKTPKAMATKAKIDKWDLIKLKSFCTAKETIIRVNRQPTECEKIFAIYPSDKGL ISRIHKELKQIYKKKTNNPIKKCVENYTGARCEEVFLPGSSIQTKSNLFEAFVALAVLVT LIIGAFYFLCRKGHFQRASSVQYDINLVETSSTSAHHKCSVTW >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_2|492_bp atgggcaaagacttcgtgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aacaggcaacctacagaatgtgagaaaatttttgcaatctatccatctgacaaagggcta atatccagaatccataaggaacttaaacaaatttacaagaaaaaaaccaacaaccccatc aaaaagtgcgttgaaaactatacaggagctcgttgtgaagaggtttttctcccaggctcc agcatccaaactaaaagtaacctgtttgaagcttttgtggcattggcggtcctagtaaca cttatcattggagccttctacttcctttgcaggaaaggccactttcagagagccagttca gtccagtatgatatcaacctggtagagacgagcagtaccagtgcccaccacaaatgttca gtgacatggtaa >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_3|130_aa MRKNQCKKAENSKSQNASSPPKDHNSLPAREQNWMENEFDKLTEVGFRRWVITNSFKLNE HVLTQYKEAKNLEKRLEELLTGRTSVEKNISDLMELKNIAREFRKAYTGINRQIDQVEER ISDVEDQLMK >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_3|393_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaagccagaacgcctcttctcct ccaaaggatcacaactccctgccagcaagggaacaaaactggatggagaatgagtttgac aaattgacagaagtaggcttcagaaggtgggtaataacaaactccttcaagctaaacgag catgttctaacccaatacaaggaagctaagaaccttgaaaaaaggttagaggaactgcta actggaagaaccagtgtagagaagaacataagtgacctgatggagctgaaaaacatagca cgagaatttcgtaaagcatacacgggtatcaatagacaaatcgatcaagtggaagaaagg atatcagatgttgaagatcaattgatgaaataa >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_4|243_aa MGKDFVTKTPKAMATKAKTDKWDLIKPKSFCTAKETLVRVNRQPTEWEKIFAIYSSEKGL ISRIYKELKQIYKKKTNNPIKNLHENENDNNEDLYDDLLPLNEYDLGGHVLQVSGPRDGE GLPDWHWALDSSTVDNRQYGTVIPERRRTSLEIPLIFRAPFLETRWLKHQKLLRFLEDGK SKVKVLADSVGGEGPLSGLQMDIFVLHTYMAERGRAEASSELVIEIYFFPGDRKAFIAHF LLL >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_4|732_bp atgggcaaagacttcgtgactaaaacaccaaaagcaatggcaacaaaagccaaaactgac aaatgggatctaattaaaccaaagagcttctgcacagcaaaagaaactctcgtcagagta aacaggcaacctacagaatgggagaaaatttttgcaatctattcatctgagaaagggcta atatccagaatctacaaggaacttaaacaaatttacaagaaaaaaaccaacaaccccatc aaaaatctacacgaaaatgaaaatgacaacaatgaagacctttatgatgatctacttcca cttaatgaatatgaccttggaggccatgttctccaggtgtcagggccaagagatggagaa gggctgcctgactggcattgggcattagacagtagtacagttgacaacaggcagtacggg acagtgatccctgagagaaggagaacaagcttagagatccctctaatcttccgggctccc ttcctggagacaagatggcttaaacatcagaaacttctccgatttctggaggatgggaag tccaaggtcaaggtgctagctgattcagttggtggtgagggccctctttctggtttgcag atggacatcttcgtgctgcatacttacatggcagagagagggagagcggaagcctcatct gagttagtgattgaaatatacttctttcctggggaccgcaaagctttcatagctcacttc ctgctgttgtga >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_5|233_aa MCHKLQSSNIPLSVVNFNGLISTRTGQEARGMKVSGKIPALPSTHPANLDRSPHPPPRPQ DSGMATNSIKAEEIVVNHKREPVSSLFCNLEEKFSSKDRSGSHSSPAREQNWMENEFDKS TELGFRRWVITNSSKLKEDILTQCKEAKNHEKRLEELLTGISSLEKNISDLIELKNIARE LQQAYTSINSQIDQVEERISEIEDQLNEIKCEDKIREKRLKRNKRASKKYGTM >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_5|702_bp atgtgtcacaaactacagtcctcaaacatcccattgagtgtggtcaatttcaacggcctc atctccacaagaactggacaagaggccagggggatgaaggtcagcggcaagatcccagcc cttccgagcacgcatcctgcaaaccttgaccggtccccccaccccccgccccgaccacaa gactcgggcatggccacaaattctattaaagcagaagaaatagtggtgaaccataaaaga gaaccagtttcctctctattctgcaatttagaggaaaaattttcatccaaggacagatca ggatcacactcctcgccagcaagggaacaaaactggatggagaatgagtttgacaaatcg acagaattaggcttcagaaggtgggtaataacaaactcctccaagctaaaggaggatatt ctaactcaatgcaaggaagctaagaaccatgaaaaaaggttagaggaattactaactgga ataagcagtttagagaagaacataagtgacctgattgagctgaaaaacatagcacgagaa cttcaacaagcatacacaagtatcaatagccaaatcgatcaagtggaagaaaggatatca gagattgaagatcaacttaatgaaataaagtgtgaagacaagattagagaaaaaagattg aaaagaaacaaacgagcctccaagaaatatgggactatgtga >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_6|144_aa MQTTTIREYYKYLYANKLENLEEMDKFLDTYTLPSLNQEEVESLNTPITSSEIEAVINSL PTKKSPGPDGFTAEFYQRYKEELVPFLLKLFRSIEKEGILPNSFYEASIILIPKPGRDTI KKENFRPISLMTIDAKILNKILAN >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_6|435_bp atgcaaactaccaccatcagagaatactataaatacctctacgcaaataaactagaaaat ctagaagaaatggataaattcctggacacatacaccctcccaagtctaaaccaggaagaa gtcgaatccctgaatacaccaataacaagttctgaaattgaggcagtaattaatagccta ccaaccaagaaaagcccaggaccagatggattcacagccgaattctaccagaggtacaag gaggagctggtaccattccttctgaaactattccgatcaatagaaaaagagggaatcctc cctaactcattttatgaggccagcatcatcctgataccaaaacctggcagagacacaata aaaaaagaaaatttcaggccaatatccctgatgacaatcgatgcgaaaatcctcaataaa atactggcaaactga >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_7|132_aa MEPSKVRSTSLKFPLPAQQLSEMDPGHSSLNSVEKNINDLMELKNIPRELHEAYTSFNSQ INQVEERIPVIEDQLNEIKREDKIREKRIKRNKQSLQEIWDYVKRPNLRLIGVSESDREN GTKLENTLQDII >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_7|399_bp atggagcccagcaaagtaagatccactagtttgaaattcccactgccagcacagcagctg tctgagatggacccgggacactctagcttgaacagtgtagagaagaacataaatgacctg atggagctgaaaaatataccacgagaacttcatgaagcatacacaagtttcaatagccaa atcaatcaagtggaagaaaggataccagtgattgaagatcaacttaatgaaataaagaga gaagacaagattagagaaaaaagaataaaaaggaataaacaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacgtttgattggtgtatctgaaagtgacagggagaat ggtaccaagttggaaaacactcttcaggatattatctag >gi568815583r:75855930_76109295|GENSCAN_predicted_peptide_8|257_aa MGGGKRGASHLTWVSAMWRTGKGGSRETGVYGSNRPRDSGGLDQDERGGEQWYNLGGIMK VKLTYSLTDRMWGMKGRSQPGRFPDAQAPSLAGSGFSQTSPLCSSSLCTVSVFPSASHPP APAKNKYSSGTCHVHGMVGESKIRTEGAHSLEGSTNLREGHTNLQPVAACHAWMPGIGAW LLRHPRTVGSKAVSYSPVYLPLVPPERPHELWKAHYLGPGHLRRSLYQMFLGSKAQWNEP QVGTRKTSAGSRPEVQX >gi568815583r:75855930_76109295|GENSCAN_predicted_CDS_8|771_bp atgggaggtggtaaacggggagcaagtcatctgacttgggtttctgctatgtggaggaca ggcaaagggggaagcagggagacaggagtctatggcagtaatcggccgagagatagtggg ggtttggaccaggatgagagaggtggtgagcagtggtacaatttgggaggtattatgaag gtaaaactgacatactcgctaacagatcggatgtggggcatgaaagggaggagtcagcca ggcagattcccagatgcacaggcccccagtttagcgggatcaggcttttcacaaacaagc cctctgtgttcatcctccctctgcactgtttctgtgttcccctctgcctcccatccccca gcccctgcaaagaataagtactcatcaggtacctgccacgtgcacggcatggtgggagaa tcaaagatccgcactgaaggagctcacagtctggagggcagcacgaatctccgagagggt cacactaacctgcagccagtggctgcctgccatgcctggatgcctggcattggagcctgg ttgctgaggcaccctcggactgttggaagcaaggctgtatcttactctcctgtgtatctc cctttggtcccacccgagaggccacatgagctgtggaaagcacattacctgggaccgggt cacctgagaagatctttgtatcagatgttcctgggatcaaaggcacagtggaatgagccc caagttggaaccagaaagacctcagcaggtagcagacctgaagtgcaagnn