GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:18:54 Sequence gi568815582r:66517280_66796525 : 279246 bp : 45.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 598 510 89 0 2 63 66 98 0.040 4.71 1.05 Intr - 11788 11715 74 1 2 131 79 18 0.743 3.40 1.04 Intr - 14190 14101 90 0 0 92 96 54 0.888 6.79 1.03 Intr - 19738 19685 54 1 0 120 105 48 0.991 8.88 1.02 Intr - 24674 24600 75 2 0 67 77 87 0.922 5.21 1.01 Init - 25215 25213 3 0 0 70 66 0 0.035 -4.00 1.00 Prom - 25528 25489 40 -8.26 2.00 Prom + 26729 26768 40 -0.46 2.01 Init + 35437 35514 78 0 0 88 81 102 0.957 10.56 2.02 Intr + 40911 41069 159 2 0 97 89 15 0.806 2.68 2.03 Intr + 45843 45938 96 2 0 92 98 103 0.958 11.91 2.04 Term + 45972 46070 99 2 0 67 40 7 0.201 -8.07 2.05 PlyA + 46152 46157 6 1.05 3.00 Prom + 47654 47693 40 -5.16 3.01 Init + 49235 49681 447 1 0 105 85 284 0.603 25.77 3.02 Intr + 54507 54599 93 2 0 13 116 46 0.319 0.06 3.03 Intr + 59825 59923 99 1 0 105 98 39 0.920 6.91 3.04 Intr + 61813 62046 234 0 0 71 59 126 0.320 5.89 3.05 Intr + 62341 62613 273 0 0 -33 85 281 0.315 13.43 3.06 Intr + 62747 62905 159 1 0 97 84 71 0.993 7.78 3.07 Intr + 69718 69819 102 0 0 101 89 119 0.999 13.67 3.08 Term + 70640 70840 201 1 0 76 38 146 0.897 5.79 3.09 PlyA + 73974 73979 6 1.05 4.00 Prom + 81559 81598 40 -1.96 4.01 Init + 87527 87673 147 1 0 95 109 141 0.572 16.99 4.02 Intr + 91030 91185 156 0 0 128 77 135 0.996 16.61 4.03 Intr + 92156 92251 96 1 0 93 72 210 0.983 20.11 4.04 Intr + 92604 92724 121 2 1 115 121 196 0.998 25.67 4.05 Intr + 95614 95691 78 2 0 125 35 50 0.481 2.92 4.06 Term + 100748 100857 110 0 2 116 49 59 0.344 3.47 4.07 PlyA + 101981 101986 6 1.05 5.07 PlyA - 103327 103322 6 1.05 5.06 Term - 104943 104779 165 0 0 69 55 273 0.672 20.02 5.05 Intr - 106223 106125 99 2 0 102 91 20 0.935 4.01 5.04 Intr - 119302 119126 177 1 0 81 79 132 0.813 11.72 5.03 Intr - 158485 158462 24 1 0 85 103 53 0.383 4.72 5.02 Intr - 178960 178844 117 1 0 67 69 55 0.482 2.16 5.01 Init - 179246 179061 186 2 0 91 66 382 0.936 35.36 5.00 Prom - 190384 190345 40 -4.16 6.17 PlyA - 190660 190655 6 1.05 6.16 Term - 206543 206443 101 0 2 125 46 76 0.987 5.39 6.15 Intr - 208665 208549 117 1 0 97 92 62 0.990 7.94 6.14 Intr - 211000 210922 79 1 1 60 65 22 0.489 -3.78 6.13 Intr - 211820 211761 60 2 0 77 110 107 0.879 10.73 6.12 Intr - 212944 212833 112 0 1 90 61 76 0.489 5.48 6.11 Intr - 217032 216953 80 0 2 90 8 66 0.549 -2.85 6.10 Intr - 218965 218796 170 2 2 105 93 111 0.980 12.97 6.09 Intr - 225389 225159 231 0 0 81 65 211 0.924 15.84 6.08 Intr - 232034 231918 117 0 0 77 101 114 0.991 12.04 6.07 Intr - 234067 233994 74 0 2 129 97 75 0.613 11.55 6.06 Intr - 234320 234206 115 0 1 35 77 185 0.001 11.61 6.05 Intr - 242011 241862 150 2 0 105 25 55 0.152 1.03 6.04 Intr - 257964 257839 126 1 0 75 83 65 0.943 5.35 6.03 Intr - 260055 259924 132 1 0 58 103 74 0.961 6.52 6.02 Intr - 261736 261584 153 2 0 50 78 92 0.881 4.44 6.01 Init - 264601 264595 7 1 1 45 121 0 0.448 0.09 6.00 Prom - 273967 273928 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 31730 31699 32 0 2 106 103 22 0.831 3.35 S.002 Term + 189320 189439 120 1 0 102 46 77 0.871 3.37 S.003 Init - 234312 234206 107 0 2 110 77 167 0.998 17.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:66517280_66796525|GENSCAN_predicted_peptide_1|129_aa MICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQT YVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRN MDVSVDLIX >gi568815582r:66517280_66796525|GENSCAN_predicted_CDS_1|387_bp atgatctgtgtcgagggcaatattgcaagtgggaagacgacatgcctggaattcttctcc aacgcgacagacgtcgaggtgttaacggagcctgtgtccaagtggagaaatgtccgtggc cacaatcctctgggcctgatgtaccacgatgcctctcgctggggtcttacgctacagact tatgtgcagctcaccatgctggacaggcatactcgtcctcaggtgtcatctgtacggttg atggagaggtcgattcacagcgcaagatacatttttgtagaaaacctgtatagaagtggg aagatgccagaagtggactatgtagttctgtcggaatggtttgactggatcttgaggaac atggacgtgtctgttgatttgatagnn >gi568815582r:66517280_66796525|GENSCAN_predicted_peptide_2|143_aa MDNVQPKIKHRPFCFSVKGHVKMLRLALTVTSMTFFIIAQAPEPYIVITGFEVTVILFFI LLYVLRLDRLMKWLFWPLLDIINSLVTTVFMLIVSVLALIPETTTLTVGGGVLSECKFTW KTDVRIESYTIFILEIPRPIYNA >gi568815582r:66517280_66796525|GENSCAN_predicted_CDS_2|432_bp atggataacgtgcagccgaaaataaaacatcgccccttctgcttcagtgtgaaaggccac gtgaagatgctgcggctggcactaactgtgacatctatgaccttttttatcatcgcacaa gcccctgaaccatatattgttatcactggatttgaagtcaccgttatcttatttttcata cttttatatgtactcagacttgatcgattaatgaagtggttattttggcctttgcttgat attatcaactcactggtaacaacagtattcatgctcatcgtatctgtgttggcactgata ccagaaaccacaacattgacagttggtggaggggttttatcagaatgcaaatttacttgg aaaacagatgtaagaatagaaagctatactatattcatccttgagattcctaggccaata tacaatgcctag >gi568815582r:66517280_66796525|GENSCAN_predicted_peptide_3|535_aa MDPEHAKPESSEAPSGNLKQPETAAALASSGSVVSSVPKAQRNISAKTAPRKHPAVSIRS AQSAAAARPQGSEGTAPSRKATTRPPPKPTLPPPTPSAHTESKLLNEMAIKERVEGRAKV PYKFRDSLKRFSFSPTGMLKILRLVSGELAGSTHKAGEGQETSYFAVGFSGNAQEVSQTK DLTNSIITAVFLSVVAILAMQEKKRRHLLYVGGPKCDHKIRAAAFIENTCFLSLPCVKVA LYFPALGAAPAELRAVTRCSQPPDPAAERRLTDLGPPGDAGMGPRGQLRAEAAKGAKPEP APAPPPPGAKPEEDKKDGKEPSDKPQKAVQDHKEPSDKPQKAVQPKHEVGTRRGCRRYRW ELKDSNKEFWLLGHAEIKIRSLGCLIAAMILLSSLTVHPILRLIITMEISFFSFFILLYS FAIHRYIPFILWPISDLFNDLIACAFLVGAVVFAVRSRRSMNLHYLLAVILIGAAGVFAF IDVCLQRNHFRGKKAKKHMLVPPPGKEKGPQQGKGPEPAKPPEPGKPPGPAKGKK >gi568815582r:66517280_66796525|GENSCAN_predicted_CDS_3|1608_bp atggatcctgaacacgccaaacctgagtcatccgaggcaccttcagggaacttgaaacaa ccggagactgccgcagccctggcaagtagcggcagcgtagtgagttctgtacccaaggca cagcgcaacatctcagcgaagaccgcaccccggaagcaccccgcagtctcaattcgcagt gcgcagtccgcagccgccgcacgtccccaaggcagtgagggcaccgcaccctcaaggaaa gccaccacacgcccacccccaaagcccacactcccaccccccacgccctctgcacacact gaatccaaactcttaaatgagatggcgatcaaagagcgcgtggagggccgagccaaagtc ccgtacaaattcagggacagcctcaaacgtttctccttctcgcccactggaatgttgaag atcctgagactggtgagcggagagctggctgggtccacacataaagctggagagggccag gagacatcatactttgctgtagggttcagtggtaatgcccaagaagtatcccagacaaag gatcttaccaacagtatcattacagctgtgttcctttcagtagttgccatcttggccatg caagaaaagaaaagaaggcatttactctatgtcggggggcctaaatgtgaccataaaatt agggctgctgcttttatcgagaacacctgcttcctctcgttgccttgtgtgaaagtcgcg ttgtattttcctgcgcttggcgctgcgcccgcggagctcagggccgtgacccggtgctcg cagccccccgaccccgcagcggagcggcggcttacagacctggggccacctggagacgcc gggatgggaccgcgagggcagctccgcgcggaggcggcaaagggggccaagccagagcca gcaccagctccacctccacccggggccaaacccgaggaagacaagaaggacggtaaggag ccatcggacaaacctcaaaaggcggtgcaggaccataaggagccatcggacaaacctcaa aaggcggtgcagcccaagcacgaagtgggcacgaggagggggtgtcgccgctaccggtgg gaattaaaagacagcaataaagagttctggctcttggggcacgctgagatcaagattcgg agtttgggctgcctaatagctgcaatgatactgttgtcctcactcaccgtgcaccccatc ttgaggcttatcatcaccatggagatatccttcttcagcttcttcatcttactgtacagc tttgccattcatagatacatacccttcatcctgtggcccatttctgacctcttcaacgac ctgattgcttgtgcgttccttgtgggagccgtggtctttgctgtgagaagtcggcgatcc atgaatctccactacttacttgctgtgatccttattggtgcggctggagtttttgctttt atcgatgtgtgtcttcaaagaaaccacttcagaggcaagaaggccaaaaagcatatgctg gttcctcctccaggaaaggaaaaaggaccccagcagggcaagggaccagaacccgccaag ccaccagaacctggcaagccaccagggccagcaaagggaaagaaatga >gi568815582r:66517280_66796525|GENSCAN_predicted_peptide_4|235_aa MWPPDPDPDPDPEPAGGSRPGPAVPGLRALLPARAFLCSLKGRLLLAESGLSFITFICYV ASSASAFLTAPLLEFLLALYFLFADAMQLNDKWQGLCWPMMDFLRCVTAALIYFAISITA IAKYSDGASKAAGVFGFFATIVFATDFYLIFNDVAKFLKQGDSADETTAHKTEGLQPFNV SAKNQHKETKQSLVCIWAADLAVCGPHQPTKLLGLDVGADKIRQTWKKPWILQLH >gi568815582r:66517280_66796525|GENSCAN_predicted_CDS_4|708_bp atgtggcccccagaccccgaccccgacccggaccccgagcctgccggcggctcccgtccc ggccccgcggtccccgggctccgcgccctgctgccggcgcgggctttcctctgctctctc aaaggccgcctcctgctggccgagtcgggtctctcattcatcacttttatctgctatgtg gcgtcctcagcatctgccttcctcacagcgcctctgctggagttcctgctggccttgtac ttcctctttgctgatgccatgcagctgaatgacaagtggcagggcttgtgctggcccatg atggacttcctgcgctgtgtcaccgcggccctcatctactttgctatctccatcacggcc atcgccaagtactcggatggggcttccaaagccgctggggtgtttggcttctttgctacc atcgtgtttgcaactgatttctacctgatctttaacgacgtggccaaattcctcaaacaa ggggactctgcagatgagaccacagcccacaagacagaaggccttcagccctttaacgtc tctgccaaaaaccagcacaaggagacaaagcagagccttgtctgtatctgggcagcagac ctggccgtgtgtggacctcaccagcctaccaagctgttgggcctggacgtgggtgctgat aaaataagacaaacctggaaaaaaccctggattctgcaacttcactag >gi568815582r:66517280_66796525|GENSCAN_predicted_peptide_5|255_aa MRSGEELDGFEGEASSTSMISGASSPYQPTTEPVSQRRGLAGLRCDPDYLRGALGRLKVA QVAGARSSRPLSLVGSRDHSAAFSGDPAFGQSLPFPGRQPQCVALVEVVILALIAFICIE TIMACSPCEGLYFFEFVSCSAFVVTGVLLIMFSLNLHMRIPQINWNLTDLVNTGLSAFLF FIASIVLAALNHRAGAEIAAVIFGFLATAAYAVNTFLAVQKWRVSVRQQSTNDYIRARTE SRDVDSRPEIQRLDT >gi568815582r:66517280_66796525|GENSCAN_predicted_CDS_5|768_bp atgcggagcggcgaggagctggacggcttcgagggcgaggcctcgagcacctccatgatc tcgggcgccagcagcccgtaccagcccaccaccgagccggtgagccagcgccgcgggctg gccggcctgcgctgcgaccccgactacctgcgcggcgcgctcggccgcctcaaggtcgcc caagtggccggagcccgctcctcgcgccccctctccctcgtggggtcccgggaccactct gcagccttctctggggaccccgccttcggccagagcctgcccttcccaggccggcagcct cagtgtgtggcgctggtggaggtggtgatcttggccctgattgcattcatctgcatagag accatcatggcatgctccccgtgtgaaggcctctacttttttgagtttgtgagctgcagt gcgtttgtggtgactggcgtcttgctgattatgttcagtctcaacctgcacatgaggatc ccccagatcaactggaatctgacagatttggtcaacactggactcagcgctttccttttc tttattgcttcaatcgtactggctgctttaaaccatagagccggagcagaaattgctgcc gtgatatttggcttcttggcgactgcggcatatgcagtgaacacattcctggcagtgcag aaatggagagtcagcgtccgccagcagagcaccaatgactacatccgagcccgcacggag tccagggatgtggacagtcgccctgagatccagcgcctggacacgtga >gi568815582r:66517280_66796525|GENSCAN_predicted_peptide_6|607_aa MKAYVQKYFVSVGGLDVLSQVLMQLESDSHETLSSAKLAVVVTKTVDACIADNPTFGIVL SKYHIVSKLLALLLHESLDSGEKFSIMLTLGHCTEDCEENQYDLFKNNGLPLMIQALTES QNEELNKAATFVLHNCKKISCIAVEKSLNSRNFSKLLHSCPYQCDRHKVIVEAEDRYKSE LRKSLICNKIGKMAPVGVEKKLLLGPNGPAVAAAGDLTSEEEEGQSLWSSILSEVSTRAR SKLPSGKNILVFGEDGSGKTTLMTKLQGAEHGKKGRGLEYLYLSVHDEDRDDHTRCNVWI LDGDLYHKGLLKFAVSAESLPETLVIFVADMSRPWTVMESLQKWASVLREHIDKMKIPPE KMRELERKFVKDFQDYMEPEEGCQGSPQRRGPLTSGSDEENVALPLGDNVLTHNLGIPVL VVCTKCDAVSVLEKEHDYRDEHLDFIQSHLRRPAGWDNEKKIAILHENFTTVKPEDAYED FIVKPPVRKLVHDKELAAEDEQVFLMKQQALVNSQFVPLFAVTPCQATSHSHESFYNAAS EGVLASFFNSLLSKKTGSPGSPGAGGVQSTAKKSGQKTVLSNVQEELDRMTRKPDSMVTN SSTENEA >gi568815582r:66517280_66796525|GENSCAN_predicted_CDS_6|1824_bp atgaaagcatatgttcagaaatacttcgtatctgtgggcggactggatgtattgtctcaa gttctcatgcagctggaatctgattcacatgagactctttccagtgctaaacttgcagtg gttgtgacgaagactgtggatgcatgcattgctgataatcctacttttgggatagtactc tccaagtaccacattgtttctaaacttctggcattactgcttcatgaaagtctggattca ggagaaaaatttagcatcatgcttactcttggtcattgcacagaggattgtgaggaaaat cagtatgacctttttaaaaacaatgggcttccactcatgattcaagccttaactgaatcg cagaatgaggaactgaacaaagctgccacatttgtgcttcacaactgcaaaaaaattagt tgcatagcagtagaaaaatccctgaatagccgaaactttagcaagctcttgcactcttgc ccataccaatgtgatcgtcacaaagtcattgtggaagctgaagacagatataaaagtgaa ctaaggaaatcacttatctgtaacaaaattggcaagatggcgccggtgggggtggagaag aagctgctgctaggtcccaacgggcccgcggtggcggccgccggcgacctgaccagtgag gaggaggaaggccagagcctatggtcctccattctgagcgaagtgtccacccgcgccagg tccaagctgccgtccggcaagaacatcctggtcttcggtgaagatggttctggtaaaaca accctcatgactaaactacaaggagctgagcatggcaaaaaaggaagaggcctagaatat ctctacctcagtgtccatgatgaggaccgagatgatcacacgcgctgcaacgtgtggatt ctggatggagacttgtaccacaaaggcctgctgaaatttgcagtttctgctgaatccttg ccagagaccctcgtcatttttgttgcagacatgtctagaccttggactgtgatggaatct ctgcagaaatgggctagtgttttacgtgagcacattgataaaatgaaaattccaccagaa aaaatgagggagctggaacggaagtttgtgaaagattttcaagactatatggaacctgaa gaaggttgtcaaggttccccacagagaagaggccctctgacctcaggctccgatgaagaa aatgttgccctgcctctgggtgacaatgtgctgactcataacctggggatcccggtgttg gtggtgtgcacaaagtgtgatgcggtgagtgtcctggagaaggagcacgattacagggat gagcatttggactttatccagtcacacctgcggagacctgcaggctgggacaatgaaaag aaaatagctattttacatgaaaattttacaaccgtgaagccggaagatgcatatgaagac tttattgtgaaacctcccgtgagaaagctggtccacgacaaagagttggcagcagaagat gagcaggtgttcctaatgaagcaacaggccttggttaatagccagtttgtccctctcttt gcagtcactccttgccaagcaaccagccactcccacgagagcttctataatgcagcaagt gaaggggtgttggccagcttcttcaacagtctgttgagtaaaaagacaggctctcctgga agtcctggtgctggtggggtgcagagcacagccaagaagtcaggacaaaagactgtgttg tcaaatgttcaggaagaactggatagaatgactcgaaagccagactctatggtaacaaac tcttcaacagaaaatgaagcctga