GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:25:17 Sequence gi568815591r:74943035_75173645 : 230611 bp : 49.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10056 10140 85 2 1 79 77 47 0.788 3.68 1.02 Intr + 21464 22064 601 0 1 75 115 301 0.039 23.07 1.03 Intr + 64960 65030 71 1 2 88 97 76 0.785 7.33 1.04 Intr + 71155 71268 114 2 0 50 34 93 0.377 0.52 1.05 Intr + 72005 72094 90 0 0 71 91 24 0.416 0.97 1.06 Intr + 74564 74757 194 0 2 107 93 340 0.996 35.61 1.07 Intr + 74956 75088 133 0 1 87 80 289 0.999 28.22 1.08 Intr + 75938 76061 124 0 1 102 45 125 0.996 9.24 1.09 Intr + 77005 77115 111 1 0 70 83 312 0.999 28.39 1.10 Intr + 78840 78922 83 0 2 70 97 54 0.964 3.78 1.11 Intr + 81406 81500 95 2 2 63 70 147 0.917 10.08 1.12 Intr + 81600 81641 42 2 0 86 82 94 0.239 7.04 1.13 Term + 89606 89644 39 1 0 104 42 47 0.083 -1.11 1.14 PlyA + 90832 90837 6 1.05 2.12 PlyA - 91686 91681 6 1.05 2.11 Term - 100075 99998 78 1 0 87 40 95 0.188 2.36 2.10 Intr - 109762 109677 86 2 2 126 86 50 0.837 8.14 2.09 Intr - 113040 112867 174 1 0 83 100 147 0.993 15.31 2.08 Intr - 114582 114495 88 0 1 95 93 -43 0.918 -3.56 2.07 Intr - 115735 115554 182 2 2 57 85 211 0.954 17.29 2.06 Intr - 118257 118173 85 0 1 102 80 3 0.887 0.29 2.05 Intr - 120309 120258 52 2 1 52 110 67 0.921 4.21 2.04 Intr - 121614 121548 67 1 1 43 109 41 0.897 -0.34 2.03 Intr - 123758 123630 129 0 0 96 105 58 0.956 8.97 2.02 Intr - 127735 127606 130 1 1 61 61 100 0.835 4.87 2.01 Init - 130703 130380 324 2 0 81 56 512 0.746 42.63 2.00 Prom - 134135 134096 40 -6.56 3.00 Prom + 136223 136262 40 -3.46 3.01 Init + 151515 151995 481 2 1 87 110 331 0.705 30.72 3.02 Intr + 165926 166029 104 0 2 95 81 70 0.530 6.89 3.03 Intr + 169363 169501 139 0 1 91 43 156 0.898 11.44 3.04 Intr + 180102 180285 184 1 1 73 29 120 0.544 3.45 3.05 Intr + 180454 180482 29 1 2 87 106 -8 0.374 -1.34 3.06 Intr + 182882 182936 55 0 1 103 93 16 0.583 1.64 3.07 Intr + 183308 183351 44 2 2 128 68 45 0.973 4.38 3.08 Intr + 191967 192023 57 1 0 98 72 44 0.859 2.66 3.09 Intr + 193751 193816 66 0 0 64 96 73 0.866 4.58 3.10 Intr + 195916 195993 78 2 0 76 80 73 0.962 4.82 3.11 Intr + 198069 198152 84 1 0 72 111 43 0.965 4.79 3.12 Intr + 199448 199631 184 0 1 108 111 147 0.988 17.85 3.13 Intr + 200916 200959 44 0 2 108 94 -12 0.345 -0.72 3.14 Term + 204660 206263 1604 1 2 38 45 1615 0.825 142.49 3.15 PlyA + 207884 207889 6 1.05 4.12 PlyA - 208320 208315 6 1.05 4.11 Term - 213881 213760 122 0 2 119 54 259 0.999 24.24 4.10 Intr - 214361 214216 146 1 2 77 109 239 0.837 24.83 4.09 Intr - 214936 214832 105 0 0 89 115 90 0.993 11.13 4.08 Intr - 217742 217625 118 0 1 71 91 202 0.939 18.32 4.07 Intr - 219399 219292 108 1 0 90 49 92 0.980 5.76 4.06 Intr - 219985 219863 123 2 0 123 41 327 0.998 32.06 4.05 Intr - 222145 222090 56 0 2 99 94 32 0.981 3.52 4.04 Intr - 223668 223503 166 1 1 78 81 305 0.990 27.82 4.03 Intr - 223843 223768 76 1 1 68 99 18 0.980 -0.01 4.02 Intr - 225804 225598 207 0 0 78 84 226 0.882 20.47 4.01 Init - 228909 228838 72 0 0 88 73 146 0.813 14.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 10959 11005 47 1 2 100 54 47 0.862 -0.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:74943035_75173645|GENSCAN_predicted_peptide_1|593_aa MVPASAASEDRRKLPIIVEDEGGPTRSRVLRVPALLHFPCTGLPVAPAAQPPRQQQQQQH RGSPPPGGLQVPAGRARAAHAQRGPAFEWQVVCYSEHQLLPAPGAARRCSAARPLGCCSA GAASLAPRLPLATWRASSLLSGPARPPPPPAPGAELPGLRAGCRCRQGAEGAGAAARPPA DEPPRTGRGRTMELHILEHRLQVASVAKESIPLFTYGLIKLAFLSSKTRCKFFSLTETPE DYTIIVDEEGFLVPLSVNWGQRYIIVVHATHGKAWSGGSDRSKGGEVAGQATHWLTTGPS QPAGPMGSLGCGHHGDRSWAELPSSEHLSVADATWLALNVVSGGGSFSSSQPIGVTKIAK SVIAPLADQNISVFMLSTYQTDFILVRERDLPFVTHTLSSEFTILRVVNGETVAAENLGI TNGFVKPKLVQRPVIHPLSSPSNRFCVTSLDPDTLPAVATLLMDVMFYSNGVKDPMATGD DCGHIRFFSFSLIEGYISLVMDVQTQQRFPSNLLFTSASGELWKMVRIGGQPLGFDECGI VAQISEPLAAADIPAYYISTFKFDHALVPEENINGVISALKEENLETGSENGS >gi568815591r:74943035_75173645|GENSCAN_predicted_CDS_1|1782_bp atggtgccagcatctgctgctagtgaggaccgcaggaagcttcctatcatagtggaagat gaaggggggccgacaagatcacgtgtcctgagggtgcccgcgctcctgcatttcccgtgc accgggctgccggtagctccggccgcccagcccccgcggcagcaacagcagcaacagcac cgggggagcccccccccaggcggactacaagtcccggcaggccgcgcgcgggccgcgcat gcgcagcggggaccggcgtttgagtggcaagttgtttgttacagcgaacaccagctgctc cccgcgccgggcgccgcgcgccgctgctccgccgctcggcccctcggctgctgctccgcc ggcgctgcctccctcgccccgcggctcccccttgcaacttggcgggcctcctcccttttg tccggcccggcccggccgccgccgccccccgcgcccggcgccgagctcccgggtctccgg gccggctgtcggtgccggcagggcgcggagggggcgggggccgcggctcgtccccccgcg gatgagccgccgcggacggggcgcgggcggacgatggaactccacatcctggagcaccgg ctgcaagttgccagcgtcgccaaggagagtatcccgctgttcacctacggcctgatcaaa cttgccttcctgtcctccaagaccaggtgcaagttcttcagtctgactgagacgccagag gattacactatcattgtcgatgaggaaggattcctagtccccttgtctgtgaactgggga cagcggtacatcattgtggtccatgccacgcatggcaaggcatggtcaggaggcagtgac cgatcaaaaggtggagaggtggccgggcaagccacccactggctgaccactggtccttct cagccagcgggaccaatgggctccttgggctgtggtcaccatggtgaccggagctgggca gagctgccctcctcggagcacctgagtgtggcagatgccacctggctggccctgaacgtg gtgtccggcggtggcagcttctccagctcccagcccatcggcgtgaccaagatcgccaag tcagtcatcgccccactggctgaccagaacatatccgtgttcatgctgtccacgtatcag acagacttcatcctggtgcgcgagcgggacctgccctttgtcacccacacattgtcatca gagttcaccatcctgcgggtcgtcaatggcgagaccgtggcagccgagaacctcggcatc accaatggcttcgtgaagcccaagctggtccagaggccagtcatccacccactgtccagc ccgagcaacaggttctgtgtcaccagcctggaccctgacacgctgcctgctgttgccaca ctcctcatggatgtcatgttctactccaatggagtgaaggaccccatggccactggggat gactgcggccacatccgcttcttctccttctccctcatcgagggctacatctccctggtg atggacgtgcagacgcagcagaggtttcctagtaacttgctgttcacaagcgcatccgga gagctctggaagatggtccggattggaggacagcccctggggtttgatgagtgtggcatc gtggcccagatctcagagcccttggctgctgcagacatcccagcctactacatcagtact ttcaagtttgatcatgcacttgtccccgaagagaacatcaatggtgtcatcagtgccctg aaggaagagaacctggagaccgggagtgaaaatggaagctga >gi568815591r:74943035_75173645|GENSCAN_predicted_peptide_2|464_aa MALVALVAGARLGRRLSGPGLGRGHWTAAGRSRSRREAAEAEAEVPVVQYVGERAARADR VFVWGFSFSGALGVPSFVVPSSGPGPRAGARPRRRIQPVPYRLELDQKISSAACGYGFTL LSSKTADVTKVWGMGLNKDSQLGFHRSRKDKTRGYEYVLEPSPVSLPLDRPQETRVLQVS CGRAHSLVLTDREGVFSMGNNSYGQCGRKVVENEIYSESHRVHRMQDFDGQVVQVACGQD HSLFLTDKGEVYSCGWGADGQTGLGHYNITSSPTKLGGDLAGVNVIQVATYGDCCLAVSA DGGLFGWGNSEYLQLASVTDSTQVNVPRCLHFSGVGKVRQAACGGTGCAVLNGEGHVFVW GYGILGKGPNLVESAVPEMIPPTLFGLTEFNPEIQVSRIRCGLSHFAALTNKGELFVWGK NIRGCLGIGRLEDQYFPWRVTMPGEPVDVACGVDHMVTLAKSFI >gi568815591r:74943035_75173645|GENSCAN_predicted_CDS_2|1395_bp atggcgctggtggcgttggtggctggggctcggctggggcggcggctgagcgggccgggg ctggggcgagggcactggacggcggccgggcgctcccggagccggcgcgaagcggcagaa gccgaggcggaggtgcccgtggtccagtacgtgggcgagcgcgctgcccgcgccgatcgc gtcttcgtgtggggcttcagcttctcgggggcgctgggcgtgccttcctttgtggtgccc agctccgggcccgggccccgcgccggcgcccgaccgcgccgcaggatccagcccgtgccc tatcgcctggagctggaccaaaagatttcatctgctgcttgcggctatggattcacactg ctgtcctctaagactgcggatgttacgaaagtctgggggatgggactcaacaaagattct cagcttggatttcacaggagccggaaagataaaacgaggggctacgagtatgtgttggag ccctcacccgtctccctgcctctggacagacctcaggagacacgggtgctgcaggtctcc tgcggccgagctcactctcttgtgttgactgacagggaaggagtcttcagcatgggaaac aattcttatgggcaatgtggaagaaaggtggtcgaaaatgaaatttacagtgaaagtcac agagtccacaggatgcaggacttcgatggccaggtggtccaggtcgcctgtggtcaggat catagtctgttcctgacggataaaggagaagtctattcttgtggatggggtgctgatggg caaacaggtctgggtcactacaatatcaccagctcgcccaccaagctgggtggagacctg gcgggagtgaacgttatccaagttgccacctacggtgattgctgcctggccgtgtccgcc gacggaggactttttggttggggaaactcggagtacctgcagctggcctctgtcactgac tccacacaggtgaatgtgccccgctgcttacacttctcaggagtggggaaggtgcgacag gctgcatgcggtggcacgggctgtgcagtgttaaacggagaaggacatgtttttgtctgg ggctatggaattcttgggaaaggtccaaacctagtggaaagtgccgtccctgaaatgatt ccacccactctctttggcttgacggagttcaacccagaaatccaggtttcccgcatccga tgtggactcagccactttgctgcactgaccaacaaaggagagctgtttgtatggggcaag aacatccgagggtgcctgggaatcggtcgcctggaggaccagtatttcccatggagggtg acgatgcctggggagcctgtggacgtggcatgtggcgtggaccacatggtgaccctggcc aagtcattcatctaa >gi568815591r:74943035_75173645|GENSCAN_predicted_peptide_3|1050_aa MAPKHKSSDAGNLDRPKRSRKVLPLSEKVKVLDLIRKDKKSYAEVAKIYGKNESSIREIV KKEKEIRASFAVSPPTAKVTATVRDKCLVKMEQALHLWVEEMNRKRVPIDSNMLRQKALS LYQDFCKGCSETDTKPFTASKGWLHRFRHRFSHHYKKKKKGIMAQVAVSTLPVEEESSSE TRMVVTFLVSALESMCKELAKSKAEVACIAVYETDVFVVGTERGCAFVNARTDFQKDFAK YCKALGTTVMVPVPYEKMLRDQSAVVVQGLPEGVAFQHPENYDLATLKWILENKAGISFI INRPFLGPESQLGGPGMVTDAERSIVSPSESCGPINVKTEPMEDSGSHPSSTSNEVIEME LPMEDSTPLVPSEEPNEDPEAEVKIEGNTNSSSVTNSAAGVEDLNIVQVTVPDNEKERLS SIEKIKQLREQVNDLFSRKFGEAIGVDFPVKVPYRKITFNPGCVVIDGMPPGVVFKAPGY LEISSMRRILEAAEFIKFTVIRPLPGLELSNGEYSTVGKRKIDQEGRVFQEKWERAYFFV EVQNIPTCLICKQSMSVSKEYNLRRHYQTNHSKHYDQYTERMRDEKLHELKKGLRKYLLG SSDTECPEQKQVFANPSPTQKSPVQPVEDLAGNLWEKLREKIRSFVAYSIAIDEITDINN TTQLAIFIRGVDENFDVSEELLDTVPMTGTKSGNEIFLRVEKSLKKFCINWSRLVSVAST GTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGK PLPQLSSIDWIRDLAFLVDMTMHLNALNISLQGHSQIVTQMYDLIRAFLAKLCLWETHLT RNNLAHFPTLKLVSRNESDGLNYIPKIAELKTEFQKRLSDFKLYESELTLFSSPFSTKID SVHEELQMEVIDLQCNTVLKTKYDKVGIPEFYKYLWGSYPKYKHHCAKILSMFGSTYICE QLFSIMKLSKTKYCSQLKDSQWDSVLHIAT >gi568815591r:74943035_75173645|GENSCAN_predicted_CDS_3|3153_bp atggccccaaagcacaagagtagtgatgctgggaatttggataggccaaagagaagccgt aaagtgcttcctctaagtgaaaaggtgaaagttctcgacttaatcaggaaagacaaaaaa tcctatgctgaggttgctaagatctacgggaagaatgaatcttccatccgtgaaattgtg aagaaggaaaaagaaattcgtgctagttttgctgtctcacctccaactgctaaagtgacg gccacagtgcgtgataagtgcttagttaagatggaacaggcactgcatttgtgggtggaa gagatgaacagaaaacgtgttcccattgacagcaacatgttgcgccagaaagctttgagc ctataccaagacttctgcaagggatgctctgaaactgacaccaagccatttactgcgagt aagggatggttacacagattcaggcatagattctcacatcattacaagaagaagaagaag gggatcatggcccaggtagcagtgtccaccctgcctgttgaagaagagtcctcctcagag accaggatggtggtgacattcctcgtgtctgccctcgaatccatgtgtaaagaactggcc aagtccaaggcagaagtggcctgcatcgcagtgtacgaaacagacgtgtttgtcgtcgga accgagagaggatgcgcttttgttaatgccaggacggattttcagaaagattttgcaaaa tactgtaaagccttagggacaacagtgatggtgcctgttccctatgagaagatgctgcga gaccagtcggctgtggtagtgcaggggcttccggaaggcgttgcctttcaacaccctgag aattacgaccttgcaaccctgaaatggattttggagaacaaagcagggatttcattcatc ataaatagacccttcctaggaccagagagtcagctgggtggccctgggatggtaacagat gcggagagatccatagtatcaccaagtgaaagctgcggccccatcaatgtgaaaactgaa cccatggaagattctggaagccacccttcttccacaagcaatgaagtaatagaaatggaa ttaccaatggaagattccactccgctggtcccttcagaagaaccaaatgaggaccctgaa gccgaggtgaaaatcgaaggaaacacaaattcatccagtgttacaaattctgcagcaggt gttgaagatcttaacatcgttcaagtgactgttccagataatgagaaggaaagattatca agcattgaaaagattaaacagctaagagaacaagttaatgacctctttagccgaaaattt ggtgaagcaattggcgtggatttccctgtgaaagttccctacaggaagatcacattcaac cctggctgtgtggtgattgatggcatgcccccgggggtggtattcaaggcccccggctat ctggaaatcagttccatgaggaggatcttggaggcagctgagtttatcaaattcacagtc atcaggccgcttccagggcttgagctcagtaatggtgagtattctacagtgggaaaacgc aagatagaccaggagggccgtgtgtttcaagaaaagtgggagagagcgtatttcttcgtg gaagtacagaatattccaacatgtctcatatgcaaacaaagcatgtctgtgtccaaagaa tataacctaagacgccactatcaaaccaatcacagcaagcattatgaccagtatacggaa agaatgcgtgacgagaagcttcacgagctgaaaaaagggctcaggaagtatctcttaggc tcgtcagacaccgagtgtcccgagcaaaaacaagtgtttgcaaacccaagtccaacccag aaatcccccgtgcagcctgtagaggacctagctgggaacttatgggagaagttacgtgaa aaaatcaggtcttttgtggcatattctatcgcaatcgatgagatcacggatataaataat accacccagttggccatattcatccgtggtgtcgatgagaatttcgatgtgtccgaagaa cttctggacacggtgcccatgacgggtacaaaatctggcaacgagatctttttgcgtgtt gagaagagcctgaaaaagttctgtatcaactggtcgagattagtaagcgtggcctccact ggcaccccagcgatggtggatgccaataacgggcttgtcacaaaactgaagtccagggtg gcgacgttctgcaagggtgcggaactgaagtccatctgttgtataattcatccggaatca ctctgtgctcagaagttgaagatggaccacgtcatggacgtggtagtgaagtccgtgaac tggatatgctcccggggactgaaccacagcgagttcacaaccttgctctatgagctggac agccagtatggtagcctcctgtactacacggagattaagtggctcagtcgcgggctcgtg ctaaagagatttttcgaatccttggaagaaatcgactccttcatgtcatccagagggaaa cccctgcctcaactgagctccatagattggatccgagacctggccttcttggttgacatg acgatgcatctgaacgctttgaacatctctctccaaggacactcccaaatcgtcacgcag atgtatgacctgatccgggcgttcctagcaaaactgtgcctctgggagactcatttgacg aggaataatctggcccactttcccaccctgaaattggtttccagaaatgaaagcgatggc ctgaactacattcccaaaatcgcggaactcaagaccgaattccagaaaaggctgtctgat ttcaaactctacgaaagcgaactgactctgttcagctccccgttctccacgaagatcgac agtgtgcacgaggagctccagatggaggttatcgacctgcaatgcaacacggtcctgaag acgaaatacgacaaggtgggaataccagaattctacaagtacctctggggtagctacccg aaatacaagcaccattgcgcaaagattctttccatgttcgggagcacctacatctgcgaa cagctgttctccattatgaaactgagcaaaacaaaatactgctcccagttaaaggattcc cagtgggattctgtactccacatcgcaacgtga >gi568815591r:74943035_75173645|GENSCAN_predicted_peptide_4|432_aa MGDTFIRHIALLGFEKRFVPSQHYDNCKDGPHPNPLGFLQWVVGSWVHTAKPLWRLNGVP RLWLSPRYMFLVKWQDLSEKVVYRRFTEIYEFHKTLKEMFPIEAGAINPENRIIPHLPAP KWFDGQRAAENHQGTLTEYCSTLMSLPTKISRCPHLLDFFKVRPDDLKLPTDNQTKKPET YLMPKDGKSTATDITGPIILQTYRAIADYEKTSGSEMALSTGDVVEVVEKSESGWWFCQM KAKRGWIPASFLEPLDSPDETEDPEPNYAGEPYVAIKAYTAVEGDEVSLLEGEAVEVIHK LLDGWWVIRKDDVTGYFPSMYLQKSGQDVSQAQRQIKRGAPPRRSSIRNAHSIHQRSRKR LSQDAYRRNSVRFLQQRRRQARPGPQSPGSPLEEERQTQRSKPQPAVPPRPSADLILNRC SESTKRKLASAV >gi568815591r:74943035_75173645|GENSCAN_predicted_CDS_4|1299_bp atgggggacaccttcatccgtcacatcgccctgctgggctttgagaagcgcttcgtaccc agccagcactatgacaactgcaaagatggtcctcaccccaatcctctgggcttcctccag tgggtagtgggatcctgggtgcacacagcaaagcctctttggaggctgaatggggtcccc cgactctggctttcccccaggtacatgttcctggtgaaatggcaggacctgtcggagaag gtggtctaccggcgcttcaccgagatctacgagttccataaaaccttaaaagaaatgttc cctattgaggcaggggcgatcaatccagagaacaggatcatcccccacctcccagctccc aagtggtttgacgggcagcgggccgccgagaaccaccagggcacacttaccgagtactgc agcacgctcatgagcctgcccaccaagatctcccgctgtccccacctcctcgacttcttc aaggtgcgccctgatgacctcaagctccccacggacaaccagacaaaaaagccagagaca tacttgatgcccaaagatggcaagagtaccgcgacagacatcaccggccccatcatcctg cagacgtaccgcgccattgccgactacgagaagacctcgggctccgagatggctctgtcc acgggggacgtggtggaggtcgtggagaagagcgagagcggttggtggttctgtcagatg aaagcaaagcgaggctggatcccagcatccttcctcgagcccctggacagtcctgacgag acggaagaccctgagcccaactatgcaggtgagccatacgtcgccatcaaggcctacact gctgtggagggggacgaggtgtccctgctcgagggtgaagctgttgaggtcattcacaag ctcctggacggctggtgggtcatcaggaaagacgacgtcacaggctactttccgtccatg tacctgcaaaagtcggggcaagacgtgtcccaggcccaacgccagatcaagcggggggcg ccgccccgcaggtcgtccatccgcaacgcgcacagcatccatcagcggtcgcggaagcgc ctcagccaggacgcctatcgccgcaacagcgtccgttttctgcagcagcgacgccgccag gcgcggccgggaccgcagagccccgggagcccgctcgaggaggagcggcagacgcagcgc tctaaaccgcagccggcggtgcccccgcggccgagcgccgacctcatcctgaaccgctgc agcgagagcaccaagcggaagctggcgtctgccgtctga