GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:29:06 Sequence gi568815589f:111797216_112032994 : 235779 bp : 42.17% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15421 15490 70 0 1 55 65 73 0.861 2.96 1.02 Term + 15658 15731 74 2 2 95 38 94 0.542 2.19 1.03 PlyA + 17048 17053 6 1.05 2.07 PlyA - 17818 17813 6 -0.45 2.06 Term - 19895 19705 191 0 2 72 54 91 0.514 0.73 2.05 Intr - 22437 22348 90 1 0 61 101 61 0.245 3.75 2.04 Intr - 33210 32871 340 0 1 64 80 105 0.893 1.42 2.03 Intr - 34140 33965 176 1 2 109 92 161 0.913 17.34 2.02 Intr - 49274 49139 136 2 1 54 83 87 0.006 4.02 2.01 Init - 58800 58684 117 0 0 42 58 113 0.150 3.95 2.00 Prom - 91746 91707 40 -4.05 3.04 PlyA - 94213 94208 6 1.05 3.03 Term - 99374 99216 159 2 0 90 43 193 0.999 11.96 3.02 Intr - 100092 99916 177 0 0 61 66 137 0.685 7.99 3.01 Init - 100751 100458 294 2 0 60 52 273 0.916 18.13 3.00 Prom - 101997 101958 40 -7.85 4.00 Prom + 102967 103006 40 -7.35 4.01 Init + 112431 112440 10 2 1 62 100 4 0.204 -1.10 4.02 Intr + 113934 114021 88 1 1 92 37 69 0.323 0.31 4.03 Intr + 117390 117531 142 0 1 60 106 98 0.729 8.23 4.04 Intr + 118456 118621 166 0 1 56 47 62 0.538 -2.49 4.05 Intr + 125659 125736 78 2 0 47 97 62 0.505 1.60 4.06 Intr + 127562 127663 102 0 0 89 78 47 0.815 3.03 4.07 Intr + 129169 129281 113 2 2 78 109 123 0.918 12.58 4.08 Intr + 132285 132463 179 2 2 80 111 146 0.766 13.90 4.09 Intr + 134056 134142 87 1 0 80 99 20 0.411 0.47 4.10 Intr + 140936 141186 251 2 2 84 26 145 0.052 4.06 4.11 Intr + 144464 144495 32 0 2 94 116 -18 0.008 -1.47 4.12 Intr + 164509 164580 72 0 0 34 115 80 0.874 3.98 4.13 Term + 166875 167066 192 2 0 103 36 169 0.904 9.64 4.14 PlyA + 167619 167624 6 1.05 5.08 PlyA - 170373 170368 6 1.05 5.07 Term - 185842 185520 323 2 2 41 49 186 0.069 4.00 5.06 Intr - 186950 186850 101 1 2 72 77 23 0.045 -1.57 5.05 Intr - 196391 196329 63 1 0 100 86 45 0.012 2.41 5.04 Intr - 213801 213619 183 2 0 94 46 163 0.140 10.68 5.03 Intr - 228951 228793 159 2 0 58 52 136 0.037 5.18 5.02 Intr - 231734 231563 172 0 1 40 -14 133 0.296 -3.62 5.01 Intr - 233950 233853 98 0 2 72 94 79 0.871 5.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 37771 37726 46 1 1 52 36 119 0.807 4.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:111797216_112032994|GENSCAN_predicted_peptide_1|47_aa MQQTHSLNKMSPFEENVSNRKVRDVSQAHENHRQKKEGATLLTDVAT >gi568815589f:111797216_112032994|GENSCAN_predicted_CDS_1|144_bp atgcagcagactcactctttgaataagatgtcaccctttgaagaaaatgtcagcaacaga aaagtcagagatgtgtctcaggcacatgagaatcatcgtcagaagaaagagggagccacc cttttgacagacgttgcaacgtaa >gi568815589f:111797216_112032994|GENSCAN_predicted_peptide_2|349_aa MIPACDGIRGGSGKETSDSETQEPGRCNAHKKAKGPVCDLLKFCASLISSYSWSFWVPLA DEVRVFSAFIDSTGQMVLVSVTGRDAGSRGLQAGLAANLILRFQPLAAKRYNHDFPEQES PNPLSLPIKTVEPREIYRLIQEQWVSTLICKLPDLPIRELQSENTGLLPRTNSTSVEIRK RCPVLWVLSLIAQRAEQVLGFCPDLPSPGISLHGRTCTTTRGINPFYSSFQELRGSSGIF PSVTLMIAHRIPPTKSGTPKIQLHKYPCSGVGLEDDHQYDYVVVSDDQVPCYRQASLPNV PSYQATPGELDDDFPALRGPLILSRIADVFPRLLLRTQLEVHKGEKLHP >gi568815589f:111797216_112032994|GENSCAN_predicted_CDS_2|1050_bp atgatacctgcttgtgatgggataagaggaggctcagggaaagaaacctctgacagcgag acccaggaacctggaagatgcaacgctcataaaaaagccaagggtccagtgtgcgatctc cttaaattctgtgcctcactcatctcatcctattcgtggtccttctgggttcccctggca gatgaagttcgggtgttttctgccttcattgactctactggacaaatggtacttgtcagc gtaacaggtagagatgcagggtccagggggctacaagcagggcttgctgccaacttgatt ttgcgcttccagcctctggcagccaaaagatataaccatgacttcccagaacaagaaagt cccaatcctcttagcttaccgatcaagacagtggagcccagagagatctacagacttatt caagagcagtgggtttccactctgatttgtaaacttcctgaccttcccatcagagaatta caatcagagaacacaggtctgcttcctagaacaaacagtacctctgtggaaataagaaaa aggtgccctgtgctctgggtccttagcctcatagctcagcgagcagagcaggtgctaggt ttctgccctgacttaccttctccaggcatctctttgcatggccgaacttgcacaaccaca cgaggcatcaaccctttctattcctccttccaagaactcagaggaagcagtggaatcttt ccatctgtcacacttatgattgcccacagaatcccaccaaccaaatcaggtaccccaaag atccagcttcataaatacccatgttctggggttggtctagaagatgaccatcaatatgac tatgtggttgtatcagatgatcaggtgccctgctatagacaggcttcactgccaaatgtg ccatcataccaagcaacacctggggagcttgatgatgattttcctgctctgagaggcccc ctaatcctcagcagaatagctgatgtgtttccccgtctccttcttaggactcagcttgaa gttcataaaggagagaaactacatccatga >gi568815589f:111797216_112032994|GENSCAN_predicted_peptide_3|209_aa MKRCGRHGESTNLHKTIKRGTEKRKENHAEMYKLTGLKAIMPARRFGGHFSGWTGASHLS KRIEPFQEEENPQRGVQIAVSSSQKSGHNHHPNRNVAQMIAMKCISHSTKKRTNPKTAIP SKARSSSAIPRPAGPARPAAAANTDNAPAGGGQGGRPAESAAALGRAPSVSVAARDPVPG WRLLSLRLALLFLGAPEKDTGSGQLPELV >gi568815589f:111797216_112032994|GENSCAN_predicted_CDS_3|630_bp atgaaaagatgtggccggcacggggagagtacaaatctacataaaacaataaaacgagga actgaaaaaaggaaagaaaaccacgctgagatgtacaagctgacaggactcaaagctatc atgccagctcggaggtttggtggacatttttcagggtggacaggtgcttcacacctatct aagcgcatagagcccttccaggaagaagaaaatccacagagaggtgtccagatagctgtg agcagctcccagaaaagtggacacaatcaccaccctaacagaaacgtggcccagatgata gccatgaaatgcatcagccacagcaccaagaagaggacgaacccgaagacggccattccc tccaaggccaggtccagcagcgccatcccccggcccgccggaccggcccggcccgctgcg gccgccaacacggacaacgctcccgcaggaggaggacagggtgggcgcccggcggaaagc gctgcagccctgggccgggctccctcggtgtcggtggcagctcgggaccctgtgcctggc tggaggctgctctctctccgtctggccctgctgttcctgggtgccccagaaaaagacacg ggttccgggcagctgccggagcttgtctag >gi568815589f:111797216_112032994|GENSCAN_predicted_peptide_4|503_aa MLSRSPDLPKALRTVVLSFLSQASLHLQVTSRCRLHLNKKATDKQPYSKLPGVSLLKPLK GVDPNLINNLETFFELDYPKGSSNNQKEDDMRTTLLMVTGFHALISFLLAFIFLGSTNEK NREDTDVIALMQHLLNHDDPAIDVCKKLLGKYPNVDARLFIGGKKVGINPKINNLMPGYE VAKYDLIWICDSGIRVIPDTLTDMVNQMTEKVGLVHGLPYVADRQGFAATLEQVYFGTSH PRYYISANVTGFKCVTGMSCLMRKDVLDQAGGLIAFAQYIAEDYFMAKAIADRGWRFAMS TQVAMQNSGSYSISQFQSRMISDIFSTVRDKGVGLSKTEIAPDPESTHMTLMPRKQFKKG SSQEASSSEWKKQKSDALGLAACEKDMKAMGTDIWKLQQRCISTQEPLHLGRCFLKPTFI EHLLCLGTVPDTCRRCRVVAKLENLMLAVSAGLCFPSPHDGHPDTTFTNIWCAGDPEDLD YGRKCFGCQDTPSGRGEDNASPL >gi568815589f:111797216_112032994|GENSCAN_predicted_CDS_4|1512_bp atgttgtccaggagtccagatttacccaaggctctcagaacagttgtcctgtcattctta tctcaggcttctctccatctccaggtcaccagtagatgccgattacacctcaacaagaag gcaactgacaaacagccttatagcaagctcccaggtgtctctcttctgaaaccactgaaa ggggtagatcctaacttaatcaacaacctggaaacattctttgaattggattatcccaaa ggtagctcaaataatcagaaggaagatgatatgcgcactacacttcttatggtgactggt tttcatgctcttatttcttttcttttagcctttatttttctcggtagcaccaatgaaaag aatagagaagacacagatgtgatagccttaatgcagcacttattaaatcatgatgatcca gccattgatgtatgtaagaagcttcttggaaaatatccaaatgttgatgctagattgttt ataggtggcaaaaaagttggcattaatcctaaaattaataatttaatgccaggatatgaa gttgcaaagtatgatcttatatggatttgtgatagtggaataagagtaattccagatacg cttactgacatggtgaatcaaatgacagaaaaagtaggcttggttcacgggctgccttac gtagcagacagacagggctttgctgccaccttagagcaggtatattttggaacttcacat ccaagatactatatctctgccaatgtaactggtttcaaatgtgtgacaggaatgtcttgt ttaatgagaaaagatgtgttggatcaagcaggaggacttatagcttttgctcagtacatt gccgaagattactttatggccaaagcgatagctgaccgaggttggaggtttgcaatgtcc actcaagttgcaatgcaaaactctggctcatattcaatttctcagtttcaatccagaatg atcagtgatatattttctacagtgagagacaaaggggtgggactttcaaagacagaaatt gctcctgaccctgagagcacccatatgactctgatgccaaggaaacagtttaagaaaggc agctcccaggaagccagttcatcagagtggaagaagcagaagtcagatgctttaggtttg gctgcgtgtgaaaaagacatgaaagcgatgggcacagatatttggaaactgcaacaacga tgcatttccacccaagaacctttgcatttaggaaggtgtttcctcaagccaacattcatt gagcacctgctgtgtcttggcactgtaccagatacttgccgtagatgccgtgttgtggcg aagctggagaacctaatgcttgctgtgtcagcaggtctgtgctttccgagtccacatgat ggtcacccagataccactttcaccaacatatggtgtgctggagatccagaggaccttgac tatggaagaaagtgctttggctgtcaagacacacccagtggcagaggagaagataatgct tctcctctatag >gi568815589f:111797216_112032994|GENSCAN_predicted_peptide_5|366_aa XESEATNEEHENTGVESWVGIRQLTKIFTIKMAVTLEDPTKLGKGGEFPDTIDNEDGQAN GRTSEGSDFRYARSVPWLHFTKTVPFCKGRNCVIKGLFNAPCDREVDFASRYCPNLIEND DGGKTALSRAECLATPHSLVINNGSLDLLFRLAGPPKIFRHKLTFRYYDWFSSESWTISP KLESFDGMSPPSESSLDSVAHITLWWRKGSNSLATSEELTYGLVVSLQGTPSLLCSKKWG STNFQKKHSEVNPRRLIEVGSVLSREDAAVKKTDADMVPALQWVLTLALTQSWEPVDFCA LKSSSHVGAEDSAYKSPHFRAKARIDSLSGCPAQPEAVQPGVDVLVLVEEGKPEVRAKGA VSFVKE >gi568815589f:111797216_112032994|GENSCAN_predicted_CDS_5|1101_bp nnagaaagtgaagcaacaaatgaggaacatgagaacactggagtagaaagctgggtgggc atcagacagctgaccaaaattttcactatcaaaatggcagtcacccttgaagacccaaca aagctggggaaaggaggggaatttcccgataccattgacaatgaagatggacaagctaac ggcagaacctcagaaggatctgacttcagatatgcacgcagcgtaccctggcttcatttc acaaaaacagtacccttttgcaaagggaggaattgtgtcatcaagggattgttcaatgct ccctgtgatcgtgaagtagactttgcctccagatactgccccaatctgatagaaaatgat gatggaggaaaaactgcgctcagcagggcagaatgtcttgcaacaccacattccttagta atcaataatggtagtcttgacctgctttttcgacttgcaggtccacctaagatttttcgg cataaacttactttcagatattatgactggttttcctcagagtcttggacaatttctccc aaattagaaagttttgatgggatgtctccaccctctgaatcttccctggattctgttgcc cacatcacattatggtggagaaaggggtcaaacagccttgccacatctgaagagttgacc tatggccttgttgtaagccttcagggcaccccaagtttgctgtgctctaagaaatgggga agcacaaatttccagaaaaagcatagcgaagtcaacccgaggaggttgatagaggttggg agtgtcctgagccgtgaggatgcagcagtgaaaaagacagatgcagacatggtccctgcc ctccagtgggtcttgactcttgccttgacacaatcatgggagcctgtcgacttctgcgct ctgaagtcctcatcccatgtaggagctgaagacagtgcctacaagagcccacattttagg gcaaaggcaagaatagacagtctgagtggttgccctgcacagccagaagcagtgcagcca ggggtggatgtgctagtcctggtagaggaggggaagccagaagtcagggccaaaggggca gtgtcatttgttaaggagtga