GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:39:40 Sequence gi568815587f:44494663_44719123 : 224461 bp : 49.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6331 6413 83 0 2 93 82 90 0.700 9.34 1.02 Intr + 28757 28963 207 2 0 -102 25 437 0.256 16.59 1.03 Intr + 43349 43424 76 2 1 43 70 59 0.032 -0.88 1.04 Intr + 43710 43830 121 2 1 69 81 74 0.528 4.87 1.05 Term + 43856 44007 152 0 2 88 53 75 0.358 2.07 1.06 PlyA + 44244 44249 6 1.05 2.03 PlyA - 44673 44668 6 1.05 2.02 Term - 46031 45745 287 0 2 67 46 163 0.545 5.57 2.01 Init - 48043 47665 379 1 1 67 50 325 0.308 23.67 2.00 Prom - 49220 49181 40 -7.86 3.04 PlyA - 50981 50976 6 1.05 3.03 Term - 52176 52025 152 1 2 84 47 60 0.025 -0.43 3.02 Intr - 54820 54717 104 0 2 90 115 10 0.031 3.72 3.01 Init - 57391 57093 299 1 2 22 71 455 0.445 33.89 3.00 Prom - 59292 59253 40 -5.76 4.00 Prom + 65164 65203 40 -2.26 4.01 Init + 66785 66793 9 1 0 89 113 9 0.239 3.51 4.02 Intr + 74157 74385 229 2 1 55 59 137 0.319 4.94 4.03 Intr + 76148 76284 137 0 2 3 80 94 0.232 0.39 4.04 Intr + 84002 84090 89 1 2 99 10 64 0.232 -1.53 4.05 Intr + 85007 85232 226 2 1 78 68 90 0.397 3.99 4.06 Intr + 90481 90634 154 0 1 86 20 90 0.169 1.65 4.07 Intr + 92813 92894 82 0 1 84 115 60 0.160 7.00 4.08 Intr + 98700 98833 134 0 2 67 76 37 0.150 0.69 4.09 Intr + 99981 100063 83 1 2 138 78 20 0.899 5.36 4.10 Intr + 105496 105568 73 0 1 132 57 66 0.801 6.88 4.11 Intr + 110396 110520 125 0 2 136 105 227 0.987 29.50 4.12 Intr + 110693 110767 75 1 0 108 121 121 0.991 17.11 4.13 Intr + 112168 112286 119 0 2 71 55 48 0.130 -0.94 4.14 Intr + 114166 114351 186 1 0 93 95 28 0.143 2.80 4.15 Intr + 117026 117131 106 2 1 85 89 61 0.746 6.12 4.16 Intr + 120610 120711 102 0 0 137 94 198 0.997 25.67 4.17 Intr + 123500 123703 204 1 0 63 53 233 0.962 16.70 4.18 Intr + 123978 124061 84 2 0 121 25 162 0.940 13.22 4.19 Term + 124387 124464 78 0 0 137 54 81 0.941 7.36 4.20 PlyA + 124945 124950 6 -0.45 5.04 PlyA - 125227 125222 6 1.05 5.03 Term - 127227 127067 161 1 2 63 55 63 0.363 -1.40 5.02 Intr - 128881 128774 108 2 0 109 49 49 0.432 3.36 5.01 Init - 130506 130422 85 0 1 98 44 119 0.640 7.51 5.00 Prom - 150272 150233 40 -3.06 6.00 Prom + 150275 150314 40 -8.36 6.01 Init + 151391 151468 78 1 0 67 110 53 0.554 6.46 6.02 Term + 158028 158141 114 2 0 118 45 47 0.698 1.97 6.03 PlyA + 160040 160045 6 1.05 7.08 PlyA - 162373 162368 6 1.05 7.07 Term - 169566 169472 95 1 2 85 43 62 0.102 -0.61 7.06 Intr - 191524 191442 83 0 2 99 78 23 0.644 1.68 7.05 Intr - 191761 191702 60 0 0 86 115 90 0.918 9.35 7.04 Intr - 196597 196466 132 0 0 71 56 52 0.155 0.06 7.03 Intr - 201651 201604 48 2 0 140 54 33 0.272 2.90 7.02 Intr - 214450 214364 87 0 0 108 82 4 0.054 0.89 7.01 Init - 214816 214734 83 1 2 83 91 18 0.060 0.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 114233 114351 119 1 2 73 95 114 0.806 8.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:44494663_44719123|GENSCAN_predicted_peptide_1|212_aa MDVGECLVKMKVEIMVMPKIVNKLPEARRRRKKKKKKKKKKEKEKEKEKEKEKEKEKEKE KEKKKKKKKKKKKRQEEHFRQRNHSMRRAQDIDDLQSASASEEKRPAAYPDVLMPAQKSV SKDCPRFGGEETKAQRGNGRSWCPPLGVELKPGGSKQPVACSGPPSSPTQNNSDLLVPRR PQVSSDRQWAWTAVSRAQTAVSETRATPAGDK >gi568815587f:44494663_44719123|GENSCAN_predicted_CDS_1|639_bp atggacgtgggagaatgccttgtaaagatgaaggtggagattatggtgatgccaaagatt gtcaacaaactaccagaagctagaagaagaagaaagaagaagaagaagaagaagaagaag aaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggag aaggagaagaagaagaagaagaagaagaagaagaagaagaggcaggaagagcatttcagg cagaggaaccacagcatgcgcagagcccaggatatcgacgatctacaaagcgcgtctgcc tctgaggaaaagagacctgcagcctacccggatgtgctcatgccagcgcagaagagtgtg tccaaggattgcccccgctttggaggggaggaaaccaaggctcagagaggaaatgggcgc tcctggtgtcccccgctgggagtagaactcaagcctggaggctccaagcagcctgtggcc tgctctgggcctccttcctccccgacgcagaacaactcggatctgctggtcccacggcgt cctcaggtctcatcggaccgtcagtgggcctggacagcagtgagcagggcccagacggcc gtttcagagacaagggcgacacctgctggagataagtga >gi568815587f:44494663_44719123|GENSCAN_predicted_peptide_2|221_aa MVVLLVTLMVGEVVITLMAMITMLMMVSDENYVVAMVMRMVMVFAMVVMMEMVAMVMTME VAVAIAMMMQMLVATVMMGVMTVVMRRMVKCDGEVMRGHYGDDNDGGGGDSDDCVADDGG DPDDNNDAKRKEAQGEESGPGPCCHGEAGDQESGESPLLRKGLRILALDIQPKSDLSRFV KWPCYIRLQRQRAILYKWLKVPPVINQFTQDLDHQLLSCLS >gi568815587f:44494663_44719123|GENSCAN_predicted_CDS_2|666_bp atggtggtgttactagtgacactgatggtgggagaggtggtgataacattgatggcaatg ataacaatgctgatgatggtatctgatgagaattatgtggttgctatggtgatgaggatg gtaatggtgtttgctatggtggtgatgatggagatggttgctatggtgatgacgatggag gtggcggttgctatagcgatgatgatgcagatgctggttgctacagtgatgatgggagtg atgactgtagtgatgagaagaatggtgaagtgtgatggtgaggtaatgagaggtcattat ggtgatgataatgatggtggtggtggtgatagtgatgattgtgttgctgatgatggtggt gatcctgatgacaacaatgatgccaagaggaaagaagcgcagggggaagaaagtggcccc ggcccctgctgtcatggagaagcaggagaccaagaaagtggtgaatcccctctgttaaga aaaggcctaagaattttggcactggacatccagcccaaaagtgacctcagccgctttgtg aaatggccctgctatatcaggttgcagcggcagagagccatcctctataagtggctgaaa gtgccccctgtgattaaccagttcacccaggacctggaccaccagctactcagctgccta agctga >gi568815587f:44494663_44719123|GENSCAN_predicted_peptide_3|184_aa MSSASPPTPSSFSPLPLQSTITTFTITKTISIVFVIITIIIIIPIITALTKVIMNDIIDI ITAIFIIVITIIISIFITNNSSFAITIPTHGMTLSPWTVSDSIPLSGTKTSRPAERNVMG TGSKNFASGITRADGRGSSKGFHLAFTTIIALTLPVRESMWGYCQLLSMSMPIMHAGTGE MTTG >gi568815587f:44494663_44719123|GENSCAN_predicted_CDS_3|555_bp atgtcatcagcgtcaccaccaacaccatcatcattctcaccactaccattacaatccact atcaccaccttcacaatcaccaagaccatctctattgtgtttgtcatcatcaccatcatc atcatcatccccatcatcactgcattaaccaaagtcatcatgaatgacatcatcgacatc atcactgccatcttcatcatcgtcatcacaattattatcagcatcttcattaccaacaac agtagctttgccatcactattcccacccatggaatgactctcagtccatggactgtcagt gacagcattcctctctctggaactaagacctctaggccagcagaacgtaatgtcatggga acagggagcaaaaattttgctagtgggatcactagggctgatggtagaggcagctctaag ggcttccatttggcctttaccaccataatagcccttaccctaccagtcagggagtcaatg tgggggtactgccagctgctaagtatgtctatgccaattatgcatgctggcactggggaa atgaccacaggatga >gi568815587f:44494663_44719123|GENSCAN_predicted_peptide_4|764_aa MPKDQTVLPQPMAEQPQVLSVVSDSEHLGPAGTPDSGLTLALAFCSKPLYPLPLPHGKLT GGGYDLIKNIKRPGAGSVRARKSILESTIVCGRCCYPVGTQAGAWRNGTVASRKPILDII HADPQLSVDVKFLAKSISNGKLVGPGKRRGRRMASVPARAAKCLGWLNGRFPAWEPGALC GKHSPFIPSLAGPDLRSRPWKWVSQGHPLQELGSRHGEGSVRWRLGPVGWPLTQEQQWSS SGTEPLQDGHMARDWIRPQRASDCWSGKQQLFQIIIIIIIVESSLLLCGRHVGTGRSGPC DQLHWFRGSPALPMQLSQASCPGPSCPAVLAPFAPKGLKAGGWHSWIFWPLSQGSSRTGG MGSACIKVTKYFLFLFNLIFFILGAVILGFGVWILADKSSFISVLQTSSSSLRMGAYVFI GVGAVTMLMGFLGCIGAVNEVRCLLGLYFAFLLLILIAQVTAGALFYFNMGKLSPCTAYT KYSVDHVTLLLSQILTRPSQGRGRIPVLQMKEPFLKSDPVSSRSPQGLPCSTSEMKSTNK REGRRAALSRRLMLLASLPVFIIIAAIIHFMKACSYLVARVTSLHSQHRPPEHQLQAEPA PVAMVPVAQLKQEMGGIVTELIRDYNSSREDSLQDAWDYVQAQVKCCGWVSFYNWTDNAE LMNRPEVTYPCSCEVKGEEDNSLSVRKGFCEAPGNRTQSGNHPEDWPVYQEGCMEKVQAW LQENLGIILGVGVGVAIIELLGMVLSICLCRHVHSEDYSKVPKY >gi568815587f:44494663_44719123|GENSCAN_predicted_CDS_4|2295_bp atgcccaaggaccagacagtcctccctcaacccatggctgagcagccccaagttctgagt gttgtttcagattcggagcatctggggccagctggcacccctgattctggcttgaccttg gccttggccttctgctccaaaccgttatacccacttcctctgccccatgggaagctcaca gggggcggttatgacttaatcaagaacatcaaacggccaggagctggctctgtgagagcc agaaagagcatcttggagtcaacaattgtttgtggaaggtgctgctatcctgtggggact caagctggggcctggagaaatgggacggtggcctccaggaagcccatcctggacatcatc catgcagacccccagctctcagtggatgtcaaattcttggctaaatccatcagcaatgga aagttggttggccctgggaagaggagaggccggcggatggctagtgttcctgccagagct gccaagtgtctgggctggttaaatggccgcttccctgcctgggaaccaggggccctctgt gggaagcacagcccctttattcccagcttggcaggcccagatctccgcagtagaccatgg aagtgggtctctcagggccaccccttgcaggagctgggctcccgtcatggagagggctca gtacgctggagactggggcctgtgggctggccccttacccaggagcagcagtggtcatct tctggcacggagcctctccaggatgggcacatggctagagactggatcaggcctcagagg gcttcagactgttggtcaggaaagcaacaacttttccagatcatcatcatcatcatcata gttgagtcctccctgctgctgtgtggacgacacgtgggcacaggcagaagtgggccctgt gaccagctgcactggtttcgtggaagcccagctctgcccatgcagctgtcccaggccagc tgtccaggccccagctgcccggcggtgctggctccctttgccccaaagggtctgaaggct ggcggctggcactcctggattttctggcctctgagtcagggaagctccaggactggcggg atgggctcagcctgtatcaaagtcaccaaatactttctcttcctcttcaacttgatcttc tttatcctgggcgcagtgatcctgggcttcggggtgtggatcctggccgacaagagcagt ttcatctctgtcctgcaaacctcctccagctcgcttaggatgggggcctatgtcttcatc ggcgtgggggcagtcactatgctcatgggcttcctgggctgcatcggcgccgtcaacgag gtccgctgcctgctggggctgtactttgctttcctgctcctgatcctcattgcccaggtg acggccggggccctcttctacttcaacatgggcaagttgagtccatgcacagcatatacc aagtacagtgttgaccacgttacactcctcctctcgcagatcctcactaggccctctcaa ggtcgtgggagaatccccgtcctgcagatgaaggagccctttctaaaatctgatcctgtc tcgagccggagcccccaggggctgccttgcagcacttcagagatgaagagcactaacaaa cgggaagggcgtcgtgcggcactcagccggaggctgatgttgctggcatcattgccagtg tttattattattgccgccattattcacttcatgaaggcctgtagctacctggttgccagg gtgacctccctgcactcccagcacaggcctcctgagcatcagctccaagctgaaccagcc cctgtggccatggtccccgtggcccagctgaagcaggagatgggcggcatcgtgactgag ctcattcgagactacaacagcagtcgcgaggacagcctgcaggatgcctgggactacgtg caggctcaggtgaagtgctgcggctgggtcagcttctacaactggacagacaacgctgag ctcatgaatcgccctgaggtcacctacccctgttcctgcgaagtcaagggggaagaggac aacagcctttctgtgaggaagggcttctgcgaggcccccggcaacaggacccagagtggc aaccaccctgaggactggcctgtgtaccaggagggctgcatggagaaggtgcaggcgtgg ctgcaggagaacctgggcatcatcctcggcgtgggcgtgggtgtggccatcatcgagctc ctggggatggtcctgtccatctgcttgtgccggcacgtccattccgaagactacagcaag gtccccaagtactga >gi568815587f:44494663_44719123|GENSCAN_predicted_peptide_5|117_aa MPSAVAACTFLLLPCSLFACCESYLTPKGRGLGWAPGIPQQHLAQTVLSAVGSVCWYVPV YVWQVLYFANSFEPTLSLPGPVSHRKMEMQLDSNSGPLAPSEKAFMKRHPTALCYPS >gi568815587f:44494663_44719123|GENSCAN_predicted_CDS_5|354_bp atgccctccgcagtggctgcctgcactttcctcttgctgccctgctccctctttgcctgc tgcgagtcctacctcaccccaaagggaaggggccttggatgggccccagggatcccacag cagcacttggctcagactgtgctgtcagcagtcggcagtgtgtgctggtacgtacctgtg tacgtgtggcaggtcctctattttgcaaacagctttgagcccaccctcagcctgccaggc cccgtgtcgcacaggaagatggagatgcagctggactcaaactcaggtccccttgctccc agtgagaaagcatttatgaaaagacacccaacagctctctgctacccttcctga >gi568815587f:44494663_44719123|GENSCAN_predicted_peptide_6|63_aa MRNEARMSAVTTSIQHSTGSFGQCVQVFNEHRASWGWGPSSHLPGTVGPEFTPILQGLGE LKL >gi568815587f:44494663_44719123|GENSCAN_predicted_CDS_6|192_bp atgaggaacgaggcaaggatgtctgctgtcaccacttccattcagcacagtactggaagt tttggccagtgcgttcaggtgtttaatgagcacagggcatcctggggatggggacccagc agccacctgccggggaccgtgggccctgagttcactcccatattacaaggcctgggggag ctgaagctgtga >gi568815587f:44494663_44719123|GENSCAN_predicted_peptide_7|195_aa MRWGRRKRGGSHRAAAGAALASAHQVHSLNNSFWLEVWVEIILYPSTHTQSPDLSSRWRN MGLNDDLEKGPHSLLASSEFFHRLTGMEESSRLVMGLLDGALEGSLAMSPTCGGLSLPCA VRFTAVIAFNPDNGFVSSANLFLRGISIAHPGGLLPPHPLYPGQNTATIRFLNLFPTFPA FLFHKAAIVILARSQ >gi568815587f:44494663_44719123|GENSCAN_predicted_CDS_7|588_bp atgaggtggggaaggaggaagagaggtggtagccaccgagctgcagctggagcggcccta gcatctgctcaccaggttcacagtctgaataattccttctggctggaggtgtgggtggag atcattctttatccatctacccacacgcaatccccagatctatcatccaggtggagaaac atggggctaaatgatgatcttgaaaagggccctcactctttactggcctcttcagaattc ttccacaggttgactggaatggaggagtcttcccgcctagtgatggggctgctagacggg gctctggagggttcactggccatgtccccgacctgtggaggactcagtctaccatgtgct gtgcgctttactgctgtcattgcattcaatcctgataacggctttgtaagttctgcaaac ctctttctacgaggtatcagtattgctcatcctggtggcctccttccacctcatcccctg tacccaggacaaaacacggcaaccatccgatttctcaatcttttccccacctttcccgcc tttctattccacaaagccgccattgtcatcctggcccgttctcaataa