GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:23:19 Sequence gi568815591f:74674032_74889157 : 215126 bp : 45.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 15093 15196 104 1 2 107 77 98 0.739 10.49 1.02 Intr + 16942 17080 139 0 1 83 61 120 0.995 8.84 1.03 Intr + 24930 25064 135 1 0 104 53 134 0.999 12.04 1.04 Intr + 26216 26399 184 0 1 101 57 136 0.705 10.65 1.05 Intr + 32359 32402 44 1 2 88 84 34 0.886 0.88 1.06 Intr + 37001 37078 78 0 0 95 86 73 0.925 7.32 1.07 Intr + 40829 40885 57 0 0 40 70 84 0.378 0.66 1.08 Intr + 42863 42919 57 0 0 103 72 50 0.764 3.76 1.09 Intr + 54755 54865 111 0 0 65 123 71 0.936 8.65 1.10 Intr + 56198 56263 66 0 0 60 80 76 0.879 2.88 1.11 Intr + 58448 58631 184 0 1 116 63 130 0.169 12.15 1.12 Intr + 59892 59950 59 0 2 66 92 22 0.118 -1.07 1.13 Intr + 62469 62652 184 1 1 36 116 73 0.064 3.75 1.14 Intr + 67742 67830 89 2 2 12 81 75 0.021 -1.19 1.15 Intr + 69418 69489 72 2 0 84 111 42 0.963 5.48 1.16 Intr + 70727 70910 184 0 1 88 106 134 0.990 14.05 1.17 Intr + 71852 71910 59 2 2 118 116 -41 0.910 0.23 1.18 Intr + 72306 72380 75 1 0 38 116 69 0.901 4.19 1.19 Intr + 73984 74085 102 2 0 120 111 -6 0.967 5.05 1.20 Intr + 74994 75059 66 1 0 62 103 97 0.951 7.48 1.21 Intr + 75237 75420 184 1 1 81 78 94 0.768 6.55 1.22 Intr + 77330 77385 56 2 2 102 81 29 0.973 2.22 1.23 Intr + 78059 78139 81 0 0 74 80 39 0.705 1.31 1.24 Intr + 79063 79146 84 2 0 43 111 52 0.873 2.79 1.25 Intr + 79815 79998 184 1 1 94 97 164 0.987 16.75 1.26 Intr + 91379 91456 78 2 0 68 64 71 0.003 1.37 1.27 Intr + 98199 98242 44 0 2 72 96 52 0.459 2.28 1.28 Intr + 99948 100072 125 1 2 60 73 146 0.499 10.60 1.29 Intr + 103236 103316 81 2 0 88 84 220 0.933 21.43 1.30 Intr + 105051 105126 76 2 1 68 99 18 0.932 -0.01 1.31 Intr + 105226 105391 166 2 1 78 81 321 0.993 29.42 1.32 Intr + 106749 106804 56 0 2 99 94 32 0.982 3.52 1.33 Intr + 108908 109030 123 0 0 123 41 303 0.998 29.66 1.34 Intr + 109494 109601 108 1 0 90 49 96 0.984 6.16 1.35 Intr + 111151 111268 118 2 1 71 91 202 0.939 18.32 1.36 Intr + 113953 114057 105 1 0 89 115 105 0.992 12.63 1.37 Intr + 114528 114673 146 0 2 77 109 244 0.837 25.33 1.38 Term + 115008 115129 122 1 2 119 54 259 0.999 24.24 1.39 PlyA + 115263 115268 6 1.05 2.15 PlyA - 116987 116982 6 1.05 2.14 Term - 124234 122631 1604 2 2 38 45 1558 0.825 136.79 2.13 Intr - 127982 127939 44 1 2 108 94 -12 0.340 -0.72 2.12 Intr - 129450 129267 184 1 1 108 111 147 0.988 17.85 2.11 Intr - 130829 130746 84 0 0 72 111 43 0.965 4.79 2.10 Intr - 132980 132903 78 0 0 76 80 73 0.962 4.82 2.09 Intr - 135142 135077 66 2 0 64 96 73 0.866 4.58 2.08 Intr - 136921 136865 57 2 0 98 72 44 0.859 2.66 2.07 Intr - 145566 145523 44 2 2 128 68 45 0.971 4.38 2.06 Intr - 145992 145938 55 1 1 103 93 16 0.577 1.64 2.05 Intr - 148424 148396 29 1 2 87 106 -8 0.383 -1.34 2.04 Intr - 148776 148593 184 1 1 73 29 120 0.503 3.45 2.03 Intr - 158912 158774 139 2 1 91 43 156 0.894 11.44 2.02 Intr - 162352 162249 104 2 2 95 81 70 0.621 6.89 2.01 Init - 175561 175081 481 1 1 87 110 331 0.649 30.72 2.00 Prom - 190829 190790 40 -3.46 3.06 PlyA - 190901 190896 6 1.05 3.05 Term - 209010 208855 156 0 0 81 47 95 0.705 2.63 3.04 Intr - 209432 209318 115 1 1 84 56 66 0.691 3.45 3.03 Intr - 209804 209751 54 1 0 108 -1 118 0.669 2.99 3.02 Intr - 210192 210113 80 0 2 109 78 81 0.969 7.55 3.01 Intr - 213768 213652 117 0 0 79 98 114 0.806 12.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 58448 58635 188 0 2 116 47 146 0.831 10.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:74674032_74889157|GENSCAN_predicted_peptide_1|1328_aa XIMAQVAMSTLPVEDEESSESRMVVTFLMSALESMCKELAKSKAEVACIAVYETDVFVVG TERGRAFVNTRKDFQKDFVKYCVEEEEKAAEMHKMKSTTQANRMSVDAVEIETLRKTVED YFCFCYGKALGKSTVVPVPYEKMLRDQSAVVVQGLPEGVAFKHPENYDLATLKWILENKA GISFIIKSCGPIKVKTEPTEDSGISLEMAAVTVKEESEDPDYYQYNIQGPSETDDVDEKQ PLSKPLQGSHHSSEGNEGTEMEVPAEDDDYSPPSKRPKANELPQPPVPEPANAGKRKVRE FNFEKWNARITDLRKQVEELFERKYAQAIKAKGPVTIPYPLFQSHVEDLYVEGLPEGIPF RRPSTYGIPRLERILLAKERIRFVIKKHELLNSTREDLQLDKPASGAEALGSTEAKAVPY QKFEAHPNDLYVEGLPENIPFRSPSWYGIPRLEKIIQVGNRIKFVIKSLRSSAVMDGSTF FIGFSSNWKMQGPGQQQVKEDWNVRITKLRKQVEEIFNLKFAQALGLTEAVKVPYPVFES NPEFLYVEGLPEGIPFRSPTWFGIPRLERIVRGSNKIKFVVKKPELVISYLPPGMASKIN TKALQSPKRPRSPGSNSKVPEIEVTVEGPNNNNPQTSAVRTPTQTNGSNVPFKPRGREFS FEAWNAKITDLKQKVENLFNEKCGEALGLKQAVKVPFALFESFPEDFYVEGLPEGVPFRR PSTFGIPRLEKILRNKAKIKFIIKKPEMFETAIKESTSSKSPPRKINSSPNVNTTASGVE DLNIIQVTIPDDDNERLSKVEKARQLREQVNDLFSRKFGEAIGMGFPVKVPYRKITINPG CVVVDGMPPGVSFKAPSYLEISSMRRILDSAEFIKFTVIRSNSRACCGIRFLRGLKVARS SGYIKQRSGPDHELTLDSGKVHLRRSLEVPGSTGGHPVMGDTFIRHIALLGFEKRFVPSQ HYVYMFLVKWQDLSEKVVYRRFTEIYEFHKTLKEMFPIEAGAINPENRIIPHLPAPKWFD GQRAAENRQGTLTEYCSTLMSLPTKISRCPHLLDFFKVRPDDLKLPTDNQTKKPETYLMP KDGKSTATDITGPIILQTYRAIANYEKTSGSEMALSTGDVVEVVEKSESGWWFCQMKAKR GWIPASFLEPLDSPDETEDPEPNYAGEPYVAIKAYTAVEGDEVSLLEGEAVEVIHKLLDG WWVIRKDDVTGYFPSMYLQKSGQDVSQAQRQIKRGAPPRRSSIRNAHSIHQRSRKRLSQD AYRRNSVRFLQQRRRQARPGPQSPGSPLEEERQTQRSKPQPAVPPRPSADLILNRCSEST KRKLASAV >gi568815591f:74674032_74889157|GENSCAN_predicted_CDS_1|3987_bp nggatcatggcccaagttgcaatgtccaccctccccgttgaagatgaggagtcctcggag agcaggatggtggtgacattcctcatgtcagctctcgagtccatgtgtaaagaactggcc aagtccaaagccgaagtggcctgcattgcagtgtatgaaacagacgtgtttgtcgtcgga actgaaagaggacgtgcttttgtcaataccagaaaggattttcaaaaagattttgtaaaa tattgtgttgaagaagaagaaaaagctgcagagatgcataaaatgaaatctacaacccag gcaaatcggatgagtgtagatgctgtagaaattgaaacactcagaaaaacagttgaggac tatttctgcttttgctatgggaaagctttaggcaaatccacagtggtacctgtaccatat gagaagatgctgcgagaccagtcggctgtggtagtgcaggggcttccggaaggtgttgcc tttaaacaccccgagaactatgatcttgcaaccctgaaatggattttggagaacaaagca gggatttcattcatcattaagagttgtggccccatcaaagtgaaaactgaacccacagaa gattctggcatttccctggaaatggcagctgtgacagtaaaggaagaatcagaagatcct gattattatcaatataacattcaaggcccttctgaaactgatgatgttgatgaaaaacag cccctatcgaagcctttgcaaggaagccaccattcttcagagggcaatgaaggcacagaa atggaagtaccagcagaagatgatgattattctccaccgtctaagagaccaaaggccaat gagctaccgcagccaccagtcccggaacccgccaatgctgggaagcggaaagtgagggag ttcaacttcgagaaatggaatgctcgcatcactgatctacgtaaacaagttgaagaattg tttgaaaggaaatatgctcaagccataaaagccaaaggtccggtgacgatcccgtaccct cttttccagtctcatgttgaagatctttatgtagaaggacttcctgaaggaattcctttt agaaggccatctacttacggaattcctcgcctggagaggatattacttgcaaaggaaagg attcgttttgtgattaagaaacatgagcttctgaattcaacacgtgaagatttacagctt gataagccagcttcaggagcggaagccttggggagcactgaagccaaggctgtaccgtac caaaaatttgaggcacacccgaatgatctgtacgtggaaggactgccagaaaacattcct ttccgaagtccctcatggtatggaatcccaaggctggaaaaaatcattcaagtgggcaat cgaattaaatttgttattaaaagcttgaggagctccgcagtgatggatggcagcacattc ttcattggcttctccagtaattggaagatgcaaggacctggccagcaacaagtcaaagaa gattggaatgtcagaattaccaagctacggaagcaagtggaagagatttttaatttgaaa tttgctcaagctcttggactcaccgaggcagtaaaagtaccatatcctgtgtttgaatca aacccggagttcttgtatgtggaaggcttgccagaggggattcccttccgaagccctacc tggtttggaattccacgacttgaaaggatcgtccgcgggagtaataaaatcaagttcgtt gttaaaaaacctgaactagttatttcctacttgcctcctgggatggctagtaaaataaac actaaagctttgcagtcccccaaaagaccacgaagtcctgggagtaattcaaaggttcct gaaattgaggtcaccgtggaaggccctaataacaacaatcctcaaacctcagctgttcga accccgacccagactaacggttctaacgttcccttcaagccacgagggagagagttttcc tttgaggcctggaatgccaaaatcacggacctaaaacagaaagttgaaaatctcttcaat gagaaatgtggggaagctcttggccttaaacaagctgtgaaggtgccgttcgcgttattt gagtctttcccggaagacttttatgtggaaggcttacctgagggtgtgccattccgaaga ccatcgacttttggcattccgaggctggagaagatactcagaaacaaagccaaaattaag ttcatcattaaaaagcccgaaatgtttgagacggcgattaaggagagcacctcctctaag agccctcccagaaaaataaattcatcacccaatgttaatactactgcatcaggtgttgaa gaccttaacatcattcaggtgacaattccagatgatgataatgaaagactctcgaaagtt gaaaaagctagacagctaagagaacaagtgaatgacctctttagtcggaaatttggtgaa gctattggtatgggttttcctgtgaaagttccctacaggaaaatcacaattaaccctggc tgtgtggtagttgatggcatgcccccgggggtgtccttcaaagcccccagctacctggaa atcagctccatgagaaggatcttagactctgccgagtttatcaaattcacggtcattaga agcaatagcagggcttgctgtggcatccgcttcctgcgggggctcaaggttgctcgttcc tcaggctacataaagcagagatcaggtccggaccatgagctgaccctggactcaggcaag gtgcatttaaggcgcagcctggaagtgccagggagcactggaggccacccagtcatgggg gacaccttcatccgtcacatcgccctgctgggctttgagaagcgcttcgtacccagccag cactatgtgtacatgttcctggtgaaatggcaggacctgtcggagaaggtggtctaccgg cgcttcaccgagatctacgagttccataaaaccttaaaagaaatgttccctattgaggca ggggcgatcaatccagagaacaggatcatcccccacctcccagctcccaagtggtttgac gggcagcgggccgccgagaaccgccagggcacacttaccgagtactgcagcacgctcatg agcctgcccaccaagatctcccgctgtccccacctcctcgacttcttcaaggtgcgccct gatgacctcaagctccccacggacaaccagacaaaaaagccagagacatacttgatgccc aaagatggcaagagtaccgcgacagacatcaccggccccatcatcctgcagacgtaccgc gccattgccaactacgagaagacctcgggctccgagatggctctgtccacgggggacgtg gtggaggtcgtagagaagagcgagagcggttggtggttctgtcagatgaaagcaaagcga ggctggatcccagcgtccttcctcgagcccctggacagtcctgacgagacggaagaccct gagcccaactatgcaggtgagccatacgtcgccatcaaggcctacactgctgtggagggg gacgaggtgtccctgctcgagggtgaagctgttgaggtcattcacaagctcctggacggc tggtgggtcatcaggaaagacgacgtcacaggctacttcccgtccatgtacctgcaaaag tcagggcaagacgtgtcccaggcccaacgccagatcaagcggggggcgccgccccgcagg tcgtccatccgcaacgcgcacagcatccaccagcggtcgcggaagcgcctcagccaggac gcctatcgccgcaacagcgtccgttttctgcagcagcgacgccgccaggcgcggccggga ccgcagagccccgggagcccgctcgaggaggagcggcagacgcagcgctctaaaccgcag ccggcggtgcccccgcggccgagcgccgacctcatcctgaaccgctgcagcgagagcacc aagcggaagctggcgtctgccgtctga >gi568815591f:74674032_74889157|GENSCAN_predicted_peptide_2|1050_aa MAPKHKSSDAGNLDRPKRSRKVLPLSEKVKVLDLIRKDKKSYAEVAKIYGKNESSIREIV KKEKEIRASFAVSPPTAKVTATVRDKCLVKMEQALHLWVEEMNRKRVPIDSNMLRQKALS LYQDFCKGCSETDTKPFTASKGWLHRFRHRFSHHYKKKKKGIMAQVAVSTLPVEEESSSE TRMVVTFLVSALESMCKELAKSKAEVACIAVYETDVFVVGTERGCAFVNARTDFQKDFAK YCKALGTTVMVPVPYEKMLRDQSAVVVQGLPEGVAFQHPENYDLATLKWILENKAGISFI INRPFLGPESQLGGPGMVTDAERSIVSPSESCGPINVKTEPMEDSGSHPSSTSNEVIEME LPMEDSTPLVPSEEPNEDPEAEVKIEGNTNSSSVTNSAAGVEDLNIVQVTVPDNEKERLS SIEKIKQLREQVNDLFSRKFGEAIGVDFPVKVPYRKITFNPGCVVIDGMPPGVVFKAPGY LEISSMRRILEAAEFIKFTVIRPLPGLELSNGEYSTVGKRKIDQEGRVFQEKWERAYFFV EVQNIPTCLICKQSMSVSKEYNLRRHYQTNHSKHYDQYMERMRDEKLHELKKGLRKYLLG SSDTECPEQKQVFANPSPTQKSPVQPVEDLAGNLWEKLREKIRSFVAYSIAIDEITDINN TTQLAIFIRGVDENFDVSEELLDTVPMTGTKSGNEIFSRVEKSLKNFCIDWSKLVSVAST GTPAMVDANNGLVTKLKSRVATFCKGAELKSICCIIHPESLCAQKLKMDHVMDVVVKSVN WICSRGLNHSEFTTLLYELDSQYGSLLYYTEIKWLSRGLVLKRFFESLEEIDSFMSSRGK PLPQLSSIDWIRDLAFLVDMTMHLNALNISLQGHSQIVTQMYDLIRAFLAKLCLWETHLT RNNLAHFPTLKLASRNESDGLNYIPKIAELQTEFQKRLSDFKLYESELTLFSSPFSTKID SVHEELQMEVIDLQCNTVLKTKYDKVGIPEFYKYLWGSYPKYKHHCAKILSMFGSTYICE QLFSIMKLSKTKYCSQLKDSQWDSVLHIAT >gi568815591f:74674032_74889157|GENSCAN_predicted_CDS_2|3153_bp atggccccaaagcacaagagtagtgatgctgggaatttggataggccaaagagaagccgt aaagtgcttcctctaagtgaaaaggtgaaagttctcgacttaatcaggaaagacaaaaaa tcctatgctgaggttgctaagatctacgggaagaatgaatcttccatccgtgaaattgtg aagaaggaaaaagaaattcgtgctagttttgctgtctcacctccaactgctaaagtgacg gccacagtgcgtgataagtgcttagttaagatggaacaggcactgcatttgtgggtggaa gagatgaacagaaaacgtgttcccattgacagcaacatgttgcgccagaaagctttgagc ctataccaagacttctgcaagggatgctctgaaactgacaccaagccatttactgcgagt aagggatggttacacagattcaggcatagattctcacatcattacaagaagaagaagaag gggatcatggcccaggtagcagtgtccaccctgcctgttgaagaagagtcctcctcagag accaggatggtggtgacattcctcgtgtctgccctcgaatccatgtgtaaagaactggcc aagtccaaggcagaagtggcctgcatcgcagtgtacgaaacagacgtgtttgtcgtcgga accgagagaggatgcgcttttgttaatgccaggacggattttcagaaagattttgcaaaa tactgtaaagccttagggacaacagtgatggtgcctgttccctatgagaagatgctgcga gaccagtcggctgtggtagtgcaggggcttccggaaggcgttgcctttcaacaccctgag aattacgaccttgcaaccctgaaatggattttggagaacaaagcagggatttcattcatc ataaatagacccttcctaggaccagagagtcagctgggtggccctgggatggtaacagat gcggagagatccatagtatcaccaagtgaaagctgcggccccatcaatgtgaaaactgaa cccatggaagattctggaagccacccttcttccacaagcaatgaagtaatagaaatggaa ttaccaatggaagattccactccgctggtcccttcagaagaaccaaatgaggaccctgaa gccgaggtgaaaatcgaaggaaacacaaattcatccagtgttacaaattctgcagcaggt gttgaagatcttaacatcgttcaagtgactgttccagataatgagaaggaaagattatca agcattgaaaagattaaacagctaagagaacaagttaatgacctctttagccgaaaattt ggtgaagcaattggcgtggatttccctgtgaaagttccctacaggaagatcacattcaac cctggctgtgtggtgattgatggcatgcccccgggggtggtattcaaggcccccggctat ctggaaatcagttccatgaggaggatcttggaggcagctgagtttatcaaattcacagtc atcaggccgcttccagggcttgagctcagtaatggtgagtattctacagtgggaaaacgc aagatagaccaggagggccgtgtgtttcaagaaaagtgggagagagcgtatttcttcgtg gaagtacagaatattccaacatgtctcatatgcaaacaaagcatgtctgtgtccaaagaa tataacctaagacgccactatcaaaccaatcacagcaagcattatgaccagtatatggaa agaatgcgtgacgagaagcttcacgagctgaaaaaagggctcaggaagtatctcttaggc tcatcagacaccgagtgtcccgagcaaaaacaagtgtttgcaaacccaagtccaacccag aaatcccccgtgcagcctgtagaggacctagctgggaacttatgggagaagttacgtgaa aaaatcaggtcttttgtggcatattctatcgcaatcgatgagatcacggatataaataat accacccagttggccatattcatccgtggtgtcgatgagaatttcgatgtgtccgaagaa cttctggacacggtgcccatgacgggtacaaaatctggcaacgagatcttttcgcgtgtt gagaaaagcctgaaaaacttctgtatcgactggtcgaaattagtaagcgtggcctccact ggcaccccagcgatggtggatgccaataacgggcttgtcacaaaactgaagtccagggtg gcgacgttctgcaagggtgcggaactgaagtccatctgttgtataattcatccggaatca ctctgtgctcagaagttgaagatggaccacgtcatggacgtggtagtgaagtccgtgaac tggatatgctcccggggactgaaccacagtgagttcacaaccttgctctatgagctggac agccagtatggtagcctcctgtactacacggagattaagtggctcagtcgcgggctcgtg ctaaagagatttttcgaatccttggaagaaatcgactccttcatgtcatccagagggaaa cccctgcctcaactgagctccatagattggatccgagacctggccttcttggttgacatg acgatgcatctgaacgctttgaacatctctctccaaggacactcccaaatcgtcacgcag atgtatgacctgatccgggcgttcctagcaaaactgtgcctctgggagactcatttgacg aggaataatctggcccactttcccaccctgaaattggcttccagaaatgaaagcgatggc ctgaactacattcccaaaatcgcggaactccagaccgaattccagaaaaggctgtctgat ttcaaactctacgaaagcgaactgactctgttcagctccccgttctccacgaagatcgac agtgtgcacgaggagctccagatggaggttatcgacctgcaatgcaacacggtcctgaag acgaaatacgacaaggtgggaataccagaattctacaagtacctctggggtagctacccg aaatacaagcaccattgcgcaaagattctttccatgttcgggagcacctacatctgcgaa cagctgttctccattatgaaactgagcaaaacaaaatactgctcccagttaaaggattcc cagtgggattctgtactccacatcgcaacgtga >gi568815591f:74674032_74889157|GENSCAN_predicted_peptide_3|173_aa VAKYPKKGSQAVHRHSRKQSEPPANDIFNAAKAAKSDMQDWMVSMIVDREYSVAVEAVRL LILILKNMEGVLMDVDCESVYPIVLFYPECEIRTMGGREQRQSPGAQRTFFQLLLSFFVE SKLHDHAAYLVDNLWDCAGTQLKDWEGLTSLLLEKDQSTCHMEPGPGTFHLLG >gi568815591f:74674032_74889157|GENSCAN_predicted_CDS_3|522_bp gtggcaaaatatccaaagaaagggtcccaagcggtacatcgtcatagccggaaacagtca gagccaccagccaatgatattttcaatgctgcgaaagctgccaaaagtgacatgcaggac tggatggtttccatgatcgtggacagagagtacagtgtggcagtggaggccgtcagatta ctgatacttatccttaagaacatggaaggggtgctgatggacgtggactgtgagagcgtc taccccattgtacttttctaccctgagtgcgagataagaacgatgggtggaagagagcaa cgccagagcccaggcgcccagaggactttcttccagcttctgctgtccttctttgtggag agcaagctccacgaccacgctgcttacttagtagacaacctgtgggactgtgcagggact cagctgaaggactgggagggtctgacaagcctgctgctggagaaggaccagagcacgtgc cacatggagccagggccagggaccttccacctcctagggtga