GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:12:38 Sequence gi568815583f:78442176_78648869 : 206694 bp : 43.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 10308 10167 142 2 1 96 84 73 0.733 8.06 1.01 Init - 16408 16332 77 1 2 72 78 60 0.505 1.96 1.00 Prom - 27918 27879 40 -4.26 2.00 Prom + 32955 32994 40 -2.26 2.01 Init + 34109 34184 76 1 1 78 110 22 0.988 4.65 2.02 Intr + 36122 36222 101 0 2 94 76 104 0.991 9.63 2.03 Intr + 41143 41259 117 0 0 110 100 6 0.960 4.66 2.04 Intr + 42586 42745 160 0 1 11 94 106 0.998 3.06 2.05 Intr + 43530 43665 136 1 1 80 106 52 0.960 5.83 2.06 Intr + 51734 51881 148 2 1 82 93 46 0.827 4.74 2.07 Intr + 65421 65496 76 2 1 63 105 52 0.548 3.49 2.08 Intr + 70909 71250 342 2 0 123 82 243 0.967 22.60 2.09 Intr + 72847 72932 86 2 2 44 110 69 0.692 4.24 2.10 Intr + 85223 85388 166 1 1 15 82 163 0.843 7.93 2.11 Term + 91035 91495 461 1 2 123 42 279 0.994 22.05 2.12 PlyA + 92447 92452 6 1.05 3.00 Prom + 92952 92991 40 -4.76 3.01 Init + 97163 97235 73 1 1 85 66 65 0.787 5.23 3.02 Intr + 98245 98364 120 2 0 57 93 136 0.906 11.47 3.03 Intr + 100308 100470 163 1 1 83 80 107 0.974 8.43 3.04 Intr + 102694 102782 89 1 2 100 53 33 0.990 0.61 3.05 Intr + 103459 103589 131 2 2 67 68 89 0.996 5.21 3.06 Intr + 104400 104523 124 2 1 71 86 57 0.995 3.96 3.07 Term + 106615 106769 155 2 2 71 39 137 0.903 5.18 3.08 PlyA + 106996 107001 6 1.05 4.00 Prom + 118989 119028 40 -4.26 4.01 Init + 123545 123650 106 1 1 102 99 200 0.965 20.88 4.02 Intr + 138636 138787 152 1 2 47 66 83 0.038 1.88 4.03 Intr + 146139 146248 110 2 2 50 97 47 0.864 0.88 4.04 Intr + 147630 148461 832 0 1 63 58 424 0.301 28.30 4.05 Term + 150917 151078 162 1 0 121 48 11 0.225 -1.66 4.06 PlyA + 151925 151930 6 1.05 5.08 PlyA - 151972 151967 6 1.05 5.07 Term - 154557 154429 129 0 0 112 47 28 0.636 -0.72 5.06 Intr - 160089 159078 1012 2 1 122 94 1392 0.996 133.90 5.05 Intr - 166349 166164 186 1 0 74 39 110 0.018 3.50 5.04 Intr - 174958 174849 110 1 2 53 80 110 0.678 5.78 5.03 Intr - 176486 176442 45 2 0 122 108 58 0.999 9.81 5.02 Intr - 176740 176601 140 2 2 38 89 179 0.809 13.18 5.01 Init - 178619 178538 82 2 1 96 101 313 0.994 32.53 5.00 Prom - 181165 181126 40 -5.96 6.08 PlyA - 181233 181228 6 1.05 6.07 Term - 183116 182958 159 2 0 137 55 240 0.998 23.54 6.06 Intr - 185220 185119 102 0 0 3 30 146 0.594 0.67 6.05 Intr - 187770 186792 979 2 1 52 84 1602 0.979 147.14 6.04 Intr - 189010 188901 110 1 2 56 89 212 0.590 17.18 6.03 Intr - 189157 189113 45 1 0 112 111 70 0.998 10.31 6.02 Intr - 193412 193264 149 0 2 64 66 247 0.627 20.05 6.01 Init - 198958 198904 55 1 1 90 101 124 0.900 13.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 202966 202879 88 1 1 85 109 79 0.848 10.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:78442176_78648869|GENSCAN_predicted_peptide_1|73_aa MMTSWERWLMLVILALWEVRLEDHLREYNCSYGFNYHYIYTNNSKIFSVVQVCLLSSKTT CPIQEVNMFTLIL >gi568815583f:78442176_78648869|GENSCAN_predicted_CDS_1|219_bp atgatgaccagctgggaacggtggctcatgcttgtaatcctagcactttgggaggtgagg ctggaggatcatttgagggagtacaactgctcctatggcttcaattatcactacatctac actaataattccaaaatattttctgtagtccaagtctgtcttctaagctccaaaaccaca tgcccgattcaggaggtgaacatgtttactttaatactg >gi568815583f:78442176_78648869|GENSCAN_predicted_peptide_2|622_aa MCPEYGAILSFFPVDNVTLKHLEHTGFSKAKLESMETYLKAVKLFRNDQNSSGEPEYSQV IQINLNSIVPSVSGPKRPQDRVAVTDMKSDFQACLNEKVGFKGFQIAAEKQKDIVSIHYE GSEYKLSHGSVVIAAVISCTNNCNPSVMLAAGLLAKKAVEAGLRVKPYIRTSLSPGSGMV THYLSSSGVLPYLSKLGLTPREFNSYGARRGNDAVMTRGTFANIKLFNKFIGKPAPKTIH FPSGQTRRCAAEAALPVCGKAGSTPGRRVAADIMSSGNYQQSEALSKPTFSEEQASALVE SVFGLKVSKVRPLPSYDDQNFHVYVSKTKDGPTEYVLKISNTKASKNPDLIEVQNHIIMF LKAAGFPTASVCHTKGDNTASLVSVGRPIAELPVSPQLLYEIGKLAAKLDKTLQLSSLHR ENFIWNLKNVPLLEKYLYALGQNRNREIVEHVIHLFKEEVMTKLSHFRECINHGDLNDHN ILIESSKSASGNAEYQVSGILDFGDMSYGYYVFEVAITIMYMMIESKSPIQVGGHVLAGF ESITPLTAVEKGALFLLVCSRFCQSLVMAAYSCQLYPENKDYLMVTAKTGWKHLQQMFDM GQKAVEEIWFETAKSYESGISM >gi568815583f:78442176_78648869|GENSCAN_predicted_CDS_2|1869_bp atgtgtccggaatatggtgctatcctcagctttttccctgttgacaatgtgacattaaaa catttagaacatacaggttttagcaaagccaaactcgaatcaatggaaacataccttaaa gctgtgaaattgtttcgaaatgaccagaattcttcaggagaacctgaatactcccaggtg atccagattaatctgaattcaatagttccatctgttagtggtccaaaaagacctcaggat agagttgctgtgacagatatgaaaagcgatttccaggcttgcttaaatgaaaaggttgga tttaaaggcttccaaattgcagctgaaaaacaaaaggatattgtctccattcattatgaa ggaagtgaatataagctgtctcatggatcagtggtcattgctgcagttatcagttgtacc aataattgcaatccatctgtcatgcttgctgcaggtcttttggctaaaaaggctgttgaa gctggtctgcgtgttaaaccttatataagaacaagtttatctccaggcagtgggatggtt acacattacctcagttcaagtggagtattaccatatctaagtaagcttggccttacccct cgtgaattcaactcttacggagctcgaagaggtaatgatgctgtaatgacaagaggcact tttgcaaatatcaagctttttaataagtttattggaaaaccagctcctaaaacaattcat tttccatcaggacagacgcgccggtgcgcggccgaggccgcactacctgtctgcgggaaa gcgggatccaccccaggacgtcgggtcgctgccgacataatgtcaagtggaaactatcag cagtcagaggctcttagcaaacccactttcagtgaggaacaagcctctgcgttagtggag tcagtgtttgggttgaaagtttccaaggtccggccacttcctagctatgatgaccaaaac tttcatgtctacgtttcaaaaaccaaagatggcccaactgaatatgtcctcaaaataagc aacaccaaggctagcaaaaatccagacctgattgaagtgcagaatcacatcatcatgttt ctgaaagccgctggatttccaacagcctctgtgtgtcacactaaaggagacaacacagct tctctcgtgtctgtaggaagacccatcgctgagcttcccgtcagcccccagctattgtat gaaattggaaaactagctgccaaattggataagacactgcagttaagtagtcttcatcgg gagaacttcatctggaatctgaaaaatgttcctcttctggagaaatacctgtatgccctg ggccagaatcgaaaccgagagattgttgagcatgtcattcatctgttcaaggaggaagta atgaccaaattaagtcattttcgagaatgtatcaatcacggagatcttaatgaccataat attttaatagagtccagcaagtcagcctctggaaatgctgaatatcaagtgtctgggatt ttagactttggtgacatgagctatggctactatgtgtttgaagtggcaattaccatcatg tacatgatgattgagagcaagagtcctatacaagtaggaggccatgtccttgcagggttt gaaagcatcaccccactgacagctgtagagaagggtgctttgtttttacttgtatgcagt cgtttttgtcagtcacttgtcatggctgcatactcttgccagctatacccagagaacaaa gactatctcatggttactgcaaaaaccgggtggaaacacttacagcaaatgtttgacatg ggtcagaaagctgtagaagaaatctggtttgaaactgccaaatcctatgaatctgggatc tccatgtga >gi568815583f:78442176_78648869|GENSCAN_predicted_peptide_3|284_aa MDNEHRNKYDNFSKEDKMVRVYADAVLRMRGGHISSGYSVSGGGLFFRGVKGSVDISGLQ GLPSGRLYQVEYAMEAIGHAGTCLGILANDGVLLAAERRNIHKLLDEVFFSEKIYKLNEY LLQYQEPIPCEQLVTALCDIKQAYTQFGGKRPFGVSLLYIGWDKHYGFQLYQSDPSGNYG GWKATCIGNNSAAAVSMLKQDYKEGEMTLKSALALAIKVLNKTMDVSKLSAEKVEIATLT RENGKTVIRVLKQKEVEQLIKKHEEEEAKAEREKKEKEQKEKDK >gi568815583f:78442176_78648869|GENSCAN_predicted_CDS_3|855_bp atggataatgaacacagaaacaagtatgacaatttcagtaaagaagataaaatggtgcga gtatatgctgacgcggttctgcgcatgcgcgggggccatattagcagcggttattcggtg agcggtggtggtttattcttccgtggagttaagggctccgtggacatctcaggtcttcag ggtcttccatctggtcgcttataccaagttgaatatgccatggaagctattggacatgca ggcacctgtttgggaattttagcaaatgatggtgttttgcttgcagcagagagacgcaac atccacaagcttcttgatgaagtctttttttctgaaaaaatttataaactcaatgagtat ttattacagtatcaggagccaataccttgtgagcagttggttacagcgctgtgtgatatc aaacaagcttatacacaatttggaggaaaacgtccctttggtgtttcattgctgtacatt ggctgggataagcactatggctttcagctctatcagagtgaccctagtggaaattacggg ggatggaaggccacatgcattggaaataatagcgctgcagctgtgtcaatgttgaaacaa gactataaagaaggagaaatgaccttgaagtcagcacttgctttagctatcaaagtacta aataagaccatggatgttagtaaactctctgctgaaaaagtggaaattgcaacactaaca agagagaatggaaagacagtaatcagagttctcaaacaaaaagaagtggagcagttgatc aaaaaacatgaggaagaagaagccaaagctgagcgtgagaagaaagaaaaagaacagaaa gaaaaggataaatag >gi568815583f:78442176_78648869|GENSCAN_predicted_peptide_4|453_aa MAARGSGPRALRLLLLVQLVAGRCGLAGAAGGAQRGLSEPSSIAKHEDSLLKDLFQDYER WVRPVEHLNDKIKIKFGLAISQLVDVEWIDVKLRWNPDDYGGIKVIRVPSDSVWTPDIVL FDNADGRFEGTSTKTVIRYNGTVTWTPPANYKSSCTIDVTFFPFDLQNCSMKFGSWTYDG SQVDIILEDQDVDKRDFFDNGEWEIVSATGSKGNRTDSCCWYPYVTYSFVIKRLPLFYTL FLIIPCIGLSFLTVLVFYLPSNEGEKICLCTSVLVSLTVFLLVIEEIIPSSSKVIPLIGE YLVFTMIFVTLSIMVTVFAINIHHRSSSTHNAMAPLVRKIFLHTLPKLLCMRSHVDRYFT QKEETESGSGPKSSRNTLEAALDSIRYITRHIMKENDVREVVEDWKFIAQVLDRMFLWTF LFVSIVGSLGLFVPVIYKWANILIPVHIGNANK >gi568815583f:78442176_78648869|GENSCAN_predicted_CDS_4|1362_bp atggcggcgcgggggtcagggccccgcgcgctccgcctgctgctcttggtccagctggtc gcggggcgctgcggtctagcgggcgcggcgggcggcgcgcagagaggattatctgaacct tcttctattgcaaaacatgaagatagtttgcttaaggatttatttcaagactacgaaaga tgggttcgtcctgtggaacacctgaatgacaaaataaaaataaaatttggacttgcaata tctcaattggtggatgtggaatggatagatgtaaaattaagatggaaccctgatgactat ggtggaataaaagttatacgtgttccttcagactctgtctggacaccagacatcgttttg tttgataatgcagatggacgttttgaagggaccagtacgaaaacagtcatcaggtacaat ggcactgtcacctggactccaccggcaaactacaaaagttcctgtaccatagatgtcacg tttttcccatttgaccttcagaactgttccatgaaatttggttcttggacttatgatgga tcacaggttgatataattctagaggaccaagatgtagacaagagagatttttttgataat ggagaatgggagattgtgagtgcaacagggagcaaaggaaacagaaccgacagctgttgc tggtatccgtatgtcacttactcatttgtaatcaagcgcctgcctctcttttataccttg ttccttataataccctgtattgggctctcatttttaactgtacttgtcttctatcttcct tcaaatgaaggtgaaaagatttgtctctgcacttcagtacttgtgtctttgactgtcttc cttctggttattgaagagatcataccatcatcttcaaaagtcatacctctaattggagag tatctggtatttaccatgatttttgtgacactgtcaattatggtaaccgtcttcgctatc aacattcatcatcgttcttcctcaacacataatgccatggcgcctttggtccgcaagata tttcttcacacgcttcccaaactgctttgcatgagaagtcatgtagacaggtacttcact cagaaagaggaaactgagagtggtagtggaccaaaatcttctagaaacacattggaagct gcgctcgattctattcgctacattacaagacacatcatgaaggaaaatgatgtccgtgag gttgttgaagattggaaattcatagcccaggttcttgatcggatgtttctgtggactttt cttttcgtttcaattgttggatctcttgggctttttgttcctgttatttataaatgggca aatatattaataccagttcatattggaaatgcaaataagtga >gi568815583f:78442176_78648869|GENSCAN_predicted_peptide_5|567_aa MGSGPLSLPLALSPPRLLLLLLLSLLPVARASEAEHRLFERLFEDYNEIIRPVANVSDPV IIHFEVSMSQLVKVDEVNQIMETNLWLKQIWNDYKLKWNPSDYGGAEFMRVPAQKIWKPD IVLYNKQDPQLQVCWSLLEVHSRPCLPGYQQRWLQNSGGCRTVDLGDPQMLLPDRSSGSF VSEEYPAVAVGDFQVDDKTKALLKYTGEVTWIPPAIFKSSCKIDVTYFPFDYQNCTMKFG SWSYDKAKIDLVLIGSSMNLKDYWESGEWAIIKAPGYKHDIKYNCCEEIYPDITYSLYIR RLPLFYTINLIIPCLLISFLTVLVFYLPSDCGEKVTLCISVLLSLTVFLLVITETIPSTS LVIPLIGEYLLFTMIFVTLSIVITVFVLNVHYRTPTTHTMPSWVKTVFLNLLPRVMFMTR PTSNEGNAQKPRPLYGAELSNLNCFSRAESKGCKEGYPCQDGMCGYCHHRRIKISNFSAN LTRSSSSESVDAVLSLSALSPEIKEAIQSVKYIAENMKAQNEAKEIQDDWKYVAMVIDRI FLWVFTLVCILGTAGLFLQPLMAREDA >gi568815583f:78442176_78648869|GENSCAN_predicted_CDS_5|1704_bp atgggctctggcccgctctcgctgcccctggcgctgtcgccgccgcggctgctgctgctg ctgctgctgtctctgctgccagtggccagggcctcagaggctgagcaccgtctatttgag cggctgtttgaagattacaatgagatcatccggcctgtagccaacgtgtctgacccagtc atcatccatttcgaggtgtccatgtctcagctggtgaaggtggatgaagtaaaccagatc atggagaccaacctgtggctcaagcaaatctggaatgactacaagctgaaatggaacccc tctgactatggtggggcagagttcatgcgtgtccctgcacagaagatctggaagccagac attgtgctgtataacaaacaggaccctcagctgcaggtctgttggagtttgctagaggtc cactccagaccctgtttgcctgggtatcagcagcggtggctgcagaacagcggtggctgt agaacagtggatcttggtgacccacagatgctgctgcctgatcgttcctctggaagtttt gtatcagaggagtacccggccgttgctgttggggatttccaggtggacgacaagaccaaa gccttactcaagtacactggggaggtgacttggatacctccggccatctttaagagctcc tgtaaaatcgacgtgacctacttcccgtttgattaccaaaactgtaccatgaagttcggt tcctggtcctacgataaggcgaaaatcgatctggtcctgatcggctcttccatgaacctc aaggactattgggagagcggcgagtgggccatcatcaaagccccaggctacaaacacgac atcaagtacaactgctgcgaggagatctaccccgacatcacatactcgctgtacatccgg cgcctgcccttgttctacaccatcaacctcatcatcccctgcctgctcatctccttcctc actgtgctcgtcttctacctgccctccgactgcggtgagaaggtgaccctgtgcatttct gtcctcctctccctgacggtgtttctcctggtgatcactgagaccatcccttccacctcg ctggtcatccccctgattggagagtacctcctgttcaccatgatttttgtaaccttgtcc atcgtcatcaccgtcttcgtgctcaacgtgcactacagaaccccgacgacacacacaatg ccctcatgggtgaagactgtattcttgaacctgctccccagggtcatgttcatgaccagg ccaacaagcaacgagggcaacgctcagaagccgaggcccctctacggtgccgagctctca aatctgaattgcttcagccgcgcagagtccaaaggctgcaaggagggctacccctgccag gacgggatgtgtggttactgccaccaccgcaggataaaaatctccaatttcagtgctaac ctcacgagaagctctagttctgaatctgttgatgctgtgctgtccctctctgctttgtca ccagaaatcaaagaagccatccaaagtgtcaagtatattgctgaaaatatgaaagcacaa aatgaagccaaagagattcaagatgattggaagtatgttgccatggtgattgatcgtatt tttctgtgggttttcaccctggtgtgcattctagggacagcaggattgtttctgcaaccc ctgatggccagggaagatgcataa >gi568815583f:78442176_78648869|GENSCAN_predicted_peptide_6|532_aa MRRAPSLVLFFLVALCGRGNCRVANAEEKLMDDLLNKTRYNNLIRPATSSSQLISIKLQL SLAQLISVNEREQIMTTNVWLKQEWTDYRLTWNSSRYEGVNILRIPAKRIWLPDIVLYNN ADGTYEVSVYTNLIVRSNGSVLWLPPAIYKSACKIEVKYFPFDQQNCTLKFRSWTYDHTE IDMVLMTPTASMDDFTPSGEWDIVALPGRRTVNPQDPSYVDVTYDFIIKRKPLFYTINLI IPCVLTTLLAILVFYLPSDCGEKMTLCISVLLALTFFLLLISKIVPPTSLDVPLIGKYLM FTMVLVTFSIVTSVCVLNVHHRSPSTHTMAPWVKRCFLHKLPTFLFMKRPGPDSSPARAF PPSKSCVTKPEATATSTSPSNFYGNSMYFVNPASAASKSPAGSTPVAIPRDFWLRSSGRF RQDVQEALEGVSFIAQHMKNDDEDQSFLDQQHRHRLRAYRKCRVSGPNPDLLAPNLHFNQ VVEDWKYVAMVVDRLFLWVFMFVCVLGTVGLFLPPLFQTHAASEGPYAAQRD >gi568815583f:78442176_78648869|GENSCAN_predicted_CDS_6|1599_bp atgaggcgcgcgccttccctggtccttttcttcctggtcgccctttgcgggcgcgggaac tgccgcgtggccaatgcggaggaaaagctgatggacgaccttctgaacaaaacccgttac aataacctgatccgcccagccaccagctcctcacagctcatctccatcaagctgcagctc tccctggcccagcttatcagcgtgaatgagcgagagcagatcatgaccaccaatgtctgg ctgaaacaggaatggactgattaccgcctgacctggaacagctcccgctacgagggtgtg aacatcctgaggatccctgcaaagcgcatctggttgcctgacatcgtgctttacaacaac gccgacgggacctatgaggtgtctgtctacaccaacttgatagtccggtccaacggcagc gtcctgtggctgccccctgccatctacaagagcgcctgcaagattgaggtgaagtacttt cccttcgaccagcagaactgcaccctcaagttccgctcctggacctatgaccacacggag atagacatggtcctcatgacgcccacagccagcatggatgactttactcccagtggtgag tgggacatagtggccctcccagggagaaggacagtgaacccacaagaccccagctacgtg gacgtgacttacgacttcatcatcaagcgcaagcctctgttctacaccatcaacctcatc atcccctgcgtgctcaccaccttgctggccatcctcgtcttctacctgccatccgactgc ggcgagaagatgacactgtgcatctcagtgctgctggcactgacattcttcctgctgctc atctccaagatcgtgccacccacctccctcgatgtgcctctcatcggcaagtacctcatg ttcaccatggtgctggtcaccttctccatcgtcaccagcgtctgtgtgctcaatgtgcac caccgctcgcccagcacccacaccatggcaccctgggtcaagcgctgcttcctgcacaag ctgcctaccttcctcttcatgaagcgccctggccccgacagcagcccggccagagccttc ccgcccagcaagtcatgcgtgaccaagcccgaggccaccgccacctccaccagcccctcc aacttctatgggaactccatgtactttgtgaaccccgcctctgcagcttccaagtctcca gccggctctaccccggtggctatccccagggatttctggctgcggtcctctgggaggttc cgacaggatgtgcaggaggcattagaaggtgtcagcttcatcgcccagcacatgaagaat gacgatgaagaccagagtttcttggaccagcagcaccggcatcgcctgcgagcttatcgg aaatgcagagtctcaggccccaatccagacctgctagctccaaacctgcattttaaccag gtcgttgaggactggaagtacgtggctatggtggtggaccggctgttcctgtgggtgttc atgtttgtgtgcgtcctgggcactgtggggctcttcctaccgcccctcttccagacccat gcagcttctgaggggccctacgctgcccagcgtgactga