GENSCAN 1.0 Date run: 2-Nov-116 Time: 23:28:33 Sequence gi568815595f:45294454_45647527 : 353074 bp : 44.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 7811 7850 40 -2.76 1.01 Init + 12483 12678 196 2 1 100 7 165 0.013 6.70 1.02 Intr + 37336 37410 75 2 0 102 37 49 0.078 0.69 1.03 Intr + 39047 39182 136 0 1 115 87 88 0.564 11.03 1.04 Intr + 39235 39408 174 1 0 83 100 23 0.627 2.05 1.05 Intr + 50700 50748 49 0 1 22 87 69 0.029 -1.12 1.06 Intr + 75036 75137 102 2 0 52 97 51 0.023 2.77 1.07 Intr + 86420 86458 39 1 0 111 85 2 0.001 0.62 1.08 Intr + 100019 100234 216 1 0 65 76 90 0.271 4.20 1.09 Intr + 105792 105920 129 2 0 90 110 146 0.778 17.89 1.10 Intr + 118793 118840 48 1 0 89 79 21 0.302 0.18 1.11 Intr + 123029 123120 92 1 2 94 115 59 0.665 7.99 1.12 Intr + 125216 125276 61 2 1 90 110 29 0.772 4.04 1.13 Intr + 152438 152527 90 1 0 110 107 50 0.489 9.29 1.14 Intr + 164290 164433 144 0 0 44 113 108 0.880 9.38 1.15 Intr + 179790 179897 108 2 0 93 55 111 0.818 8.78 1.16 Intr + 182015 182174 160 1 1 87 105 148 0.920 15.96 1.17 Intr + 185885 185933 49 0 1 68 101 34 0.395 0.44 1.18 Intr + 193198 193385 188 1 2 30 92 88 0.369 2.73 1.19 Intr + 194244 194359 116 1 2 78 73 80 0.984 5.67 1.20 Intr + 197064 197347 284 2 2 61 116 195 0.941 15.82 1.21 Intr + 201822 201920 99 0 0 83 98 64 0.928 6.13 1.22 Intr + 205989 206126 138 0 0 101 87 71 0.965 7.88 1.23 Intr + 218682 218782 101 0 2 112 110 74 0.760 11.75 1.24 Intr + 221641 221823 183 2 0 59 106 242 0.842 22.86 1.25 Intr + 223450 223619 170 2 2 40 71 69 0.462 0.07 1.26 Intr + 225766 225843 78 0 0 45 123 46 0.562 3.55 1.27 Intr + 229544 229655 112 1 1 107 98 75 0.991 10.35 1.28 Intr + 247376 247503 128 0 2 91 119 142 0.583 18.00 1.29 Term + 252898 253077 180 0 0 116 48 208 0.957 17.21 1.30 PlyA + 254362 254367 6 1.05 2.00 Prom + 254504 254543 40 -5.56 2.01 Init + 256419 256456 38 2 2 85 64 77 0.298 2.59 2.02 Intr + 260888 260906 19 2 1 99 94 6 0.209 -1.09 2.03 Intr + 260979 261041 63 2 0 49 113 70 0.387 4.51 2.04 Term + 295822 296115 294 0 0 74 40 149 0.121 4.01 2.05 PlyA + 297724 297729 6 1.05 3.00 Prom + 297778 297817 40 -9.85 3.01 Sngl + 300427 301842 1416 0 0 80 54 745 0.936 66.24 3.02 PlyA + 304730 304735 6 -1.75 4.04 PlyA - 304980 304975 6 1.05 4.03 Term - 307070 306879 192 2 0 100 55 91 0.706 4.42 4.02 Intr - 309783 309692 92 1 2 87 76 22 0.652 0.61 4.01 Init - 316108 316054 55 1 1 104 85 47 0.579 7.55 4.00 Prom - 324876 324837 40 -3.46 5.04 PlyA - 326963 326958 6 1.05 5.03 Term - 328949 328887 63 2 0 95 48 55 0.331 0.09 5.02 Intr - 335425 335281 145 0 1 74 62 78 0.662 4.08 5.01 Init - 338318 338272 47 2 2 70 89 69 0.628 3.39 5.00 Prom - 341648 341609 40 -6.66 6.05 PlyA - 342231 342226 6 1.05 6.04 Term - 343223 343147 77 0 2 91 48 78 0.606 2.10 6.03 Intr - 348554 348496 59 1 2 101 35 63 0.523 0.83 6.02 Intr - 350086 350035 52 2 1 80 115 43 0.634 4.17 6.01 Init - 352981 352897 85 1 1 43 95 42 0.448 1.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 12483 12734 252 2 0 100 37 206 0.889 9.80 S.002 Init - 88216 88101 116 1 2 55 116 128 0.989 11.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:45294454_45647527|GENSCAN_predicted_peptide_1|1214_aa MPAPPPPTPPPAPHPGLLRGPSLPTSTTPYSTARGPIDRSRAEECGCTARDWQAAPPVAP VWDPLGNWKQQEYLLRTQNDIQDEEALEVSGTTMNPTYRTGELCRWKEVGDVETLSTTEA AGQEKVSIAVLLPTKKLLTGLCPLLSEKCQCLGPSTGDISLLRRVMTTDPRHFLLAGTVP RVLGTGSLSPYIRSSGVWKYKMEMQLIQFLLWGGSSNFERGEKSGKDYISWFECQLSCST IEHQVNQGFLLGLDPLLRLGFYASLLKRQLNGGPDVIKWERRVIPGCTRSIYSATGKWTK EYTLQTRKDVEKWWHQRIKEQASKISEADKSKPKFYVLSMFPYPSGKLHMGHVRVYTISD TIARFQKMRGMQKATMLNVAKEMEAEIVVINPMGWDAFGLPAENAAVERNLHPQSWTQSN IKHMRKQLDRLGLCFSWDREITTCLPDYYKWTQYLFIKLYEAGLAYQKEALVNWDPVDQT VLANEQVDEHGCSWRSGAKVEQKYLRQWFIKTTAYAKAMQDALADLPEWYGIKGMQAHWI GDCVGCHLDFTLKVHGQATGEKLTAYTATPEAIYGTSHVAISPSHRLLHGHSSLKEALRM ALVPGKVSLETSYMDCRLGEHQMECYWYSSTSCNAKDSPRNKVHNVSSTKVEKSCPYKTK ELGGKEKKTHSYEYPKPKQQALAPLGIPSTSSEDTILAQTLGLAYSEVIETLPDGTERLS SSAEFTGMTRQDAFLALTQKARGKRVGGDVTSDKLKDWLISRQRYWGTPIPIVHCPVCGP TPVPLEDLPVTLPNIASFTGKGGPPLAMASEWVNCSCPRCKGAAKRETDTMDTFVDSAWY YFRYTDPHNPHSPFNTAVADYWMPVDLYIGGKEHAVMHLFYARFFSHFCHDQKMVKHREP FHKLLAQGLIKGQTFRLPSGQYLQREEVDLTGSVPVHAKTKEKLEVTWEKMSKSKHNGVD PEEVVEQYGIDTIRLYILFAAPPEKDILWDVKTDALPGVLRWQQRLWTLTTRFIEARASG KSPQPQLLSNKEKAEARKLWEYKNSVISQVTTHFTEDFSLNSAISQLMGLSNALSQASQS VILHSPEFEDALCALMVMAAPLAPHVTSEIWAGLALVPRKLCAHYTWDASVLLQAWPAVD PEFLQQPEVVQMAVLINNKACGKIPVPQQVARDQDKVHEFVLQSELGVRLLQGRSIKKSF LSPRTALINFLVQD >gi568815595f:45294454_45647527|GENSCAN_predicted_CDS_1|3645_bp atgcctgcacctcccccccccaccccacccccggctccccaccctgggctcctgcgcggc ccgagcctcccaacgagcaccaccccctactccacggcacgtggtcccatcgaccgctca agggctgaggagtgcggatgcacagcacgggactggcaggcagctccacctgtggcccca gtgtgggatccactgggcaattggaaacagcaggaatatctccttagaacccaaaacgat attcaggatgaggaggcattagaggtctcagggactaccatgaatcctacttaccgcact ggagagctgtgcagatggaaggaagtgggagatgtggaaacactttcaacaactgaagca gcaggccaagaaaaggtgtcaatagctgtgcttttgcccacaaaaaagttgttgactggg ctgtgccccctgttgtcagagaagtgtcagtgtctgggaccatctactggtgatatttcc ctgctcagaagagtcatgaccacagaccccaggcatttcctcttggcaggcactgtgccc agagttttaggaacaggctcactgagtccatacatcagaagttctggagtctggaagtac aagatggagatgcagctgattcagttcttgttatggggaggctcctccaactttgaaagg ggagagaagagtgggaaggactacatctcatggtttgagtgccagctcagctgcagtaca atagaacaccaggtaaatcagggatttcttcttggtttggatccattgttgagattgggt ttttatgcctctcttctgaaaagacagctaaatggtgggccagatgtcatcaagtgggaa aggagagtaattcccggatgtaccagaagcatctacagtgccacgggaaagtggacaaaa gagtatacattgcagacaagaaaggatgttgagaaatggtggcatcaacgaataaaagaa caggcctccaaaatttcagaagctgataaatcgaagccaaaattttacgtgctttccatg ttcccttatccttctggtaagctgcacatgggccatgtgcgtgtctacaccatcagcgac accatagcacggttccagaagatgagagggatgcagaaagcaactatgttaaatgtggcc aaagagatggaggctgaaattgtggtcatcaaccccatgggatgggatgcttttggattg cctgctgaaaatgccgcagtcgagaggaatctacatccacaaagttggacacaaagtaat attaaacacatgaggaaacagcttgatcgtctgggcctgtgtttcagctgggatagggaa ataactacgtgtttgccagattactacaagtggactcagtatctctttattaaactgtat gaggctgggctggcctatcaaaaggaggccctggttaactgggacccagtggatcaaaca gtgcttgccaatgagcaggtggatgaacatggctgttcatggcgttctggagcaaaggtg gaacagaagtacctcagacaatggtttattaagacaaccgcttatgcaaaggccatgcag gacgcgttggcagaccttccagaatggtatggaataaaaggcatgcaagcccactggatt ggggactgtgtgggctgccacctggacttcacattaaaggttcatgggcaagccacgggc gaaaagctgactgcctatacggccacccctgaagccatttatggcacctcccacgtggcc atctcgcccagccacagactcctacatgggcacagctctctgaaggaagccttgaggatg gcccttgtccctggcaaagtatccctagaaactagttatatggactgccgccttggtgaa caccaaatggagtgctattggtacagttcaacatcctgcaatgcaaaagacagcccccgc aacaaagtccacaatgtcagtagtaccaaggttgagaaatcctgcccttacaaaaccaaa gagctggggggaaaagaaaaaaaaacccattcttatgaatacccgaagccaaaacagcag gccttggcccccctgggaattcccagtactagctcagaggacaccatcttagcccaaacc ctgggcctggcctactctgaagtcattgaaactttgccagatggcacagagagactgagc agctctgctgagttcacaggtatgacccggcaggatgcttttctagccctgactcagaaa gcccgggggaagagagtgggtggagacgtgacaagtgataaactgaaagactggctgatt tcacggcagcggtactggggcacaccaatccccattgtccactgcccagtctgtggcccc acacctgtgcccctggaggacttgcctgtgaccctgcccaacatcgcgtctttcactggc aagggaggccccccactggccatggcttcagagtgggtgaactgctcctgcccaaggtgc aagggagcagccaagagagagacagacacgatggatacctttgttgattctgcttggtac tacttcagatacactgaccctcataatccacacagcccttttaacacagcagtggccgat tactggatgcctgtggatttgtacattggagggaaagaacatgccgtcatgcacttgttc tatgcaagattctttagtcatttttgccatgatcaaaaaatggttaaacatagggagcct tttcataagctgctggcccaaggccttatcaaggggcagacattccgcctaccatctgga cagtatctacagagagaggaagtggatctcacaggttccgttcctgttcatgcaaaaacg aaagagaagttagaggtgacgtgggagaagatgagtaagtccaaacacaacggggtggac ccagaggaagttgtggagcagtatgggatcgacacgattcggctctacatcctttttgct gcccctcctgagaaggatatcttgtgggatgtgaaaactgatgctctccctggggtgctg agatggcaacaacgactgtggaccttgacaactcggtttattgaggccagggcttctggg aagtctccccagcctcagctgctgagtaacaaggagaaagctgaggccaggaagctctgg gagtacaagaactccgtcatctctcaggtgaccacccatttcacagaggacttctcactg aattctgcaatttctcagctgatgggactcagcaatgccctctcgcaagcctctcagagc gtcattctccacagccccgagtttgaggatgctttgtgtgccctgatggtaatggctgct ccactggcccctcatgtaacctcagagatctgggcaggcctggcgctggtgccgaggaag ctctgtgcccactacacttgggatgccagtgtgctgctccaggcatggcctgctgtggac ccggagttcctgcagcagcctgaggttgtccagatggcagttctgatcaacaataaagct tgtggcaaaattcctgtgccccaacaagttgcccgggaccaggacaaagtccacgaattt gttcttcaaagcgagctgggtgtcaggcttttgcaaggacgaagcatcaagaagtccttc ctttccccgagaactgccctcatcaacttcctggtgcaagattga >gi568815595f:45294454_45647527|GENSCAN_predicted_peptide_2|137_aa MKFLLTELLSLGRSPTLAWCPEEQPPEEEELPVSLACQEKPDRLCSARATGVDPMLAKGK PAMEQRELCEQASMGSGHCTQPGMPAVAVQAAPGASTGTAVAGPGVPQATSTAGTGECGG TWKLGDTRNRRAPKRVS >gi568815595f:45294454_45647527|GENSCAN_predicted_CDS_2|414_bp atgaagttcctgctcacggagttgctcagtctgggcagaagccccaccctggcctggtgc cctgaggagcagcctccagaggaagaggagttgccagtctccctggcctgccaggaaaag cctgacaggctgtgctcagctcgtgctactggtgtggatcccatgcttgccaagggcaag ccagccatggagcagcgagagctgtgtgagcaagcaagcatggggtctggccactgcaca cagccaggcatgccagctgtggcagtgcaggcagctccaggtgccagcacgggcacagct gtggctggaccaggcgtaccacaggcaacttccaccgctggcactggggaatgcggtggc acctggaagcttggagacaccaggaaccgcagagccccaaagagggtatcataa >gi568815595f:45294454_45647527|GENSCAN_predicted_peptide_3|471_aa MDKYDDLGLEASKFIEDLNMYEASKDGLFRVDKGAGNNPEFEETRRVFATKMAKIHLQQQ QQQLLQEETLPRGSRGPVNGGGRLGPQARWEVVGSKLTVDGAAKPPLAASTGAPGAVTTL AAGQPPYPPQEQRSRPYLHGTRHGSQDCGSRESLATSEMSAFHQPGPCEDPSCLTHGDYY DNLSLASPKWGDKPGVSPSIGLSVGSGWPSSPGSDPPLPKPCGDHPLNHRQLSLSSSRSS EGSLGGQNSGIGGRSSEKPTGLWSTASSQRVSPGLPSPNLENGAPAVGPVQPRTPSVSAP LALSCPRQGGLPRSNSGLGGEVSGVMSKPNVDPQPWFQDGPKSYLSSSAPSSSPAGLDGS QQGAVPGLGPKPGCTDLGTGPKLSPTSLVHPVMSTLPELSCKEGPLGWSSDGSLGSVLLD SPSSPRVRLPCQPLVPGPELRPSAAELKLEALTQRLEREMDAHPKADYFGE >gi568815595f:45294454_45647527|GENSCAN_predicted_CDS_3|1416_bp atggataagtatgacgacctgggcctggaggccagtaaattcatcgaggacctgaacatg tatgaggcctctaaggatgggctcttccgagtggacaagggtgcaggcaacaaccccgag tttgaggaaactcgcagggtgttcgccaccaagatggccaaaatccacctccagcagcag cagcagcagctcctgcaggaggagactctgcccagggggagtagaggccctgtcaatgga gggggccgcctgggcccacaggcccgttgggaagttgtgggcagcaagctgactgtggat ggtgctgccaagcctcctcttgctgcctcgacaggggcacctggggcagtcaccaccctc gctgctgggcagcccccgtacccaccgcaggagcagagatccaggccatacctgcatggc acgaggcatggcagccaggactgtggttccagggagagcctggcgacttctgagatgtct gctttccaccagccaggcccctgtgaggatccttcctgcctcactcatggagactattat gacaacctctccttggcaagcccaaagtggggtgacaaaccaggagtgtcccccagcatc ggcctgagtgtagggagtgggtggcctagctccccggggagtgacccaccactgcccaaa ccctgcggggaccatcccctaaatcaccgacagctctccctgagctccagcaggtcttct gagggtagcctcggtggtcagaatagtggcattggtggccgcagcagcgagaagccaaca ggcctttggtccactgcctcctcccagcgggtgagccctggcctgccttccccaaacttg gagaacggagcaccagctgtggggcctgttcagcccaggaccccttctgtgtcagcaccc ttggccctgagctgccccaggcaaggaggtcttccaagatcaaactcggggctggggggt gaggtttcaggtgtgatgtccaaacccaatgtggacccccaaccctggttccaggatggg cccaaatcttacctttccagttctgccccgtcatcctcgccagctggtctggacggttca cagcagggtgcggtccctgggctggggccgaagcctggctgcacagaccttggcactggt cccaagctcagccccaccagtcttgtccatccagtgatgtccaccctgcctgagttatct tgtaaagagggtcccctgggctggtcttctgatggtagcctgggatctgtgctcctggac agccccagctcccctagggtaaggctgccctgccagcccctcgtcccaggtcctgagctg agaccctctgctgctgagttgaaattagaagccctcacccaacgtctggagcgagagatg gatgctcacccgaaggctgattactttggtgagtga >gi568815595f:45294454_45647527|GENSCAN_predicted_peptide_4|112_aa MHACQHVSLLPAKPAQAFGRASSPGGHTVEVALGRRHQLLAVSSPSFTQEGGTVGMLQRP FDEGEDRGGHSCWGAAGRLPSCQPLKRLSQLHRAASPKVTIFLGSPQPTADP >gi568815595f:45294454_45647527|GENSCAN_predicted_CDS_4|339_bp atgcatgcctgccagcatgtctccctgctccctgccaaacctgcgcaggcctttgggcga gcttccagcccaggaggacatacagtggaggtggccctgggcagacggcaccaactcctt gctgtctccagcccaagcttcacccaggaagggggcactgtggggatgctgcaaaggccc ttcgatgaaggcgaggacaggggagggcacagctgctggggtgcggcaggcaggttgccc tcctgtcagcccctgaagagactatcccagctgcacagagctgcctcgcccaaagtcacc atcttcctgggcagcccacagccaacagctgatccatga >gi568815595f:45294454_45647527|GENSCAN_predicted_peptide_5|84_aa MVVVHLPLGTAPTMLWVCNGSVARFFSAPLLKPLGEHTGSPPTAPAKLCLVLPADGLPVW WRLSGNKELKVPLSKSCTPPPTPT >gi568815595f:45294454_45647527|GENSCAN_predicted_CDS_5|255_bp atggtggtggtgcatctgccactgggcacagcacctaccatgctgtgggtgtgcaatggg agtgtggctcgcttcttcagtgcccccctgctcaaacctctaggggagcatacaggctct cctccgactgccccagccaaactctgcctcgttctgccggctgatggcctgccagtgtgg tggcgtctgtcgggaaacaaggagctcaaagtacccctgagcaagtcctgcaccccaccc ccaactcctacatga >gi568815595f:45294454_45647527|GENSCAN_predicted_peptide_6|90_aa MHLVLENLATGSTLSQPQYSWEESGSSHGFSAPPWVEEGYAVSGPRVCQQLPESVQCQWG GMAAWEGFKMRIPKPMVVSPDHTLEVSEKL >gi568815595f:45294454_45647527|GENSCAN_predicted_CDS_6|273_bp atgcatctggtccttgaaaacctggccacaggatccacactcagccagcctcagtactcc tgggaagagtctgggagctcccatggatttagtgcgccgccatgggtggaggagggttat gcggtgtcaggcccacgggtgtgccaacagcttcctgagagtgtccagtgccagtggggg ggcatggccgcctgggaaggattcaaaatgaggatccccaaaccaatggtagtcagccct gatcacaccctggaagtatctgagaagctgtga