GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:23:57 Sequence gi568815597f:32982250_33220179 : 237930 bp : 46.61% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1619 1614 6 1.05 1.04 Term - 5147 4765 383 0 2 40 43 190 0.178 4.70 1.03 Intr - 10074 9981 94 0 1 102 75 -11 0.199 -1.46 1.02 Intr - 15580 15462 119 2 2 33 72 94 0.346 2.48 1.01 Init - 16364 16262 103 2 1 66 80 76 0.649 5.00 1.00 Prom - 19183 19144 40 -3.56 2.07 PlyA - 21011 21006 6 1.05 2.06 Term - 31153 30932 222 1 0 112 35 269 0.953 20.92 2.05 Intr - 32345 32273 73 1 1 124 52 104 0.999 9.81 2.04 Intr - 39212 39118 95 2 2 30 63 123 0.998 2.76 2.03 Intr - 39454 39344 111 1 0 90 64 107 0.870 9.08 2.02 Intr - 42318 42193 126 0 0 82 82 58 0.612 5.48 2.01 Init - 54579 54487 93 0 0 105 99 105 0.893 13.68 2.00 Prom - 57720 57681 40 -3.46 3.00 Prom + 59991 60030 40 -7.66 3.01 Init + 68921 69164 244 1 1 72 53 134 0.099 6.24 3.02 Intr + 76088 76158 71 0 2 64 35 62 0.070 -2.60 3.03 Intr + 82177 82320 144 0 0 96 91 65 0.236 8.08 3.04 Intr + 88399 88767 369 0 0 -118 78 351 0.004 8.80 3.05 Intr + 99929 100105 177 1 0 116 89 190 0.924 22.02 3.06 Intr + 101705 101878 174 1 0 84 105 100 0.912 11.44 3.07 Intr + 109801 109973 173 0 2 66 82 203 0.426 16.24 3.08 Intr + 112299 112464 166 0 1 82 76 169 0.728 15.06 3.09 Intr + 114458 114620 163 1 1 92 73 210 0.995 19.45 3.10 Intr + 115818 115930 113 1 2 95 105 200 0.986 22.50 3.11 Term + 117286 117408 123 0 0 79 39 46 0.401 -2.82 3.12 PlyA + 119313 119318 6 1.05 4.00 Prom + 123810 123849 40 -6.16 4.01 Init + 124402 124581 180 0 0 87 77 108 0.848 8.78 4.02 Intr + 135653 135867 215 1 2 75 99 223 0.997 19.61 4.03 Intr + 137795 137929 135 2 0 100 84 165 0.996 16.98 4.04 Intr + 138168 138189 22 0 1 99 64 7 0.878 -3.05 4.05 Term + 138694 138822 129 0 0 118 53 135 0.987 11.18 4.06 PlyA + 139330 139335 6 1.05 5.03 PlyA - 141125 141120 6 1.05 5.02 Term - 147574 147505 70 0 1 95 36 84 0.619 1.31 5.01 Init - 151354 151209 146 1 2 78 115 73 0.515 6.87 5.00 Prom - 156856 156817 40 -7.56 6.07 PlyA - 158088 158083 6 -0.45 6.06 Term - 165478 164928 551 2 2 107 37 1069 0.998 98.16 6.05 Intr - 176119 176004 116 0 2 106 110 152 0.999 19.29 6.04 Intr - 177695 177439 257 2 2 93 115 678 0.891 67.24 6.03 Intr - 183317 183222 96 2 0 93 105 191 0.753 21.51 6.02 Intr - 199267 198776 492 1 0 49 85 897 0.360 78.70 6.01 Init - 199933 199745 189 1 0 63 74 162 0.622 9.31 6.00 Prom - 225186 225147 40 -2.46 7.02 PlyA - 225932 225927 6 1.05 7.01 Sngl - 227924 227046 879 2 0 45 42 284 0.600 15.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:32982250_33220179|GENSCAN_predicted_peptide_1|232_aa MPKELQGSMERATDCLQCPGWLPYRKDVELLHLPDREGIPELQFEHEVLENQLPEAAQEE CYQVRDDLFQVIWPGLLSGLKRGIEYQVLVFVLLSENFLKIAWERDGRGYLLGALNTQTN VATVVPNGNTHLEPGMLVSAGLLLHGRGLQNLILERCPQEKVNDLRLLDGQGEGLDFLQG LDLHVLDQVAQLGDRHPFLIFILASASSMAQALTPTTIQAPMPLPKPPWKPL >gi568815597f:32982250_33220179|GENSCAN_predicted_CDS_1|699_bp atgcccaaggagctgcagggctccatggagagagcaactgactgcctgcagtgtcctgga tggcttccctataggaaggacgtggagctactacatcttccagacagggaaggcatccct gagttgcagtttgaacatgaggttctagagaaccagcttcctgaagcagcacaggaagag tgttaccaagtacgagatgacttgtttcaggtcatctggccagggctgcttagtggactc aaaagaggtatcgagtatcaggttttggtttttgttctgctatcagaaaattttttgaag attgcctgggagagagatggccgtggctacctcctcggagcacttaacacccagaccaac gtggccactgtagtccccaatggcaacacacaccttgaacctggtatgctggtcagcgca ggtctgcttttgcacgggcgtggtcttcaaaacctcatccttgagagatgcccccaggaa aaagtcaatgatctcagactcctcgatggtcaaggagaagggctagatttcctccaggga cttgatcttcatgtccttgaccaggtagcccaacttggtgacaggcatccattcctcatc ttcatccttgcctctgcaagctccatggcccaggccctgaccccaaccactatccaggcc ccaatgccactgccaaagcctccgtggaaaccactatga >gi568815597f:32982250_33220179|GENSCAN_predicted_peptide_2|239_aa MAPSVPAAEPEYPKGIRAVLLGPPGAGKGTQAPRLAENFCVCHLATGDMLRAMVASGSEL GKKLKATMDAGKLVSDEMVVELIEKNLETPLCKNGFLLDGFPRTVRQAEMLDDLMEKRKE KLDSVIEFSIPDSLLIRRITGRLIHPKSGRSYHEEFNPPKEPMKDDITGEPLIRRSDDNE KALKIRLQAYHTQTTPLIEYYRKRGIHSAIDASQTPDVVFASILAAFSKATCKDLVMFI >gi568815597f:32982250_33220179|GENSCAN_predicted_CDS_2|720_bp atggctcccagcgtgccagcggcagaacccgagtatcctaaaggcatccgggccgtgctg ctggggcctcccggggccggtaaagggacccaggcacccagattggctgaaaacttctgt gtctgccatttagctactggggacatgctgagggccatggtggcttctggctcagagcta ggaaaaaagctgaaggcaactatggatgctgggaaactggtgagtgatgaaatggtagtg gagctcattgagaagaatttggagacccccttgtgcaaaaatggttttcttctggatggc ttccctcggactgtgaggcaggcagaaatgctcgatgacctcatggagaagaggaaagag aagcttgattctgtgattgaattcagcatcccagactctctgctgatccgaagaatcaca ggaaggctgattcaccccaagagtggccgttcctaccacgaggagttcaaccctccaaaa gagcccatgaaagatgacatcaccggggaacccttgatccgtcgatcagatgataatgaa aaggccttgaaaatccgcctgcaagcctaccacactcaaaccaccccactcatagagtac tacaggaaacgggggatccactccgccatcgatgcatcccagacccccgatgtcgtgttc gcaagcatcctagcagccttctccaaagccacatgtaaagacttggttatgtttatctaa >gi568815597f:32982250_33220179|GENSCAN_predicted_peptide_3|638_aa MKRSRCRDRPQPPPPDRREDGVQRAAELSQSLPPRRRAPPGRQRLEERTGPAGPEGKEQP PALASQSAEIAASARPPPRLGRLLGFQKACRCWSLNPHILMALLRSLVPPDKKHPQVWRG RPPLHLAPNVGLFSRVKVRSSVVIEDKSMRDSRRGLSQRRRRRKKKKRGSSSKKKKRRKK RKKKKRKKRKRRKNRKKKKKRKNKRKKKRKKEEKKEEEEERRKKEEEDEEGRGRGRRKRK RKKRKKRRSRKKKETAAAAAAGERLGKWWPGECPVECVAYFLRRRLQQRLHPARQLLLQG MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKCL PRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIAQ IKYAAKHGIQLLSFDNEMELAKVVKSHPSANFHIGSGCPDPQAYAQSIADARLVFEMGTE LGHKMHVLDLGGGFPGTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAF TVAVSIIAKKEVLLDQPGREEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPILQKYS SSCCMLALTHSTPFIGSSEEEMMVPAHCHAPHRDLCFG >gi568815597f:32982250_33220179|GENSCAN_predicted_CDS_3|1917_bp atgaagcggagccgctgccgtgaccgaccgcagccgccgccgcccgaccgccgggaggat ggagttcagcgggcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgccc gggaggcagcggctggaggagcggacgggccccgcggggcccgagggcaaggagcagccg cctgccttggcctcccaaagtgccgagattgcagcctctgcccggccgccaccccgtctg ggaaggcttctgggattccagaaagcctgcaggtgttggagcctcaaccctcatatcctc atggccctgctgaggtctcttgtcccacctgacaagaaacacccacaggtgtggaggggc aggccaccccttcatctggcgcccaacgtggggcttttctctagggtgaaggtacgctcg agcgtggtcattgaggacaagtcaatgagagattcccgaagaggcctatctcaaagaaga agaaggagaaagaagaagaagagaggtagcagcagcaagaagaagaagaggaggaagaag agaaagaagaagaagaggaagaagaggaagaggaggaagaataggaagaagaagaagaag aggaagaacaagaggaagaagaagaggaagaaggaagaaaagaaagaagaggaagaagaa agaagaaagaaggaggaggaagatgaagaaggaagaggaagaggaagaaggaagaggaag aggaagaagaggaagaaaagaagaagcagaaagaagaaagaaacagcagcagcagcagca gctggagaaaggctgggaaaatggtggccaggagagtgtccggtagagtgtgttgcatac tttctaaggcggcggctgcagcagcggctccatccagcccgtcagctcctcctgcaaggc atggctggctacctgagtgaatcggactttgtgatggtggaggagggcttcagtacccga gacctgctgaaggaactcactctgggggcctcacaggccaccacggacgaggtagctgcc ttcttcgtggctgacctgggtgccatagtgaggaagcacttttgctttctgaagtgcctg ccacgagtccggcccttttatgctgtcaagtgcaacagcagcccaggtgtgctgaaggtt ctggcccagctggggctgggctttagctgtgccaacaaggcagagatggagttggtccag catattggaatccctgccagtaagatcatctgcgccaacccctgtaagcaaattgcacag atcaaatatgctgccaagcatgggatccagctgctgagctttgacaatgagatggagctg gcaaaggtggtaaagagccaccccagtgccaattttcacattggcagtggctgtcctgac cctcaggcctatgctcagtccatcgcagacgcccggctcgtgtttgaaatgggcaccgag ctgggtcacaagatgcacgttctggaccttggtggtggcttccctggcacagaaggggcc aaagtgagatttgaagagattgcttccgtgatcaactcagccttggacctgtacttccca gagggctgtggcgtggacatctttgctgagctggggcgctactacgtgacctcggccttc actgtggcagtcagcatcattgccaagaaggaggttctgctagaccagcctggcagggag gaggaaaatggttccacctccaagaccatcgtgtaccaccttgatgagggcgtgtatggg atcttcaactcagtcctgtttgacaacatctgccctacccccatcctgcagaagtactcg tcttcctgttgcatgctggctctcacccactccactcccttcattggctcctcagaagag gagatgatggtcccagcccactgccacgccccccaccgggatctttgctttggctga >gi568815597f:32982250_33220179|GENSCAN_predicted_peptide_4|226_aa MKDKNHMIISVDGEQAFDKIQHPFMIKTFNKLGMEGMYLNMIRAIYDKLTANVILNKDPK KPSTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWGTQA CHITYAMSRVAWEALRRQLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIIARG LVSQVGKAVRVKQTPGGRFCFFGQCLLEQGWPRSIAVSSADSPGIP >gi568815597f:32982250_33220179|GENSCAN_predicted_CDS_4|681_bp atgaaggataaaaatcatatgatcatctcagtagatggagaacaagcatttgacaaaatt cagcatcctttcatgataaaaactttcaacaaattaggtatggaaggaatgtacctcaac atgataagggccatatatgacaagctcacagctaatgttatactcaacaaggatcctaag aaaccatccacggagcagcccctgtacagcagcagcctgtggggcccggcggttgatggc tgtgattgcgtggctgagggcctgtggctgccgcaactacacgtaggggactggctggtc tttgacaacatgggcgcctacactgtgggcatgggttcccccttttgggggacccaggcc tgccacatcacctatgccatgtcccgggtggcctgggaagcgctgcgaaggcagctgatg gctgcagaacaggaggatgacgtggagggtgtgtgcaagcctctgtcctgcggctgggag atcacagacaccctgtgcgtgggccctgtcttcaccccagcgagcatcattgcaaggggc ctggtcagccaggttggcaaggcagtcagagtaaagcagacacctggtggtcgcttttgc ttctttgggcagtgcctgttagaacagggctggccacggagtattgctgtgtccagtgcc gacagccctggcatcccctga >gi568815597f:32982250_33220179|GENSCAN_predicted_peptide_5|71_aa MGGLQAAVITIPALLSSSTLLVISCINGKRAAGGPRSPAPAHHPHEGPRNSDDLLILRNI TLYCPKDHQNG >gi568815597f:32982250_33220179|GENSCAN_predicted_CDS_5|216_bp atgggtggcttgcaagctgcagtgatcaccatcccggccctgctctcctcatccactctg ctggtcatcagctgtataaatggaaaacgtgccgccggtggtcccaggtcacctgcccca gcccaccatccccatgaggggcccagaaattctgatgacttgctgatacttcggaacatc accctttattgtccaaaagatcaccagaatgggtaa >gi568815597f:32982250_33220179|GENSCAN_predicted_peptide_6|566_aa MVGVGRGQPAGGPAPPPPAAGAELLGSRPRGRGGSRGHRQLGFGWFRVERLRWTEAVAAK LAGPSVLPPTAPRSLSRPPAPRAPLSAAPGAMACSLKDELLCSICLSIYQDPVSLGCEHY FCRRCITEHWVRQEAQGARDCPECRRTFAEPALAPSLKLANIVERYSSFPLDAILNARRA ARPCQAHDKVKLFCLTDRALLCFFCDEPALHEQHQVTGIDDAFDELQRELKDQLQALQDS EREHTEALQLLKRQLAETKSSTKSLRTTIGEAFERLHRLLRERQKAMLEELEADTARTLT DIEQKVQRYSQQLRKVQEGAQILQERLAETDRHTFLAGVASLSERLKGKIHETNLTYEDF PTSKYTGPLQYTIWKSLFQDIHPVPAALTLDPGTAHQRLILSDDCTIVAYGNLHPQPLQD SPKRFDVEVSVLGSEAFSSGVHYWEVVVAEKTQWVIGLAHEAASRKGSIQIQPSRGFYCI VMHDGNQYSACTEPWTRLNVRDKLDKVGVFLDYDQGLLIFYNADDMSWLYTFREKFPGKL CSYFSPGQSHANGKNVQPLRINTVRI >gi568815597f:32982250_33220179|GENSCAN_predicted_CDS_6|1701_bp atggtgggggtggggcggggtcaaccggctggtggccccgcccctcccccgcccgctgcg ggggcggagttgcttgggtcccgcccacgggggcggggaggcagccgcggccaccggcag ctcggattcggctggttccgggttgagaggctgcgctggaccgaagcggtggctgctaag ctcgcggggccctcggtgctgcctccgacagcgccgcgctctctcagccgcccccctgcc cctcgggcccccctctctgctgcccctggcgccatggcgtgcagcctcaaggacgagctg ctgtgctccatctgcctgagcatctaccaggacccggtgagcctgggctgcgagcattac ttctgccgccgctgcatcacggagcactgggtgcggcaggaggcgcagggcgcccgcgac tgccccgagtgccggcgcacgttcgccgagcccgcgctggcgcccagcctcaagctggcc aacatcgtggagcgctacagctccttcccgctggacgccatcctcaacgcgcgccgcgcc gcgcgaccctgccaggcgcacgacaaggtcaagctcttctgcctcacggaccgcgcgctt ctctgcttcttctgcgacgagcctgcactgcacgagcagcatcaggtcaccggcatcgac gacgccttcgacgagctgcagagggagctgaaggaccaacttcaggcccttcaagacagc gagcgggaacacaccgaagcgctgcagctgctcaagcgacaactggcggagaccaagtct tccaccaagagcctgcggaccactatcggcgaggccttcgagcggctgcaccggctgctg cgtgaacgccagaaggccatgctagaggagctggaggcggacacggcccgcacgctgacc gacatcgagcagaaagtccagcgctacagccagcagctgcgcaaggtccaggagggagcc cagatcctgcaggagcggctggctgaaaccgaccggcacaccttcctggctggggtggcc tcactgtccgagcggctcaagggaaaaatccatgagaccaacctcacatatgaagacttc ccgacctccaagtacacaggccccctgcagtacaccatctggaagtccctgttccaggac atccacccagtgccagccgccctaaccctggacccgggcacagcccaccagcgcctgatc ctgtcggacgactgcaccattgtggcttacggcaacttgcacccacagccactgcaggac tcgccaaagcgcttcgatgtggaggtgtcggtgctgggttctgaagccttcagtagtggc gtccactactgggaggtggtggtggcggagaagacccagtgggtgatcgggctggcacac gaagccgcaagccgcaagggcagcatccagatccagcccagccgcggcttctactgcatc gtgatgcacgatggcaaccagtacagcgcctgcacggagccctggacgcggcttaacgtc cgggacaagcttgacaaggtgggtgtcttcctggactatgaccaaggcttgctcatcttc tacaatgctgatgacatgtcctggctctacaccttccgcgagaagttccctggcaagctc tgctcttacttcagccctggccagagccacgccaatggcaagaacgttcagccgctgcgg atcaacaccgtccgcatctag >gi568815597f:32982250_33220179|GENSCAN_predicted_peptide_7|292_aa MKAEIKMFFETNKNKDTTYQNLWGTFKAVCRGNFIALNTHKRKQERSKIDTLTSQLKQLE KEEQTNSKASRRQQITKIRAELKEIETKQTLQKKINESRSWFFEKINKIHRPLARLIKKK REKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLSRLNQE EVESPNRPITGSEIEAIINSLPTKKSPGPDGVTAEFYQRYKEELVSFLLKLFQSIEKERI LPNSFYEASIILIPKPGRDTTTKKENFRPISLMNINAKILNKILANRIQQHI >gi568815597f:32982250_33220179|GENSCAN_predicted_CDS_7|879_bp atgaaggcagaaataaagatgttctttgaaaccaataagaacaaagacacaacataccag aatctctggggcacatttaaagcagtgtgtagagggaactttatagcactaaatacccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaacaactagag aaggaagagcaaacaaattcaaaagctagcagaaggcaacaaataactaagatcagagca gaactgaaggaaatagagacaaaacaaacccttcaaaaaaaaatcaatgaatccaggagc tggttttttgaaaagatcaacaaaattcatagaccactagcaagactaataaagaagaaa agagagaagaatcaaatagatgcaataaaaaatgataaaggggatatcaccaccgatccc acagaaatacaaactaccatcagggaatattataaacacctctatgcaaataaactagaa aacctagaagaaatggataaattcctggacacatacaccctctcaagactaaaccaggaa gaagttgaatctccaaatagaccaataacaggctctgaaattgaggcaataattaatagc ttaccaaccaaaaaaagtccaggaccagacggagtcacagccgaattctaccagaggtac aaagaagagctggtatcattccttctgaaattattccaatcaatagaaaaagagagaatc ctccctaactcattttatgaggccagcatcatcctgataccaaagcctggcagagacaca acaacaaaaaaagagaattttaggccaatatccctgatgaacatcaatgcaaaaatcctt aataaaatactggcaaaccgaatccagcagcacatctaa