GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:29:10 Sequence gi568815587r:118654715_118886251 : 231537 bp : 45.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 891 976 86 1 2 91 77 131 0.965 11.84 1.02 Intr + 1146 1178 33 2 0 106 65 38 0.748 1.72 1.03 Term + 1969 2109 141 0 0 106 36 209 0.999 15.43 1.04 PlyA + 3292 3297 6 1.05 2.15 PlyA - 3536 3531 6 -1.95 2.14 Term - 3727 3575 153 1 0 91 48 145 0.939 8.72 2.13 Intr - 4157 3966 192 2 0 73 99 38 0.473 3.09 2.12 Intr - 4303 4191 113 2 2 11 105 131 0.961 7.20 2.11 Intr - 4767 4656 112 0 1 28 75 118 0.900 4.45 2.10 Intr - 5367 5033 335 1 2 40 78 268 0.795 16.29 2.09 Intr - 6019 5825 195 2 0 103 80 159 0.981 15.99 2.08 Intr - 6201 6152 50 2 2 71 105 74 0.590 5.72 2.07 Intr - 6568 6446 123 0 0 71 94 66 0.970 5.20 2.06 Intr - 7015 6923 93 0 0 53 64 146 0.995 7.78 2.05 Intr - 7276 7176 101 1 2 86 93 53 0.970 4.51 2.04 Intr - 8254 8167 88 0 1 85 64 76 0.997 4.87 2.03 Intr - 8482 8338 145 2 1 126 80 192 0.999 21.54 2.02 Intr - 8725 8625 101 0 2 130 72 122 0.998 14.55 2.01 Init - 24913 24825 89 1 2 103 75 141 0.806 12.42 2.00 Prom - 28547 28508 40 -6.86 3.08 PlyA - 29482 29477 6 1.05 3.07 Term - 33471 33319 153 0 0 94 48 68 0.367 1.32 3.06 Intr - 35068 35015 54 1 0 85 99 52 0.659 5.18 3.05 Intr - 35454 35278 177 0 0 34 65 94 0.400 1.82 3.04 Intr - 35922 35671 252 0 0 28 78 284 0.491 19.03 3.03 Intr - 45843 45819 25 2 1 128 66 -13 0.002 -1.47 3.02 Intr - 63529 63397 133 2 1 95 87 18 0.009 2.10 3.01 Init - 65551 65479 73 1 1 31 110 35 0.020 1.38 3.00 Prom - 66097 66058 40 -2.56 4.06 PlyA - 66476 66471 6 1.05 4.05 Term - 67354 67222 133 0 1 70 43 124 0.088 3.66 4.04 Intr - 73253 72850 404 2 2 49 36 238 0.228 7.93 4.03 Intr - 74461 74356 106 0 1 37 21 140 0.144 2.42 4.02 Intr - 87176 86980 197 2 2 49 92 117 0.879 6.41 4.01 Init - 87509 87483 27 2 0 68 77 32 0.428 -0.19 4.00 Prom - 92062 92023 40 -3.46 5.13 PlyA - 92150 92145 6 1.05 5.12 Term - 100173 99998 176 1 2 66 42 224 0.994 13.52 5.11 Intr - 100789 100688 102 2 0 63 94 49 0.856 3.15 5.10 Intr - 101609 101546 64 2 1 76 119 78 0.999 7.99 5.09 Intr - 102573 102457 117 0 0 71 109 20 0.498 3.06 5.08 Intr - 104188 104060 129 1 0 119 90 134 0.973 17.59 5.07 Intr - 105330 105208 123 0 0 116 101 13 0.983 6.18 5.06 Intr - 108592 108498 95 2 2 28 80 65 0.984 -0.62 5.05 Intr - 110641 110495 147 2 0 59 110 62 0.850 5.71 5.04 Intr - 113638 113509 130 1 1 79 96 44 0.863 4.57 5.03 Intr - 125022 124918 105 0 0 83 91 55 0.972 5.71 5.02 Intr - 126470 126407 64 1 1 85 123 35 0.999 5.42 5.01 Init - 131537 131338 200 2 2 69 97 126 0.831 9.97 5.00 Prom - 132991 132952 40 -7.96 6.04 PlyA - 133095 133090 6 1.05 6.03 Term - 135440 135292 149 0 2 106 45 60 0.878 1.56 6.02 Intr - 137486 137336 151 2 1 83 40 160 0.680 10.44 6.01 Init - 157005 156955 51 0 0 83 64 34 0.051 1.73 6.00 Prom - 158900 158861 40 -4.36 7.02 PlyA - 159487 159482 6 1.05 7.01 Sngl - 179346 179074 273 0 0 13 38 282 0.824 11.03 7.00 Prom - 179945 179906 40 -4.56 8.00 Prom + 180840 180879 40 -7.26 8.01 Init + 203084 203086 3 1 0 87 101 0 0.133 1.20 8.02 Intr + 217701 217886 186 2 0 69 80 97 0.260 6.89 8.03 Intr + 225736 225811 76 0 1 112 92 38 0.273 5.69 8.04 Intr + 229217 229278 62 0 2 27 105 143 0.117 8.35 8.05 Term + 229605 229694 90 2 0 118 37 25 0.661 -1.88 8.06 PlyA + 229805 229810 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_1|86_aa XKHETKLKGVIYFQAIEEVYYDHLRSAAKKRFFRFTMVTESPNPALTFCVKTHDRLYYMV APSAEAMRIWMDVIVTGAEGYTQFMN >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_1|261_bp nacaagcatgagacgaagctgaagggagtcatctatttccaggccattgaggaagtgtac tacgaccacctgcgcagtgcagccaagaagaggtttttccgcttcactatggtgactgag agcccgaacccagccctcaccttctgcgtaaagacccatgaccggctgtactacatggtg gccccatctgcagaggccatgcgtatctggatggatgtcattgtcacaggggctgagggc tacactcagttcatgaactaa >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_2|629_aa MPGRTWELCLLLLLGLGLGSQEALPPPCESEIYCHGELLNQVQMAKLYQDDKQFVDMPLS IAPEQVLQTFTELSRDHNHSIPREQLQAFVHEHFQAKGQELQPWTPADWKDSPQFLQKIS DAKLRAWAGQLHQLWKKLGKKMKPEVLSHPERFSLIYSEHPFIVPGGRFVEFYYWDSYWV MEGLLLSEMAETVKGMLQNFLDLVKTENIETLALELDFWTKNRTVSVSLEGKNYLLNRYY VPYGGPRPESYSKDVELADTLPEGDREALWAELKAGAESGWDFSSRWLIGGPNPNSLSGI RTSKLVPVDLNAFLCQAEELMSNFYSRLGCLSLCRETQSGEAKVEPRKKPGAAGRALAAG SKGSTAPGNDSQATKYRILRSQRLAALNTVLWDEQTGAWFDYDLEKKKKNREFYPSNLTP LWAGCFSDPGVADKALKYLEDNRILTYQYGIPTSLQKTGQQWDFPNAWAPLQDLVIRGLA KAPLRRAQEVAFQLAQNWIRTNFDVYSQKSAMYEKAVPVATPEHCRGCWAGRAASPQGLW LFVVPGAGGQYLTYSPPACPQYDVSNGGQPGGGGEYEVQEGFGWTNGVVLMLLDRYGDRL TSGAKLAFLEPHCLAATLLPSLLLSLLPW >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_2|1890_bp atgccagggaggacctgggagctgtgcctgctactgctgctggggctgggactggggtcc caggaggccctacccccaccctgtgagagtgagatttactgccacggggagctcctaaac caagttcaaatggccaagctctaccaggatgacaagcagtttgtggacatgccactgtct atagctccagaacaagtcctgcagaccttcactgagctgtccagggaccacaatcacagc atccccagggagcagctgcaggcgtttgtccacgaacacttccaggccaaggggcaggag ctgcagccctggacccctgcagactggaaagacagcccccagttcctgcagaagatttca gatgccaaactgcgtgcctgggcagggcagctgcatcagctctggaagaagctggggaag aagatgaagccagaggttctcagccaccctgagcggttctctctcatctactcagaacat cccttcattgtgcctggcggtcgctttgttgagttctactactgggactcctactgggtc atggagggtctgctcctctcagagatggctgagacggtgaagggcatgctgcagaacttc ttggacctggtgaaaacggaaaacattgaaacactagccttggaattggacttttggacc aagaacaggactgtctctgtgagcttggagggaaagaactacctcctgaatcgctattat gtcccttatgggggacccaggcctgagtcctacagcaaagatgtggagttggctgacacc ttgccagaaggagaccgggaggctctgtgggctgagctcaaggctggggctgagtctggc tgggacttctcttcacgctggctcattggaggcccaaaccccaactcgcttagcggcatc cgaacaagcaaactggtgcctgttgacctgaatgccttcctatgccaagcagaggagctg atgagcaacttctattccaggctggggtgtctcagcctttgcagagaaacacaaagcggg gaggctaaagtagaaccacgaaaaaagcctggggcagcaggcagggcactggctgctggc tctaaaggcagcactgccccagggaacgactcccaggccacgaagtacagaatcctgcgg tcgcagcgcttggccgccctgaacacagtcctgtgggatgagcagaccggagcctggttc gattacgaccttgagaagaagaagaaaaaccgggagttttacccatccaacctcactcca ctctgggccgggtgtttctctgaccctggcgtggcggacaaggctctgaaatacctggag gacaaccggatcctgacttaccagtatgggatcccgacctctctccagaagacaggccag cagtgggatttccccaatgcctgggcccccctgcaggacctggtcatcagaggcctggcc aaggcacctttacgtcgggcccaggaagtggctttccagctggctcagaattggatccga accaattttgatgtctactcgcagaagtcagccatgtatgagaaggctgtcccagtggcc actcctgagcactgcaggggctgctgggcaggcagagcagcttctccccaggggctctgg ttgtttgttgttcctggagctggggggcagtacctgacatacagcccacccgcttgtccc cagtatgacgtcagcaacggtggacagcccggtgggggaggagaatatgaagttcaggag ggatttggctggacgaatggcgtggtcctgatgctgctggaccgctatggtgaccggctg acctcaggggccaagctggctttcctggagccccactgcctggcggccacccttctgccc agcctcctgctcagcctcctgccatggtga >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_3|288_aa MLPDRLGSPVDHRGCRALSNLTVEGKVERTKGLLKTHLTKLSHQLKKDWTILLPLSLLRS QTCPQNATRSFLSPIKQRAPVGPKGSTAPGNDSQATKYRILPAQRLAALNTVLWDEQTSA WFDYDLENKENREFYPSNLTPLWNGCFSDPGVADKALKYLESSPTHSHGNVWNLLGQPDP DLPICPRIGSEPTLMSTSKSAMYEKMSWPEPSPYPQGVPVYDISNGGEPGGGGEYEVQEG FGWMNGVVLMLLDCCGDRLTLGAQLAFLEPHCLAATLFPSLLLSLLPQ >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_3|867_bp atgctgcctgatcgccttggaagccccgtagaccatcgcggatgccgagctttaagtaat ctcacagtggagggaaaagtagaacggactaaaggtcttttaaaaacacacctcaccaag ctcagccaccaacttaaaaaggactggacaatacttttaccactttcccttctcagaagt cagacatgtcctcagaatgctacaaggtcatttctctctccaataaagcagagggctcct gtagggcccaaaggcagcactgccccagggaacgactcccaggccacaaagtacagaatc ctgccggcgcagcgcttggccgccctgaacacagtcctgtgggatgagcagaccagtgcc tggttcgactacgaccttgagaacaaggagaaccgggagttttacccatccaacctcact ccactctggaacgggtgtttctctgaccctggcgtggcggacaaggctctgaaatacctg gagtccagccccacccacagtcacgggaatgtctggaaccttctagggcagccagatcct gatctaccaatatgccccagaattggatccgaaccaactttgatgtctaccagcaagtca gccatgtatgagaagatgagctggcctgagccctccccgtacccccagggcgtccccgtg tatgacattagcaacggtggagagcccggtggcggaggggagtatgaagttcaggagggc tttggctggatgaatggcgtggtcctgatgctactggactgctgtggtgaccggctgacc ttaggggcccagctggctttcttggagccccactgtctggcggccaccctttttcccagc ctcctgctcagccttctgccacagtga >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_4|288_aa MLIHGIMDVGGPEQEKALKQAQAALQVALPLVSYGLAEVSMADRNAYMEALKGFFRCVVA QPFRNVKKSYAAVYRVKVLSSVVIEDKSTRDPEYVYSQPYGKLVGSEEARTWWIDGVQEQ VRRNRAANPPVNIDADQLLGRGQNWSTISQQAFMQNEAIEQVRAICLRAWEKIQDPGSAC PSFNTVRQGSKEPYPDFVARLQDVAQKSIADEKAHKVIVELMAYENANPECQSAIKPLKG KVPTGCTQQLRRDSDHREWAMMTMAVLSKRKGGNVGKSKRDQIVTVSV >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_4|867_bp atgctgatccacggtatcatggacgtgggtggcccagaacaagagaaggctctgaaacag gcccaggctgcattacaagtagctctgccacttgtgtcttatggtctggcagaagtgtct atggctgacaggaatgcttatatggaggctttgaaaggctttttcagatgtgtggtagca cagccctttaggaatgtaaagaaaagctatgccgccgtctacagggtgaaggtactctca agcgtggtcattgaggacaagtcgacgagagatcccgagtacgtctacagtcagccttac ggtaagcttgtgggctcagaagaagctaggacttggtggattgatggggtacaagaacag gtccgaagaaatagggctgccaatcctccagttaacatagatgcagatcaactattagga agaggtcaaaattggagtactattagtcaacaagcattcatgcaaaatgaggccattgag caagttagagctatctgccttagagcctgggaaaaaatccaagacccaggaagcgcctgc ccctcatttaatacagtaagacaaggttcaaaagagccctaccctgattttgtggcaagg ctccaagatgttgctcaaaagtcaattgccgacgaaaaagcccataaggtcatagtggag ttgatggcatatgaaaacgccaatcctgagtgtcaatcagccattaagccattaaaagga aaggttcccacagggtgtacccaacagctccgaagagacagcgaccatcgagaatgggcc atgatgacgatggcggttttgtcgaaaagaaaagggggaaatgtggggaaaagcaagaga gatcagattgttactgtgtctgtatag >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_5|483_aa MSTARTENPVIMGLSSQNGQLRGPVKPTGGPGGGGTQTQQQMNQLKNTNTINNGTQQQAQ SMTTTIKPGDDWKKTLKLPPKDLRIKTSDVTSTKGNEFEDYCLKRELLMGIFEMGWEKPS PIQEESIPIALSGRDILARAKNGTGKSGAYLIPLLERLDLKKDNIQAMVIVPTRELALQV SQICIQVSKHMGGAKVMATTGGTNLRDDIMRLDDTVHVVIATPGRILDLIKKGVAKVDHV QMIVLDEADKLLSQDFVQIMEDIILTLPKNRQILLYSATFPLSVQKFMNSHLQKPYEINL MEELTLKGVTQYYAYVTERQKVHCLNTLFSRLQINQSIIFCNSSQRVELLAKKISQLGYS CFYIHAKMRQEHRNRVFHDFRNGLCRNLVCTDLFTRGIDIQAVNVVINFDFPKLAETYLH RIGRSGRFGHLGLAINLITYDDRFNLKSIEEQLGTEIKPIPSNIDKSLYVAEYHSEPVED EKP >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_5|1452_bp atgagcacggccagaacagagaaccctgttataatgggtctgtccagtcaaaatggtcag ctgagaggccctgtgaaacccactggtggccctggaggagggggcacacagacacagcaa cagatgaaccagctgaaaaacaccaacacaatcaataatggcactcagcagcaagcacag agtatgaccaccactattaaacctggtgatgactggaaaaagactttaaaactccctcca aaggatctaagaatcaaaacttcggatgtgacctccacaaaaggaaatgagtttgaagat tactgtttgaaacgggagttactgatgggaatttttgaaatgggctgggaaaagccatct cctattcaggaggagagcattcccattgctttatctggtagggatatcttagctagagca aaaaatggaacaggcaagagcggtgcctacctcattcccttacttgaacggctagacctg aagaaggacaatatacaagcaatggtgattgttcccactagagaacttgctctacaggtc agtcaaatttgcatccaggtcagcaaacacatgggaggggccaaagtgatggcaaccaca ggaggaaccaatttacgagatgacataatgaggcttgatgatacagtgcacgtggtgatt gctacccctgggagaatcctggatcttattaagaaaggagtagcaaaggttgatcatgtc cagatgatagtattggatgaggcagataagttgctgtcacaggattttgtgcagataatg gaggatattattctcacgctacctaaaaacaggcagattttactatattccgctactttc cctcttagtgtacagaagttcatgaattcccatttgcagaaaccctatgagattaacctg atggaggaactaactctgaagggagtaacccagtactacgcatatgtaactgagcgccaa aaagtacactgcctcaacacacttttctccaggcttcagataaaccagtcgatcattttc tgtaactcctctcagcgagttgaattgctagccaagaagatttctcaactgggttattct tgcttctatattcatgctaaaatgaggcaggaacatcgaaatcgtgtatttcatgatttc cgaaatggcttatgccgcaatcttgtttgcactgatctgtttacccgaggtattgatata caagctgtgaatgtggtaataaactttgatttcccaaagctggcagagacctatctccat cgtattggaagatcaggtcgctttggtcatcttggcttagccatcaacttgatcacatat gatgatcgcttcaacctgaaaagtattgaggagcagctgggaacagaaattaaacctatt ccgagcaacattgataagagcctgtatgtggcagaataccacagcgagcctgtagaagat gagaaaccttaa >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_6|116_aa MAGKGGAGTSFTGRQDGTGPPRYALTVRSPAVLSRRTLKSGAFPPQTPEAHPQARCLCAP RRGALKPAPISLQNKTQSLQLAGKARKTALHLQTKALVGDDDTVLGVKLSIANYDL >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_6|351_bp atggcaggcaaaggaggagcaggcacctccttcacagggcggcaggatggaacaggtccc ccaagatacgccctcacagtccggtcccccgccgtcctctcccggcgcacgctcaagtcc ggtgccttccccccgcagacccccgaggcgcaccctcaagctcggtgcctctgcgccccc cgcaggggcgccctcaagcctgcccccatttcccttcagaataaaacccagtcactccaa cttgcaggcaaagctagaaaaactgcgctgcatttgcaaacaaaagctcttgttggcgat gacgatactgttttgggtgtgaaactgtcaattgctaactacgatctgtga >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_7|90_aa MEIWRGFDKAFKSDTEYSQSEELESFFTWFTDHSDAGAGELGEVIKDDIWPNSLQLYLVP DTDDKEKEEDDDDEGLEDTDKGEEDERGDG >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_7|273_bp atggaaatctggaggggatttgataaagcattcaagtcagacacagaatacagccagagt gaggaattggagagcttttttacctggttcactgatcattctgatgcgggtgctggagag ttaggagaggtcatcaaagatgatatttggccaaactcattacagctctacttggttcct gatacggatgataaagaaaaagaagaagatgatgatgatgaagggttggaagatactgac aaaggagaggaggatgaaagaggagatggctaa >gi568815587r:118654715_118886251|GENSCAN_predicted_peptide_8|138_aa MRRSHCVSSPVPYCCVINYSQVPGPHPVAPTLQPRRLPSWGSRAESFGNLSRRSSVIKVQ MRQAHQAFTLSKPPVAKTSMKSFLTFGYTGTAMNYPLTLEMDLENLEDLGRNMELDRKVL WRFSYRKEAGIANQSMNA >gi568815587r:118654715_118886251|GENSCAN_predicted_CDS_8|417_bp atgcgccggtcccactgtgtcagcagccctgtcccctactgctgtgtcatcaattactct caggtgcctggcccccacccagtggcccccaccttgcagccccgaaggcttccttcctgg ggcagcagggccgagtcatttggaaacctctctcggaggagctccgtgatcaaggtgcag atgcggcaggcacaccaagccttcactctctccaagcctccagtggccaagacttccatg aagtctttcctgacctttgggtataccggcacagccatgaactacccgctaacgctggaa atggacctcgagaacctggaggacctgggtcggaatatggaactagacaggaaagtactt tggaggttttcttaccgtaaggaggctggcattgctaatcagtcaatgaatgcatag