GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:39:43 Sequence gi568815590f:43003549_43222874 : 219326 bp : 43.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4299 4381 83 0 2 53 68 91 0.862 2.88 1.02 Intr + 6757 6857 101 2 2 74 84 111 0.995 9.13 1.03 Intr + 9503 9607 105 1 0 116 53 81 0.976 7.81 1.04 Intr + 9781 9852 72 0 0 95 95 42 0.942 5.20 1.05 Intr + 14810 14946 137 1 2 93 75 48 0.353 3.37 1.06 Term + 45743 45872 130 2 1 83 42 44 0.013 -3.05 1.07 PlyA + 46022 46027 6 1.05 2.00 Prom + 46778 46817 40 -1.36 2.01 Init + 52565 52998 434 1 2 40 97 349 0.850 24.69 2.02 Intr + 55544 55629 86 2 2 70 101 88 0.985 7.76 2.03 Intr + 60553 60667 115 2 1 59 103 141 0.998 12.21 2.04 Intr + 66007 66111 105 1 0 77 111 107 0.996 11.23 2.05 Intr + 68633 68759 127 2 1 75 72 38 0.978 1.58 2.06 Intr + 73668 73816 149 2 2 97 24 117 0.508 5.23 2.07 Intr + 81162 81333 172 0 1 97 99 112 0.981 13.15 2.08 Intr + 99980 100282 303 1 0 106 70 313 0.898 28.09 2.09 Intr + 110282 110450 169 1 1 77 58 111 0.001 6.52 2.10 Intr + 113831 113883 53 0 2 51 79 29 0.001 -3.07 2.11 Term + 118559 119329 771 1 0 110 48 741 0.214 65.70 2.12 PlyA + 122213 122218 6 1.05 3.00 Prom + 136145 136184 40 -5.66 3.01 Init + 136949 137066 118 1 1 82 105 311 0.975 30.56 3.02 Intr + 143400 143515 116 1 2 74 60 67 0.864 2.67 3.03 Intr + 155027 155163 137 1 2 99 58 103 0.769 7.77 3.04 Intr + 155375 155496 122 2 2 67 50 113 0.929 5.54 3.05 Intr + 157890 157959 70 1 1 123 94 -17 0.899 0.64 3.06 Intr + 165625 165694 70 1 1 83 84 81 0.984 6.38 3.07 Intr + 167258 167391 134 1 2 99 2 73 0.085 -0.76 3.08 Intr + 168762 168838 77 0 2 84 95 22 0.134 1.66 3.09 Intr + 178597 178712 116 2 2 117 89 136 0.932 16.77 3.10 Term + 180257 180325 69 1 0 91 54 17 0.740 -3.46 3.11 PlyA + 180539 180544 6 1.05 4.02 PlyA - 181336 181331 6 1.05 4.01 Sngl - 182527 181460 1068 1 0 43 39 286 0.809 16.35 4.00 Prom - 184099 184060 40 -3.56 5.00 Prom + 186629 186668 40 -3.66 5.01 Init + 188580 188590 11 2 2 68 77 15 0.456 -2.19 5.02 Intr + 188756 188882 127 2 1 64 68 180 0.854 14.28 5.03 Intr + 190209 190295 87 2 0 66 68 163 0.997 12.27 5.04 Intr + 191935 192180 246 0 0 -72 32 591 0.886 35.16 5.05 Intr + 192882 192992 111 2 0 111 49 33 0.586 2.28 5.06 Intr + 193400 193477 78 1 0 84 89 31 0.797 2.55 5.07 Intr + 194124 194194 71 2 2 121 84 21 0.940 2.98 5.08 Intr + 194292 194404 113 0 2 97 111 45 0.911 7.72 5.09 Term + 195840 196021 182 1 2 84 45 243 0.999 17.37 5.10 PlyA + 196977 196982 6 1.05 6.04 PlyA - 197969 197964 6 -0.45 6.03 Term - 200925 200738 188 1 2 72 47 203 0.932 12.25 6.02 Intr - 203550 203348 203 2 2 131 2 103 0.737 4.93 6.01 Init - 210609 210554 56 0 2 72 72 31 0.598 0.66 6.00 Prom - 211796 211757 40 -2.96 7.02 PlyA - 212302 212297 6 1.05 7.01 Term - 216436 216354 83 2 2 110 43 89 0.805 4.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 81612 81734 123 2 0 100 41 30 0.901 -2.12 S.002 Intr - 90022 89925 98 0 2 84 88 86 0.905 7.85 S.003 Term - 110116 109964 153 1 0 48 37 196 0.886 8.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:43003549_43222874|GENSCAN_predicted_peptide_1|209_aa XEKLHEANNELQKKRAIIEDLEPRFNNSSLKIEELQEALRKKEEEMKQMEERYKKYLEKA KSVIRTLDPKQNQGAAPEIQALKNQLQERDRLFHSLEKEYEKTKSQREMEEKYIVSAWYN MGMTLHKKAAEDRLASTGSGQSFLARQRQATSSRRSYPGHVQPATASHTGRLDPQMHQTL PAQYLWTSAGNGLEDLQVGLLIHIPQIFI >gi568815590f:43003549_43222874|GENSCAN_predicted_CDS_1|630_bp nnagagaagctgcatgaggccaataatgaactacagaagaagagagccattattgaagat ctcgagccaagatttaacaacagctccttaaaaattgaagaattacaagaagctttacga aagaaagaggaagaaatgaagcaaatggaagaacgatacaaaaaatacttagagaaagcc aaaagtgtcatccgtactttagatcctaaacagaatcaaggagcagcaccagaaatacaa gctcttaaaaatcagctccaggaacgagaccgactgttccactcattagagaaagaatat gagaaaacaaagagtcagagagagatggaagagaaatatattgttagtgcctggtacaat atgggaatgaccctgcataaaaaggcagctgaagatagactggcaagcacaggctcaggg cagtcatttctggcgaggcagaggcaagcgaccagcagcagaagatcatacccaggccac gtgcagccggccacagcaagccacactggccgcctagatccacagatgcaccagactctc cctgctcagtatctttggacttctgccgggaatggtctagaagatctccaagtgggcttg ctcattcatattcctcaaatctttatttga >gi568815590f:43003549_43222874|GENSCAN_predicted_peptide_2|827_aa MGRRVGAWGISGEPRKLAADAASTRSQHPHTPTHTPAARQDSLARTSMVGNPENRKWPRE GEGRGGASATTSAADRGEMAATEGVGEAAQGGEPGQPAQPPPQPHPPPPQQQHKEEMAAE AGEAVASPMDDGFVSLDSPSYVLYRDRAEWADIDPVPQNDGPNPVVQIIYSDKFRDVYDY FRAVLQRDERSERAFKLTRDAIELNAANYTVWHFRRVLLKSLQKDLHEEMNYITAIIEEQ PKNYQVWHHRRVLVEWLRDPSQELEFIADILNQDAKNYHAWQHRQWVIQEFKLWDNELQY VDQLLKEDVRNNSVWNQRYFVISNTTGYNDRAVLEREVQILQDRGLSKYPNLLNQLLDLQ PSHSSPYLIAFLVDIYEDMLENQCDNKEDILNKALEEIAEAVNMEKQPQNSRRGLAPREV PPAVGLLLIMALMNTLLYLCLDHFFIAPRQSTVDPTHCPYGHFRIGQMKNCSPWLSCEEL RTEVRQLKRVGEGAVKRTGPSAAGLLEFARGPLQTLFAWVPAAVAAEQRIFVNRECCCLI VPLEVLSQRSTRPLEVDPSSIANVFILGFFTVFLSEWKEHKVALSQLTSLEMKDDFLHGL QMLKSLQGTHVVTLLGYCEDDNTMLTEYHPLGSLSNLEETLNLSKYQNVNTWQHRLELAM DYVSIINYLHHSPVGTRVMCDSNDLPKTLSQYLLTSNFSILANDLDALPLVNHSSGMLVK CGHRELHGDFVAPEQLWPYGEDVPFHDDLMPSYDEKIDIWKIPDISSFLLGHIEGSDMVR FHLFDIHKACKSQTPSERPTAQDVLETYQKVLDTLRDAMMSQAREML >gi568815590f:43003549_43222874|GENSCAN_predicted_CDS_2|2484_bp atgggaagacgcgtgggagcctgggggatctcgggagagccgcgcaaactcgcggcggac gcagccagcacccgcagccagcacccgcacactcccacccataccccggcagcccgccaa gactctctggcccgcacctctatggtaggaaacccagaaaacaggaagtggcctcgagag ggggaagggaggggcggggcctccgccaccacctcagctgcggaccgaggcgagatggcg gccaccgagggggtcggggaggctgcgcaagggggcgagcccgggcagccggcgcaaccc ccgccccagccgcacccaccgccgccccagcagcagcacaaggaagagatggcggccgag gctggggaagccgtggcgtcccccatggacgacgggtttgtgagcctggactcgccctcc tatgtcctgtacagggacagagcagaatgggctgatatagatccggtgccgcagaatgat ggccccaatcccgtggtccagatcatttatagtgacaaatttagagatgtttatgattac ttccgagctgtcctgcagcgtgatgaaagaagtgaacgagcttttaagctaacccgggat gctattgagttaaatgcagccaattatacagtgtggcatttccggagagttcttttgaag tcacttcagaaggatctacatgaggaaatgaactacatcactgcaataattgaggagcag cccaaaaactatcaagtttggcatcataggcgagtattagtggaatggctaagagatcca tctcaggagcttgaatttattgctgatattcttaatcaggatgcaaagaattatcatgcc tggcagcatcgacaatgggttattcaggaatttaaactttgggataatgagctgcagtat gtggaccaacttctgaaagaggatgtgagaaataactctgtctggaaccaaagatacttc gttatttctaacaccactggctacaatgatcgtgctgtattggagagagaagtccagatt ttgcaggatcgtggtctttccaaatatcctaatctgttaaatcaattacttgatttacaa ccaagtcatagttccccctacctaattgcctttcttgtggatatctatgaagacatgcta gaaaatcagtgtgacaataaggaagacattcttaataaagcattagaggaaattgcagag gccgtcaacatggaaaagcagccccagaacagcaggagaggcctcgccccccgagaggtg ccgccagctgttgggctgctgctgatcatggccctgatgaatactctgctctacctctgc ctcgaccacttcttcatcgctcctcgacaatccactgtggaccccacacactgtccctat ggtcacttcaggataggacagatgaaaaactgctcaccttggctgtcctgcgaggagctg agaacagaagtgagacagctgaagcgtgttggggaaggagctgtaaagagaacaggaccc tcagctgcaggtctgttggagtttgctagaggtccactccagaccctgtttgcctgggta ccagcagcggtggctgcagaacagcggattttcgtgaaccgcgaatgctgctgtctgatc gttcctctggaagttttgtctcagaggagtacccggcccctggaagtggatccatcatcc atcgcaaacgtctttatccttggttttttcacagtctttctgtctgagtggaaggagcac aaagttgcactctcacagctcaccagcctggagatgaaagatgatttcctccatggactg cagatgctgaaatctctccaaggcacacatgttgtcacgctgcttggctattgtgaggat gacaacactatgcttactgaatatcaccctctaggttccttgagtaacctggaagaaaca ctaaacctttcaaagtaccaaaatgtgaacacgtggcagcacaggctggagctggccatg gactatgtcagcatcattaattacctgcaccacagccctgtgggcacacgggtcatgtgc gactccaacgacctgccgaagacactgtcccagtatctgctaacaagcaacttcagcatt ttggcaaatgacttggacgccttacccctggtgaaccacagctccgggatgctggtgaag tgcggccacagggagctgcatggggatttcgtggctccagagcaactgtggccctatgga gaggacgtgcctttccacgatgatctcatgccctcatatgatgagaagattgacatttgg aagatcccagacatctccagtttccttctggggcacattgaagggagtgatatggtccga ttccatttgtttgatattcacaaagcatgcaagagccagactccctcagaaagacccact gcccaggacgttctggagacctaccagaaggtcttggatacacttagagatgccatgatg tctcaggcaagagagatgctgtga >gi568815590f:43003549_43222874|GENSCAN_predicted_peptide_3|342_aa MSGAGRALAALLLAASVLSAALLAPGGSSGRDAQAAPPRDLDKKRHAELKMDQALLLIHN ELLWTNLTVYWKSECCYHCLFQVLVNVPQSPKAGKPSAAAASVSTQHGSILQLNDTLEEK EVCRLEYRFGEFGNYSLLVKNIHNGVSEIACDLAVNEDPVDSNLPVSIAFLIGLAVIIVI SFLRLLLSLDDFNNWISKAISSRETDRLINSARKDFLDVKVNSAASPVWKTIMILSVCID RGVVDEARASVHLDTWIALILMVFVNYGGGKYWYFKHASWNVSWDKVRIPGVLQRLGVTY FVVAVLELLFAKPVPEHCASAPVCDVPHPVTKCSHCLIPTYE >gi568815590f:43003549_43222874|GENSCAN_predicted_CDS_3|1029_bp atgagcggggcgggcagggcgctggccgcgctgctgctggccgcgtccgtgctgagcgcc gcgctgctggcccccggcggctcttcggggcgcgatgcccaggccgcgccgccacgagac ttagacaaaaaaagacatgcagagctgaagatggatcaggctttgctactcatccataat gaacttctctggaccaacttgaccgtctactggaaatctgaatgctgttatcactgcttg tttcaggttctggtaaacgttcctcagagtccaaaagcagggaagcctagtgctgcagct gcctctgtcagcacccagcacggatctatcctgcagctgaacgacaccttggaagagaaa gaagtttgtaggttggaatacagatttggagaatttggaaactattctctcttggtaaag aacatccataatggagttagtgaaattgcctgtgacctggctgtgaacgaggatccagtt gatagtaaccttcctgtgagcattgcattccttattggtcttgctgtcatcattgtgata tcctttctgaggctcttgttgagtttggatgactttaacaattggatttctaaagccata agttctcgagaaactgatcgcctcatcaattctgctaggaaggatttcttggatgtaaaa gtaaactctgcagcctctccagtctggaagaccatcatgatcctcagtgtctgtattgac agaggagtagtggatgaggccagagccagtgtccacttggatacgtggattgctcttata ctcatggtctttgtcaattatggaggaggaaaatattggtacttcaaacatgcaagttgg aatgtgtcttgggacaaggtgcgcattcctggtgtgctgcagcgattgggagtgacatac tttgtggttgctgtgttggagctcctctttgctaaacctgtgcctgaacattgtgcctcg gccccggtgtgtgatgttccccaccctgtgaccaagtgttctcattgtttaattcccacc tatgagtga >gi568815590f:43003549_43222874|GENSCAN_predicted_peptide_4|355_aa MNIDAKILNKILGNRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIHHINRTKNKNHM IISIDAEKAFDKIQQPFMLKTFNKLGIDGTCLKIIRATYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIGLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISA QNFLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSEVPFTIASKRIKYLGIQLTR DVKDLLKENYKPLLNKIKEDTNKWKNIPCLWIGRINIMKMAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQKRACIAKTILSHKNKAGGIMPPGFKLYYKATVTKTAWYW >gi568815590f:43003549_43222874|GENSCAN_predicted_CDS_4|1068_bp atgaacatcgatgcaaaaatcctcaataaaatactgggaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtcggcttcattcctgggatgcaaggctggttcaac atatgcaaatcaataaatgtaatccatcatataaacagaaccaaaaacaaaaaccacatg attatctcaatagatgcagaaaaggccttcgacaaaattcaacagcccttcatgctaaaa actttcaataaactaggtatcgatgggacctgtctcaaaataataagagctacttatgac aaacccacagccaatatcatactgaatggacaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctgttcaacataggattggaagttctg gccagggcaatcaggcaggagaaagaaataaaaggtattcaattaggaaaggaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc caaaatttccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgta caaaaatcacaagcattcctatacaccaataacagacaaacagagagccaaatcatgagt gaagtcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcctcaaggagaactacaaaccactgctcaacaaaataaaagaggac acaaacaaatggaagaacattccatgcttatggataggaagaatcaatatcatgaaaatg gcgatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcctgc attgccaagacaatcctaagccacaagaacaaagctggaggtatcatgccacctggcttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtag >gi568815590f:43003549_43222874|GENSCAN_predicted_peptide_5|341_aa MLASGYLGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEVAYDPEGI LGTINSIVMAFLGVQEEEKEEEEEEEEEEEEEEEGVEEEEDEEEEGVEEEEEEEEEKEGV EEDGVEEEEEEGLEEEEGEGVDEEEEGVEEEEEEEEGCCPDWVTKQACLTEPLSPLWRIL FGPCLEVRATEPAQAGKILLYYKARTKDILIRFTAWCCILGLISVALTKVSENEGFIPVN KNLWSLSYVTTLSSFAFFILLVLYPVVDVKGLWTGTPFFYPGMNSILVYVGHEVFENYFP FQWKLKDNQSHKEHLTQNIVATALWVLIAYILYRKKIFWKI >gi568815590f:43003549_43222874|GENSCAN_predicted_CDS_5|1026_bp atgttggccagtggttatcttggtcctgggggcattggagattttggcaagtatccaaat tgcactggaggagctgcaggctacatcgaccgcctgctgctgggagacgatcacctttac cagcacccatcttctgctgtactttaccacaccgaggtggcctatgaccccgagggcatc ctgggcaccatcaactccatcgtgatggcctttttaggagttcaggaggaggagaaggaa gaggaggaggaggaggaggaagaagaggaggaagaggaggagggagtagaggaggaggag gatgaggaggaagagggggtggaggaagaggaggaggaggaagaagagaaggagggtgtg gaggaggatggggtggaggaagaggaggaggaggggctggaggaggaggagggtgagggt gtggatgaggaggaggagggtgtggaggaggaggaggaagaggaggaggggtgctgccca gactgggtcaccaaacaggcctgtctgacggaacccttgtcacctttgtggagaattctc tttgggccctgcctggaagtaagagccacggagcctgcccaggcaggaaaaatactattg tattacaaggctcggaccaaagacatcctgattcgattcactgcttggtgttgtattctt gggctcatttctgttgctctgacgaaggtttctgaaaatgaaggctttattccagtaaac aaaaatctctggtccctttcgtatgtcactacgctcagttcttttgccttcttcatcctg ctggtcctgtacccagttgtggatgtgaaggggctgtggacaggaaccccattcttttat ccaggaatgaattccattctggtatatgtcggccacgaggtgtttgagaactacttcccc tttcagtggaagctgaaggacaaccagtcccacaaggagcacctgactcagaacatcgtc gccactgccctctgggtgctcattgcctacatcctctatagaaagaagattttttggaaa atctga >gi568815590f:43003549_43222874|GENSCAN_predicted_peptide_6|148_aa MSYHQPERISFYNKKSNERWPEHGPLHHFLPLHCQTSHTGASVTVPAQDLLTPVGSYIPS LQFGASPMSTYDHGWTGPGAQEPIGKGPSCLCKPQFSGRLPTTCIAASYYKDRRFVSPPD NANEQNPGGVVTRTAIPLSASSVTHPAL >gi568815590f:43003549_43222874|GENSCAN_predicted_CDS_6|447_bp atgagctaccaccagccagaacgcatttctttctacaataaaaagtcaaatgaaagatgg cctgaacacggccccctccatcacttcctgcccctccactgtcaaacatctcacacagga gcctctgtcacggtccccgcccaggaccttctcacccctgttggcagttacatcccatcc ctacagtttggagcttcacccatgtccacatatgaccacgggtggacaggacccggggcc caggaacccatcggcaaagggccctcctgcctgtgcaagccacaattcagtgggcgtctt cctacaacttgcatcgctgcctcctattacaaagacagacggtttgtttctcccccggac aacgccaatgaacagaatccaggtggcgtggtcacaaggacagccatcccgctcagtgcc tcctcggtgacgcaccccgccctttaa >gi568815590f:43003549_43222874|GENSCAN_predicted_peptide_7|27_aa XSPFVLMGHDQGIRAQLCLMWKDWTIS >gi568815590f:43003549_43222874|GENSCAN_predicted_CDS_7|84_bp ntcagcccctttgtcctcatgggccatgaccagggcattcgggctcagctctgcctgatg tggaaggactggacaatctcctaa