GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:23:53 Sequence gi568815579r:19545425_19763666 : 218242 bp : 50.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 113 554 442 1 1 63 30 708 0.516 58.73 1.02 Intr + 3081 3207 127 1 1 93 36 34 0.372 -1.56 1.03 Intr + 8831 8921 91 2 1 80 77 53 0.637 3.40 1.04 Term + 12129 12254 126 2 0 104 37 94 0.343 4.18 1.05 PlyA + 12327 12332 6 1.05 2.10 PlyA - 13895 13890 6 -0.45 2.09 Term - 16986 16946 41 1 2 100 46 44 0.511 -1.25 2.08 Intr - 17532 17404 129 1 0 79 66 92 0.358 6.77 2.07 Intr - 19665 19509 157 0 1 114 117 158 0.997 20.88 2.06 Intr - 24250 24025 226 0 1 86 66 206 0.973 16.19 2.05 Intr - 24875 24685 191 2 2 65 71 239 0.707 18.28 2.04 Intr - 25409 25162 248 0 2 71 58 222 0.986 14.68 2.03 Intr - 34628 34527 102 0 0 58 40 81 0.000 0.45 2.02 Intr - 53941 53868 74 0 2 112 96 92 0.931 11.45 2.01 Init - 73205 73087 119 2 2 65 53 280 0.905 19.87 2.00 Prom - 74393 74354 40 -10.84 3.03 PlyA - 75386 75381 6 -0.45 3.02 Term - 79145 78832 314 0 2 134 37 300 0.967 24.66 3.01 Init - 81851 81119 733 2 1 95 106 1330 0.515 130.44 3.00 Prom - 82125 82086 40 -11.72 4.22 PlyA - 84069 84064 6 1.05 4.21 Term - 84912 84539 374 1 2 50 47 222 0.978 9.15 4.20 Intr - 85113 85047 67 0 1 98 113 54 0.955 7.38 4.19 Intr - 88766 88379 388 1 1 106 94 389 0.999 36.09 4.18 Intr - 89279 89083 197 2 2 83 69 266 0.999 22.41 4.17 Intr - 89505 89368 138 0 0 64 73 171 0.999 13.86 4.16 Intr - 89789 89601 189 2 0 118 53 115 0.897 10.78 4.15 Intr - 90145 89991 155 2 2 106 48 179 0.907 15.49 4.14 Intr - 90297 90220 78 1 0 65 81 92 0.942 5.72 4.13 Intr - 91372 91283 90 2 0 53 91 103 0.979 7.07 4.12 Intr - 91605 91493 113 2 2 93 113 14 0.991 4.42 4.11 Intr - 92137 91941 197 1 2 89 94 205 0.999 19.41 4.10 Intr - 92633 92496 138 2 0 104 76 268 0.992 27.86 4.09 Intr - 92905 92735 171 1 0 78 86 167 0.999 15.64 4.08 Intr - 93058 92978 81 1 0 31 77 163 0.699 9.33 4.07 Intr - 94768 94661 108 1 0 97 94 127 0.962 14.68 4.06 Intr - 94936 94872 65 2 2 89 66 71 0.945 3.44 4.05 Intr - 95147 95022 126 0 0 34 77 191 0.996 13.25 4.04 Intr - 96443 96386 58 2 1 103 91 4 0.973 0.66 4.03 Intr - 96627 96552 76 2 1 80 62 55 0.965 1.62 4.02 Intr - 97195 97111 85 2 1 90 67 63 0.871 3.28 4.01 Init - 98252 98087 166 2 1 33 53 256 0.999 14.39 4.00 Prom - 99787 99748 40 -5.96 5.26 PlyA - 99801 99796 6 1.05 5.25 Term - 100366 99998 369 1 0 114 50 445 0.477 37.95 5.24 Intr - 100561 100450 112 0 1 97 78 269 0.999 27.18 5.23 Intr - 100923 100781 143 0 2 91 100 272 0.990 27.85 5.22 Intr - 101901 101705 197 1 2 56 86 403 0.992 36.03 5.21 Intr - 102104 101990 115 2 1 108 73 173 0.999 17.82 5.20 Intr - 102335 102175 161 0 2 114 96 143 0.999 17.41 5.19 Intr - 104239 104143 97 1 1 69 92 258 0.998 23.88 5.18 Intr - 104516 104317 200 0 2 76 95 379 0.914 36.47 5.17 Intr - 106373 106265 109 2 1 84 100 133 0.793 13.96 5.16 Intr - 107296 107171 126 1 0 94 55 165 0.999 14.68 5.15 Intr - 108470 108360 111 2 0 94 96 151 0.993 17.08 5.14 Intr - 108720 108545 176 1 2 102 89 220 0.999 23.16 5.13 Intr - 109276 109119 158 0 2 81 100 323 0.999 32.45 5.12 Intr - 109815 109695 121 1 1 90 99 137 0.991 14.55 5.11 Intr - 110029 109892 138 2 0 75 65 334 0.982 30.24 5.10 Intr - 110230 110104 127 1 1 60 121 288 0.976 29.55 5.09 Intr - 110509 110454 56 2 2 100 80 97 0.917 8.80 5.08 Intr - 110759 110630 130 2 1 70 44 236 0.988 17.67 5.07 Intr - 111342 111236 107 1 2 125 75 152 0.999 17.53 5.06 Intr - 111492 111423 70 0 1 76 85 147 0.999 11.95 5.05 Intr - 111725 111570 156 2 0 33 64 307 0.991 23.01 5.04 Intr - 111984 111912 73 2 1 65 103 12 0.865 -0.19 5.03 Intr - 114367 114177 191 1 2 65 89 294 0.929 25.58 5.02 Intr - 114563 114474 90 2 0 70 75 169 0.999 13.99 5.01 Init - 118242 117847 396 0 0 102 131 476 0.983 48.00 5.00 Prom - 121748 121709 40 -5.16 6.00 Prom + 128739 128778 40 -3.26 6.01 Init + 128952 128957 6 2 0 80 41 17 0.302 -3.92 6.02 Intr + 132440 132566 127 1 1 75 103 140 0.824 14.45 6.03 Intr + 133302 133362 61 1 1 93 84 21 0.994 -0.01 6.04 Intr + 133757 134763 1007 2 2 66 48 397 0.472 23.89 6.05 Intr + 153862 153916 55 2 1 92 39 96 0.385 3.14 6.06 Intr + 162234 162435 202 0 1 34 116 22 0.014 -1.11 6.07 Term + 163409 163816 408 1 0 54 33 147 0.019 1.02 6.08 PlyA + 164031 164036 6 1.05 7.05 PlyA - 165067 165062 6 1.05 7.04 Term - 167665 165928 1738 0 1 104 48 1204 0.992 105.04 7.03 Intr - 168727 168667 61 2 1 59 113 75 0.996 4.89 7.02 Intr - 169063 168937 127 1 1 56 82 218 0.922 18.25 7.01 Init - 169775 169737 39 2 0 32 97 28 0.278 -1.71 7.00 Prom - 172880 172841 40 -4.76 8.02 PlyA - 175534 175529 6 1.05 8.01 Sngl - 177250 176942 309 1 0 86 48 172 0.920 8.91 8.00 Prom - 179365 179326 40 -4.96 9.03 PlyA - 179897 179892 6 1.05 9.02 Term - 180718 180636 83 2 2 65 42 124 0.523 3.36 9.01 Init - 194310 194148 163 0 1 88 105 59 0.516 7.49 9.00 Prom - 196708 196669 40 -5.36 10.00 Prom + 202286 202325 40 -4.26 10.01 Init + 205435 205647 213 0 0 102 -12 190 0.068 7.20 10.02 Intr + 206009 206148 140 1 2 25 80 164 0.239 8.56 10.03 Intr + 206193 206430 238 0 1 2 90 97 0.591 -1.08 10.04 Intr + 206516 206613 98 1 2 77 94 107 0.979 9.01 10.05 Intr + 206718 206922 205 0 1 -7 64 220 0.681 9.30 10.06 Intr + 212782 212987 206 0 2 45 90 134 0.219 7.30 10.07 Term + 215895 216003 109 0 1 35 48 97 0.180 -1.62 10.08 PlyA + 217785 217790 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 27271 27223 49 1 1 94 58 50 0.926 1.71 S.002 Term - 197870 197689 182 0 2 44 53 150 0.908 4.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_1|261_aa MLFDQRQVDRTLVTIMPQGSCRRVAVNGLLRDYLTRHPPPVPAEDPAAFSMLAPLDPLGH NYGVYTVTDQSPRLAKEIAIGRCFDGSSDGFSREMKADAGTAVTFQCREPPAGRPSLFQR LLESPATALGDIRREMSEAAQAQARASALKPGIPPPPSWNLGYRPPPGKSIVYPPTKLGL PDHTPSVWEERGNQCQEEDQAPAPADAPSPLAKVSGHRLAGLGVQVTKPLSLIVPTCYVE TPISTHTASRQFKSLVTPVSL >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_1|786_bp atgctgttcgaccagcggcaggtggacaggacgctggtgaccattatgccccagggcagc tgccggcgcgtggccgtcaacggactccttcgggattacctgacccggcaccccccaccg gtgcccgcggaggacccagctgccttctccatgctggcccccctagaccctctgggccac aactatggcgtctacactgtcactgaccagagcccacgcttggccaaggagatcgccatt ggccgctgctttgatggttcctctgacggcttctccagagagatgaaggctgatgccggc acagccgtcaccttccagtgccgggagccaccggccggacgacccagcctcttccagagg ctgctggagtccccggcgacagcacttggtgacatccgcagggagatgagcgaggcggcg caggcacaggcccgggcctcagccctcaagccaggaatcccaccccctccctcctggaac ctgggctaccgtccgcctcctggtaaatccattgtctaccccccaaccaaattggggctg ccagaccacaccccctcagtctgggaggagagagggaaccagtgccaggaggaagaccag gcgcccgcccctgcggacgctcctagccccttagctaaggtctcaggacaccggctggcg gggctcggggtgcaggtcacaaagcctctgagcctcattgtccccacctgttatgtggag acgcctatcagcacccacacagctagcaggcagttcaagtccctggtgactcctgtgtcc ctgtaa >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_2|428_aa MAAPPRPAPSPPAPRRLDTSDVLQQIMAITDQSLDEAQARKHALNCHRMKPALFSVLCEI KEKTGLLKERFGRLSPGVNPLSQIPQFNPSPTLIPAVDVVSIRGIQDEDPPDAQLLRLDN MLLAEGVCRPEKRGRGGAVARAGTATPGGCPNDNSIEHSDYRAKLSQIRQIYHSELEKYE QACREFTTHVTNLLQEQSRMRPVSPKEIERMVGAIHGKFSAIQMQLKQSTCEAVMTLRSR LLDARTTGKSYSELEVGVVLESQHINAYEGGCFCRRKRRNFSKQATEVLNEYFYSHLNNP YPSEEAKEELARKGGLTISQVSNWFGNKRIRYKKNMGKFQEEATIYTGKTAVDTTEVGVP GNHASCLSTPSSGVQRTDASCAVCGVPPGPGKHTSHTVLLWQLPPITMQPPARDTRALVV PDASQSFA >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_2|1287_bp atggccgccccgccgcgccccgcgccatcgccccccgccccgcggcgcctcgacacgagc gacgtcctgcagcagatcatggccatcaccgaccagagcctggacgaggcacaggccaga aagcatgctctgaattgccatcggatgaagcctgctctgttcagcgtgctctgtgagatc aaggaaaagacaggactcctcaaagagagatttggccgcctctcaccaggcgtcaaccct ctgtcccagattccacagtttaatccgtctccaacgctcattccggctgtggatgtggta agcatccgtggcattcaagacgaagatccccctgacgcccagctcctgaggctggataac atgctgctggctgagggcgtgtgcaggcccgagaagagaggaagaggaggagcggtggcc agggccggcacagcaacaccaggtggctgtccaaatgacaatagcattgagcactctgac tacagggccaagctgtcccagatccgacagatttaccactctgagctagagaaatatgaa caggcctgtcgtgagttcaccacgcacgtcaccaacctcctccaggagcagagcaggatg aggcctgtctcccctaaggagattgagcgcatggtcggcgccattcacggcaagttcagc gccatccagatgcagttgaagcagagcacctgtgaggcagtgatgaccctgcgttcgcgg ctgctcgatgccagaactactggaaaatcatactcagaactagaggtgggggtggtactg gagtcccagcatattaatgcctacgaaggcggctgtttttgcaggcgcaagcggcggaat ttcagcaagcaggcgacggaagtgctgaatgagtatttttactcccatctgaacaaccct taccccagcgaagaagccaaagaagagctggccaggaagggcggcctcaccatctcccag gtctctaactggtttggcaacaaaagaatccggtataaaaagaacatggggaagtttcaa gaagaggctaccatttacacgggtaaaacggctgtggataccacggaagttggggtccca gggaaccacgccagctgcctgtcaacacctagctccggggttcagagaacggatgcctcc tgtgctgtctgtggggtcccacctggtcctgggaagcacacgtcccacactgtcctgctg tggcagctgccccccatcaccatgcagcccccagccagggacacacgagcccttgttgta cccgatgccagtcagtcctttgcctaa >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_3|348_aa MGQCYYNETIGFFYNNSGKELSSHWRPKDVVVVALGLTVSVLVLLTNLLVIAAIASNRRF HQPIYYLLGNLAAADLFAGVAYLFLMFHTGPRTARLSLEGWFLRQGLLDTSLTASVATLL AIAVERHRSVMAVQLHSRLPRGRVVMLIVGVWVAALGLGLLPAHSWHCLCALDRCSRMAP LLSRSYLAVWALSSLLVFLLMVAVYTRIFFYVRRRVQRMAEHVSCHPRYRETTLSLVKTV VIILGAFVVCWTPGQVVLLLDGLGCESCNVLAVEKYFLLLAEANSLVNAAVYSCRDAEMR RTFRRLLCCACLRQSTRESVHYTSSAQGGASTRIMLPENGHPLMDSTL >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_3|1047_bp atgggccagtgctactacaacgagaccatcggcttcttctataacaacagtggcaaagag ctcagctcccactggcggcccaaggatgtggtcgtggtggcactggggctgaccgtcagc gtgctggtgctgctgaccaatctgctggtcatagcagccatcgcctccaaccgccgcttc caccagcccatctactacctgctcggcaatctggccgcggctgacctcttcgcgggcgtg gcctacctcttcctcatgttccacactggtccccgcacagcccgactttcacttgagggc tggttcctgcggcagggcttgctggacacaagcctcactgcgtcggtggccacactgctg gccatcgccgtggagcggcaccgcagtgtgatggccgtgcagctgcacagccgcctgccc cgtggccgcgtggtcatgctcattgtgggcgtgtgggtggctgccctgggcctggggctg ctgcctgcccactcctggcactgcctctgtgccctggaccgctgctcacgcatggcaccc ctgctcagccgctcctatttggccgtctgggctctgtcgagcctgcttgtcttcctgctc atggtggctgtgtacacccgcattttcttctacgtgcggcggcgagtgcagcgcatggca gagcatgtcagctgccacccccgctaccgagagaccacgctcagcctggtcaagactgtt gtcatcatcctgggggcgttcgtggtctgctggacaccaggccaggtggtactgctcctg gatggtttaggctgtgagtcctgcaatgtcctggctgtagaaaagtacttcctactgttg gccgaggccaactccctggtcaatgctgctgtgtactcttgccgagatgctgagatgcgc cgcaccttccgccgccttctctgctgcgcgtgcctccgccagtccacccgcgagtctgtc cactatacatcctctgcccagggaggtgccagcactcgcatcatgcttcccgagaacggc cacccactgatggactccaccctttag >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_4|1019_aa MRGRGLRWAGRRGTEAAAAAAAAGNRGSAPPARDPIPIPVPAERSPGPDMDAAEPGLPPG PEGRKRYSDIFRSLDNLEISLGNVTLEMLAGDPLLSEDPEPDKTPTATVTNEASCWSGPS PEGPVPLTGEELDLRLIRTKGGVDAALEYAKTWSRYAKELLAWTEKRASYELEFAKSTMK IAEAGKVSIQQQSHMPLQYIYTLFLEHDLSLGTLAMETVAQQKRDYYQPLAAKRTEIEKW RKEFKEQWMKEQKRMNEAVQALRRAQLQYVQRSEDLRARSQGSPEDSAPQASPGPSKQQE RRRRSREEAQAKAQEAEALYQACVREANARQQDLEIAKQRIVSHVRKLVFQGDEVLRRVT LSLFGLRGAQAERGPRAFAALAECCAPFEPGQRYQEFVRALRPEAPPPPPPAFSFQEFLP SLNSSPLDIRKKLSGPLPPRLDENSAEPGPWEDPGTGWRWQGTPGPTPGSDVDSVGGGSE SRSLDSPTSSPGAGTRQLVKASSTGTESSDDFEERDPDLGDGLENGLGSPFGKWTLSSAA QTHQLRRLRGPAKCRECEAFMVSGTECEECFLTCHKRCLETLLILCGHRRLPARTPLFGV DFLQLPRDFPEEVPFVVTKCTAEIEHRALDVQGIYRVSGSRVRVERLCQAFENGRALVEL SGNSPHDVSSVLKRFLQELTEPVIPFHLYDAFISLAKTLHADPGDDPGTPSPSPEVIRSL KTLLVQLPDSNYNTLRHLVAHLFRVAARFMENKMSANNLGIVFGPTLLRPPDGPRAASAI PVTCLLDSGHQAQLVEFLIVHYEQIFGMDELPQATEPPPQDSSPAPGPLTTSSQPPPPHL DPDSQPPVLASDPGPDPQHHSTLEQHPTATPTEIPTPQSDQREDVAEDTKDGGGEVSSQG PEDSLLGTQSRGHFSRQPVKYPRGGVRPVTHQLSSLALVASKLCEETPITSVPRGSLRGR GPSPAAASPEGSPLRRTPLPKHFEITQETARLLSKLDSEAVPRATCCPDVQPEEAEDHL >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_4|3060_bp atgcgggggcgtggcctccgctgggcagggcgacgcggaaccgaggcggcggcggcggct gcggcggcaggaaatcggggctcggccccgccggcgcgcgaccccatccccatcccggtc cctgcagagcgatccccgggcccagatatggacgcagcagagccgggactccccccaggt cctgagggcaggaagaggtacagtgacatcttccggagcctggacaacctcgaaatctca ctggggaacgtgacccttgagatgctggctggagaccctctactctcagaagacccagaa cctgacaagacccctacagccactgttaccaacgaagccagctgttggagcggcccctcc ccagagggtcctgtacccctcacaggggaggaactggacttgcggctcattcggacaaag gggggtgtggacgcagccctggaatatgccaagacctggagccgctatgccaaggaactg cttgcctggactgaaaagagagccagctatgagctggagtttgctaagagcaccatgaag atcgctgaagctggcaaggtgtccattcaacagcagagccacatgcctctgcagtacatc tacaccctgtttctggagcacgatctcagcctgggaaccctggccatggagacagtggcc cagcagaaaagagactactaccagcccctcgccgccaaacggactgagattgagaagtgg cggaaggagttcaaggagcagtggatgaaggagcagaagcggatgaatgaggcggtgcag gcactgcggcgcgcccagctgcagtatgtgcaacgcagcgaggacctgcgggcacgctcc caggggtcccctgaggactcggccccccaggcctcgccgggacctagcaagcagcaggag cggcggcggcgctcgcgagaggaggcccaggccaaggcgcaggaggccgaggcgctgtac caggcctgtgtccgcgaggccaacgcgcggcagcaggacctggagatcgccaagcagcga atcgtgtcgcacgtgcgcaagctggtgtttcagggggatgaagtgctgaggcgggtgacg ctgagtctcttcgggctgcggggggcgcaggcagagcgtggcccccgcgccttcgccgcc ctggccgagtgctgtgcgccctttgagccgggccagcgctaccaggagtttgtacgggcg ctgcggcccgaggccccgccgcccccgccgcccgccttctccttccaggagttccttccc tccttgaacagctcccctctggacatcagaaagaagctctctgggcctcttcctccaagg ctggatgagaattcagctgagccaggcccttgggaggatccgggcacaggctggcgctgg caagggactccaggccccactccgggcagcgatgtggacagcgtgggtggcggcagcgag tctcggtccctggactcacccacttccagcccaggcgctggcacgaggcagctggtgaag gcttcgtccacaggcactgagtcctcagatgactttgaggagcgagaccctgacctggga gacgggctggagaatgggctgggcagccccttcgggaagtggacactgtccagcgcggct cagacccaccagctgcggcgactgcggggcccagccaagtgccgcgagtgcgaagccttc atggtcagcgggacggagtgtgaggagtgctttctgacctgccacaagcgctgcctggag actctcctgatcctctgtggacacaggcggctcccagcccggacacccctttttggggtt gacttcctgcagctacccagggacttcccggaggaggtaccctttgtggtcacgaagtgc acggctgagatagaacaccgtgccctggatgtgcagggcatttaccgggtcagcgggtcc cgggtccgtgtggagcggctgtgccaggctttcgagaatggccgagcgttggtggagctg tcggggaactcgcctcatgacgtctcgagtgtcctcaagcgatttcttcaggagctcacc gagcccgtgatccccttccacctctacgacgccttcatctctctggctaagaccttgcat gcagaccctggggacgaccctgggacccccagccccagccctgaggttatccgctcgctg aagaccctcttggtacagctgcctgactctaactacaacaccctgcggcacctggtggcc catctgttcagggtggctgcacgatttatggaaaacaagatgtctgccaacaacctgggc attgtgtttgggccgacactgctgcggccgccggacggcccgcgggcagccagcgccatc cctgtcacctgcctgctggactctgggcatcaggcccagcttgtggagttcctcatcgtg cactacgagcagatctttgggatggatgagctcccccaggccactgagcccccgccccaa gactccagcccagcccctgggcccctcacaaccagctcccaaccgccacccccgcacctt gacccagactcccagcccccagtcctagcctcagaccccggcccagacccccagcaccac agtaccctggagcagcatcccacggccacacctaccgagattccaactccacagagtgac cagagagaggacgtggctgaagacaccaaagatgggggaggggaagtgtccagccaaggc ccagaggactcactcctggggacacagtctcgtggccacttcagccgccagccagtgaag tatccccggggcggtgtgaggcctgtaacccaccagctgtccagtctggccctggtggct tccaagctgtgcgaggagacccccatcacatcagtgcccagagggagtttgcgggggcgg gggcccagccctgcagctgcctcccctgagggcagccccctgcgccgcaccccgctgccc aagcattttgagattacccaggagacagcccggctactctcgaaattggacagcgaggct gtgcccagggccacctgctgcccggacgtccagcctgaggaagccgaggaccatctctga >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_5|1242_aa MAAAAAVGNAVPCGARPCGVRPDGQPKPGPQPRALLAAGPALIANGDELVAAVWPYRRLA LLRRLTVLPFAGLLYPAWLGAAAAGCWGWGSSWVQIPEAALLVLATICLAHALTVLSGHW SVHAHCALTCTPEYDPSKATFVKVVPTPNNGSTELVALHRNEGEDGLEVLSFEFQKIKYS YDALEKKQFLPVAFPVGNAFSYYQSNRGFQEDSEIRAAEKKFGSNKAEMVVPDFSELFKE RATAPFFVFQVFCVGLWCLDEYWYYSVFTLSMLVAFEASLVQQQMRNMSEIRKMGNKPHM IQVYRSRKWRPIASDEIVPGDIVSIGRSPQENLVPCDVLLLRGRCIVDEAMLTGESVPQM KEPIEDLSPDRVLDLQADSRLHVIFGGTKVVQHIPPQKATTGLKPVDSGCVAYVLRTGFN TSQGKLLRTILFGVKRVTANNLETFIFILFLLVFAIAAAAYVWIEGTKDPSRNRYKLFLE CTLILTSVVPPELPIELSLAVNTSLIALAKLYMYCTEPFRIPFAGKVEVCCFDKTGTLTS DSLVVRGVAGLRDGKEVTPVSSIPVETHRALASCHSLMQLDDGTLVGDPLEKAMLTAVDW TLTKDEKVFPRSIKTQGLKIHQRFHFASALKRMSVLASYEKLGSTDLCYIAAVKGAPETL HSMFSQCPPDYHHIHTEISREGARVLALGYKELGHLTHQQAREVKREALECSLKFVGFIV VSCPLKADSKAVIREIQNASHRVVMITGDNPLTACHVAQELHFIEKAHTLILQPPSEKGR QCEWRSIDGSIVLPLARGSPKALALEYALCLTGDGLAHLQATDPQQLLRLIPHVQVFARV APKQKEFVITSLKELGYVTLMCGDGTNDVGALKHADVGVALLANAPERVVERRRRPRDSP TLSNSGIRATSRTAKQRSGLPPSEEQPTSQRDRLSQVLRDLEDESTPIVKLGDASIAAPF TSKLSSIQCICHVIKQGRCTLVTTLQMFKILALNALILAYSQSVLYLEGVKFSDFQATLQ GLLLAGCFLFISRSKPLKTLSRERPLPNIFNLYTILTVMLQFFVHFLSLVYLYREAQARS PEKQEQFVDLYKEFEPSLVNSTVYIMAMAMQMATFAINYKGPPFMESLPENKPLVWSLAV SLLAIIGLLLGSSPDFNSQFGLVDIPVEVSGPVEVTLGLSLMGPSYGSDPELGCRSPAHH PSLPRQFKLVIAQVLLLDFCLALLADRVLQFFLGTPKLKVPS >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_5|3729_bp atggcggcagcggcggcggtgggcaacgcggtgccctgcggggcccggccttgcggggtc cggcctgacgggcagcccaagcccgggccgcagccgcgcgcgctccttgccgccgggccg gcgctcatagcgaacggtgacgagctggtggctgccgtgtggccgtaccggcggttggcg ctgttgcggcgcctcacggtgctgccattcgccgggctgctttacccggcctggttgggt gccgcagccgctggctgctggggctggggcagcagttgggtgcagatccccgaagctgcg ctgctcgtgcttgccaccatctgcctcgcgcacgcgctcactgtcctctcggggcattgg tctgtgcacgcgcattgcgcgctcacctgcaccccggagtacgaccccagcaaagcgacc tttgtgaaggtggtgccaacccccaacaatggctccacggagctcgtggccctgcaccgc aatgagggcgaagacgggcttgaggtgctgtccttcgaattccagaagatcaagtattcc tacgatgccctggagaagaagcagtttctccccgtggcctttcctgtgggaaacgccttc tcatactatcagagcaacagaggcttccaggaagactcagagatccgagcagctgagaag aaatttgggagcaacaaggccgagatggtggtgcctgacttctcggagcttttcaaggag agagccacagcccccttctttgtatttcaggtgttctgtgtggggctctggtgcctggat gagtactggtactacagcgtctttacgctatccatgctggtggcgttcgaggcctcgctg gtgcagcagcagatgcggaacatgtcggagatccggaagatgggcaacaagccccacatg atccaggtctaccgaagccgcaagtggaggcccattgccagtgatgagatcgtaccaggg gacatcgtctccatcggccgctccccacaggagaacctggtgccatgtgacgtgcttctg ctgcgaggccgctgcatcgtagacgaggccatgctcacgggggagtccgtgccacagatg aaggagcccatcgaagacctcagcccagaccgggtgctggacctccaggctgattcccgg ctgcacgtcatcttcgggggcaccaaggtggtgcagcacatccccccacagaaagccacc acgggcctgaagccggttgacagcgggtgcgtggcctacgtcctgcggaccggattcaac acatcccagggcaagctgctgcgcaccatcctcttcggggtcaagagggtgactgcgaac aacctggagaccttcatcttcatcctcttcctcctggtgtttgccatcgctgcagctgcc tatgtatggattgaaggtaccaaggaccccagccggaaccgctacaagctgtttctggag tgcaccctgatcctcacctcggtcgtgcctcctgagctgcccatcgagctgtccctggcc gtcaacacctccctcatcgccctggccaagctctacatgtactgcacagagcccttccgg atcccctttgctggcaaggtcgaggtgtgctgctttgacaagacggggacgttgaccagt gacagcctggtggtgcgcggtgtggccgggctgagagacgggaaggaggtgaccccagtg tccagcatccctgtagaaacacaccgggccctggcctcgtgccactcgctcatgcagctg gacgacggcaccctcgtgggtgaccctctagagaaggccatgctgacggccgtggactgg acgctgaccaaagatgagaaagtattcccccgaagtattaaaactcaggggctgaaaatt caccagcgctttcattttgccagtgccctgaagcgaatgtccgtgcttgcctcgtatgag aagctgggctccaccgacctctgctacatcgcggccgtgaagggggcccccgaaactctg cactccatgttctcccagtgcccgcccgactaccaccacatccacaccgagatctcccgg gaaggagcccgcgtcctggcgctggggtacaaggagctgggacacctcactcaccagcag gcccgggaggtcaagcgggaggccctggagtgcagcctcaagttcgtcggcttcattgtg gtctcctgcccgctcaaggctgactccaaggccgtgatccgggagatccagaatgcgtcc caccgggtggtcatgatcacgggagacaacccgctcactgcatgccacgtggcccaggag ctgcacttcattgaaaaggcccacacgctgatcctgcagcctccctccgagaaaggccgg cagtgcgagtggcgctccattgacggcagcatcgtgctgcccctggcccggggctcccca aaggcactggccctggagtacgcactgtgcctcacaggcgacggcttggcccacctgcag gccaccgacccccagcagctgctccgcctcatcccccatgtgcaggtgttcgcccgtgtg gctcccaagcagaaggagtttgtcatcaccagcctgaaggagctgggctacgtgaccctc atgtgtggggatggcaccaacgacgtgggcgccctgaagcatgctgacgtgggtgtggcg ctcttggccaatgcccctgagcgggttgtcgagcggcgacggcggccccgggacagccca accctgagcaacagtggcatcagagccacctccaggacagccaagcagcggtcggggctc cctccctccgaggagcagccaacctcccagagggaccgcctgagccaggtgctgcgagac ctcgaggacgagagtacgcccattgtgaaactgggggatgccagcatcgcagcacccttc acctccaagctctcatccatccagtgcatctgccacgtgatcaagcagggccgctgcacg ctggtgaccacgctacagatgttcaagatcctggcgctcaatgccctcatcctggcctac agccagagcgtcctctacctggagggagtcaagttcagtgacttccaggccaccctacag gggctgctgctggccggctgcttcctcttcatctcccgttccaagcccctcaagaccctc tcccgagaacggcccctgcccaacatcttcaacctgtacaccatcctcaccgtcatgctc cagttctttgtgcacttcctgagccttgtctacctgtaccgtgaggcccaggcccggagc cccgagaagcaggagcagttcgtggacttgtacaaggagtttgagccaagcctggtcaac agcaccgtctacatcatggccatggccatgcagatggccaccttcgccatcaattacaaa ggcccgcccttcatggagagcctgcccgagaacaagcccctggtgtggagtctggcagtt tcactcctggccatcattggcctgctcctcggctcctcgcccgacttcaacagccagttt ggcctcgtggacatccctgtggaggtcagtggccctgtggaggtgaccctggggctcagc ttgatgggtccctcctatggcagtgaccctgagctgggctgcaggtctccagctcaccat ccctccctgccccgacagttcaagctggtcattgcccaggtcctgctcctggacttctgc ctggcgctcctggccgaccgcgtcctgcagttcttcctggggaccccgaagctgaaagtg ccttcctga >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_6|621_aa MLDSVAFEDVAVNFTQEEWALLSPSQKNLYRDVTLETFRNLASVGIQWKDQDIENLYQNL GIKLRSLVERLCGRKEGNEHRETFSQIPDCHLNKKSQTGVKPCKCSVCGKVFLRHSFLDR HMRAHAGHKRSECGGEWRETPRKQKQHGKASISPSSGARRTVTPTRKRPYECKVCGKAFN SPNLFQIHQRTHTGKRSYKCREIVRAFTVSSFFRKHGKMHTGEKRYECKYCGKPIDYPSL FQIHVRTHTGEKPYKCKQCGKAFISAGYLRTHEIRSHALEKSHQCQECGKKLSCSSSLHR HERTHSGGKLYECQKCAKVFRCPTSLQAHERAHTGERPYECNKCGKTFNYPSCFRRHKKT HSGEKPYECTRCGKAFGWCSSLRRHEMTHTGEKPFDCKQCAQTVARTKEADIFAVLALRG FPYTTRLEFCRVGSVPRSTKCWPWRQGYETSPAAAHVTRTEEKPCHCQCLSQVECARKSG VGKLGKAVERVAYVLEDISILLTFKKLLLNVCMVSWSQNELGCTRNSEMTEKLNEYETEE FRFHGYLKTSQLEMDIQIKKIKHKTDNLIAELEGASSKCLYLAENNQVLQQELLSMKTIQ QKCKEFEKNKKVLEQEVRGVS >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_6|1866_bp atgctggactcagtggcctttgaggatgtggctgtgaacttcacccaggaggagtgggct ttgctgagtccttcccagaagaatctctacagagatgtgacgctggaaaccttcaggaac ctggcctcggtcggaatccaatggaaagaccaggacattgagaatctgtaccaaaacctg gggattaagctaagaagtctggtggagagactctgtggacgtaaagaagggaatgaacac agagaaactttcagccagattcctgattgtcacctgaacaagaaaagtcaaactggagtg aaaccatgcaaatgcagcgtgtgtgggaaagtcttcctccgtcattcattcctggacagg cacatgagagctcatgctggacacaaacgatctgagtgtggtggggaatggagagagacg ccccgtaaacagaaacaacatgggaaagcctccatttcccccagtagtggtgcacggcgc acagtaacaccaactcgaaagagaccttatgaatgcaaggtgtgcgggaaagcctttaat tctcccaatttatttcaaatccatcaaagaactcacactggaaagaggtcctataaatgt agggaaatagtgagagccttcacagtttccagtttctttcgaaaacatggaaaaatgcat actggagaaaaacgctatgaatgtaaatactgtggaaaacctatcgattatcccagttta tttcaaattcatgttagaactcacactggagaaaaaccttacaaatgtaaacaatgtggt aaagccttcatttccgcaggttaccttcggacacatgaaatcagatctcacgcgctggag aaatcccaccaatgtcaggaatgtgggaaaaaactcagttgttccagttcccttcacaga catgaaagaactcatagtggaggaaaactctacgaatgtcaaaaatgtgccaaagtcttt agatgtcccacgtcccttcaagcacatgaaagagctcacactggagaaagaccttatgaa tgtaataaatgtggtaaaaccttcaattatcccagttgttttcgaagacataaaaaaact catagtggagaaaagccatatgaatgtacaaggtgtggtaaagcctttgggtggtgcagt tccctccgaagacatgaaatgactcacactggagaaaaaccctttgattgtaaacagtgt gcgcagacagttgcccggactaaggaagccgacatctttgctgtcctggcgctccgcggc ttcccgtataccacaaggctggagttctgcagggtgggttctgttccacgaagcactaag tgctggccttggaggcaaggctacgaaaccagccctgcagcagcacacgtgacacgcact gaggagaaaccttgtcattgtcagtgcctaagccaggtggaatgtgccagaaaatctggg gtgggcaaattagggaaggcagtggaaagggttgcatatgtacttgaagacatttccata ttgcttacattcaagaagctgctcttaaatgtgtgcatggtttcatggtctcaaaatgaa ctaggatgcacaagaaactcagaaatgacagaaaagttaaatgaatatgaaactgaagag tttagattccacggatatttaaaaaccagtcaacttgaaatggatattcagattaaaaag ataaaacataagactgataatcttatagcagaactggaaggtgcatcttcaaaatgtctg tatctggctgaaaacaatcaagttcttcaacaggaattattatctatgaaaacgatacaa cagaaatgtaaagaatttgagaagaataaaaaggtgttggaacaagaagtgagaggagtt agctag >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_7|654_aa MRNEMLSDMVRNKDSVSFEDVAVNFTLEEWALLDSSQKKLYEDVMQETFKNLVCLGKKWE DQDIEDDHRNQGKNRRCHMVERLCESRRGSKCGETTSQMPNVNINKETFTGAKPHECSFC GRDFIHHSSLNRHMRSHTGQKPNEYQEYEKQPCKCKAVGKTFSYHHCFRKHERTHTGVKP YECKQCGKAFIYYQPFQRHERTHAGQKPYECKQCGKTFIYYQSFQKHAHTGKKPYECKQC GKAFICYQSFQRHKRTHTGEKPYECKQCGKAFSCPTYFRTHERTHTGEKPYKCKECGKAF SFLSSFRRHKRTHSGEKPYECKECGKAFFYSASFRAHVIIHTGARPYKCKECGKAFNSSN SCRVHERTHIGEKPYECKRCGKSFSWSISLRLHERTHTGEKPYECKQCHKTFSFSSSLRE HETTHTGEKPYECKQCGKTFSFSSSLQRHERTHNAEKPYECKQCGKAFRCSSYFRIHERS HTGEKPYECKQCGKVFIRSSSFRLHERTHTGEKPYECKLCGKTFSFSSSLREHEKIHTGN KPFECKQCGKAFLRSSQIRLHERTHTGEKPYQCKQCGKAFISSSKFRMHERTHTGEKPYR CKQCGKAFRFSSSVRIHERSHTGEKPYECKQCGKAFISSSHFRLHERTHMGEKV >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_7|1965_bp atgcgtaatgaaatgctgagtgatatggtacgtaataaggactcagtctcctttgaggat gtggccgtgaacttcaccctggaggagtgggctttgctggattcttcacagaaaaagctc tatgaagatgtgatgcaggagaccttcaaaaacctggtttgtctaggaaaaaagtgggaa gaccaggacattgaagatgaccacagaaaccaggggaaaaatcgaagatgtcatatggtt gagagactctgtgaaagtagaagaggtagcaaatgtggagaaaccactagccagatgcca aatgttaatatcaacaaggaaacttttactggagcaaaaccacatgaatgcagcttttgt ggaagagacttcattcatcattcgtcccttaataggcacatgagatctcacactggacag aaaccaaatgagtatcaggaatatgaaaagcaaccatgtaaatgtaaagcagttgggaaa accttcagttatcaccactgctttcgcaaacatgaaagaactcacactggagtgaagccc tatgaatgtaaacagtgtgggaaagcctttatatattaccagccatttcaaagacatgaa aggactcatgctggacagaaaccctatgaatgtaagcaatgtggaaaaacctttatatat taccagtcttttcaaaaacatgctcatactggaaagaaaccctatgaatgtaaacagtgt gggaaagcctttatatgttaccaatcttttcaaagacacaaaaggactcacactggagag aaaccctatgaatgtaagcaatgtggtaaggctttcagttgtcccacatactttcgaact catgaaagaactcacactggagaaaaaccctacaaatgtaaagaatgtggtaaagccttc agttttctcagttcttttcgaaggcataaaaggactcatagtggagagaaaccctatgaa tgtaaagaatgtggaaaagccttcttttattctgcaagctttcgagcacatgtaataata cacactggggctcgaccttataaatgtaaagaatgtgggaaagccttcaactcttctaat tcctgtcgagtgcatgaaagaactcatattggagaaaaaccatatgaatgtaaacgatgt ggcaaatcattcagttggtccatttctcttcgattgcatgaaagaactcatactggagag aaaccttatgagtgtaaacagtgtcataaaaccttcagtttttcaagttcccttcgagaa cacgaaacaactcacactggagagaaaccctatgaatgtaaacaatgtggtaaaaccttc agtttttcaagttcccttcaaagacatgaaaggactcacaatgcagagaaaccctatgaa tgtaaacagtgtgggaaagccttcaggtgttcaagttattttcgaattcatgaaaggtca cacactggagagaaaccctatgaatgtaaacagtgtggaaaagttttcattcgttccagt tcctttcgactgcatgaaagaacacacactggagagaaaccctatgaatgtaaactatgc ggtaaaaccttcagtttttcaagttcccttcgagaacatgaaaaaattcacactggaaat aagccttttgagtgtaagcaatgtggtaaggccttccttcgttccagtcaaattcgattg catgaaaggactcacactggagagaaaccgtatcaatgtaaacaatgtggaaaagccttc atttcttccagtaaatttcgaatgcatgagagaactcacacgggagagaaaccctatcga tgtaaacaatgtgggaaagccttcagattttcaagttctgttcgaattcatgaaaggtct cacactggagagaaaccttatgaatgcaaacaatgtggaaaagccttcatttcttccagt cactttcgactgcatgaaaggactcatatgggagagaaagtctaa >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_8|102_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKTILSQKNKAGGITLPD FKLYYKATATKTAWYWYQNRDTDQWNRTEHSEIIPHIYDPDL >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_8|309_bp atggccatactgccgaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc catatagcaaagacaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagcaaccaaaacagcatggtactggtaccaaaacaga gatacagaccagtggaacagaacggagcactcagaaataataccacacatctacgaccct gatctttga >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_9|81_aa MAQGHVWVPGPVTALITSDVSPVEWVYLGTAPTPWSHHEDEQDNRVHGQHRTSTGMQLFA SNGTKLDGENYFDELREEGFR >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_9|246_bp atggcccagggtcacgtctgggttcctggaccagtgaccgcccttataacctcagatgtc tcacctgtcgaatgggtatacttggggacagcacccaccccctggagtcaccacgaggat gaacaagacaaccgtgtacatggtcagcacagaacaagcactggaatgcagctctttgcc agcaatggaacaaagctggacggagagaattactttgatgagttgagagaagaaggcttc agatga >gi568815579r:19545425_19763666|GENSCAN_predicted_peptide_10|402_aa MLSGGFEAMPGMLPTSLRPLPDTFQPVLQPQHVPPLNTFQQPVQASDGRRARQPWPQQLF RGNQDSQGSDEAGPSARRNKHLDWLDACLVTFLAMSVPHLPEKWATSCFQEDFISKHSAG AVPRPPALPDVPQQHREKAWKQGKRKAFCMAQHLLRKENKKTASDPSRAGQREGGHHLGE WMWVSKTNTGYVAWSWEEVESMIDSFICFLKKVDDGAAAQPQGMEFTASSPSTFTGGAYN TIVKLFPDWPRQDLDSVSDLLLLSKQQRIKFPDILHVHRGALTKVTGSRRLVGGGMMECG AVIPLFITVIIAALAGLAATRIAEVMTAGSPVPGSGKPALAAGRPLVVALVVAAAAARGL WEAASRRTYNEDCCHRNEKSKCIGYNRISGLQDLSGSTQLQL >gi568815579r:19545425_19763666|GENSCAN_predicted_CDS_10|1209_bp atgctgtcaggaggcttcgaggccatgcccggtatgctgcccacctccttgagacccctt cctgacactttccagccggtgctacagccgcagcacgtgccaccactgaatacctttcag cagccggtgcaggcttcagacggcaggcgtgcccgacagccgtggcctcagcagctcttt cgtggcaaccaggacagtcagggcagtgatgaagcaggtcccagtgcacggcgcaacaag cacttggactggctggatgcgtgtctggtgacattcctggctatgtctgtgccccaccta cctgagaagtgggccacgagctgcttccaggaggacttcatctccaagcacagtgctggc gcagtgccacgtcctccagcacttcctgatgtgcctcagcagcaccgagagaaggcctgg aaacaaggcaagaggaaggccttctgtatggcacagcacctcttgaggaaagaaaataag aaaactgcctccgatccaagcagagcaggtcagcgagaaggtgggcatcacttgggggag tggatgtgggtatcgaagacgaacactggatacgtggcctggtcatgggaggaggtggag agcatgattgacagcttcatatgctttctcaagaaggtggatgacggcgctgctgctcag ccacagggcatggagttcacggcaagcagcccatcaaccttcaccggaggtgcctacaat accatcgtcaagctcttcccagattggcccagacaggacctggattctgtctcggacctg ttactgctgtccaagcagcaacgcatcaaattcccagacatcctccacgttcacagagga gctcttaccaaagtcacggggagcaggagactcgtgggaggaggaatgatggagtgtgga gctgtcatcccacttttcattacggtcatcatcgctgccctggctggcctggctgccacc agaattgctgaggtgatgactgcaggttccccagtccccggctctggaaagcctgcactg gcagctggaaggcccctggtggtggcacttgttgtggctgcagcagcagctagagggctt tgggaggcagcttcgaggagaacttacaatgaagattgttgccaccgaaatgaaaaatct aaatgcattggttataaccggatctcgggactgcaggatttaagtggcagcactcaactg cagctgtga