GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:57:36 Sequence gi568815588r:109765187_110007807 : 242621 bp : 42.23% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 963 958 6 1.05 1.07 Term - 5002 4845 158 2 2 79 38 68 0.125 -1.99 1.06 Intr - 6554 6427 128 1 2 67 83 86 0.217 5.40 1.05 Intr - 11812 11741 72 0 0 115 78 14 0.024 0.70 1.04 Intr - 19462 19242 221 1 2 1 70 135 0.001 -0.82 1.03 Intr - 33169 31059 2111 2 2 37 42 531 0.039 29.77 1.02 Intr - 34967 34753 215 1 2 14 58 129 0.003 -0.06 1.01 Init - 50991 47900 3092 0 2 44 53 1050 0.690 88.57 1.00 Prom - 51249 51210 40 -5.25 2.02 PlyA - 51331 51326 6 -4.04 2.01 Sngl - 52365 51349 1017 0 0 88 43 758 0.997 68.07 2.00 Prom - 53915 53876 40 -9.35 3.00 Prom + 54350 54389 40 -3.05 3.01 Sngl + 61547 61828 282 1 0 88 40 312 0.890 21.64 3.02 PlyA + 61960 61965 6 1.05 4.00 Prom + 62552 62591 40 -6.15 4.01 Sngl + 63909 66191 2283 2 0 70 42 564 0.852 42.88 4.02 PlyA + 66328 66333 6 -0.45 5.00 Prom + 67226 67265 40 -6.05 5.01 Init + 79019 79378 360 1 0 62 74 282 0.287 21.32 5.02 Intr + 86739 86892 154 2 1 78 89 65 0.458 4.32 5.03 Intr + 87387 87515 129 1 0 75 -4 153 0.202 4.55 5.04 Intr + 94764 94853 90 1 0 95 41 101 0.441 5.15 5.05 Term + 94945 95051 107 2 2 91 47 76 0.378 1.39 5.06 PlyA + 97444 97449 6 1.05 6.23 PlyA - 99326 99321 6 1.05 6.22 Term - 100126 99998 129 1 0 121 38 112 0.967 6.70 6.21 Intr - 102254 102087 168 2 0 67 95 93 0.969 7.12 6.20 Intr - 102955 102809 147 1 0 60 95 55 0.875 2.91 6.19 Intr - 103526 103428 99 2 0 89 80 45 0.885 3.19 6.18 Intr - 104843 104767 77 0 2 75 100 132 0.999 11.42 6.17 Intr - 105718 105545 174 2 0 95 115 103 0.892 12.69 6.16 Intr - 106675 106606 70 1 1 108 100 8 0.995 1.84 6.15 Intr - 108391 108181 211 0 1 33 116 80 0.038 3.29 6.14 Intr - 112681 112604 78 0 0 119 68 126 0.110 11.45 6.13 Intr - 112872 112814 59 0 2 34 110 99 0.619 3.46 6.12 Intr - 115745 115656 90 2 0 98 94 95 0.987 10.37 6.11 Intr - 117456 117246 211 2 1 123 81 218 0.767 22.59 6.10 Intr - 118962 118881 82 1 1 92 84 57 0.956 3.48 6.09 Intr - 121155 121060 96 1 0 81 67 87 0.651 4.96 6.08 Intr - 123120 122863 258 1 0 39 71 232 0.644 13.11 6.07 Intr - 123409 123317 93 2 0 107 111 121 0.990 15.32 6.06 Intr - 126640 126536 105 2 0 94 55 128 0.980 9.37 6.05 Intr - 127889 127826 64 2 1 93 100 45 0.792 3.57 6.04 Intr - 132873 132795 79 2 1 84 51 62 0.050 0.73 6.03 Intr - 138285 138167 119 0 2 3 16 172 0.061 -0.16 6.02 Intr - 142629 142505 125 1 2 98 84 158 0.726 15.78 6.01 Init - 147073 147010 64 1 1 69 61 107 0.721 7.46 6.00 Prom - 147367 147328 40 -10.45 7.00 Prom + 149089 149128 40 -7.05 7.01 Init + 151133 151198 66 1 0 32 107 71 0.870 4.52 7.02 Intr + 155242 155284 43 0 1 93 102 34 0.976 2.19 7.03 Intr + 156035 156205 171 0 0 92 82 75 0.947 6.29 7.04 Intr + 156343 156428 86 2 2 62 59 90 0.714 2.12 7.05 Intr + 156744 156903 160 2 1 2 67 148 0.438 2.74 7.06 Intr + 157222 157331 110 2 2 67 100 17 0.778 -0.12 7.07 Intr + 157939 158253 315 0 0 84 8 172 0.837 4.14 7.08 Term + 158422 158616 195 0 0 48 43 201 0.890 8.03 7.09 PlyA + 160057 160062 6 1.05 8.05 PlyA - 161259 161254 6 1.05 8.04 Term - 171113 170918 196 1 1 37 48 200 0.068 6.80 8.03 Intr - 176644 176403 242 1 2 84 44 87 0.500 -0.67 8.02 Intr - 177674 177495 180 2 0 68 47 168 0.539 9.84 8.01 Init - 188689 188570 120 1 0 71 92 83 0.671 7.24 8.00 Prom - 202768 202729 40 -1.75 9.03 PlyA - 203070 203065 6 1.05 9.02 Term - 204400 204236 165 1 0 79 54 112 0.720 3.83 9.01 Init - 204917 204738 180 2 0 68 19 151 0.391 5.43 9.00 Prom - 221941 221902 40 -5.65 10.02 PlyA - 222027 222022 6 1.05 10.01 Term - 240895 240774 122 2 2 106 48 228 0.043 18.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 33101 30840 2262 2 0 70 48 403 0.805 27.58 S.002 Term + 44524 44742 219 0 0 59 44 140 0.872 2.76 S.003 Term - 47552 47516 37 1 1 82 37 41 0.922 -5.77 S.004 Init + 52821 53011 191 2 2 80 96 172 0.879 13.74 S.005 Init - 108369 108181 189 0 0 99 116 91 0.814 10.38 S.006 Intr - 170196 170033 164 2 2 65 95 91 0.810 6.20 S.007 Term - 200380 200203 178 0 1 77 37 153 0.872 5.28 S.008 Init - 201599 201595 5 2 2 82 82 0 0.869 -1.68 S.009 Term - 216115 215936 180 1 0 139 41 114 0.991 8.43 S.010 Term - 238577 238409 169 1 1 38 37 123 0.801 -1.43 S.011 Intr - 240895 240771 125 1 2 106 86 186 0.883 18.66 S.012 Intr - 242107 241906 202 0 1 71 61 125 0.931 6.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_1|1998_aa MVKGSIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTLIMGDFNTPLSTLDRSTR QKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKIDHIVGSKALLSKCK RTEIITNYLSDHSAIKLEIRIKNLTQSHSTTWKLNNLLLNDYWVHNEMKAEIKMFFETNE NKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDSLTSQLKELEKQEQTHSKASRRQ EITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKG DITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEI VAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIP KPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRK SINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPT ANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELP FTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAIL PKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLY YKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELK QIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVR MAIIKKSGNNSDRENGTKLENTLQDIIQENFPNLAMQANIQIQEIQRTPQRYSSRRATPR HIIVRFTKVKMKEKMLRAAREKEIQTTIREYYKQLYTNKLENLEEMDKFLNTYTLPRLNQ EEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEG ILPNSFYEASIILIPKPGRDTTTKDNFRPISLMNIDAKILNTILAKRIQQHIKKLIHHDQ VGFIPGMQGWFNICKSINVIQHINRTKDKYHMIISIDAAKAFDKIQQPFMLKTLNKLGID GIYFKIIRTIYDKPTANIILNGQKLEAFPLKTGTRQGCSLSPLLFNIVLEVLVRAIRQEK EIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNFLKLISNFSKVSGYKINVQKSQAFLY TNNRQTESQIMSRLPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLQEIKEDTNKWKNIP CSWVGRINIVKMAILPKVIYRFNAILIKLPMNFFTELEKTTLKFIWNQKRARITKSILSQ KNKAGGITLPDFKLYYKATVTKTAWYWYQSRDIDQWNRAELSEITPYIYNYQIFDKSEKN KQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGN TIQDIGMGKDFMSKTPKAMATKAKLDKWDPIKLKSFCTAKEITIRVNRQPTKWEKIFTTY SSDKGLISRIYNELKQIYKKKIKQPHQKVGEGHEQTLLKRRHLCSQKHMKKCSSSLVIRE MQIKTTLIKKEIILSRPELTWWKPLREGLGSLPEIRDCPQLALKKQVPTIPHCRKMNSFN SLKELGSSFTSVASTLIAVLWALFAGSHMVEGAMPQLSRSHPQRSPCRNASVGAGELNVF QLLQRARNSQIEPVGQETVDSELPLCANTALRTWVVIGSCRDPISREIRGLSLGSAASLN LMGNFEQFLLSVFPISYP >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_1|5997_bp atggtaaagggatcaattcaacaagaggagctaactatcctaaatatttatgcacccaat acaggagcacccagattcataaagcaagtcctcagtgacctacaaagagacttagactcc cacacattaataatgggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagtcaacaaggatacccaggaattgaactcagctctgcaccaagcagacctaata gacatctacagaactctccaccccaaatcaacagaatatacatttttttcagcaccacac cacacctattccaaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaa agaacagaaattataacaaactatctctcagaccacagtgcaatcaaactagaaatcagg attaagaatctcactcaaagccactcaactacatggaaactgaacaacctgctcctgaat gactactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacaccacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaa tttatagcactaaatgcctacaagagaaagcaggaaagatccaaaattgacagcctaaca tcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaggatcaacaaaattgatagaccactagca agactaataaagaaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggg gatatcaccaccgatcccacagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggatacattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagtttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttatgaggccagcatcattctgatacca aagccgggcagagacacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagctt atccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaa tcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgattatctca atagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaat aaattaggtattgatgggacgtatttcaaaataataagagctatctatgacaaacccaca gccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaaga cagggatgccctctctcaccgctcctattcaacatagtgttggaagttctggccagggca atcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtcc ctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatca caagcattcttatacaccaacaacagacaaacagagagccaaatcatgggtgaactccca ttcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaag gacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaa tggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatac tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccacatatctacaactatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagatttaaacgttagacctaaaaccataaaaaccctagaa gaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatgtccaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaa attttcgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacaga cacttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatca ctggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacagcgacagggagaatggaaccaagttggaa aacactctgcaggatattatccaggagaacttccccaatctagcaatgcaggccaacatt cagattcaggaaatacagagaacaccacaaagatactcctcgagaagagcaactccaaga cacataattgtcagattcaccaaagttaaaatgaaggaaaaaatgttaagggcagccaga gagaaagaaatacaaactaccatcagagaatactacaaacaactctacacaaataaacta gaaaatctagaagaaatggataaattcctcaacacatacactctcccaagactaaaccag gaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatcaat agcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccagagg tacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagaggga atcctccctaactcattttatgaggccagcatcatcctgataccaaagccgggcagagac acaacaacaaaagataattttagaccaatatccttgatgaacattgatgcaaaaatcctc aatacaatactggcaaaacgaatccagcagcacatcaaaaagcttatccaccatgatcaa gtgggcttcatccctgggatgcaaggctggttcaatatatgcaaatcaataaatgtaatc cagcatataaacagaaccaaagacaaataccacatgattatctcaatagatgcagcaaag gcctttgataaaattcaacaacccttcatgctaaaaactctcaataaattaggaattgat gggatatatttcaaaataataagaactatctatgacaaacccacagccaatatcatactg aatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgctctctg tcaccactcctattcaacatagtgttggaagttctggtcagggcaattaggcaggagaag gaaataaagggtattcaattaggaaaggaggaagtcaaattgtccctgtttgcagacgac atgattgtatatctagaaaaccccattgtctcagcccaaaatttccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatat accaacaatagacaaacagagagccaaatcatgagtagactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactataaaccactgctccaggaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatattgtgaaaatggccatactgcccaaggtaatttac agattcaatgccatcctcatcaaactaccaatgaatttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccgcatcaccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaagcagagatatagatcaatggaacagagcagag ctctcagaaataacaccatatatctacaactatcagatctttgacaaatctgagaaaaac aagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaactcaagatgg attaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcaat accattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggca acaaaagccaaacttgacaaatgggatccaattaaactaaagagcttctgcacagcaaaa gaaattaccatcagagtgaacaggcaacctacaaaatgggagaaaattttcacaacctac tcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaa aaaatcaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaaga agacatttatgcagccaaaaacacatgaaaaaatgctcatcatcactggtcatcagagaa atgcaaatcaaaaccacattaatcaaaaaggagattattctaagtagacctgaattaacc tggtggaagcccttaagagagggactggggtctctccctgagatcagagactgtccccaa ctcgccttgaagaagcaagtccccacgattccacactgcaggaaaatgaactctttcaac agcctgaaggagcttggaagctcttttaccagtgtggccagtaccttgattgcagtctta tgggctctctttgccggtagtcacatggtagaaggggctatgccacagctatccagatca catccacagagaagcccttgtaggaatgcctcagtgggagcaggagaactaaatgtcttc cagttgctccagagggcaaggaactcacagatagaacctgtgggtcaggaaacagtggac tcagagctgcctctgtgtgccaacacagcactgaggacttgggtggtgattggcagttgc agagaccctataagccgagagatcagaggtctgagtcttggctctgctgcttctcttaac ctcatgggtaactttgaacaatttctcctttctgtgtttccaatttcttatccttaa >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_2|338_aa MGKKQNRKTGNSKTQSVSPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMNREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEVLNMERNNRYQPLQNHAKM >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_2|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgtctctcctcct ccaaaggaacgcagttcctcaccagcaacagaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaatcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagtg ctaaacatggaaaggaacaaccggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_3|93_aa MGKKQNRKTGNSKKHSASPPPKERSSSPATEKSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMEL >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_3|282_bp atggggaaaaaacaaaacagaaaaactggaaactctaaaaagcacagtgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaaaaaagctggatggagaacgactttgat gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgtaa >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_4|760_aa MDKFLNTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISFMNIDAKILNKMLA KRIQQHIKKLIHHDQVGFIPGMPGWFNIRKSINVIQHINRAKDKNHMIISIDAAKAFDKI QQHFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKSGTRQECSLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQTLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKEIKEGTNKWKNILCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFI WNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIT PHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLN VRPKTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIR VNRQPTKWEKIFATYSSDKGLISRIYNQLKQIYKKKINNPIKKWAKDMNRHFSKEDIYAA KKHKKKCSSSLAIREMQIKTTVRYNLTPVRMAIIKKSGNNRCWRGRGEIGTLLHCWWDCK LVQPLWKSVWRFLRDLELEIPFDPAIPLLDIYPKDYKSCC >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_4|2283_bp atggataaattcctcaacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggatctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagcctggcagagacacaaccaaaaaagag aattttagaccaatatccttcatgaacattgatgcaaaaatcctcaataaaatgctggca aaacgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgccaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagcaaaggcctttgacaaaatt caacaacacttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaagtggcacaagacaggaatgctctctgtcaccactcctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaggaggaagtcaaattgtccctgtttgcagacgacatgattgtatatcta gaaaaccccattgtctcagcccaaactctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagagggtacaaacaaatggaagaacattctatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaaattcata tggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatgg tactggtaccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacg ccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaa ctggatcccttccttacaccttatacaaaaatcaattcaagatggattaaagacttaaac gttagacctaaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacata ggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcaga gtgaacaggcaacctacaaaatgggagaaaatttttgcaacctactcatctgacaaaggg ctaatatccagaatctacaatcaactcaaacaaatttacaagaaaaaaataaacaacccc atcaaaaagtgggcaaaggacatgaacagacacttctcaaaagaagacatttatgcagcc aaaaaacacaagaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaacc acagtgagatacaatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaac aggtgctggagaggacgtggagaaataggaacacttttacactgttggtgggactgtaaa ctagttcaaccattgtggaagtcagtgtggcgattcctcagggatctagaactagaaata ccatttgacccagccatcccattactggatatatacccaaaggactataaatcatgctgc taa >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_5|279_aa MEIGLEREEVCSKDTCVRLFQKITKKEDDLKQLGQQQTQRKGKGSECPQRQNGQDSTKGK RKGENGQDSTKGVWERKESTQDTWEAETGTADTGCLLSARHYVEDLIQVPAHGILRSTYE RIESSLGFLFRTKSEESHRGIHRKSPRGKGKGKGGEIVDTEDKKAGATCWHAGWPSPLSP QIHTAICWQKAQEHLNAAVPALQGVQQFPMPDAQTWGLVVGVPGTLTAPAPAGLTKNLSY SPTGGPSDTPSLPSASSCTDADNTLVSLHVLLLRVCGDA >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_5|840_bp atggaaattggattggaaagggaagaggtctgtagcaaggatacttgtgtgagattattt cagaagattacaaagaaagaagacgacctgaaacaactagggcagcagcagacacaaagg aaaggaaagggttctgagtgcccacagagacagaatggacaagattccacaaaggggaag aggaagggagagaatggacaagattccaccaagggagtgtgggagaggaaggagtcaacg caggacacctgggaagctgagacaggaacagcagatactggatgcttactgagtgccagg cattatgtggaggacttgatacaggtcccagcccatggaatcctcagaagcacctatgaa agaatagaatcctcattaggtttcctgtttaggacaaagtcagaggaaagtcacagaggg atccatcggaagagtcccagagggaaagggaaagggaaaggaggtgagatcgttgacact gaagacaaaaaggctggggcgacttgctggcatgctggctggccgtctccactctctcca cagatccacactgccatctgctggcagaaggcacaggagcatctcaacgccgctgtgccc gccttgcagggcgtgcagcaattcccaatgcctgatgcccagacgtggggcttggtcgtg ggagttcctgggacgctgactgctccagcccctgcaggccttaccaaaaacctctcatac tcacccactggaggtccatcagataccccatcacttccttcagcttcctcctgcacagat gctgataatacgctggttagtcttcacgtgctcttattaagggtttgtggagatgcttga >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_6|865_aa MNVIGLKDEEVGRGKVLSVDGDGRMPPKVTSELLRQLRQAMRNSEYVTEPIQAYIIPSGD AHQMVVPATGAHDDYVAVGLGSSVSPLAQLSEIGALLRKGSHRIYNPIVRLDVYNFCTKV EEVAWQLARSEYIAPCDCRRAFVSGFDGSAGTAIITEEHAAMWTDGRYFLQAAKQMDSNW TLMKMGLKDTPTQEDWLVSVLPEGSRVGVDPLIIPTALGHGRTTFLKVLDFQGPAPCPAA MGVRLMAGPQCVSSDYWKKMAKVLRSAGHHLIPVKENLVDKIWTDRPERPCKPLLTLGLD YTGISWKDKVADLRLKMAERNVMWFVVTALDEIAWLFNLRGSDVEHNPVFFSYAIIGLET IMLFIDGDRIDAPSVKEHLLLDLGLEAEYRIQVHPYKSILSELKALCADLSPREKVWVSD KASYAVSETIPKDHRCCMPYTPICIAKAVKNSAESEGMRRAHVPKGGVTEISAADKAEEF RRQQADFVDLSFPTISSTGPNGAIIHYANCELFAKMAVLFYMPTSIVWGFSLVHMLACVI FDLKLILTGNQFVLFLERDGTTDVTRTMHFGTPTAYEKECFTYVLKGHIAVSAAVFPTGT KGHLLDSFARSALWDSGLDYLHGTGHGVGSFLNVHEGPCGISYKTFSDEPLEAGMIVTDE PGYYEDGAFGIRIENVVLVVPVKTKYNFNNRGSLTFEPLTLVPIQTKMIDVDSLTDKETR RKIVPALLGYYCGHSGAVKPFTKPGAQQCLLAQLRGLWVLTPSDPGLLNIPTHRTHMGLA DTGWVMQLVDGYGKGNSAYQHNVWCVSSAGRVLPVSRGSCPAPCDWLNNYHLTCRDVIGK ELQKQGRQEALEWLIRETQPISKQH >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_6|2598_bp atgaatgtgatcggcctgaaggatgaggaagtaggaagagggaaagtgttgagtgttgat ggggacggcagaatgcctccaaaggtgacttcagagctgcttcggcagctgagacaagcc atgaggaactctgagtatgtgaccgaaccgatccaggcctacatcatcccatcgggagat gctcatcagatggtggttcccgcaactggagcacatgatgattatgtagcagttggtttg ggcagttctgtaagtcccctggcacagttgtctgaaattggggccctcctccgaaagggt tctcacagaatttataatccaattgtgagactagatgtgtacaacttctgtacaaaagta gaagaggttgcttggcagcttgcaaggagtgagtatattgctccatgtgactgtcggcgg gcttttgtctctggattcgatggctctgcgggcacagccatcatcacagaagagcatgca gccatgtggactgacgggcgctactttctccaggctgccaagcaaatggacagcaactgg acacttatgaagatgggtctgaaggacacaccaactcaggaagactggctggtgagtgtg cttcctgaaggatccagggttggtgtggaccccttgatcattcctacagctctgggccat ggcaggaccacctttctgaaggttctggactttcaaggccctgctccctgccctgctgca atgggcgttcggctcatggctgggccacaatgtgtctcctcagattattggaagaaaatg gccaaagttctgagaagtgccggccatcacctcattcctgtcaaggagaacctcgttgac aaaatctggacagaccgtcctgagcgcccttgcaagcctctcctcacactgggcctggat tacacaggcatctcctggaaggacaaggttgcagaccttcggttgaaaatggctgagagg aacgtcatgtggtttgtggtcactgccttggatgagattgcgtggctatttaatctccga ggatcagatgtggagcacaatccagtatttttctcctacgcaatcataggactagagacg atcatgctcttcattgatggtgaccgcatagacgcccccagtgtgaaggagcacctgctt cttgacttgggtctggaagccgaatacaggatccaggtgcatccctacaagtccatcctg agcgagctcaaggccctgtgtgctgacctctccccaagggagaaggtgtgggtcagtgac aaggccagctatgctgtgagcgagaccatccccaaggaccaccgctgctgtatgccttac acccccatctgcatcgccaaagctgtgaagaattcagctgagtcagaaggcatgaggcgg gctcacgttcccaaaggtggtgtgacagagatctcagctgctgacaaagctgaggagttt cgcaggcaacaggcagactttgtggacctgagcttcccaacaatttccagtacgggaccc aacggcgccatcattcactacgcgaactgcgaactatttgcaaagatggctgtgctattt tacatgcccaccagcattgtatggggcttctcacttgtccacatgcttgcctgtgtcata tttgacttaaagcttattttgacgggaaaccaatttgtcctttttttggaaagggatggc accacagatgtgacgcggacaatgcattttgggacccctacagcctacgagaaggaatgc ttcacatatgtcctcaagggccacatagctgtgagtgcagccgttttcccgactggaacc aaaggtcaccttcttgactcctttgcccgttcagctttatgggattcaggcctagattac ttgcacgggactggacatggtgttgggtcttttttgaatgtccatgagggtccttgcggc atcagttacaaaacattctctgatgagcccttggaggcaggcatgattgtcactgatgag cccgggtactatgaagatggggcttttggaattcgcattgagaatgttgtccttgtggtt cctgtgaagaccaagtataattttaataaccggggaagcctgacctttgaacctctaaca ttggttccaattcagaccaaaatgatagatgtggattctcttacagacaaagagacaagg agaaagatagtaccagcccttctcggttactactgtggtcactctggggcagtgaaaccc ttcacaaagcctggggcacaacagtgcttacttgcccagctcagagggttatgggtctta actccttcagaccctggactgctaaacattcccactcacaggacccacatgggccttgct gataccgggtgggttatgcagcttgtagatggatatggaaaaggaaactcagcctatcag cacaatgtctggtgtgtctcttcagcaggaagagtccttccagtgtccaggggctcttgt ccagccccttgcgactggctcaacaattaccacctgacctgcagggatgtgattgggaag gaattgcagaaacagggccgccaggaagctctcgagtggctcatcagagagacgcaaccc atctccaaacagcattaa >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_7|381_aa MSSLPEEKQEGQRSVIVSGDGEGYKSDSHRISHYFIHQETKKQRSNDWLKASQLTADRAR TSVPASQHPAQCSYPSQNFFDLHLLNFMMSHKVVHGNHTCVAFSMGQNKEQDAQSHLVDE TRPRTVITCHKEKVSLRMKRILNKSEPRDDKEKSEANVGGAPGFSHTRSQLNPILGSSFQ MEVLHPSHILPQGGSGFLLTLNKLWQRQIQVKAECSSSAGIRVQEPDSGRPVPTDPPETN RISPGLMDPFPRLLFRGSGERGPRQTAAGRAPHPRVPARRGATRAAAQRGLHAARTPAAG GVPSPYSRWLSGGCHSAAPPTLPDLGFAAAAPAAQSGACPSTPVRIASPCSERGKDAVLL SVFIFVRFCRHSSKRKKGFAE >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_7|1146_bp atgtctagtttacctgaggaaaagcaagaaggtcagcgcagtgtgattgtatcaggggat ggggagggatacaagtcagattctcaccgaatttcccattacttcatacaccaggaaacc aagaaacaaagatcaaatgactggctcaaggcctcacagctaactgctgacagagctaga accagcgttccagcctcccaacatccagctcagtgctcctacccttcccagaacttcttc gacctacaccttctcaacttcatgatgagccacaaagtggttcatgggaatcacacatgt gtagcgttcagcatggggcaaaacaaggaacaagatgctcaatcacacctagtggatgag acaaggcctagaactgtcataacctgccataaggagaaagtctccctgagaatgaagcgg atactgaataaatcagagccaagagatgataaagagaagtctgaagccaatgttggtggt gctcccggattcagccacactagaagccagctcaaccccattttaggaagctccttccag atggaagttctccacccctcccacatacttccacagggtgggtctggttttctgctcacc cttaacaagctgtggcaacggcaaatccaggtcaaggccgagtgcagcagttccgcgggc atacgcgtccaggaaccagactcgggccgccccgtgcccaccgaccctccggaaacgaac cggatctcgcccggcctcatggaccccttcccccggctcctgttccggggctccggcgag cgcggccctcgccagactgcggcgggccgggctcctcacccgcgcgtccctgcccgccga ggtgcaacacgcgcggccgcgcagcgagggctgcacgctgcccggacgccggctgccggc ggagtgccctcaccttactcgcggtggctttctggaggctgccattcggcggccccgccc acactcccggacctggggttcgcggccgcagcgcctgcggcccagagcggggcctgccct tccacaccagtccgcattgcctctccctgcagcgagcggggaaaggacgcagtgctgctg agtgtcttcatttttgttcgtttttgtcgtcattcctcaaagaggaaaaagggctttgcc gaataa >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_8|245_aa MRPKTLRNSAARSRKEPGFLNHHMEGSCLQLGELSSVGYWWPFGHPGPAQGHLPVAVWAS WVVAGISIALYLPPHPEGRYEGPFKRHTKTFRSAQCADCLRENPKEGVSAAWQREERVHG FSGMLQLMPLINTPGASATAALLLCNHFSVLCLRHALWNREPFGRCKRIPGNGSKALEEF KGIGAEVVCRGAHFEDVQYPQTVEQDVRQSRGTKEGEKEAQRRCMDEAEGWAEPCPAQQL KVFSL >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_8|738_bp atgaggcccaagaccctaaggaacagtgcagccaggagtcggaaggagcctgggtttctg aatcatcacatggaggggagctgtcttcagctgggagagctgtcttcagttggatactgg tggccgtttgggcatcctgggccagcccaaggacatcttccagtggccgtttgggcatcc tgggtggtagcaggtatctccattgcactctatttgccacctcaccctgaaggaaggtac gagggccccttcaaacgccatacaaagaccttccggagtgcccagtgtgcagattgcctg agagaaaatcccaaagaaggtgtttctgcggcttggcaaagggaggagagagttcatggg ttctcaggaatgctacagttaatgcctctaattaacaccccaggtgcttctgctacggct gcactgctgctttgtaatcatttctctgtactctgcctcagacatgcgctctggaaccgt gagccatttgggaggtgcaagaggattccaggaaatggcagcaaggccttagaggagttt aaaggtattggggcagaggtggtgtgtagaggggctcattttgaggatgtgcaatatcct cagactgtggaacaggatgtgcgacagtcccgaggcacaaaggaaggtgagaaagaggcc cagagaaggtgcatggatgaagcggagggttgggcagagccttgcccagcacaacagctt aaggtttttagcctttga >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_9|114_aa MCWDAEPDAQDSPRTFQLGFGRRWLTQKWKKVACYSTVRGSGSRQPVLYAERELDPLTEE TNGSLGFLGIESERKWGEEEDVNELARKSLWRRLVGGCLRERSTRTAAGYEWSP >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_9|345_bp atgtgttgggatgctgagcctgatgcccaggattctcccagaactttccagttgggcttt ggccgtagatggcttactcagaaatggaaaaaggtagcatgctacagcacagtgcgtggt tcagggtcaagacagccagttctttatgcagaaagagaactggatcccttaactgaagag accaatggttctttgggctttcttgggatagaatcagagaggaagtggggagaggaggaa gacgtgaacgaactagcgagaaagagtctttggagaagattggtgggcggctgtttacgt gagagaagcacaagaacagccgcaggatatgagtggagtccttga >gi568815588r:109765187_110007807|GENSCAN_predicted_peptide_10|40_aa XVDSAMLHRNLLQPQTNSSSSSGGGGSSSSSISSSSFSQC >gi568815588r:109765187_110007807|GENSCAN_predicted_CDS_10|123_bp ngtgtggattctgcaatgctgcacagaaacctactgcagcctcagaccaacagcagcagc agcagcggcggcggcggcagcagcagcagcagcattagcagcagcagcttctctcaatgc tga