GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:22:59 Sequence gi568815587r:66585874_66821240 : 235367 bp : 47.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 22 17 6 1.05 1.02 Term - 7276 4593 2684 2 2 99 39 2272 0.824 208.80 1.01 Init - 7543 7495 49 1 1 51 92 1 0.516 -2.07 1.00 Prom - 10198 10159 40 -9.75 2.00 Prom + 12073 12112 40 -5.86 2.01 Init + 12773 12782 10 1 1 42 73 0 0.144 -5.53 2.02 Intr + 13243 13380 138 2 0 133 66 117 0.996 14.44 2.03 Intr + 13586 13763 178 0 1 40 109 239 0.998 20.18 2.04 Intr + 14616 14676 61 0 1 107 114 41 0.997 7.34 2.05 Intr + 19466 19543 78 1 0 49 116 144 0.915 13.05 2.06 Intr + 19616 19719 104 1 2 70 105 102 0.999 9.07 2.07 Term + 19829 19982 154 2 1 87 53 157 0.999 9.49 2.08 PlyA + 23244 23249 6 1.05 3.00 Prom + 27480 27519 40 -7.36 3.01 Init + 30848 31184 337 1 1 80 92 652 0.982 62.24 3.02 Intr + 38341 39805 1465 2 1 107 79 1412 0.998 130.98 3.03 Intr + 40588 40791 204 1 0 110 25 226 0.135 16.82 3.04 Intr + 52120 52249 130 1 1 33 81 101 0.379 4.60 3.05 Intr + 52796 52879 84 1 0 65 80 88 0.279 5.72 3.06 Intr + 53827 54250 424 0 1 99 61 701 0.712 61.94 3.07 Term + 57577 58259 683 2 2 68 42 728 0.720 60.02 3.08 PlyA + 58516 58521 6 1.05 4.03 PlyA - 58977 58972 6 1.05 4.02 Term - 83418 82751 668 1 2 76 42 719 0.730 60.09 4.01 Init - 91206 90795 412 0 1 83 61 726 0.495 66.08 4.00 Prom - 96548 96509 40 -5.06 5.39 PlyA - 96928 96923 6 1.05 5.38 Term - 100231 99998 234 1 0 82 33 288 0.941 19.02 5.37 Intr - 100567 100525 43 0 1 108 83 12 0.989 0.94 5.36 Intr - 101294 101121 174 1 0 75 41 185 0.999 11.55 5.35 Intr - 101774 101554 221 2 2 105 92 224 0.993 21.60 5.34 Intr - 102061 101995 67 0 1 68 117 30 0.513 2.81 5.33 Intr - 102438 102296 143 0 2 17 80 217 0.989 12.95 5.32 Intr - 102976 102780 197 2 2 24 77 150 0.900 6.63 5.31 Intr - 103307 103223 85 2 1 113 80 97 0.834 10.79 5.30 Intr - 104070 103932 139 2 1 112 90 206 0.972 23.67 5.29 Intr - 104410 104166 245 1 2 101 97 477 0.997 46.20 5.28 Intr - 105785 105411 375 2 0 66 41 663 0.975 54.61 5.27 Intr - 106867 106663 205 0 1 95 74 411 0.982 39.60 5.26 Intr - 107227 107097 131 1 2 109 90 283 0.983 30.09 5.25 Intr - 107693 107313 381 2 0 101 100 464 0.949 44.01 5.24 Intr - 107988 107899 90 0 0 126 94 131 0.999 17.69 5.23 Intr - 108490 108266 225 1 0 84 68 304 0.998 26.18 5.22 Intr - 110667 110404 264 0 0 118 69 545 0.974 53.31 5.21 Intr - 112912 112766 147 1 0 50 105 319 0.991 30.23 5.20 Intr - 113245 113119 127 0 1 29 78 100 0.779 3.78 5.19 Intr - 113735 113533 203 2 2 89 25 338 0.993 25.68 5.18 Intr - 115409 114653 757 1 1 83 67 1039 0.895 92.77 5.17 Intr - 115848 115711 138 2 0 98 94 209 0.994 22.08 5.16 Intr - 119595 118725 871 1 1 93 80 1341 0.988 124.20 5.15 Intr - 119964 119811 154 0 1 81 100 297 0.998 29.85 5.14 Intr - 121945 121643 303 1 0 95 76 723 0.837 68.59 5.13 Intr - 122426 122268 159 2 0 81 91 426 0.999 42.38 5.12 Intr - 123146 123029 118 1 1 105 73 248 0.999 25.47 5.11 Intr - 124896 124709 188 0 2 135 58 397 0.999 39.89 5.10 Intr - 125156 125044 113 0 2 103 79 192 0.999 19.90 5.09 Intr - 127873 127758 116 0 2 92 81 142 0.999 13.99 5.08 Intr - 128298 128218 81 2 0 98 97 124 0.999 13.05 5.07 Intr - 128534 128443 92 2 2 105 72 135 0.998 12.39 5.06 Intr - 129522 129172 351 0 0 102 37 466 0.858 38.42 5.05 Intr - 130108 129957 152 2 2 49 93 215 0.995 17.98 5.04 Intr - 132755 132570 186 0 0 41 100 263 0.772 22.46 5.03 Intr - 135389 135211 179 1 2 125 94 268 0.788 30.66 5.02 Intr - 141792 141620 173 0 2 113 -13 82 0.177 -0.66 5.01 Init - 142651 142583 69 1 0 95 -9 143 0.524 4.35 5.00 Prom - 155422 155383 40 -4.26 6.00 Prom + 155624 155663 40 -5.06 6.01 Init + 158870 159046 177 1 0 68 49 275 0.796 18.86 6.02 Intr + 162506 162644 139 1 1 83 94 1 0.282 0.24 6.03 Term + 173170 173273 104 2 2 93 28 82 0.404 1.14 6.04 PlyA + 173328 173333 6 1.05 7.03 PlyA - 173716 173711 6 1.05 7.02 Term - 175980 175702 279 0 0 5 43 253 0.943 7.95 7.01 Init - 176496 176107 390 0 0 36 77 231 0.865 11.66 7.00 Prom - 182146 182107 40 -5.26 8.00 Prom + 182741 182780 40 -7.86 8.01 Init + 185597 185843 247 1 1 82 -21 283 0.768 14.58 8.02 Intr + 206936 207021 86 0 2 48 91 55 0.178 1.34 8.03 Intr + 210408 210491 84 2 0 54 121 13 0.023 1.22 8.04 Intr + 214768 214791 24 0 0 127 94 -21 0.017 0.62 8.05 Intr + 228763 228820 58 0 1 88 113 12 0.762 2.16 8.06 Intr + 230180 230326 147 0 0 56 98 88 0.378 6.81 8.07 Term + 231476 231585 110 0 2 13 53 96 0.248 -2.73 8.08 PlyA + 232048 232053 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 40588 40795 208 1 1 110 49 238 0.864 18.91 S.002 Term - 212181 212110 72 0 0 79 53 79 0.880 1.41 S.003 Init - 213669 213619 51 0 0 86 89 65 0.843 5.56 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_1|910_aa MAPGPNPSAHSFSSSQVRAHARPCFVSHRVAAAQAHSGRRAPGCLSNPSAFCSWTVAAAG SMMEPPKPEPELQRFYHRLLRPLSLFPTRTTSPEPQKRPPQEGRILQSFPLAKLTVASLC SQVAKLLAGSGIAAGVPPEARLRLIKVILDELKCSWREPPAELSLSHKNNQKLRKRLEAY VLLSSEQLFLRYLHLLVTMSTPRGVFTESATLTRLAASLARDCTLFLTSPNVYRGLLADF QALLRAEQASGDVDKLHPVCPAGTFKLCPIPWPHSTGFAQVQCSNLNLNYLIQLSRPPEF LNEPGRMDPVKELKSIPRLKRKKPFHWLPSIGKKREIDISSSQMVSLPSYPVAPTSRASP SPFCPELRRGQSMPSLREGWRLADELGLPPLPSRPLTPLVLATESKPELTGLIVAEDLKQ LIKKMKLEGTRYPPLDSGLPPLLGVVTRHPAAGHRLEELEKMLRNLQEEEASGQWDPQPP KSFPLHPQPVTITLKLRNEVVVQAAAVRVSDRNFLDSFHIEGAGALYNHLAGELDPKAIE KMDIDNFVGSTTREVYKELMSHVSSDHLHFDQGPLVEPAADKDWSTFLSSAFLRQEKQPQ IINPELVGLYSQRANTLQSNTKKMPSLPSLQATKSWEKWSNKASLMNSWKTTLSVDDYFK YLTNHETDFLHVIFQMHEEEVPVEIVAPARESLEIQHPPPLLEDEEPDFVPGEWDWNTVL EHRLGAGKTPHLGEPHKILSLQKHLEQLWSVLEVPDKDQVDMTIKYSSKARLRQLPSLVN AWERALKPIQLREALLARLEWFEGQASNPNRFFKKTNLSSSHFLEENQVRSHLHRKLNLM ESSLVSLLEEIELIFGEPVIFKGRPYLDKMKSDKVEMLYWLQQQRRVRHLVSALKDPHQS TLFRSSAASL >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_1|2733_bp atggccccaggccccaacccaagtgcccactccttctcctcttcccaagtgcgtgcgcac gcaaggccctgcttcgtcagccaccgggtggcggccgcgcaggcgcactcgggccgtcgg gctcctggttgccttagtaacccctcggctttctgttcctggacggtggcggccgccggc tctatgatggagcccccgaagcccgagcctgagctccagcggttttaccaccggctgctg cgtccgctgtcgctcttccccactaggacgacgtccccagagcctcagaagcgccccccg caggagggccggattctgcagtccttccctctggcgaagctgacggtggcgtcgctgtgc agccaggtggccaagctgctggccggcagcgggatagcagcgggagtgcctcctgaggcc cgactacgtctcatcaaggtcatcctggacgagctgaagtgcagctggcgggagccgccc gccgaacttagtctgagccacaaaaacaaccagaagctgcggaagcggctcgaggcctac gtgctgctgagcagcgagcagctcttcttgcgctacctgcacctgctggtgaccatgtcg actcccaggggggtcttcactgaatcagccaccctcacccggttggccgccagcctcgcc agggactgcacactcttccttactagtcccaacgtctaccgtggcctgcttgccgacttc caggccctgctgagggcagagcaggcctctggggatgtggacaagctgcaccctgtctgc cccgctgggacgttcaagctgtgccctatcccctggcctcacagcactggcttcgcccaa gtgcagtgctctaacctcaacctgaactacctcatccaactcagccgtccaccagagttt ctcaatgagccaggaaggatggatccagtgaaggaattgaagtccatccctcggttgaag aggaaaaagcctttccactggctgccctccataggaaagaagagagaaatcgacatcagt tcctcacagatggtgtcgctgcccagctatcctgtggcccccaccagcagggcttccccc tcgcctttctgccctgagctccggagaggccaatccatgccctccctgcgtgagggctgg aggctggcagatgagttgggccttcctccactcccatctcgccccttaaccccgctggtc ttggctacagagagcaaaccagagctgactgggctcatcgtggctgaggatctgaagcag ttgataaagaagatgaagttggaggggactcgctacccaccactggactcaggcctgcct cctctcctgggggttgtgacccgtcacccagctgcagggcatcgcctggaggagctggag aagatgttgaggaacctccaggaggaagaagcctctgggcagtgggacccccagcccccc aaatcctttccacttcacccacagccagtgaccattactttgaagcttagaaatgaggtc gtggtccaggcggctgccgtacgggtctctgatagaaacttcttagactctttccacatt gagggggccggagccctgtataaccatctggctggtgaactggatcccaaagccattgaa aaaatggatattgataactttgttggcagtactaccagggaggtctacaaggagttgatg agccatgtctcttctgaccacttacattttgatcaagggcccctagttgagcctgcagca gataaagactggtcgaccttcctgtcctcagcctttctacgtcaagaaaaacagcctcaa atcatcaaccctgagctggttggactttactcccagagagcaaacactttacagtccaat actaagaagatgccctccctcccatcactccaagctaccaaaagctgggagaagtggtca aacaaggcctccttgatgaactcatggaaaaccaccttgtctgtggatgactacttcaag tacctcaccaaccatgaaacagatttccttcatgtcatctttcaaatgcatgaagaagag gttcctgtggagattgtggcccctgccagagagtccctagagattcagcaccctccccca ttgctagaagatgaagaaccagactttgtgccaggagagtgggattggaacactgtgcta gagcacaggctaggagctgggaagacaccccacctgggagaaccccacaaaattctgagc ctgcagaagcatctggaacaactgtggtctgtgcttgaggtccctgacaaggaccaggtg gacatgaccattaaatatagctccaaagcccgcctgaggcagctgccttcattggtgaat gcctgggagcgggccctgaagcccattcagctgcgggaggcattgctggcgagactagag tggtttgagggacaagcttccaatcccaaccgcttcttcaaaaagaccaacttgagctcc agtcacttcctggaggagaatcaggtccgaagccatctccacaggaagctcaacttaatg gagtcttctttggtttccctcctggaggagatagagttaatctttggcgagccagtgatc ttcaaggggcggccctacctggacaagatgaagagtgacaaagtggagatgctctattgg ctgcaacagcagcggcgggttcgccacctggtctcggccctgaaggatccccaccagtca accctgttcaggagctcagcagccagcctttag >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_2|240_aa MFQGVQDVEVHLEDQMVLVHTTLPSQEVQALLEGTGRQAVLKGMGSGQLQNLGAAVAILG GPGTVQGVVRFLQLTPERCLIEGTIDGLEPGLHGLHVHQYGDLTNNCNSCGNHFNPDGAS HGGPQDSDRHRGDLGNVRADADGRAIFRMEDEQLKVWDVIGRSLIIDEGEDDLGRGGHPL SKITGNSGERLACGIIARSAGLFQNPKQICSCDGLTIWEERGRPIAGKGRKESAQPPAHL >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_2|723_bp atgtttcaaggtgtccaggatgtggaggtgcacttggaggaccagatggtcttggtacac accactctacccagccaggaggtgcaggctctcctggaaggcacggggcggcaggcggta ctcaagggcatgggcagcggccagttgcagaatctgggggcagcagtggccatcctgggg gggcctggcaccgtgcagggggtggtgcgcttcctacagctgacccctgagcgctgcctc atcgagggaactattgacggcctggagcctgggctgcatggactccacgtccatcagtac ggggaccttacaaacaactgcaacagctgtgggaatcactttaaccctgatggagcatct catgggggcccccaggactctgaccggcaccgcggagacctgggcaatgtccgtgctgat gctgacggccgcgccatcttcagaatggaggatgagcagctgaaggtgtgggatgtgatt ggccgcagcctgattattgatgagggagaagatgacctgggccggggaggccatccctta tccaagatcacagggaactccggggagaggttggcctgtggcatcattgcacgctccgct ggccttttccagaaccccaagcagatctgctcttgcgatggcctcaccatctgggaggag cgaggccggcccatcgctggcaagggccgaaaggagtcagcgcagccccctgcccacctt tga >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_3|1108_aa MKIFVGNVDGADTTPEELAALFAPYGTVMSCAVMKQFAFVHMRENAGALRAIEALHGHEL RPGRALVVEMSRPRPLNTWKIFVGNVSAACTSQELRSLFERRGRVIECDVVKDYAFVHME KEADAKAAIAQLNGKEVKGKRINVELSTKGQKKGPGLAVQSGDKTKKPGAGDTAFPGTGG FSATFDYQQAFGNSTGGFDGQARQPTPPFFGRDRSPLRRSPPRASYVAPLTAQPATYRAQ PSVSLGAAYRAQPSASLGVGYRTQPMTAQAASYRAQPSVSLGAPYRGQLASPSSQSAAAS SLGPYGGAQPSASALSSYGGQAAAASSLNSYGAQGSSLASYGNQPSSYGAQAASSYGVRA AASSYNTQGAASSLGSYGAQAASYGAQSAASSLAYGAQAASYNAQPSASYNAQSAPYAAQ QAASYSSQPAAYVAQPATAAAYASQPAAYAAQATTPMAGSYGAQPVVQTQLNSYGAQASM GLSGSYGAQSAAAATGSYGAAAAYGAQPSATLAAPYRTQSSASLAASYAAQQHPQAAASY RGQPGNAYDGAGQPSAAYLSMSQGAVANANSTPPPYERTRLSPPRASYDDPYKKAVAMSK RYGSDRRLAELSDYRRLSESQLSFRRSPTKSSLDYRRLPDAHSDYARYSGSYNDYLRAAQ MHSGYQRRIAASCRAYFWKAITEVLSPPVRGLILGELRDSQGYSVDCWPKEKSTAAAAAI LAFCQKRPRREEEALLVSVRALVRMVKLFIGNLPREATEQEIRSLFEQYGKVLECDIIKN YGFVHIEDKTAAEDAIRNLHHYKLHGVNINVEASKNKSKTSTKLHVGNISPTCTNKELRA KFEEYGPVIECDIVKDYAFVHMERAEDAVEAIRGLDNTEFQGKRMHVQLSTSRLRTAPGM GDQSGCYRCGKEGHWSKECPIDRSGRVADLTEQYNEQYGAVRTPYTMSYGDSLYYNNAYG ALDAYYKRCRAARSYEAVAAAAASVYNYAEQTLSQLPQVQNTAMASHLTSTSLDPYDRHL LPTSGAAATAAAAAAAAAAVTAASTSYYGRDRSPLRRATAPVPTVGEGYGYGHESELSQA SAAARNSLYDMARYEREQYADRARYSAF >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_3|3327_bp atgaagatattcgtgggcaacgtcgacggggcggatacgactccggaggagctggcagcc ctctttgcgccctacggcacggtcatgagctgcgccgtcatgaaacagttcgccttcgtg cacatgcgcgagaacgcgggcgcgctgcgcgccatcgaagccctgcacggccacgagctg cggccggggcgcgcgctcgtggtggagatgtcgcgcccaaggcctcttaatacttggaag attttcgtgggcaatgtgtcggctgcatgcacgagccaggaactgcgcagcctcttcgag cgccgcggacgcgtcatcgagtgtgacgtggtgaaagactacgcgtttgttcacatggag aaggaagcagatgccaaagccgcaatcgcgcagctcaacggcaaagaagtgaagggcaag cgcatcaacgtggaactctccaccaagggtcagaagaaggggcctggcctggctgtccag tctggggacaagaccaagaaaccaggggctggggatacggccttccctggaactggtggc ttctctgccaccttcgactaccagcaggcttttggcaacagcactggtggctttgatggg caagcccgtcagcccacaccacccttctttggtcgcgaccgcagccctctgcgccgttca cctccccgagcctcttatgtggctcctctgacggcccagccagctacctaccgggcccag ccgtccgtgtcactgggagctgcctacagggcccagccttctgcctctttgggtgttggc tatcggactcagcccatgacagcccaggcagcctcttaccgcgctcagccctctgtctcc cttggggcaccatacaggggccagctggctagtcctagctcccagtctgctgcagcttct tcactcggcccatatggtggagcccagccctcagcctcggccctttcctcctatgggggt caggcagctgcagcttcttcgctcaactcctatggggctcagggttcctcccttgcctcc tatggtaaccagccatcctcttacggcgcccaggctgcctcttcctatggggttcgtgca gctgcttcttcctacaacacccagggagcagcttcctccttaggctcctacggggctcag gcagcctcctatggggcccagtctgcagcctcctcactagcttatggagcccaggcagct tcatataatgcccagccctcggcctcttacaatgcccagtctgccccatatgctgcacag caggctgcttcctactcttcccaacctgctgcctatgtggcacagccagccacagctgct gcctatgccagccagccagcagcctacgccgcacaagccactaccccaatggctggctcc tatggggcccagccggttgtgcagacccagctgaatagttacggggcccaagcatcaatg ggcctttcaggctcctatggggctcagtcggctgctgcggccactggctcctatggtgcc gcagcagcctacggggcccaaccttctgccaccctggcagctccttaccgcactcagtca tcagcctcattggctgcttcctatgctgcccagcagcatccccaggctgctgcctcctac cgcggccagccaggcaatgcctacgatggggcaggtcagccgtctgcagcctacctgtcc atgtcccagggggccgttgccaacgccaacagcaccccgccgccctatgagcgtacccgc ctctccccaccccgggccagctacgacgatccctacaaaaaggctgtcgccatgtcgaaa aggtatggttccgaccggcgtttagccgagctctctgattaccgccgtttatcagagtcg cagctttcgttccgccgctcgccgacaaagtcctcgctggattaccgtcgcctgcccgat gcccattccgattacgcacgctattcgggctcctataatgattacctgcgggcggctcag atgcactctggctaccagcgccgcattgcagcctcctgccgggcctacttctggaaggcc attacagaggtcctttcaccacctgtgagaggtttgattttgggggagctccgggactct caaggctacagtgtggactgctggcccaaggagaagagcactgctgcggccgccgccatt ttagcgttttgtcagaagcgtccgcgccgcgaggaggaggccctgctggtttctgtgcgg gctcttgtcaggatggtgaagctgttcatcggaaacctgccccgggaggctacagagcag gagattcgctcactcttcgagcagtatgggaaggtgctggaatgtgacatcattaagaat tacggctttgtgcacatagaagacaagacggcagctgaggatgccatacgcaacctgcac cattacaagcttcatggggtgaacatcaacgtggaagccagcaagaataagagcaaaacc tcaacaaagttgcatgtgggcaacatcagtcccacctgcaccaataaggagcttcgagcc aagtttgaggagtatggtccggtcatcgaatgtgacatcgtgaaagattatgccttcgta cacatggagcgggcagaggatgcagtggaggccatcaggggccttgataacacagagttt caaggcaaacgaatgcacgtgcagttgtccaccagccggcttaggactgcgcccgggatg ggagaccagagcggctgctatcggtgcgggaaagaggggcactggtccaaagagtgtccg atagatcgttcaggccgcgtggcagacttgaccgagcaatataatgagcaatacggagca gtgcgtacgccttacaccatgagctatggggattcattgtattacaacaacgcgtacgga gcgctcgatgcctactacaagcgctgccgtgctgcccggtcctatgaggcagtggcagct gcagctgcctccgtgtataattacgcagagcagaccctgtcccagctgccacaagtccag aatacagccatggccagtcacctcacctccacctctctcgatccctacgatagacacctg ttgccgacctcaggagctgctgccacagctgctgctgcagcagcagccgctgctgctgtt actgcagcttccacttcatattacgggcgggatcggagccccctgcgtcgcgctacagcc ccagtccccactgttggagagggctacggttacgggcatgagagtgagttgtcccaagct tcagcagccgcgcggaattctctgtacgacatggcccggtatgagcgggagcagtatgcc gatcgggcgcggtactcagccttttaa >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_4|359_aa MVKLFIGNLPREATEQEIRSLFEQYGKVLECDIIKNYGFVHIEDKTAAEDAIRNLHHYKL HGVNINVEASKNKSKASTKLHVGNISPTCTNQELRAKFEEYGPVIECDIVKDYAFVHMER AEDAVEAIRGLDNTEFQGKRMHVQLSTSRLRTAPGMGDQSGCYRCGKEGHWSKECPVDRT GRVADFTEQYNEQYGAVRTPYTMGYGESMYYNDAYGALDYYKRYRVRSYEAVAAAAAASA YNYAEQTMSHLPQVQSTTVTSHLNSTSVDPYDRHLLPNSGAAATSAAMAAAAATTSSYYG RDRSPLRRAAAMLPTVGEGYGYGPESELSQASAATRNSLYDMARYEREQYVDRARYSAF >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_4|1080_bp atggtgaagctgttcatcggaaaccttccccgggaggctacagagcaggagattcgctca ctcttcgagcagtatgggaaggtgctggaatgtgacatcattaagaattacggctttgtg cacatagaagacaagacggcagctgaggatgccatacgcaacctgcaccattacaagctt catggggtgaacatcaacgtggaagccagcaagaataagagcaaagcttcaaccaagtta cacgtgggtaacatcagccccacttgtaccaaccaagagcttcgagccaagtttgaggag tatggtccggtcatcgaatgtgacatcgtgaaagattatgccttcgtacacatggagcgg gcagaggatgcagtggaggccatcaggggccttgacaacacagagtttcaaggcaaaaga atgcatgtgcagttgtccacaagccggcttcggactgcccctggtatgggagaccagagt ggctgctatcggtgtgggaaagaagggcactggtccaaagagtgcccagtagatcgtacg ggtcgtgtggcagactttactgagcagtataatgaacaatatggagcagttcgaacacct tacaccatgggctacggggaatccatgtattacaacgatgcatatggagcactcgactac tataagcgataccgggtccgctcttatgaggcagtagcagcggcggcagcggcttctgca tacaactacgcagagcagaccatgtcccatctgcctcaagtccaaagcacaactgtgacc agccacctcaactctacttctgttgatccctatgacagacacctattgccaaactctggc gctgctgccacttcagctgctatggctgctgctgcagccaccacttcctcctactatgga agggacaggagcccactgcgtcgtgctgcagccatgctccccacagttggagagggctac ggttatgggccagagagtgaattatctcaggcttccgcagctacacggaattctctgtat gacatggcccggtatgaacgggagcagtatgtggaccgagcccggtactcagccttttaa >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_5|2631_aa MALGGEGATRAPPRAPRGRRLQMCVGEGHAVGCGGRNWPGEKADASGGGEAAACGVSVAR KLIGLCGPPRTELPSKACSRGSRKPPTTMSSTLSPTDFDSLEIQGQYSDINNRWDLPDSD WDNDSSSARLFERSRIKALAAGSMNGNGVSRVHGNGLHEASEFYYEAVEGAHNPGGLLLS PAAFINPAQYASVLEGRFKQLQDEREAVQKKTFTKWVNSHLARVTCRVGDLYSDLRDGRN LLRLLEVLSGEILPKPTKGRMRIHCLENVDKALQFLKEQKVHLENMGSHDIVDGNHRLTL GLVWTIILRFQVPQHTVTQGVVPALALHRKPFLLAHDTTNTKMEDLEPYVAAAMAALSKH QISGLPSFSRIQDISVETEDNKEKKSAKDALLLWCQMKTAGYPNVNVHNFTTSWRDGLAF NAIVHKHRPDLLDFESLKKCNAHYNLQNAFNLAEKELGLTKLLDPEDVNVDQPDEKSIIT YVATYYHYFSKMKALAVEGKRIGKVLDHAMEAERLVEKYESLASELLQWIEQTIVTLNDR QLANSLSGVQNQLQSFNSYRTVEKPPKFTEKGNLEVLLFTIQSKLRANNQKVYTPREGRL ISDINKAWERLEKAEHERELALRTELIRQEKLEQLAARFDRKAAMRETWLSENQRLVSQD NFGLELAAVEAAVRKHEAIETDIVAYSGRVQAVDAVAAELAAERYHDIKRIAARQHNVAR LWDFLRQMVAARRERLLLNLELQKVFQDLLYLMDWMEEMKGRLQSQDLGRHLAGVEDLLQ LHELVEADIAVQAERVRAVSASALRFCNPGKEYRPCDPQLVSERVAKLEQSYEALCELAA ARRARLEESRRLWRFLWEVGEAEAWVREQQHLLASADTGRDLTGALRLLNKHTALRGEMS GRLGPLKLTLEQGQQLVAEGHPGASQASARAAELQAQWERLEALAEERAQRLAQAASLYQ FQADANDMEAWLVDALRLVSSPELGHDEFSTQALARQHRALEEEIRSHRPTLDALREQAA ALPPTLSRTPEVQSRVPTLERHYEELQARAGERARALEAALALYTMLSEAGACGLWVEEK EQWLNGLALPERLEDLEVVQQRFETLEPEMNTLAAQITAVNDIAEQLLKANPPGKDRIVN TQEQLNHRWQQFRRLADGKKAALTSALSIQNYHLECTETQAWMREKTKVIESTQGLGNDL AGVLALQRKLAGTERDLEAIAARVGELTREANALAAGHPAQAVAINARLREVQTGWEDLR ATMRRREESLGEARRLQDFLRSLDDFQAWLGRTQTAVASEEGPATLPEAEALLAQHAALR GEVERAQSEYSRLRALGEEVTRDQADPQCLFLRQRLEALGTGWEELGRMWESRQGRLAQA HGFQGFLRDARQAEGVLSSQEYVLSHTEMPGTLQAADAAIKKLEDFMSTMDANGERIHGL LEAGRQLVSEGNIHADKIREKADSIERSKFQSFTRGCRSRHKKNQDAAQQFLGRLRDNRE QQHFLQDCHELKLWIDEKMLTAQDVSYDEARNLHTKWQKHQAFMAELAANKDWLDKVDKE GRELTLEKPELKALVSEKLRDLHRRWDELETTTQAKARSLFDANRAELFAQSCCALESWL ESLQAQLHSDDYGKDLTSVNILLKKQQMLEWEMAVREKEVEAIQAQAKALAQEDQGAGEV ERTSRAVEEKFRALCQPMRERCRRLQASREQHQFHRDVEDEILWVTERLPMASSMEHGKD LPSVQLLMKKNQARRGSRGPRKERGTGVKGGLREHVGLGQDFQHFLFCGHGQTLQKEIQG HEPRIADLRERQRALGAAAAGPELAELQEMWKRLGHELELRGKRLEDALRAQQFYRDAAE AEAWMGEQELHMMGQEKAKDELSAQAEVKKHQVLEQALADYAQTIHQLAASSQDMIDHEH PESTRISIRQAQVDKLYAGLKELAGERRERLQEHLRLCQLRRELDDLEQWIQEREVVAAS HELGQDYEHVTMLRDKFREFSRDTSTIGQERVDSANALANGLIAGGHAARATVAEWKDSL NEAWADLLELLDTRGQVLAAAYELQRFLHGARQALARVQHKQQQLPDGTGRDLNAAEALQ RRHCAYEHDIQALSPQVQQVQDDGHRLQKAYAGDKAEEIGRHMQAVAEAWAQLQGSSAAR RQLLLDTTDKFRFFKAVRELMLWMDEVNLQMDAQERPRDVSSADLVIKNQQGIKAEIEAR ADRFSSCIDMGKELLARSHYAAEEISEKLSQLQARRQETAEKWQEKMDWLQLVLEVLVFG RDAGMAEAWLCSQEPLVRSAELGCTVDEVESLIKRHEAFQKSAVAWEERFCALEKLTALE EREKERKRKREEEERRKQPPAPEPTASVPPGDLVGGQTASDTTWDGFLSLQPLLGQQRLE HSSFPEGPGPGSGDEANGPRGERQTRTRGPAPSAMPQSRSTESAHAATLPPRGPEPSAQE QMEGMLCRKQEMEAFGKKAANRSWQNVYCVLRRGSLGFYKDAKAASAGVPYHGEVPVSLA RAQGSVAFDYRKRKHVFKLGLQDGKEYLFQAKDEAEMSSWLRVVNAAIATASSASGEPEE PVVPSTTRGMTRAMTMPPVSPVGAEGPVVLRSKDGREREREKRFSFFKKNK >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_5|7896_bp atggcgctgggcggggaaggggcgactcgggccccgcccagagccccgcggggccgccgg ctccagatgtgtgtgggagaggggcacgcggttggctgtggtggccggaactggcccggg gagaaagcagacgcttcgggcggcggggaagcggccgcgtgcggcgtttctgtggccagg aagctgatagggctgtgcggtccgccccgcacggaactacccagcaaggcctgctcgcgg gggagcaggaagccgcctaccaccatgagcagcacgctgtcacccacagactttgacagc ttggaaatccagggccagtacagtgacatcaacaaccgctgggaccttcctgactcggac tgggacaatgacagcagctcggcccgcctctttgagaggtctcgcattaaggctctggca gctggcagcatgaatgggaacggagtgagcagggtgcacgggaacggcctgcatgaggcc agcgagttttactatgaggctgtggaaggggcccacaaccccgggggcctcctgctttca ccagctgctttcatcaaccctgctcagtatgccagcgtgctggaaggacgcttcaaacag ctgcaagatgaacgagaagctgtgcagaagaaaaccttcaccaagtgggtaaactcgcac ctggcccgggtcacgtgccgggtgggggacctgtacagcgacctccgggacggacgcaac ctgctgaggctcctcgaggtgctctcgggagagatactgccaaagcctacaaagggccgc atgcggatccactgcctggagaacgtggacaaggcactgcagttcctcaaggagcagaaa gtgcacttggaaaacatgggctcccatgacattgtggacggaaaccaccgactgaccctt gggctggtctggaccatcatccttcgattccaggtaccccagcacactgtcacacagggt gtggttcctgccctggctctgcaccgcaagccgttcctgctggcccatgacacaacaaac acaaagatggaggatctagaaccttatgtagcagctgccatggcggcgctatccaagcac cagatcagtggactcccctctttcagcaggatccaagacatcagtgtggagacagaagac aacaaggagaagaagtcagccaaggatgccctgcttctgtggtgccagatgaagactgca ggttatcccaacgtcaatgtacacaacttcaccaccagctggagagatggactagctttc aacgccatcgtgcataaacaccggccagacctgctggattttgagtctctgaagaagtgt aatgcacactataatctgcagaatgcattcaatctggctgaaaaggaactgggacttacc aagctgctggatcccgaagacgtgaatgtggaccagccagatgagaagtcaatcattacc tatgtggctacttactaccattacttctccaagatgaaggccctggccgtggaaggcaag agaattggcaaggtgctggaccatgccatggaggcagagcgcctggtggagaaatacgag tccctggcctcggagctgctgcagtggatcgagcaaacgatcgtgaccctcaatgaccgg cagttggccaactcccttagcggggtccagaaccagctgcagtccttcaactcctaccgc accgtggagaagccgcccaagtttaccgagaaagggaacttggaagtgctgctcttcacc atccagagcaagcttcgggccaacaaccagaaggtctacacgccccgcgagggccggctc atctcggacatcaacaaggcttgggagcggctggagaaggcggagcacgagcgtgagctg gccctgcgcaccgagctcatccgccaggagaagctggagcagctggccgcccgcttcgac cgcaaggctgccatgcgggagacctggctcagcgagaaccagcgcctcgtgtcccaggac aactttgggctggagctggcagctgtcgaggcagcagtacggaagcacgaagccattgag acggacatcgtggcctacagcggccgggtgcaggcagtggacgccgtggctgcagagctg gccgccgagcgctaccacgacatcaagcgcatcgccgctcggcagcacaacgtggcacgg ctctgggacttcttgcggcagatggtggccgcccggcgggagcggctcctcctcaacctg gagctgcagaaggtgttccaggacctgctctacctcatggactggatggaagagatgaag ggccggctgcagtctcaggacctgggcaggcacctagcaggagtggaggacctgctgcag ctgcacgagctggtggaggcagacatcgccgtgcaggccgagagggtgcgggccgtcagc gcctctgccctgcgcttctgcaacccagggaaagagtatagaccttgcgacccgcagctg gtgtcggagcgggtggccaagctagagcagagctatgaggcactgtgcgagttggcagcg gcgcggcgggcccggctggaggaatcacggcggctctggcgtttcctctgggaggtgggt gaagctgaggcctgggtgcgggagcagcagcacctcctggcctcagccgacacgggccga gacctgaccggtgccctccgcctgctcaacaagcacacagccctgcggggcgagatgagc ggccggctggggcccctgaagctcaccctggagcagggccagcagttggtggccgagggt caccctggggcaagccaggcctctgcccgtgcagctgaactccaagcccagtgggagcgg ctagaggccctggccgaggagcgtgcccagcggctggcccaagccgccagcctctaccag ttccaggccgatgcaaacgacatggaggcctggttggttgacgcactgcgcctggtgtcc agccccgagctggggcacgacgagttctccacgcaggctctagccaggcagcatcgggcc ctggaggaggagattcgaagccaccggccaaccctggacgccttgagggaacaggcagca gccctgccccccacactgagccgcacgcccgaggtgcagagccgggtgcccaccctggag cggcactacgaggagctgcaggcccgggcaggcgagcgagcgcgggccttggaggcagcc ctggcgctctacaccatgctcagcgaggccggggcctgtggactctgggtggaggagaag gagcagtggctcaacgggctggccctgcctgaacgcctggaggacctggaggtcgtgcag cagaggttcgagaccctggagcctgaaatgaacacccttgcagcacaaatcaccgcggtg aatgacattgccgagcagttactgaaggccaaccccccaggcaaagaccgcattgtcaac acccaggagcagctcaaccacaggtggcagcagtttcggcgtctggcagacggcaagaag gcagctctcacctcagccctgagcatccagaactaccacttagagtgcacggagacccag gcctggatgagagagaagaccaaagtcatcgagtccacccagggcctaggcaacgatctg gctggggtgctggccctgcagcgcaagctggccggcacggagcgggacctggaggccatc gccgcccgggtgggcgaactgactcgagaggcaaatgccctggctgccggccatcccgct caggcagtggccatcaacgcccggctgagagaggtgcagaccggctgggaggacctcagg gccaccatgcggcgtcgagaagagtcgctgggggaggcgcggcggctgcaggacttcttg cgcagcttggatgacttccaggcctggctaggccgcactcagactgctgtggcctctgaa gaagggccggccaccctgcctgaggcagaggccctcctggcccaacatgcagccctgcgg ggagaggtggagcgggcccagagcgagtatagccggctgcgagccctgggcgaggaggtg acccgggaccaggctgacccccagtgcctcttcctacgacagcgactggaggccctggga actggctgggaggagctgggccgaatgtgggagagccggcaaggtcgcctggcccaggcc cacggcttccagggattcctgcgggatgctcgtcaggctgagggcgtgctcagcagccag gaatatgttctgtctcacacggagatgccagggacactccaggctgctgatgctgccatt aaaaaactggaggacttcatgagcaccatggacgccaatggggaacggatccacgggctc ctggaggctggccgccagctggtatctgaaggcaacatccacgccgacaagattcgggaa aaggcagactccattgagaggagcaaattccagagtttcacaagagggtgtcgttccagg cacaagaagaatcaagacgcagcgcagcaatttctgggccgtcttcgggacaaccgggag cagcagcatttcctgcaagattgtcacgagctgaagctctggatcgacgagaagatgctg acagcccaggacgtgtcctatgacgaggcccgcaacctgcatactaagtggcagaagcac caggcattcatggccgagctggctgccaacaaagactggctggacaaggtggacaaggaa gggcgagagctcacccttgagaagccagagctgaaagccctggtgtcggagaagctgaga gacctgcacaggcgctgggacgagctggagaccaccacccaagccaaggcccgcagcctc tttgatgccaaccgagctgagctgtttgcccagagctgctgtgccctggagagctggctg gagagcctgcaggcccagctgcactcggatgactacggcaaggacctcaccagcgtcaac atcctgctcaagaagcagcagatgctggaatgggagatggctgtgagagagaaggaggtg gaggcaatccaggcccaggccaaagcactggcccaggaggaccagggtgcaggggaggtg gagagaacctcgagggccgtggaggagaagttcagggccttgtgccagcccatgcgggaa cgctgccggcgcctgcaggcttctcgcgagcagcaccagttccaccgcgatgtggaagat gagattttgtgggtgacagagcggctgcccatggccagctccatggagcatggcaaggac ctgcccagcgtccagcttctcatgaagaaaaaccaggctaggagagggagcagaggaccc aggaaggagagaggcacaggggtgaagggtggtctccgggagcacgtggggctggggcag gactttcagcattttctcttctgtggccatgggcagaccctgcagaaagagattcagggc catgagccccggatcgcggacctgagggagcggcagcgtgctctaggtgcagcagcagca ggtccagagctggctgagctgcaggaaatgtggaaacgcctgggccacgagctggaactt cgagggaagcgactggaggatgccctgcgagcccagcagttctaccgcgatgccgccgag gcggaggcctggatgggcgagcaggaattacacatgatgggccaggagaaggccaaggat gagctgagtgcccaggcagaggtgaagaagcaccaggtgctggagcaagccctggccgac tacgcgcagaccatccaccagctggcggccagcagccaggacatgattgaccacgagcac ccagagagcactcggatatccatccgccaagcccaggtggacaagctgtatgccggcctg aaggagctggctggagagcggcgggagcgcctgcaggagcacctccggctgtgccagctc cgccgcgagctggatgacctggaacagtggatccaggagcgcgaggtggtggcggcctcc cacgagctgggccaggactacgagcatgtgactatgctccgagacaaattccgagagttc tcccgggacacaagcaccatcggtcaggagcgcgtagatagcgccaatgcgctggccaat gggctcattgctgggggccatgctgcacgggccaccgtggccgagtggaaggacagtctc aacgaggcctgggctgacctgcttgagctgctggacacacggggtcaggtgctggccgcg gcgtacgagctgcagcgcttcctgcacggggcacgccaagccctggcgcgggtgcagcac aagcagcagcagcttccggacgggactggccgcgacctcaacgctgccgaggccctgcag cgccgacactgtgcctacgagcatgacattcaggccctcagcccccaggtccagcaggtg caggacgacggccaccggctccagaaggcctacgctggagacaaggctgaggagatcggc cgccacatgcaggccgtggccgaggcctgggcccagcttcagggaagctctgccgcccgc cggcagctgctgctggacaccacagacaagttccgcttcttcaaggctgtccgggaactg atgctctggatggatgaggtcaacctgcagatggatgcccaggagcgtccccgggatgtg tcctccgcggatctagtcatcaagaaccagcaaggcatcaaggcagagatagaggcccgg gcagaccgcttctcctcctgcatcgacatggggaaggagctgctggccaggagccactat gcggccgaggagatctcagagaagctgtctcagctgcaggcacggcgccaggagacagct gagaagtggcaggagaagatggactggcttcagctggttttggaggtgcttgtgtttgga agagatgcagggatggcagaggcctggctctgcagccaggagccactggtgcgcagcgct gagctgggttgcacggtcgacgaagttgagagcctcatcaagcggcacgaggccttccag aagtcagcagtggcctgggaggagcgattctgtgcgctggagaagcttactgcgctagag gagcgggagaaggagcgaaagagaaagagggaggaggaggagcggcggaaacagccgcct gctcccgaacccacagccagtgtgcctccaggggacctggtgggcggccagacagcttct gacaccacctgggacggattcttgtccttgcagcccctgctgggacaacagagacttgag cacagcagcttccccgaagggccgggacctggctcaggggacgaagccaatgggccccgg ggagagaggcagacccggactcggggcccggccccatctgcaatgccccagagcaggtct accgagtcagcccatgctgccaccctgccgcctcgaggcccagagccatctgcccaggag cagatggaggggatgctgtgccgcaagcaggagatggaggccttcgggaagaaggctgcc aacaggtcctggcagaacgtgtactgtgtcctgcggcgtgggagcctcggcttttacaag gatgccaaggcagccagcgcgggagtgccataccacggagaagtgcctgtcagcctggcc agggcccagggcagcgtcgcctttgattaccgaaagcgcaaacatgtcttcaagctgggc ttacaggatggaaaagaatatttattccaggccaaggatgaggcagagatgagctcgtgg ctacgggtggtgaatgcagccattgccacagcgtcttctgcctctggagagcctgaagag ccggtggtgcccagcaccacccggggcatgacccgggccatgaccatgcccccagtgtca cccgtcggggctgaggggcctgttgtgctccgcagcaaagacggcagagaacgagagcga gaaaaacgcttcagcttctttaagaagaacaagtag >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_6|139_aa MGSRCRAWAWTRAWALAEFQARAEEGAAAAAAAAGGYPSTGRCRCSLRGMEGTAVAVFEI LRFLIIHWKCDIDVSKGALLEGQLVISIEGLNSKHQANALHCVTTIASAGSLFGGMVLKK FLKGKELPNFHETEYLPDN >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_6|420_bp atgggctcgcgctgccgggcgtgggcgtggactcgggcgtgggcactggcggagttccaa gcccgggctgaggagggggcggcggcggcggcggcggcggcgggcgggtacccttcgact gggcgttgccgctgttccctgcgcggcatggaggggacggccgtggccgtgttcgagatt ttgagatttttaataattcactggaagtgtgacatagatgtatcaaagggagcattgcta gaagggcagctagtgatttccatagaaggattaaattctaagcaccaggcaaatgctctt cattgtgtaacaactattgcttctgcaggaagcctttttggtggcatggtcctcaagaag ttcctaaaaggtaaagaattacctaacttccatgagactgaatatcttcccgacaattaa >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_7|222_aa MLGFCVAGFHTPGARPAFPAAPGRRPGCAPALQAAQFARVFCGRAPCGLAGPAPPAVGVA CEGDKALIDFLSDEIQEERKIQKHKTLPKMSGVWELELNGTEAKLVRKVAGGKKSLSLSI LTTASHQHLMRMKLDKKRRLSDIFSIREVSFWSTSEFEWKDTNYTLNTDSLDWALYDHLM DFLADQGVDNIFADELVELRTAPEHQAYITFLEDLKSFVKSQ >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_7|669_bp atgctgggcttctgcgtcgccggcttccacacccccggggcccgccccgcctttccggca gctcctggccggcgccccggctgtgcgcccgcccttcaggctgctcagtttgcgcgcgtg ttctgcggccgggccccctgcggcctcgcgggccctgcacctccggctgtgggtgtggcg tgcgaaggagacaaagctttgattgatttcctgagtgatgaaattcaggaggaaaggaaa atccagaagcataaaacccttcctaagatgtctggagtttgggagctggaactgaatggg acagaagctaaattagtgcggaaagttgctggggggaaaaaatcactgtcactttcaata ttaacaacagcatcccaccaacatttgatgaggatgaagctggacaagaagaggaggctg agtgacatcttctctatcagggaagttagcttttggtccaccagcgagtttgaatggaag gatactaactatacactcaacacagattccctggactgggccttatatgaccacctaatg gatttccttgcggaccaaggggtggacaacatttttgcagatgagttggtggagcttcgc acagccccggagcaccaggcgtacattacttttcttgaagacctcaaaagttttgtcaag agccagtag >gi568815587r:66585874_66821240|GENSCAN_predicted_peptide_8|251_aa MLALQLQEILCTLTLLSQFVEEVASAFQCHIIVVKIEAQNERDVGGLQIQIDQAIDSCLQ LDEIILMSLVGKKELTTKTVLADIISPLYDTPYNHQGYKALKVFQEVVLEVVDEKPRTLM TDCLVIKHFLRKIIMVHPKVRFHFSVKILVQRISNVPNLSTLGLYKDLVLPDVSYQVESS EEDQSQTMDPQGQTLLLFLFVDFHSAFPVQQMEIWERLSMIEIKQWFHFLVHETHASESL DDDDDDDDDDF >gi568815587r:66585874_66821240|GENSCAN_predicted_CDS_8|756_bp atgctggctttgcaacttcaagagatcctctgcacccttacacttctcagccaatttgtg gaagaagtggccagtgccttccagtgccacatcattgtggtcaaaatagaagcccagaat gagagagatgtaggaggcctacagatacaaattgaccaggcgattgacagctgcctccaa ctcgatgaaataattctgatgagtctagttggcaagaaggagctaaccacaaaaacagtg ttggctgatataattagccctctctatgatacgccctataatcaccaaggttacaaagct ttgaaagtgttccaagaagttgttttggaagtggttgatgaaaagcccagaaccttgatg acagattgtctggttataaagcattttttacgtaaaatcatcatggtgcaccctaaggtc agatttcatttcagtgtaaagatacttgttcagaggatttctaatgtacctaatctgtca actctgggcctttacaaagatttggtgcttccagatgtgagttatcaggtggaatccagt gaggaggatcagtctcagactatggatcctcaaggacaaactctgctgctttttctcttt gtggatttccacagtgcatttccagtccagcaaatggaaatctgggaaaggttatctatg atagagataaaacagtggtttcattttctggtccatgaaacacatgcatcagaatcactg gatgatgatgatgatgatgatgatgatgatttttga