GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:29:00 Sequence gi568815581r:2286904_2500712 : 213809 bp : 48.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 5727 5649 79 2 1 74 107 89 0.952 8.95 1.05 Intr - 6074 5968 107 2 2 46 80 93 0.981 3.31 1.04 Intr - 10450 10340 111 1 0 122 72 157 0.999 18.08 1.03 Intr - 11152 10960 193 0 1 24 82 241 0.999 16.59 1.02 Intr - 13761 12003 1759 1 1 51 80 1358 0.832 118.62 1.01 Init - 16817 16730 88 2 1 106 105 112 0.994 14.28 1.00 Prom - 17365 17326 40 -7.76 2.00 Prom + 19247 19286 40 -10.05 2.01 Sngl + 19845 20681 837 2 0 72 49 645 0.997 54.75 2.02 PlyA + 21131 21136 6 1.05 3.00 Prom + 27995 28034 40 -6.16 3.01 Init + 28658 28825 168 1 0 66 87 99 0.881 7.13 3.02 Intr + 30967 31093 127 0 1 98 75 128 0.999 12.75 3.03 Intr + 31923 32026 104 1 2 138 94 57 0.998 11.19 3.04 Intr + 34403 34522 120 1 0 29 98 107 0.984 6.49 3.05 Intr + 34639 34713 75 0 0 54 105 23 0.542 0.31 3.06 Term + 36233 36469 237 1 0 74 43 243 0.601 14.57 3.07 PlyA + 36614 36619 6 -1.75 4.14 PlyA - 37103 37098 6 -0.45 4.13 Term - 37471 37293 179 2 2 128 39 105 0.999 7.45 4.12 Intr - 37926 37786 141 1 0 113 90 70 0.998 10.02 4.11 Intr - 42572 42440 133 2 1 62 71 61 0.235 2.02 4.10 Intr - 43722 43612 111 0 0 92 94 0 0.257 1.58 4.09 Intr - 44206 44044 163 0 1 55 55 84 0.310 1.78 4.08 Intr - 45456 45266 191 0 2 69 57 236 0.353 16.98 4.07 Intr - 46221 46058 164 1 2 74 110 171 0.999 17.59 4.06 Intr - 46813 46654 160 1 1 104 56 152 0.997 13.16 4.05 Intr - 47993 47569 425 0 2 92 94 221 0.987 16.69 4.04 Intr - 48491 48357 135 0 0 105 92 21 0.944 4.74 4.03 Intr - 48827 48608 220 2 1 102 66 82 0.995 5.27 4.02 Intr - 49237 49134 104 2 2 77 81 108 0.998 8.89 4.01 Init - 49524 49428 97 0 1 91 86 70 0.996 7.58 4.00 Prom - 51882 51843 40 -6.66 5.00 Prom + 55211 55250 40 1.94 5.01 Init + 56621 56717 97 1 1 73 89 92 0.093 8.27 5.02 Intr + 74734 74896 163 2 1 81 99 122 0.504 11.63 5.03 Intr + 75206 75367 162 2 0 105 60 199 0.999 17.89 5.04 Intr + 75935 76002 68 2 2 80 71 138 0.592 9.85 5.05 Intr + 76086 76231 146 1 2 85 61 171 0.999 14.10 5.06 Intr + 76562 76696 135 1 0 48 74 203 0.995 15.66 5.07 Intr + 77156 77280 125 1 2 58 59 166 0.992 10.08 5.08 Intr + 77693 77760 68 2 2 72 80 93 0.978 5.45 5.09 Intr + 77994 78154 161 1 2 67 88 67 0.944 4.31 5.10 Intr + 78312 78438 127 2 1 134 79 203 0.996 24.25 5.11 Intr + 80368 80502 135 2 0 63 14 184 0.933 9.04 5.12 Intr + 84359 84512 154 0 1 126 117 36 0.996 9.43 5.13 Intr + 85287 85351 65 0 2 96 68 84 0.534 5.56 5.14 Intr + 85440 85585 146 1 2 111 45 177 0.996 15.70 5.15 Intr + 86050 86178 129 0 0 53 94 242 0.827 22.19 5.16 Intr + 86428 86610 183 0 0 1 50 264 0.980 13.88 5.17 Intr + 88589 88972 384 1 0 58 36 429 0.866 29.75 5.18 Intr + 89234 89358 125 1 2 90 51 191 0.947 14.98 5.19 Intr + 89830 89912 83 1 2 100 77 116 0.993 10.98 5.20 Intr + 90056 90165 110 0 2 129 94 168 0.994 21.50 5.21 Intr + 90954 91050 97 2 1 52 94 143 0.999 10.88 5.22 Intr + 92133 92300 168 1 0 63 68 311 0.215 26.52 5.23 Intr + 92529 92613 85 1 1 89 63 146 0.427 11.08 5.24 Intr + 93788 94002 215 2 2 60 75 13 0.388 -4.44 5.25 Term + 94463 94557 95 0 2 57 54 124 0.660 3.89 5.26 PlyA + 97131 97136 6 1.05 6.09 PlyA - 97192 97187 6 1.05 6.08 Term - 100746 99998 749 1 2 124 55 732 0.999 67.12 6.07 Intr - 101146 100954 193 1 1 47 93 412 0.995 36.67 6.06 Intr - 107251 107140 112 0 1 103 80 274 0.999 28.48 6.05 Intr - 107443 107402 42 0 0 103 89 42 0.898 3.16 6.04 Intr - 108551 107972 580 0 1 84 84 291 0.991 20.06 6.03 Intr - 108837 108687 151 0 1 105 70 10 0.295 0.64 6.02 Intr - 109079 108992 88 1 1 97 50 15 0.201 -1.43 6.01 Init - 113013 112859 155 0 2 81 11 194 0.671 8.46 6.00 Prom - 122773 122734 40 -6.06 7.13 PlyA - 122790 122785 6 -0.45 7.12 Term - 124384 124190 195 1 0 87 47 93 0.856 2.51 7.11 Intr - 127568 127492 77 0 2 78 110 57 0.568 6.13 7.10 Intr - 133693 133102 592 1 1 86 11 430 0.390 27.27 7.09 Intr - 134001 133828 174 0 0 108 77 123 0.999 13.34 7.08 Intr - 134677 134615 63 1 0 72 50 76 0.447 1.11 7.07 Intr - 151295 151206 90 2 0 90 100 -1 0.008 1.49 7.06 Intr - 154656 154587 70 2 1 95 60 41 0.019 1.18 7.05 Intr - 168711 168631 81 2 0 96 101 19 0.063 2.75 7.04 Intr - 177447 177305 143 0 2 99 115 47 0.841 7.65 7.03 Intr - 180973 180858 116 2 2 83 93 48 0.564 4.97 7.02 Intr - 186761 186621 141 0 0 48 77 53 0.292 0.52 7.01 Intr - 190982 190783 200 1 2 105 44 98 0.542 6.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 38479 38401 79 1 1 97 115 42 0.821 7.20 S.002 Init + 113802 113870 69 2 0 30 71 159 0.991 7.45 S.003 Term + 114766 114879 114 0 0 116 43 89 0.909 5.77 S.004 Init + 146221 146265 45 0 0 72 108 45 0.838 5.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:2286904_2500712|GENSCAN_predicted_peptide_1|779_aa MAEGLERVRISASELRGILATLAPQAGSRENMKELKEARPRKDNRRPDLEIYKPGLSRLR NKPKIKEPPGSEEFKDEIVNDRDCSAVENGTQPVKDVCKELNNQEQNGPIDPENNRGQES FPRTAGQEDRSLKIIKRTKKPDLQIYQPGRRLQTVSKESASRVEEEEVLNQVEQLRVEED ECRGNVAKEEVANKPDRAEIEKSPGGGRVGAAKGEKGKRMGKGEGVRETHDDPARGRPGS AKRYSRSDKRRNRYRTRSTSSAGSNNSAEGAGLTDNGCRRRRQDRTKERPRLKKQVSVSS TDSLDEDRIDEPDGLGPRRSSERKRHLERNWSGRGEGEQKNSAKEYRGTLRVTFDAEAMN KESPMVRSARDDMDRGKPDKGLSSGGKGSEKQESKNPKQELRGRGRGILILPAHTTLSVN SAGSPESAPLGPRLLFGSGSKGSRSWGRGGTTRRLWDPNNPDQKPALKTQTPQLHFLDTD DEVSPTSWGDSRQAQASYYKFQNSDNPYYYPRTPGPASQYPYTGYNPLQYPVGPTNGVYP GPYYPGYPTPSGQYVCSPLPTSTMSPEEVEQHMRNLQQQELHRLLRVADNQELQLSNLLS RDRISPEGLEKMAQLRAELLQLYERCILLDIEFSDNQNVDQILWKNAFYQVIEKFRQLVK DPNVENPEQIRNRLLELLDEGSDFFDSLLQKLQVTYKFKLEDYMDGLAIRSKPLRKTVKY ALISAQRCMICQGDIARYREQASDTANYGKARSWYLKAQHIAPKNGRPYNQLALLAVYT >gi568815581r:2286904_2500712|GENSCAN_predicted_CDS_1|2337_bp atggcggaagggctggagcgtgtgcggatctccgcgtcggagctgcgcgggatcctggct actctggccccgcaggccgggagcagagaaaacatgaaggaattaaaggaggccaggccg cgcaaagataacaggcgtccagatctggaaatctataagcctggcctttctcggctaagg aacaagcccaaaatcaaggaaccccctgggagtgaggaattcaaagatgaaattgttaat gaccgagattgctctgctgttgaaaatggtacacagcccgttaaagatgtctgcaaggaa ctgaacaaccaagagcagaatggtcctatagacccagaaaataatcggggacaagaatcc tttcctaggactgctggacaagaggatcgtagtctaaaaattatcaaaagaacaaagaaa cccgacctgcagatctatcagcctggacgacgtttgcagactgttagcaaagaatccgcc agtcgggtggaggaggaagaagtcctcaaccaggtagaacaactgagagtagaggaagat gagtgtaggggaaatgttgcgaaggaggaagttgcgaataaaccagacagggctgagata gaaaagagcccaggtggtgggagagtaggggctgcaaaaggagaaaaaggaaagaggatg ggaaaaggggagggggtgagggaaacccacgacgacccagcccgcgggaggccgggctcc gcaaagcgctactcccgctcagacaaacgaaggaatcgctaccgcacgcgcagcaccagc tcagctggcagcaacaacagcgctgagggagctggcctgacggataatggatgtcgccgc cgccgacaggataggaccaaggagaggccacgactgaagaagcaagtgtctgtgtcctca accgattccttagacgaggacagaattgatgagcctgatggattaggacccaggagaagt tcagaaaggaagagacatttagaaagaaactggtctggccgtggggagggtgagcagaaa aacagtgctaaagaatatcgaggcactcttcgtgtcactttcgatgcagaagccatgaac aaagagtctcccatggtgaggtcagccagggatgatatggatagaggaaagcctgacaaa ggcttgagcagtgggggcaaaggctctgagaagcaggagtccaaaaacccgaaacaagaa cttcggggtcgtggtcgtggcattctgattttgcctgcccataccaccctatctgtcaat tcagcaggttctccagagtccgcgcctttgggacctcggcttttgtttggatctggtagt aagggatctcggagttggggccgtggaggcaccacacgccgattgtgggacccaaacaat cctgatcagaaacctgctctaaagactcagacgccccagctacatttcttggacactgat gatgaagtcagccctacatcttggggtgactcacgccaggctcaggcatcttactataag tttcaaaactctgacaacccctattattacccccggacaccaggccctgcctcccagtat ccctatacgggctataaccctctacagtacccagtgggccctacgaatggtgtgtaccca gggccttactacccaggctacccgactccgtcaggacagtatgtgtgtagccctctacct accagcaccatgagtcccgaggaggtagagcagcacatgaggaacctgcagcaacaggag ctgcacaggcttctccgggtggctgacaaccaggaactgcagctcagcaacctgctctcc agggaccgcatcagtccggagggcctggagaagatggcgcaactcagagctgaactgctg cagctatatgagcgctgtattctattagatattgagttctctgataatcagaatgtggat cagatcctgtggaagaatgctttctatcaggtgattgagaagttcaggcaacttgtcaag gatccgaatgttgagaacccagaacagattcggaacagacttttggagctcttggatgag ggtagtgacttctttgatagtttgcttcagaagctgcaggttacttacaagttcaaactg gaagactacatggatggtcttgccattcgcagcaagccattacgcaagacagtaaaatat gccttgatcagtgcccagcgatgcatgatatgccaaggagatattgctaggtaccgggag caagccagtgatacagcgaattatgggaaagcacgcagttggtacctgaaggcccagcac attgctcccaagaatgggcgcccctataaccagttggctttgctggcagtgtatacg >gi568815581r:2286904_2500712|GENSCAN_predicted_peptide_2|278_aa MDTAKSKSESPKELVQLKKLFIGGLSFETTNESLRSHSEQWGTLADCVVMRDPNTKHSRG FGFATYATEEEVDAATNARPHKVDGRVVEPKRAVSREDSQRPDAHLTMEKIFVGGINEDT EEYHLSDYFEQYGKTEVIEIMTDQGSGKKRGFAFVTFDDHDSVDKTVIQKYHTVNGHNCE IRKGLSKQEMVSASSSQRGQSTSGNFGGGRGGGFGRNDNFGCEGNFSGHGGFGGSCGGGG YRAVGMAIMDLVMMEAILEVEATMILAITTISLQFLDP >gi568815581r:2286904_2500712|GENSCAN_predicted_CDS_2|837_bp atggacaccgccaagtctaagtcagagtctcctaaagagctcgtacagctgaagaagctc ttcattggagggctgagctttgaaacaaccaatgagagcctgaggagccattctgagcaa tggggaacactcgcggactgtgtggtaatgagagatccaaacaccaagcactccaggggc tttggatttgccacatatgccactgaggaggaggtggatgcagccacgaatgcaaggcca cacaaggtggatggaagagttgtggaaccaaagagagctgtctcaagagaagattctcaa agaccagatgcccacttaactatggaaaagatatttgttggtggcattaacgaagacact gaagaatatcacctaagtgattattttgaacagtatggaaaaactgaagtgattgaaatc atgactgaccaaggcagtggcaagaaaaggggctttgcctttgtaacctttgatgatcat gactccgtggataagactgtcattcagaaataccatactgtgaatggccacaactgtgaa attaggaaaggcctgtcaaagcaagagatggtgagtgcttcatccagccaaagaggtcaa agtacttctggaaactttggtggtggtcgtggaggtggtttcggtaggaatgacaacttt ggttgtgaaggaaatttcagtggtcatggtggctttggtggcagctgtggtggtggtgga tatcgtgcagtggggatggctataatggatttggtaatgatggaagcaattttggaggtg gaagctacaatgattttggcaattacaacaattagtcttcaatttttggacccatga >gi568815581r:2286904_2500712|GENSCAN_predicted_peptide_3|276_aa MCAQYCISFADVEKAHINIRDSIHLTPVLTSSILNQLTGRNLFFKCELFQKTGSFKIRGA LNAVRSLVPDALERKPKAVVTHSSGNHGQALTYAAKLEGIPAYIVVPQTAPDCKKLAIQA YGASIVYCEPSDESRENVAKRVTEETEGIMVHPNQEPAVIAGQGTIALEVLNQVPLVDAL VVPVGGGGMLAGIAITVKALKPSVKVYAAEPSNADDCYQSKLKGKLMPNLYPPETIADGV KSSIGLNTWPIIRDLVDDIFTVTEDEIKVRLQQEKK >gi568815581r:2286904_2500712|GENSCAN_predicted_CDS_3|831_bp atgtgtgctcagtattgcatctcctttgctgatgttgaaaaagctcatatcaacattcga gattctatccacctcacaccagtgctaacaagctccattttgaatcaactaacagggcgc aatcttttcttcaaatgtgaactcttccagaaaacaggatcttttaagattcgtggtgct ctcaatgccgtcagaagcttggttcctgatgctttagaaaggaagccgaaagctgttgtt actcacagcagtggaaaccatggccaggctctcacctatgctgccaaattggaaggaatt cctgcttatattgtggtgccccagacagctccagactgtaaaaaacttgcaatacaagcc tacggagcgtcaattgtatactgtgaacctagtgatgagtccagagaaaatgttgcaaaa agagttacagaagaaacagaaggcatcatggtacatcccaaccaggagcctgcagtgata gctggacaagggacaattgccctggaagtgctgaaccaggttcctttggtggatgcactg gtggtacctgtaggtggaggaggaatgcttgctggaatagcaattacagttaaggctctg aaacctagtgtgaaggtatatgctgctgaaccctcaaatgcagatgactgctaccagtcc aagctgaaggggaaactgatgcccaatctttatcctccagaaaccatagcagatggtgtc aaatccagcattggcttgaacacctggcctattatcagggaccttgtggatgatatcttc actgtcacagaggatgaaattaaggtgaggctccagcaagaaaagaaatag >gi568815581r:2286904_2500712|GENSCAN_predicted_peptide_4|740_aa MAAHRPGPLKQQNKAHKGGRHRGRGSAQRDGKGRLALKTLSKKVRKELSRVDQRHRASQL RKQKKEAVLAEKRQLGGKDGPPHQVLVVPLHSRISLPEAMQLLQDRDTGTVHLNELGNTQ NFMLLCPRLKHRWFFTSARPGDLHVVLDMAKVADTILFLLDPLEGWDSTGDYCLSCLFAQ GLPTYTLAVQGISGLPLKKQIDTRKKLSKAVEKRFPHDKLLLLDTQQEAGMLLRQLANQK QQHLAFRDRRAYLFAHAVDFVPSEENNLVGTLKISGYVRGQTLNVNRLLHIVGYGDFQMK QIDAPGDPFPLNPRGIKPQKDPDMAMEICATDAVDDMEEGLKVLMKADPGRQESLQAEVI PDPMEGEQTWPTEEELSEAKDFLKESSKVVKKVPKGTSSYQAEWILDGGSQSGGEGDEYE YDDMEHEDFMEEESQDESSEEEEEYETMTIGESVHDDLYDKKVDEEAEAKMLEKYKQERL EEMFPDEVDTPRDVAARIRFQKYRGLKSFRTSPWDPKENLPQDYARIFQFQNFTNTRKSI FKEVEEKEVEGAEVGWYVTLHVSEVPVSVVECFRQGTPLIAFSLLPHEQKMSVLNMVVRR DPGNTEPVKAKEELIFHCGFRRFRASPLFSQHTAGMHSLIATGHLMSVDPDRMVIKRVVL SGHPFKIFTKMAVVRYMFFNRGTHGHMKCSFDGKLKSQDTVLMNLYKRVFPKWTYDPYVP EPVPWLKSEISSTVPQGGME >gi568815581r:2286904_2500712|GENSCAN_predicted_CDS_4|2223_bp atggcggcccaccgccccggcccgctcaagcagcagaataaagctcataaaggcggacgg catcggggtcggggatctgcacagcgggacggcaagggccgtctggcactgaaaacccta agcaagaaggtgagaaaagaactcagcagagtcgaccagaggcatcgcgccagccagctc cgaaagcagaagaaggaggcggttctggcagagaagagacagctgggtggcaaggatggc cctcctcatcaggtactggtggtgcccctgcacagcagaatttccctgccagaggccatg cagctgcttcaagatagggacactggaacagtacacttgaatgaattgggaaacacccag aactttatgctgctgtgcccccgcttgaaacatcggtggtttttcacctcagcaaggcca ggggatctgcacgttgtgttagacatggctaaagtagctgataccatcctgttcctcctt gatccactagaaggctgggacagcaccggtgattactgtctttcctgcctctttgctcag ggccttccgacctatacactagctgtccaggggatttctggcctcccactgaagaaacaa atagataccaggaagaagctaagtaaagcagtggagaagcgctttccgcatgacaaactc ctcttgttagacactcaacaggaggcagggatgctgcttaggcagttggctaaccagaag caacagcatcttgcttttcgagatcggcgggcctacctatttgcccatgctgttgatttt gttcctagtgaagagaataacttggtgggcaccttgaaaatttcaggctatgttcgaggg cagactctgaatgtcaataggttgctgcatatcgttggatatggtgatttccagatgaaa cagatagatgcccccggagaccctttccctttaaatcctagaggaattaaaccccaaaag gacccagacatggcaatggagatttgtgctacggatgctgtagatgatatggaagaaggt cttaaagtcctaatgaaggcagaccctggtagacaggaatccttgcaagcagaggttatc ccagatccaatggagggagagcaaacctggcccactgaggaggagctgagcgaggcaaag gatttcttgaaggaaagttctaaggtggtaaagaaggtccccaaaggaacatccagttac caagctgaatggattttggatggtggcagccaaagtggtggggaaggagatgaatatgaa tatgatgatatggaacatgaggattttatggaggaggaatctcaggatgagagtagtgaa gaagaggaagaatatgaaactatgactattggggagtctgtgcatgatgatctgtatgat aagaaagtagatgaagaagctgaggcaaaaatgttggagaaatataaacaagaaagactg gaagagatgtttccagatgaagtggacacgccccgtgatgtggctgctcgaattcgattt cagaaatacagaggccttaagagcttccggacatctccatgggatcctaaggaaaacctt cctcaagattatgctcgaatatttcagtttcagaactttactaacactaggaaaagcatc tttaaagaggttgaagaaaaagaggttgaaggagctgaggttggctggtatgtcacactt catgtctctgaagtccccgtctcagtggtcgagtgcttcaggcaaggaacacccttgatt gcattttctttactacctcatgaacagaagatgtcagtattgaatatggtggtgaggcgt gaccctggcaacactgaacctgtgaaagccaaggaagagctcatatttcactgtggattc aggcgcttccgagcctcacctttattctctcagcacactgcaggaatgcacagcctcatt gctacaggccatcttatgtcagtagatccagacagaatggtcatcaagagagttgttctg agtggtcatcctttcaaaatttttactaagatggcagtagtacgttacatgttcttcaac agaggtacccatggccacatgaaatgcagctttgatgggaagctaaaatctcaagacaca gtactgatgaacctgtataaacgagtcttccccaaatggacttatgatccatatgtacca gaaccagtaccctggctgaaaagtgagatttcttcaacagtgcctcaagggggcatggag taa >gi568815581r:2286904_2500712|GENSCAN_predicted_peptide_5|1141_aa MLSMCPQVKQIMEEAVTRKFVHEDSSHIIALCGAVEACLLHQLRRRAAGFLRSDKMAALF TKVGKTCPVAGEICHKVQELQQQAEGRKPSGVSQEALRRQGSASGKAPALSPQALKHVWV RTALIEKVLDKVVQYLAENCSKYYEKEALLADPVFGPILASLLVGPCALEYTKLKTADHY WTDPSADELVQRHRIRGPPTRQDSPAKRPALGIRKRHSSGSASEDRLAACARECVESLHQ NSRTRLLYGKNHVLVQPKEDMEAVPGYLSLHQSAESLTLKWTPNQLMNGTLGDSELEKSV YWDYALVVPFSQVVCIHCHQQKSGGTLVLVSQDGIQRPPLHFPQGGHLLSFLSCLENGLL PRGQLEPPLWTQQGKGKVFPKLRKRSSIRSVDMEEMGTGRATDYVFRIIYPGHRHEHITI NYHHLAASRAASVDDDEEEEDKLHAMLSMICSRNLTAPNPMKDAGDMIEMQGFGPSLPAW HLEPLCSQGSSCLSCSSSSSPHATPSHCSCIPDRLPLRLLCESMKRQIVSRAFYGWLAHC RHLSTVRTHLSALVHHSVIPPDRPPGASAGLTKDVWSKYQKDKKNYKELELLRQVYYGGI EHEIRKDVWPFLLGHYKFGMSKKEMEQVDAVVAARYQQVLAEWKACEVVVRQREREAHPA TRTKFSSGSSIDSHVQRLIHRDSTISNDVFISVDDLEPPEPQDPEDSRPKPEQEAGPGTP GTAVVEQQHSVEFDSPDSGLPSSRNYSVASGIQSSLDEGQSVGFEEEDGGGEEGSSGPGP AAHTLREPQDPSQEKPQAGELEAGEELAAVCAAAYTIELLDTVALNLHRIDKDVQRCDRN YWYFTPPNLERLRDVMCSYVWEHLDVGYVQGMCDLLAPLLVTLDNDQLAYSCFSHLMKRM SQNFPNGGAMDTHFANMRSLIQILDSELFELMHQNGDYTHFYFCYRWFLLDFKRELLYED VFAVWEVIWAARHISSEHFVLFIALALVEAYREIIRDNNMDFTDIIKFFNERAEHHDAQE ILRIARDLVHKVQMLIENKLGLALEHGGPARGWGLVQQPPKSGRLSAFEKHDKGPRSPSA PSTACPAQPCPRALHSLSGGQTTPLSILVVRCNEKLELDYATAADGNGAAPVGVALTPST V >gi568815581r:2286904_2500712|GENSCAN_predicted_CDS_5|3426_bp atgctgtccatgtgtccgcaggtgaagcaaatcatggaggaggctgtcaccaggaagttt gtgcatgaagacagcagccacatcattgctttatgtggtgcagtggaggcttgcctcttg catcagctgagacgccgtgccgctggcttcctgcgcagtgacaagatggcagccctgttc accaaggtggggaagacgtgcccagtggcgggggagatttgccacaaggtacaggagctg cagcaacaagcagagggcaggaaaccctcaggggtcagccaggaggccctgcggagacag ggctcagccagcgggaaggccccggccctcagccctcaggccttgaaacacgtatgggta cgcacggcgctcatcgagaaagttctggacaaggtcgtgcaatacctggcggaaaactgc agcaagtactacgagaaggaggcactgctggcagaccctgtgttcggcccgatcctggcc tctcttctagtgggaccctgtgccttggaatacactaagctcaagacagccgatcactac tggactgacccctctgctgatgagctggtccagcggcaccgcatccggggtccacctact cgccaggactcccctgcaaagcgcccagccctggggatccggaaacggcactcaagcggc agcgcgtcggaggacaggctggctgcctgtgcccgcgagtgtgtggagtccctgcaccag aactcacggacgcggctgctctatggcaagaaccacgtgctggtgcagccgaaggaggat atggaggcggtccctggctacctctccctgcaccagtctgcagagagcctgactctgaag tggacccccaaccagctcatgaatgggactctgggggactccgagctggaaaagagcgtt tactgggactatgccctcgtggtgcccttcagccaggtcgtgtgcatccactgccaccag caaaagagcggtggcacgcttgtgctggtgagccaggatggcatccagaggccgccgctg catttcccacagggaggacacctgctgtcctttctgtcctgtctggagaatgggctgctg cctcggggacagctagagcccccgctgtggacccagcaagggaaggggaaagtgttcccc aagctacggaaacgaagcagcattcgctccgtggatatggaggagatgggcacggggcgg gccaccgactatgtgttccggatcatctaccccggccacaggcacgagcacatcactatt aactaccaccacctagcggccagccgcgcggcctcggtggacgatgatgaggaagaggag gataaactgcacgcgatgctctcaatgatctgctcgcggaacctcacagctcccaatccg atgaaagacgctggtgacatgatcgagatgcagggctttgggcccagcctgccagcctgg cacctggagcccctgtgcagtcagggctcctcctgcctctcctgctcctccagcagctcc ccacatgcaacccccagccactgtagctgcatccccgaccggttgccgctcaggctactg tgtgagagtatgaagaggcagatcgtgtcccgggccttctacggctggctggcacactgc cgccacctgtccacggtgcggacccacctgtcggcgctggtgcaccatagcgttatccca cctgaccggcccccgggggcctccgcgggcctcaccaaggacgtgtggagcaagtatcag aaggacaaaaagaactacaaagagctggagctgctgcggcaagtttactacggaggcata gagcacgagatccgcaaggacgtctggccctttctgcttggccactacaagttcggcatg agcaagaaggagatggagcaggtggacgcagtggtggcagcaaggtaccagcaggtgttg gcagagtggaaggcctgcgaggtggtggtgaggcagcgggagcgggaggcccacccagcc acacgcaccaagttctcctcaggcagcagcatcgacagccacgtgcagcgcctcatccac cgagactccaccatcagcaacgatgtgtttatctcagtggatgatctggaacccccggag ccccaggaccctgaagattccagaccaaaacctgagcaggaagcaggacccgggactccg ggcaccgccgtggtggagcagcagcattccgtggagttcgactctccagactcaggactg ccctcctctcgcaattactccgtggcctcgggcatccagtcaagcctagatgaggggcag agcgtgggcttcgaagaggaggacggcggtggggaggaaggctccagtgggcccggccct gcagctcacactttgagggagccccaggatcccagccaggagaagcctcaggccggagaa ctggaggccggagaggagcttgcggctgtgtgtgcggctgcctacactatagaattactg gacactgtggccttaaacctgcaccgcatagacaaggatgtgcagaggtgtgaccgcaac tactggtacttcacgccccccaacctcgagaggctcagagacgtcatgtgcagctacgtg tgggagcacctggacgtgggctatgtgcagggcatgtgcgatctgctggcgcctctcctg gtcaccctcgacaatgatcagctggcctacagctgcttcagccacctcatgaagaggatg agccagaacttccccaacgggggtgccatggacacccactttgccaacatgcgctccctc atccagatcctggactcagagctgtttgagctgatgcatcagaatggagactacacccac ttctacttctgttatcgctggttcctgctggattttaagagagaactgctgtatgaggat gtgtttgctgtgtgggaggtgatctgggcagccaggcacatctcatcggagcactttgtc ctgttcatcgccctcgccctggtggaggcctaccgagagatcatccgtgacaacaacatg gacttcactgacatcatcaagtttttcaatgaacgtgctgagcatcacgatgcccaggag atcctgcggattgcccgggacctcgtccacaaggtgcagatgctcatagagaacaagcta ggccttgccctggaacatggaggccctgcccggggctggggcctagtccagcagccacca aagtctggcagactttctgcatttgagaaacatgacaaagggccccgcagcccttctgca cccagcacagcctgcccagcccagccctgcccccgggcactgcacagcctgtctgggggc cagaccaccccattgtccatccttgttgtccgctgcaatgagaagctggagcttgactat gccactgcagctgatggcaacggagctgcccctgtgggtgtggctctcaccccgagcact gtctga >gi568815581r:2286904_2500712|GENSCAN_predicted_peptide_6|689_aa MARGSLGVSPRAPLAAPRFSQCGMWRVCAFQTRSRLECCVCLRSEPVGYTGWHCLPSRKC LVNIVKGLPRKLGSVSTPPPLGRLSCPQPGLSAPVTVSPRDCPTIPWWRGTGLLDLFTAV PPALSPVPGWAEEQERLRLEQEREQEQKKANSLARLAHTLPVEEPRMEAPPLPLSPPAPP PAPPPPLATPAPLTVIPIPVVTNSPQPLPPPPPLPAAAQPLPLAPRQPALVGAPGLSIKE PAPLPSRPQVPTPAPLLPDSKATIPPNGSPKPLQPLPTPVLTIAPHPGVQPQLAPQQPPP PTLGTLKLAPAEEVKSSEQKKRPGGIGTREVHNKLEKNRRAHLKECFETLKRNIPNVDDK KTSNLSVLRTALRYIQSLKRKEKEYEHEMERLAREKIATQQRLAELKHELSQWMDVLEID RVLRQTGQPEDDQASTSTASEGEDNIDEDMEEDRAGLGPPKLSHRPQPELLKSTLPPPST TPAPLPPHPHPHPHSVALPPAHLPVQQQQPQQKTPLPAPPPPPAAPAQTLVPAPAHLVAT AGGGSTVIAHTATTHASVIQTVNHVLQGPGGKHIAHIAPSAPSPAVQLAPATPPIGHITV HPATLNHVAHLGSQLPLYPQPVAVSHIAHTLSHQQVNGTAGLGPPATVMAKPAVGAQVVH HPQLVGQTVLNPVTMVTMPSFPVSTLKLA >gi568815581r:2286904_2500712|GENSCAN_predicted_CDS_6|2070_bp atggcccgcgggtcgctgggggtcagcccccgggcccccctcgctgcgccccgctttagc cagtgtggaatgtggcgtgtctgcgcgttccaaacccgctctcgcctggaatgttgcgtg tgcctgcgttccgagcccgttggttacaccgggtggcactgcctgccctctagaaagtgc ttggtaaacattgttaaaggccttccacggaagctgggatctgtcagcacacccccccct ctgggccgcctgtcctgcccgcagcccggactgtcagcccccgtgactgtcagcccccgt gactgtcctactattccctggtggcggggaacaggtctgcttgatttgttcactgctgtg cccccggcactttcccccgtgcctggctgggctgaggagcaggagcggcttcgcttggag caggagcgagagcaggaacagaagaaggccaatagcctggccaggctggcacataccctt cctgtggaggaaccccgcatggaggcgccacccctgcctctgtctccaccggctcccccg ccggcacccccaccaccacttgccacccctgccccactgactgtcatccctatccctgtg gtgaccaactcccctcagcctctacccccacccccacccttgcccgcggcagcccagcct ctgcccctggcgcctcgtcagccggccctggttggcgcccccggactcagcattaaggag cctgcccccctgcccagcaggccgcaggtgcccacccctgctcccctactgccggactcg aaggccaccattccacccaatggcagccccaagcctttgcagcccctccccacgcctgtc ctgaccatagcgccacaccctggagtccagcctcagctggccccccagcagccgccccca cccacgctggggaccctgaagttggcaccagctgaagaagtcaaatccagtgaacagaag aagaggcccggggggatcggaaccagagaagtccacaacaaattggagaagaacaggagg gcccatctgaaagagtgctttgagaccctgaagcggaacatccccaacgtggatgacaag aagacgtccaatctgagcgtgctgcggacggcgctgcggtacatccagtccctgaagagg aaggagaaggaatatgagcatgaaatggagcggctggcacgtgagaagattgccacgcag cagcggctggcagagctcaagcacgagctgagccagtggatggacgtactggagattgac cgcgtgctgcggcagacgggccagcccgaggatgaccaggcctccacctccaccgcctct gagggtgaggacaacatagacgaggatatggaggaggaccgggcgggcctgggcccacct aagctgagccatcgtccccagccggagctgctgaagtccaccctgccaccccccagcacc acccctgcgcctctgcctccacacccacaccctcacccccactccgtggccctacctcct gcccacctccccgtgcagcagcagcagccacagcagaagacccctctgccagcccctcct cccccaccggctgcccctgcccagacactggtgccagctccagcccatctggtggcgacg gctgggggtggctccacggtcatcgcccacacagccaccactcacgcttcagtcatccag actgtgaaccacgttctgcaggggccaggcggcaagcacatcgcccacatcgccccctcg gcccccagccctgcggtgcaactggcgcctgccacaccccccattgggcacatcactgtg caccctgccaccctcaaccatgtggcccacctgggctcccagctgcccttgtacccgcag cccgtggcagtgagccacatcgcccacaccctctcgcaccagcaagtcaacggcacggcc ggcctggggcccccggctactgtcatggcaaagccggccgtgggggctcaggtggtgcac cacccccagctggtgggccagaccgtgctcaaccctgtgaccatggtcaccatgccctcc ttcccagtcagcacactcaagctggcttga >gi568815581r:2286904_2500712|GENSCAN_predicted_peptide_7|647_aa XLNFKDPEAVRALTCTLLREDFGLSIDIPLERLIPTVPLRLNYIHWVEDLIGHQDSDKST LRRGIDIGTGASCIYPLLGATLNGWYFLATEVDDMCFNYAKKNVEQNNLSDLIKVVKVPQ KTLLMDALKEESEIIYDFCMCNPPFFANQLEAKGVNSRNPRRPPPSSVNTGGITEIMAEG GELEFVKRIIHDSLQLKKRLRIDMAALLIGVCSEVSGRKDWIQQLAPRWYSCMLGKKCSL APLKEELRIQGVPKVTYTEFCQGRTMRWALAWSFYDDVTVPVHEGRLAVGGVTSDSQNKG SASPPSKRRKLEKPRKPITFVVLASVMKELSLKASPLRSETAEGIVVVTTWIEKILTDLK VQHKRVPCGKEEVSLFLTAIENSWIHLRRKKRERVRQLREVPRAPEDVIQALEEKKPTPK ESGNSQELARGPQERTPCGPALREGEAAAVEGPCPSQESLSQEENPEPTEDERSEEKGGV EVLESCQGSSNGAQDQEASEQFGSPVAERGKRLPGVAGQYLFKCLINVKKEVDDALVEMH WVEGQNRDLMNQLCTYIREQEAGKSTDFDVQTWVPVQFHSLHRGPSHLLREESEGVYPWA CLLFLPLLADLIHASFFNHSSYIHCAQIQTHPELPASSFLTRRPMAV >gi568815581r:2286904_2500712|GENSCAN_predicted_CDS_7|1944_bp nnccttaattttaaagaccccgaagcagtcagagctctgacgtgtactctcctaagggaa gattttggactttctattgatattccattggagagactaattcccacagttcccttgaga ctcaactatattcactgggtagaagatctgatcggtcaccaggattctgacaaaagtact ctccgaagaggaattgacataggcacgggggcatcttgcatctaccccttacttggagca accttgaatggctggtatttcctcgcaacagaagtggatgatatgtgtttcaactatgca aagaaaaatgtggaacagaataacttatctgatctcataaaagtggtgaaagtgccacag aagacactcctgatggatgctcttaaagaagaatctgagataatctatgacttttgcatg tgcaaccctcccttttttgccaatcaattggaagccaagggagtaaactcacgaaatcct cgaagacctccgcctagttctgttaatacaggaggcatcacagagatcatggcagaagga ggtgaattagagtttgttaaaaggatcatccatgacagtctacaacttaaaaaaagatta aggattgatatggcagccttgttgattggtgtctgctctgaggtctcagggagaaaagat tggatccagcagcttgcgccacgatggtatagctgcatgctgggaaagaaatgcagcctg gcgcctctgaaggaggagcttcgcatacaaggggttcccaaagtaacgtacactgaattc tgtcaaggtcggacaatgagatgggccttagcttggagtttttatgatgatgtcacagta ccagtacacgaaggacgtttagcagttggaggagttacatctgatagccagaacaaaggc agtgcgtcaccaccaagtaagcgaagaaaattagagaaaccgagaaaacccataacattc gtggtgctggcgtccgtgatgaaggaattatccctcaaagcatcacctctgcgctcggag acggcggaaggcatagtcgttgtcacgacatggattgaaaaaattctcactgatttgaag gtccagcataaacgagttccctgtggaaaagaggaagtcagccttttcctaacggccata gaaaactcctggattcatttaaggagaaagaaaagagagcgtgtgagacagctgagagaa gttccccgagctcctgaggacgtcattcaggccttggaagagaaaaagcccacccccaaa gagtctggcaatagccaagaactggccaggggcccccaggagaggaccccctgtgggcct gctctgcgggaaggcgaggctgccgctgtggagggcccgtgcccgagccaggagtccctg tcccaggaggaaaacccggaacccacggaggatgaaaggagtgaggaaaagggaggggtg gaggttttggaaagttgtcaaggctctagcaacggagcccaggaccaagaggcttctgag cagttcggcagcccagtggctgaaagggggaaacgtctcccaggagtggccggacagtac ctgtttaagtgtttgataaacgttaagaaggaggtggacgatgccttagtggagatgcac tgggttgagggccagaacagggatctgatgaaccagctttgcacctacatacgggagcaa gaggcaggaaagagcacagacttcgacgttcagacctgggttcccgtccagttccattcc ttgcacaggggtccttcccacctcttaagggaggagtccgaaggagtctacccatgggcc tgcttgctctttctcccactgctggctgacctcatccacgccagtttcttcaaccactcc tcatacattcattgtgcccagatccagacccatcctgaacttccagcatcctcattcctc acccgccggcccatggccgtgtga