GENSCAN 1.0 Date run: 16-Jul-119 Time: 15:57:12 Sequence gi568815588r:24484415_24821899 : 337485 bp : 41.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1531 1634 104 2 2 66 61 111 0.536 5.17 1.02 Intr + 3245 3275 31 1 1 132 93 5 0.395 2.19 1.03 Intr + 6367 6516 150 2 0 55 34 111 0.073 1.61 1.04 Intr + 6577 6721 145 2 1 39 93 33 0.267 -2.68 1.05 Intr + 10086 10190 105 0 0 96 66 116 0.519 8.61 1.06 Intr + 10733 10782 50 2 2 94 111 35 0.916 3.81 1.07 Intr + 16965 17131 167 1 2 90 66 115 0.731 8.26 1.08 Intr + 28845 29020 176 2 2 90 89 220 0.503 20.12 1.09 Intr + 35709 35839 131 0 2 115 92 200 0.995 22.52 1.10 Intr + 37368 37515 148 1 1 68 45 188 0.849 10.87 1.11 Intr + 39909 40350 442 0 1 80 65 555 0.591 44.93 1.12 Intr + 43522 43705 184 0 1 78 111 211 0.997 20.84 1.13 Intr + 47416 47579 164 2 2 38 47 199 0.957 9.57 1.14 Intr + 47791 47904 114 0 0 83 83 63 0.498 5.02 1.15 Intr + 48656 48823 168 1 0 102 91 192 0.999 20.12 1.16 Intr + 50633 50860 228 1 0 70 92 200 0.385 15.74 1.17 Intr + 52360 52479 120 0 0 84 92 25 0.311 2.27 1.18 Intr + 57189 57296 108 2 0 44 72 68 0.513 0.36 1.19 Intr + 58279 58356 78 0 0 48 113 107 0.940 8.03 1.20 Intr + 58469 60067 1599 1 0 109 65 1695 0.952 157.12 1.21 Intr + 60567 60689 123 2 0 122 68 113 0.948 12.56 1.22 Intr + 61413 61689 277 2 1 108 103 67 0.443 6.37 1.23 Intr + 67191 67327 137 1 2 38 37 100 0.147 -0.73 1.24 Intr + 67926 68084 159 2 0 39 84 98 0.234 3.76 1.25 Intr + 72642 72761 120 2 0 75 96 76 0.425 6.87 1.26 Intr + 83653 83910 258 0 0 61 40 149 0.699 4.14 1.27 Intr + 86384 86453 70 1 1 62 73 68 0.480 0.54 1.28 Intr + 87245 87475 231 0 0 97 86 144 0.813 11.92 1.29 Term + 89557 89885 329 2 2 58 48 183 0.555 5.19 1.30 PlyA + 91309 91314 6 1.05 2.18 PlyA - 91496 91491 6 1.05 2.17 Term - 101692 99998 1695 1 0 104 37 1573 0.988 141.44 2.16 Intr - 107269 107075 195 1 0 82 42 136 0.718 7.09 2.15 Intr - 107598 107473 126 0 0 77 113 145 0.914 15.86 2.14 Intr - 110625 110536 90 0 0 64 95 44 0.789 1.97 2.13 Intr - 111629 111474 156 2 0 109 95 70 0.957 9.09 2.12 Intr - 112468 112326 143 2 2 -8 57 120 0.891 -1.55 2.11 Intr - 113169 113033 137 2 2 70 67 143 0.947 9.69 2.10 Intr - 113595 113531 65 0 2 75 78 58 0.988 0.10 2.09 Intr - 116516 116232 285 2 0 76 90 234 0.996 19.01 2.08 Intr - 117689 117564 126 2 0 90 67 129 0.976 10.96 2.07 Intr - 118708 118600 109 0 1 86 8 104 0.612 1.57 2.06 Intr - 123187 123085 103 2 1 54 87 91 0.997 3.91 2.05 Intr - 123489 123331 159 1 0 110 100 78 0.992 10.24 2.04 Intr - 136955 135059 1897 2 1 92 71 1355 0.972 120.57 2.03 Intr - 139274 139136 139 1 1 136 62 30 0.958 4.75 2.02 Intr - 166541 166465 77 2 2 111 61 66 0.277 3.59 2.01 Init - 166783 166709 75 1 0 94 53 89 0.531 7.14 2.00 Prom - 168108 168069 40 -8.55 3.00 Prom + 168524 168563 40 -8.85 3.01 Sngl + 170084 171313 1230 1 0 60 55 387 0.649 28.15 3.02 PlyA + 173631 173636 6 1.05 4.00 Prom + 188712 188751 40 -5.25 4.01 Init + 192141 192230 90 2 0 52 92 121 0.807 9.44 4.02 Term + 194825 194893 69 1 0 75 47 75 0.658 -0.94 4.03 PlyA + 196603 196608 6 1.05 5.00 Prom + 197399 197438 40 -4.85 5.01 Init + 211948 212068 121 0 1 82 71 154 0.529 13.50 5.02 Term + 212738 212832 95 0 2 81 54 132 0.945 6.11 5.03 PlyA + 213452 213457 6 1.05 6.00 Prom + 228027 228066 40 -5.65 6.01 Init + 237188 237330 143 1 2 87 47 113 0.296 6.75 6.02 Intr + 239957 240353 397 2 1 85 59 299 0.181 20.36 6.03 Term + 240816 240947 132 2 0 24 49 143 0.687 1.11 6.04 PlyA + 241038 241043 6 1.05 7.05 PlyA - 241683 241678 6 1.05 7.04 Term - 281422 281091 332 2 2 9 46 282 0.565 9.93 7.03 Intr - 302579 302462 118 2 1 83 18 121 0.011 3.72 7.02 Intr - 306414 306216 199 2 1 0 37 190 0.013 3.53 7.01 Init - 307666 307539 128 1 2 101 41 41 0.047 0.38 7.00 Prom - 307847 307808 40 -4.05 8.00 Prom + 311183 311222 40 -4.15 8.01 Init + 314821 314874 54 0 0 72 100 39 0.304 2.84 8.02 Intr + 322004 322198 195 1 0 45 53 95 0.098 0.39 8.03 Intr + 323262 323358 97 2 1 60 100 -1 0.061 -3.04 8.04 Intr + 332140 332271 132 2 0 90 49 117 0.174 7.70 8.05 Term + 332951 333015 65 0 2 90 48 68 0.566 0.07 8.06 PlyA + 334981 334986 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 300989 301114 126 1 0 87 65 115 0.967 9.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_1|2038_aa XNNSFIHLVTQAKSLGVILIAVSPTLRLDTQTQRTVSLNVSVFLHWGMSDEKPPDSVSLM QRVSFLVALQVTCPERGHPGSLHSLPHRYKKASQAYRSATLSMSALKKNRMLFGSCLSLH PKTITWWELRVEAEYSLSQGTCRRERMQAMEKQIASLTGLVQSALFKGPITSYSKDASSE KMMKTTANRNHTDSAGTPHVSGGKMLSALESTVPPSQPPPVGTSAIHMSLLEMRRSVAEL RLQLQQMRQLQLQNQELLRAMMKKAELEISGKVMETMKRLEDPVQRQRVLVEQERQKYLH EEEKIVKKLCELEDFVEDLKKDSTAASRLVTLKDVEDGAFLLRQVGEAVATLKGEFPTLQ NKMRAILRIEVEAVRFLKEEPHKLDSLLKRVRSMTDVLTMLRRHVTDGLLKGTDAAQAAQ YMAMEKATAAEVLKSQEEAAHTSGQPFHSTGAPGDAKSEVVPLSGMMVRHAQSSPVVIQP SQHSVALLNPAQNLPHVASSPAVPQEATSTLQMSQAPQSPQIPMNGSAMQSLFIEEIHSV SAKNRAVSIEKAEKKWEEKRQNLDHYNGKEFEKLLEEAQANIMKSIPNLEMPPATGPLPR GDAPVDKVELSEDSPNSEQDLEKLGGKSPPPPPPPPRRSYLPGSGLTTTRSGDVVYTGRK ENITAKLQQYEMQITSYFPCGHHKIENEGRRGKRKKDYLPKVTEASSEDAGPSPQTRATK YPAEEPASAWTPSPPPVTTSSSKDEEEEEEEGDKIMAELQAGQGAAVAPSKDSCGTTFCA PHLQASEENTGSEIEGLIAQRRWQLAFVSSFLSKRINQTHPGQARTGMTKEQQGDQAFQK CSFMDVNSNSHAEPSRADSHVKDTRSGATVPPKEKKHEERAAVPWNRQGITSVSVMSGPE ATDTPWERPGLQNLEFFHEDVRKSDVEYENGPQMEFQKVTTGAVRPSDPPKWERGMENSI SDASRTSEYKTEIIMKENSISNMSLLRDSRNYSQETVPKASFGFSGISPLEDEINKGSKI SGLQYSIPDTENQTLNYGKTKEMEKQNTDKCHVSSHTRLTESSVHDFKTEDQEVITTDFG QVVLRPKEARHANVNPNEDGESSSSSPTEENAATDNIAFMITETTVQVLSSGEVHDIVSQ KGEDIQTVNIDARKEMTPRQEGTDNEDPVVCLDKKPVIIIFDEPMDIRSAYKRLSTIFEE CDEELERMMMEEKIEEEEEEENGDSVVQNNNTSQMSHKKVAPGNLRTGQQVETKSQPHSL ATETRNPGGQEMNRTELNKFSHVDSPNSECKGEDATDDQFESPKKKFKFKFPKKQLAALT QAIRTGTKTGKKTLQVVVYEEEEEDGTLKQHKEAKRFEIARSQPEDTPENTVRRQEQPSI ESTSPISRTDEIRKNTYRTLDSLEQTIKQLENTISEMSPKALVDTSCSSNRDSVASSSHI AQEASPRPLLVPDEGPTALEPPTSIPSASRKGSSGAPQTSRMPVPMSAKNRPGTLDKPGK QSKLQDPRQYRQANGSAKKSGGDFKPTSPSLPASKIPALSPSSGKSSSLPSSSGDSSNLP NPPATKPSIASNPLSPQTGPPAHSASLIPSVSNGSLKFQSLTHTEGNVRKNNPKSTSEKR VPGPPKVHVHLAPSQVPTLEANVRPDGTRKPQVVNYNSRALSWDCREGFTCQKDHTPDKP SQAQPRSYPYQAPPPPGLATPLQHLVETKADSGGEWWSVNYQVQESMSVLHTEDPKAGPA KSQTYGMYTSSCTIVLSLANVGFSATLAYQVILIWRGKPAAFSWPGSVKHTCLSPGQSPD APRCVDSLLRKQEERVESSSGKGVFLETADLNIRQERQKWTHSHGAASVKVQALKRTACP WVTLGGHLWIPSRQSLSRQSGTFPVFFSYGNVRCFWQGGGVEKNEQDLYIFGFLTGEQGF EKLAQLLSQGLGSAAHDRKPEVILVETREVDFSYPSSPEVGGPQCHLFGIHCARLSFRDS CVFQGDVEAGLCPVCGKRKAYRSPCPLRVHAVARVCVKPLHLVPWPHLAVKEAEKCGL >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_1|6117_bp ngtaacaacagcttcattcatctagtcactcaggccaaaagtcttggtgtcatcttgatt gctgtctccccaaccctgcgtttggacacccagacccagagaacagtatccttaaatgtg tctgtcttcctgcactgggggatgtcagatgagaagccacctgactcagtatctctgatg cagcgtgtctctttcctggtggccttgcaggtgacatgccctgaacgaggccatccaggg agccttcactcccttcctcacagatacaaaaaggcaagccaagcataccgttctgcaact ctgtcaatgtcagccttaaagaagaataggatgttgtttgggagttgtctctctctgcac cctaaaactattacctggtgggagctcagggtagaagctgagtactcactgagccaagga acatgccgaagagagaggatgcaagccatggagaaacagattgccagtttaactggcctt gttcagtctgcgctttttaaagggcccattacaagttatagcaaagatgcgtctagcgag aaaatgatgaaaaccacagccaacaggaaccacacagatagtgcaggaacgccccatgtg tctggtgggaagatgctcagtgctctggagtccacggtgcctcccagccagcctccacct gtgggcacctcagccatccacatgagcctgcttgagatgaggcggagcgtggcggaactc aggctccagctccagcagatgcggcagctccagctgcagaaccaggagttgctgagggca atgatgaagaaggccgagctggaaatcagtggcaaagtgatggaaacaatgaagagactg gaggatcccgtgcagcgacagcgcgtcctagtggagcaagagagacaaaaatatcttcat gaggaagagaagatcgtcaagaagttgtgcgagttggaagactttgttgaagacttgaag aaggactccacggcagccagccgattggttactctgaaagacgtggaagacggggctttc ctcctgcgtcaagtgggagaggctgtagctaccctgaaaggagaatttccaaccttacaa aacaagatgcgagccatcctgcgcatagaagtggaggccgtgcggtttctgaaggaggag ccacacaagctggacagtctcctgaagcgtgtgcgcagcatgacagacgtcctgaccatg ctgcggagacatgtcactgatgggctcctgaaaggcacggacgcagcccaagccgcacag tacatggctatggaaaaggccacagccgcagaagtcctgaagagtcaggaggaggcagcc cacacctccggccagcccttccacagcacaggtgcccctggcgatgcgaagtcggaagtg gtgcctttgtccggcatgatggttcgccacgcgcagagctcccctgtggtcatccagccc tcccagcactccgtggccctgctgaaccctgctcagaacttgcctcacgtggccagctcc ccagccgtcccccaggaagcaacctccactctgcagatgtcgcaggctccgcagtcccca cagatacccatgaatgggtctgccatgcagagcttgttcattgaagaaatccacagtgtg agtgccaagaacagggcagtgtctatcgagaaagcagaaaagaaatgggaggaaaaaagg caaaatctggatcactataatgggaaagagtttgagaagctcctagaagaagctcaggcc aatatcatgaagtcaataccaaatctggagatgccgccagccacaggcccactgccaagg ggagatgccccagtggacaaggtggaactttcagaagattctccaaattcggaacaggac ttggaaaagctggggggaaagtcgccccctcctcctccgccacctcctcgtcgaagctac ctgccaggatcgggactcaccaccacgaggtcaggcgatgtggtctacaccggcagaaag gagaacatcaccgctaagctgcaacaatatgaaatgcaaatcacaagctatttcccatgt gggcatcataagattgaaaatgagggtaggagggggaaacgcaagaaagattatcttcca aaggtaactgaggcaagcagtgaagatgctggaccaagcccacagaccagagctacaaaa tatccagcagaggagcctgcttcagcctggaccccatccccaccgcctgtcaccacctcc tcctcaaaggatgaggaggaagaagaagaagaaggagacaaaataatggcagaactccag gcaggacaaggagcagctgtggccccttctaaggacagttgtggaaccactttctgtgct ccacatctccaggcatcagaagaaaacacaggctctgaaatcgagggtctaatagctcaa aggagatggcaattggcatttgtaagcagctttctaagtaagcggataaaccaaacacat ccggggcaagctaggactggtatgactaaggaacagcaaggagaccaggcattccagaag tgttcctttatggatgtaaattcaaacagtcatgctgagccatcccgggctgacagtcac gttaaagacactaggtcgggcgccacagtgccacccaaggagaagaagcatgaagaaagg gcagcagtgccttggaaccgacaaggcatcactagtgttagtgttatgtcagggcctgag gctactgatacaccttgggaaaggccaggcctccagaatttggaatttttccatgaagat gtacggaaatctgatgttgaatatgaaaatggcccccaaatggaattccaaaaggttacc acaggggctgtaagacctagtgaccctcctaagtgggaaagaggaatggagaatagtatt tctgatgcatcaagaacatcagaatataaaactgagatcataatgaaggaaaattccata tccaatatgagtttactcagagacagtagaaactattcccaggaaactgtgcctaaggcc agtttcggtttctctggcattagtccattagaagatgaaataaacaaagggtctaaaatc tcaggcctgcaatactctatacctgacaccgagaaccagacgctgaattacggaaagaca aaggagatggaaaagcaaaatacggataagtgtcacgtttcctctcacactagactaaca gaatcaagcgtgcatgattttaaaacagaagatcaagaggttatcacgacagattttggc caagttgttctaagacccaaggaggcaaggcatgctaacgtgaaccctaatgaggatgga gaatcaagttcaagttctcccactgaagaaaatgcagccactgacaatattgccttcatg attaccgaaaccactgtccaggttctttccagtggggaggtgcatgatattgttagccaa aagggagaagacatacagacggttaatatcgatgccagaaaagagatgaccccccgacaa gaagggactgacaatgaggatccagtcgtgtgcctggacaagaaaccagtgatcatcatt ttcgatgagcccatggacatccggtctgcctataagagactttcaactatctttgaggaa tgtgatgaggaattagagagaatgatgatggaggaaaagatagaggaggaggaagaggag gaaaatggggattctgtagtccagaataataacacttcccagatgtctcataagaaggtg gccccaggcaatcttagaaccggacaacaggtggaaacaaagtcacagccacactccctg gccacagagaccagaaacccaggaggacaggaaatgaacagaacggagctgaacaagttc agccacgtggattctccaaattcggaatgcaagggtgaggacgcgaccgatgaccagttt gaaagccccaagaaaaagtttaaattcaaattccctaagaagcaactcgccgctctcact caagccattcgcaccggaactaaaacagggaagaagactttgcaagtggtagtctatgaa gaagaggaagaggatggcaccctgaaacagcacaaagaagccaagcgcttcgaaatcgct aggtctcaacctgaagacacccctgaaaacacagtgaggaggcaagagcagcccagcatc gagagtacatctccgatttcaagaactgatgaaattagaaaaaacacctacagaacattg gatagcctggagcagaccattaaacagctcgaaaatacaatcagtgaaatgagtcccaaa gccctagttgatacctcatgttcttccaacagagattctgttgcaagttcatcccacata gcccaagaggcctctccccgacccttgctagttccggatgaaggtcccactgccctagag ccccctacgtcgataccttcagcttcacgtaagggctccagcggggccccacagacgagc aggatgcctgtccccatgagtgccaagaacagacccggaaccctggacaaacccggcaag cagtccaaactgcaggatccccgccaatatcgtcaggctaatggaagtgctaagaaatct ggtggggactttaagcctacttccccctccttacctgcttctaagattccagccctttct cccagctctgggaaaagcagttctctgccctcttctagtggtgacagctctaacctccct aatccacctgctactaaaccatcgattgcttctaaccctctcagcccccaaacaggacca cctgctcactctgcctccctcatcccttctgtctctaatggctctttgaagtttcagagc ctcactcatacagaaggtaatgtccgaaagaataacccgaaaagcacctctgagaagaga gtcccaggccctcccaaggtgcatgtccatctagctccaagtcaggtgcccaccttagaa gcaaatgttcgccctgatggaacgaggaagccccaagtggtcaactacaactctagggcc ctgagctgggactgcagggaaggcttcacctgccagaaagaccacaccccagacaagcca agccaagcccagcctaggtcctacccgtaccaggccccacctcctcctggccttgccacg cccctccagcacttagtggaaactaaagcagacagtgggggtgagtggtggtcagtaaat tatcaagtgcaggaatcaatgtctgtattacacacagaggaccccaaggcaggacctgcc aaaagtcagacgtatggtatgtatacatccagttgtaccatcgtcctgagccttgccaat gtcgggttctcagccacactggcataccaggtcatcttaatttggagaggcaaacctgca gccttctcttggcctgggagtgtgaagcacacgtgcttatctccaggccaaagccctgat gcccccagatgtgttgattcattgttgaggaagcaggaggagagggttgagagtagttct gggaaaggggtcttcctggagacagctgatctcaatatccggcaggaaaggcagaaatgg acacacagtcacggtgctgccagtgtgaaggttcaggcattaaagcgcactgcttgccca tgggtgacactggggggacatctctggattccatcacgtcagtcgctgtccaggcagtcg ggcacatttcccgtattctttagctacggcaatgtcaggtgcttctggcaaggtggaggc gttgagaaaaatgaacaagatctttacatttttggttttttaactggagaacaaggattt gaaaaattagctcagttgctgtctcaagggctaggttcagctgcacatgacagaaaaccc gaagtaatactggttgaaacgagagaagttgatttctcatatccaagcagcccagaggta ggcggcccacagtgtcatctatttgggatccactgtgcaaggctctcgtttagagattcc tgcgtgtttcaaggggatgtggaagccgggctgtgtccagtctgtggtaagaggaaagca tatagaagcccatgccccctgcgtgtgcacgcagtggctagagtttgcgttaaaccactt catctagtcccgtggccacacctagctgtgaaggaggctgagaagtgtggtctctaa >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_2|1858_aa MVMDPKGGSWGCQSQSSPRQVLQKEPFEIDGSSVQLVSLSAITLETGNASRWYIIKLHKD VVGNAVPDSTVFSHPDTIFLLQKSIGMILISMEKIDLAYSQDAYLKGNEAYSGNARNIPE PPPICYPWLPSAPSAMAQPVEISPPDSSLSKQQTSTPVLTQPGRAYRMEIQVPPSPTDVA KSNTAVCVCNESVRTVIVPSEKVVDLLSNRNNHTGPSHRTEEVRYGVSEQTSLKTVSRTT SPPLSIPTTHLIHQPAGSRSLEPSGILLKSGNYSGHSDGISSSRSQAVEAPSVSVNHYSP NSHQHIDWKNYKTYKEYIDNRRLHIGCRTIQERLDSLRAASQSTTDYNQVVPNRTTLQGR RRSTSHDRVPQSVQIRQRSVSQERLEDSVLMKYCPRSASQGALTSPSVSFSNHRTRSWDY IEGQDETLENVNSGTPIPDSNGEKKQTYKWSGFTEQDDRRGICERPRQQEIHKSFRGSNF TVAPSVVNSDNRRMSGRGVGSVSQFKKIPPDLKTLQSNRNFQTTCGMSLPRGISQDRSPL VKVRSNSLKAPSTHVTKPSFSQKSFVSIKDQRPVNHLHQNSLLNQQTWVRTDSAPDQQVE TGKSPSLSGASAKPAPQSSENAGTSDLELPVSQRNQDLSLQEAETEQSDTLDNKEAVILR EKPPSGRQTPQPLRHQSYILAVNDQETGSDTTCWLPNDARREVHIKRMEERKASSTSPPG DSLASIPFIDEPTSPSIDHDIAHIPASAVISASTSQVPSIATVPPCLTTSAPLIRRQLSH DHESVGPPSLDAQPNSKTERSKSYDEGLDDYREDAKFDLMQRSPKVYMAILDLLPELQLV SAAVTWMFNQQLKIADSQKSSEDSGSRKDSSSEVFSDAAKEGWLHFRPLVTDKGKRVGGS IRPWKQMYVVLRGHSLYLYKDKREQTTPSEEEQPISVNACLIDISYSETKRKNVFRLTTS DCECLFQAEDRDDMLAWIKTIQESSNLNEEDTGVTNRDLISRRIKEYNNLMSKAEQLPKT PRQSLSIRQTLLGAKSEPKTQSPHSPKEESERKLLSKDDTSPPKDKGTWRKGIPSIMRKT FEKKPTATGTFGVRLDDCPPAHTNRYIPLIVDICCKLVEERGLEYTGIYRVPGNNAAISS MQEELNKGMADIDIQDDIHDLPEHHYETLKFLSAHLKTVAENSEKNKMEPRNLAIVFGPT LVRTSEDNMTHMVTHMPDQYKIVETLIQHHDWFFTEEGAEEPLVSIVCQHLYSVVSFGTQ DPGAPTMSTSCFAIVRFAPRKQVSSSVSDLMNIQGSWGSGKDQYSRELLVSSIFAAASRK RKKPKEKAQPSSSEDELDNVFFKKENVEQCHNDTKEESKKESETLGRKQKIIIAKENSTR KDPSTTKDEKISLGKESTPSEEPSPPHNSKHNKSPTLSCRFAILKESPRSLLAQKSSHLE ETGSDSGTLLSTSSQASLARFSMKKSTSPETKHSEFLANVSTITSDYSTTSSATYLTSLD SSRLSPEVQSVAESKGDEADDERSELISEGRPVETDSESEFPVFPTALTSERLFRGKLQE VTKSSRRNSEGSELSCTEGSLTSSLDSRRQLFSSHKLIECDTLSRKKSARFKSDSGSLGD AKNEKEAPSLTKVFDVMKKGKSTGSLLTPTRGESEKQEPTWKTKIADRLKLRPRAPADDM FGVGNHKVNAETAKRKSIRRRHTLGGHRDATEISVLNFWKVHEQSGERESELSAVNRLKP KCSAQDLSISDWLARERLRTSTSDLSRGEIGDPQTENPSTREIATTDTPLSLHCNTGSSS STLASTNRPLLSIPPQSPDQINGESFQNVSKNASSAANAQPHKLSETPGSKAEFHPCL >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_2|5577_bp atggtgatggatccaaagggcggcagctggggctgtcagtcacagtcttcccctcggcaa gttcttcagaaggagccttttgagattgatggaagctctgtgcagctggtcagtctctca gccattactttggaaacaggaaacgcctcaaggtggtatataataaaactacacaaggat gttgtgggaaatgctgtaccagacagtacagtattctctcatccagacacaattttccta ttgcagaaaagtattggtatgattctaatcagcatggaaaagatagacttggcatattct caagatgcctacctgaaaggcaacgaagcttatagcggcaatgcccgcaatatacctgaa cctccaccaatctgctatccctggctgccatctgccccatcagccatggcacagccagtt gaaatatctcctcctgactcatcattgagcaaacagcaaaccagtacaccagtactgaca caacctggtagggcctatagaatggaaatacaagtgcctccatcaccaacagatgttgca aaatcaaacacagcagtgtgtgtttgcaatgaaagtgtaaggactgtcattgtgccttct gagaaggttgtagatttgttatccaatagaaacaaccatacaggtccttcacatagaact gaagaagtgaggtatggcgtgagtgagcagacctctttaaaaacagtgtcaagaaccaca tcaccaccattatcaattcccaccactcatctaattcatcagcctgcaggctccagatca ctggaaccttctggaattttacttaagtctggaaattacagtggacattctgatggaatc tcaagcagcagatctcaagctgtggaggctccctctgtatctgttaatcactattcgcca aattcccatcagcacatagactggaaaaactataaaacttacaaagagtatattgataac agacgattgcacataggttgtcggacaatacaagaaagattagatagtttaagagcagca tctcaaagcacgacagattataaccaggtcgtccccaaccgcactactttgcagggacga cgtcgaagcacctctcatgatcgagtgccccagtctgtccagatacggcaacgcagtgtg tcccaagaaagactggaagattctgtgctaatgaagtattgtccaagaagtgcatctcaa ggagcactgacgtctccatctgttagttttagtaatcatagaactcgttcatgggattat attgagggacaggatgaaaccttagaaaatgtcaattctggaactccaatacctgattcc aatggagagaaaaaacagacttacaagtggagtgggtttactgaacaggatgatagacga ggtatttgtgaaagacctaggcagcaagaaattcataaatcttttcgaggttccaatttt actgtggctccaagcgttgttaattctgataacaggcgaatgagtggtagaggagtggga tctgtgtcgcagtttaaaaaaattccaccagatctaaaaacattgcagtcaaacagaaat tttcagactacttgtggaatgtcactgcctcggggtatttcacaagacaggtcacctctt gtgaaagtccgaagtaattctctgaaagctccttccacgcatgtcacaaaaccatcattt agccagaaatcatttgtttctatcaaagaccaaagaccagtaaatcacttgcatcagaac agtctgttgaatcagcagacatgggtaaggactgacagtgcccccgatcagcaagtggag actgggaaatccccctctttatctggagcctctgccaagcctgcccctcagtcgagtgaa aacgctggtacttcagatttagaactacctgtcagtcaaaggaatcaagatttaagttta caagaggctgaaactgagcaatcagatactttagataataaagaagctgtcatcctaagg gaaaaacctccatctggacgccagacaccgcagcctttaaggcatcagtcttacatcttg gcagtaaatgaccaggagaccgggtcagacactacctgctggctgcccaatgatgcacgt cgagaggtccacataaaaagaatggaggaaagaaaagcctcgagtaccagtccgcctggc gattctttggcttccatcccatttatagatgaaccaactagccctagcattgatcatgat attgcacatatccctgcctctgctgttatatcagcctctacctctcaggtcccctccata gcaacagttcctccttgcctcacaacttcagctccattaattcgccgtcagctctcacat gaccacgaatctgttggccctcctagcctggatgctcagcccaactcaaagacagaaaga tcaaaatcatatgatgagggtctggatgattacagagaagatgcaaaatttgatcttatg cagaggtctcccaaagtttacatggctatccttgacctccttcctgaactgcagctggtg tcagctgccgtcacttggatgtttaatcagcaactcaagatcgcagacagccaaaagtca tcagaagactctgggtccagaaaagattcttcctcagaggtcttcagtgatgctgccaag gaagggtggcttcatttccgaccccttgtcaccgataagggcaagcgagttggtggaagt attcggccatggaaacagatgtatgttgtccttcggggtcattcactttacctgtacaaa gataaaagagagcagacgactccgtctgaggaagagcagcccatcagtgttaatgcttgc ttgatagacatctcttacagtgagaccaagaggaaaaatgtgtttcgactcaccacgtcc gactgtgaatgcctgtttcaggctgaagacagagatgatatgctagcttggatcaagacg atccaggagagcagcaacctaaacgaagaggacactggagtcactaacagggatctaatt agtcgaagaataaaagaatacaacaatctgatgagcaaagcagaacagttgccaaaaaca cctcgccagagtctcagcatcaggcaaactttgcttggtgctaaatcagagccaaagact caaagcccacactctccgaaggaagagtcggaaaggaaacttctcagtaaagatgatacc agtcccccaaaagacaaaggcacatggagaaaaggcattccaagtatcatgagaaagaca tttgagaaaaagccaactgctacaggaactttcggcgtccgactagatgactgcccacca gctcatactaatcggtatattccattaatagttgacatatgttgcaaattagttgaagaa agaggtcttgaatatacaggtatttatagagttcctggaaataatgcagccatctcaagt atgcaagaagaactcaacaagggaatggctgatattgatatacaagatgatattcacgat ttgcctgaacatcattatgaaacacttaagttcctttcagctcatctgaagacagtggca gaaaattcagaaaaaaataagatggaaccaagaaacctagcaatagtgtttggtcccacc cttgttcgaacatcagaagacaacatgacccacatggtcacccacatgcctgaccagtac aagattgtagaaacgctcatccagcaccatgactggtttttcacagaagaaggtgctgaa gagcctcttgtaagtattgtctgtcagcatttgtactcagtagtttcatttgggacacaa gatcctggcgcgccaacaatgtcaacttcctgctttgctattgtccgctttgcgccccgg aagcaggtctctagctcagtttctgatctaatgaatatacaaggttcttggggatctgga aaggatcagtatagcagggaactgcttgtgtcctccatctttgcagctgctagtcgcaag aggaagaagccgaaagaaaaagcacagcctagcagctcagaagatgaactggacaatgta ttttttaagaaagaaaatgtggaacagtgtcacaatgatactaaagaggagtccaaaaaa gaaagtgagacactgggcagaaaacagaagatcatcattgccaaagaaaacagcactagg aaagaccccagcacgacaaaagatgaaaagatatcactaggaaaagagagcacgccttct gaagaaccctcaccaccacacaactcaaaacacaacaagtcaccaactctcagctgtcgc tttgccatcctgaaagagagccccaggtcacttctggcacagaagtcctcccaccttgaa gagacaggctctgactctggcactttgctcagcacgtcttcccaggcctccctggcaagg ttttccatgaagaaatcaaccagtccagaaacgaaacatagcgagtttttggccaacgtc agcaccatcacctcagattattccaccacatcgtctgctacatacttgactagcctggac tccagtcgactgagccctgaggtgcaatccgtggcagagagcaagggggacgaggcagat gacgagagaagcgaactcatcagtgaagggcggcctgtggaaaccgacagcgagagcgag tttcccgtgttccccacagccttgacttcagagaggcttttccgaggaaaactgcaagaa gtgactaagagcagccggagaaattctgaaggaagtgaattaagttgcaccgagggaagt ttaacatcaagtttagatagccggagacagctcttcagttcccataaactcatcgaatgt gatactctttccaggaaaaaatcagctagattcaagtcagatagtggaagtctaggagat gccaagaatgagaaagaagcaccttcgttaactaaagtgtttgatgttatgaaaaaagga aagtcaactgggagtttactgacacccaccagaggcgaatccgaaaaacaggaacccaca tggaaaacgaaaatagcagatcggttaaaactgagacccagagcccctgcggatgacatg tttggagtagggaatcacaaagtgaatgccgagactgctaaaaggaaaagcatccggcgc agacatacactaggagggcacagagatgctaccgaaatcagcgttttgaatttttggaaa gtgcatgagcagagcggggagagagaatctgaactttcagctgtaaaccggttaaaacca aaatgctcagcccaggacctttccatctcagactggctggccagggaacgcctacgcacc agtacctctgaccttagcagaggagaaatcggagatccccagacagagaacccaagcaca cgagaaatagccacgaccgacacacctttgtctcttcattgcaacacaggcagttcttcc agcaccttggcttcaacaaacaggccccttctttccataccaccacagtcacctgaccaa ataaacggagaaagcttccagaacgtgagcaaaaatgctagttctgcagcgaatgcccaa cctcataaactgtctgaaaccccaggcagtaaagcagagtttcatccctgtctttaa >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_3|409_aa MSELPFTIVSKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNAIPIKLTMTFFTELEKTTLKFIWNQKRACIAKTILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEASEITPHIYNHLIFDKPDKNKKWGKDSLFN KRCWENWLAIRRKLKLDPFLTPYTKINSTWIKDLNVRPKTIKILEENLGNTIQDIGMGKD FVSKTPKAMATKGKIDKWDLIKLKSFCTSKETTIRVNRQSTEWEEIFTIYPSDKGLTSRI YKELKQIYKKKSNNPIKKWAKDMNRHFSKEDIYAANRHMKKCSSSRLEERTGPAGPEGKE QPPALASQSAEIAASARPPPRLGSEECLCLAAHRLGCEEPLCLAAQSGK >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_3|1230_bp atgagtgaactcccattcacaattgtttcaaagagaataaaatacctgggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggacacaaacaaatggaagaacattccatgctcttggataggaagaatcaatattgtg aaaatggccatactgcccaaagtaatttatagattcaatgccatccccatcaaactaaca atgactttcttcacagaactggaaaaaactactttaaaattcatatggaaccaaaaaaga gcctgcattgccaagacaatcctaagccaaaagaacaaggctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagaggcctcagaaataacaccacatatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaaaggtgctgggaaaactggctagccatacgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaacatggattaaagacttaaatgttagacctaaaacc ataaaaatcctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcgtgtctaaaacaccaaaagcaatggcaacaaaaggcaaaattgacaaatgggatcta attaaactaaagagcttctgcacatcaaaagaaactaccatcagagtgaacagacaatct acagaatgggaggaaatttttacaatctacccatctgacaaagggctaacatccagaatc tacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatcaaaaagtgggca aaagatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaa aaatgctcatcatcgcggctggaggagcggacgggccccgcggggcccgagggcaaggag cagccgcctgccttggcctcccaaagtgccgagattgcagcctctgcccggccgccaccc cgtctgggaagtgaggagtgtctctgcctggccgcccatcgtctgggatgtgaggagccc ctctgcctggctgcccagtctggaaagtga >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_4|52_aa MRVENYPYNVHYSGDGYAKSPDFTPTQYTQRPRKPKIKAPADSVFGEGLLPK >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_4|159_bp atgagggtggaaaattacccgtacaatgttcactattcaggtgatggttatgctaaaagc ccagacttcacccctacgcaatatacccagaggcccaggaagcccaagatcaaggcaccg gcagattcagtgtttggtgaaggcctacttcctaaatag >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_5|71_aa MREKNLLKKLQAPTLKTHTGRRREPVFANEWEKPQHSGAVGKGESDDPDGAKLIKPRETI QRKRTAGGIRC >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_5|216_bp atgagggaaaagaatctcctgaagaaattacaggcaccaaccctcaaaactcacacagga cgaagaagagagcccgttttcgccaatgagtgggaaaaacctcagcattcaggggcagta gggaaaggtgaaagtgatgacccagatggggcaaaactcattaagccaagggaaaccatc cagaggaaaaggacagctggaggtattcgctgctga >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_6|223_aa MRTFQTAGHVLAKETRESLETDVGQASPRYWLSWGAPANRLKAPGSPYGKAMKLRRDVGL PDKGVSPHASQQKSLPVVGRGKLAKRCSFSSFPFINSAERLSEFGAASGTVERRLRFSVR HTPGHSGSGAAAQPPPSPPNSRALSQTPAARSAPPCLTTSPAANYPIGRRDERTRTNQGL GSGDKKHSLQLEYLRLSFYIFLKVEGAKPNLQLNPPLQFEGRR >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_6|672_bp atgaggacattccaaactgcgggacatgttcttgcaaaagaaacacgggagtccctggaa acagacgtgggccaggcttctcctcgctactggttaagctggggcgctcccgccaacagg ctcaaagccccgggaagcccctacgggaaggcaatgaagctgcgacgagacgtgggactt ccagataagggagtgtcccctcacgcctcacagcaaaaatctcttcctgtggtgggaagg gggaagttagccaagcgttgctcattcagttctttcccttttataaattctgcagagcgg ctgagcgaatttggcgcggctagcggaactgtggagaggcggctgcgcttctctgtccgc cacactccaggtcacagcggctccggagccgcggcacaaccgccaccctccccacccaat tcccgggccttgtcacagaccccggcagctcgctccgccccgccgtgcctcacaacatct cccgctgcaaattacccaatcgggagacgcgacgagagaacgcgaaccaatcaggggctg gggagcggggacaaaaaacactccctgcagctggagtacctgcgccttagtttttacatc ttccttaaagtggaaggtgccaaaccaaatctacaactaaatcctccgcttcaattcgag ggaagacgttga >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_7|258_aa MAGAEDLYPKQINAKTSHHIPSLAMSRSTFASHLQVGAIILYKKGLQLATERIRCPNKES AGMWSQPRTALRSRLCLATRPLRGFKIIEGLEAARQIKEINPTEQLCNMEQQQQQTFQKF LSLLLFADPLMETRDWRNFRKRLTLLDPEWGIAEKKPKNMEATLELSVRQRLEQLEGSEE EMKMWESLEPPRDLLNGFDKNADSDISNMVQAEVVSDGDKELVGNWSKCDSRYVLAKRLQ HFASALEICGTLNLREMI >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_7|777_bp atggctggagctgaagacctttatcctaagcaaattaatgcaaaaacatcacaccacatc cccagcttggccatgtcaagatccacttttgcttctcacttacaagtgggagctataatt ctctataaaaaaggccttcagctggccaccgagagaatccgctgtccgaacaaagagtcc gcagggatgtggtcccagccacgaactgctctcaggagccggctctgcctcgctaccagg cccttgcgtggtttcaaaataattgagggtttggaagcagcaaggcagattaaagagatt aacccaacagagcaactctgcaacatggagcagcagcagcagcagaccttccagaagttt ctctctctccttttatttgctgaccctcttatggaaacgcgggactggagaaatttcagg aagagattaaccctgttagacccagagtggggcattgctgaaaagaaacccaaaaatatg gaagcaactttggaactgagtgtcaggcagaggttggaacagttagagggctcagaagaa gagatgaaaatgtgggaaagtttggaacctcctagagacttgttgaatggctttgacaaa aatgctgatagcgatataagcaatatggtccaggctgaggtggtctcagatggagataag gaacttgttgggaactggagcaaatgtgactctcgttatgttttagcaaaaagactgcag cattttgcctctgccctagagatttgtggaactttgaacttgagagagatgatttag >gi568815588r:24484415_24821899|GENSCAN_predicted_peptide_8|180_aa MAKAWWLMLVIPVLWEAKLQLKSPTTTKDIVVLGMLGLMLGCLCQHCPETASAIMMDLPT DTALVLVLGWDLQPLVGQAPPEQPGQQSETLSQKQNKTIPGTVAHACNHSALRSQDMGSC YIVQAEYKVFRTSFRLESEGSGLVERAYRESQEKVGLVDGSRDFLHFLAHVRFFRLHSQQ >gi568815588r:24484415_24821899|GENSCAN_predicted_CDS_8|543_bp atggccaaggcgtggtggctcatgcttgtaattccagtgctttgggaggccaagctgcag ctgaagtcacccacaaccactaaagacatcgttgttctagggatgctggggctgatgttg gggtgtctctgccaacactgtcctgagactgccagtgccattatgatggacttgcccaca gatacagccctagttctcgttctgggctgggatctgcaaccccttgtggggcaggcacca ccagagcagcctggacaacaaagtgagaccctatctcaaaaacaaaacaaaacaattcct ggcactgtggctcatgcctgcaatcacagcgctttgagaagccaagatatggggtcttgc tacatcgtccaggctgaatacaaggttttccgaacaagcttcagactagaatcagaaggc tctgggctagtggagagggcgtacagggagagtcaggagaaagtcggtctggtggatggt tccagagacttcttgcatttcttggctcatgtccgcttcttccgtcttcacagccagcag tga