GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:02:23 Sequence gi568815593r:176429031_176632472 : 203442 bp : 49.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 18104 18246 143 2 2 45 86 82 0.741 3.70 1.02 Term + 18580 18971 392 2 2 69 51 329 0.578 22.45 1.03 PlyA + 20906 20911 6 1.05 2.00 Prom + 24723 24762 40 -2.16 2.01 Init + 43465 43530 66 0 0 63 95 34 0.322 0.67 2.02 Intr + 48860 48958 99 1 0 75 92 11 0.404 0.51 2.03 Intr + 50158 50226 69 0 0 129 60 39 0.165 4.58 2.04 Intr + 62139 62209 71 2 2 93 60 -9 0.026 -5.22 2.05 Intr + 63164 63302 139 2 1 127 82 36 0.551 7.37 2.06 Intr + 64969 65054 86 0 2 110 123 49 0.487 9.22 2.07 Intr + 65154 65245 92 0 2 74 98 -70 0.410 -7.76 2.08 Intr + 67456 67633 178 2 1 52 82 230 0.951 17.78 2.09 Intr + 69884 70055 172 2 1 86 95 175 0.988 17.95 2.10 Intr + 70973 71116 144 1 0 126 70 92 0.866 11.68 2.11 Intr + 76343 76376 34 1 1 87 86 16 0.202 -0.90 2.12 Term + 80235 80428 194 1 2 86 42 104 0.472 3.18 2.13 PlyA + 82690 82695 6 1.05 3.12 PlyA - 84612 84607 6 1.05 3.11 Term - 91124 91013 112 1 1 122 43 104 0.954 7.33 3.10 Intr - 100060 100002 59 1 2 74 33 50 0.088 -4.22 3.09 Intr - 100357 100258 100 0 1 91 87 114 0.876 11.71 3.08 Intr - 100614 100493 122 0 2 100 68 222 0.998 20.69 3.07 Intr - 100788 100701 88 2 1 112 16 159 0.932 11.07 3.06 Intr - 101176 101052 125 1 2 104 95 136 0.999 15.28 3.05 Intr - 101713 101552 162 1 0 90 89 201 0.992 20.57 3.04 Intr - 101991 101818 174 0 0 91 101 183 0.960 20.04 3.03 Intr - 102600 102433 168 0 0 120 117 84 0.999 14.64 3.02 Intr - 103163 102974 190 1 1 66 96 128 0.912 10.99 3.01 Init - 103442 103336 107 2 2 46 97 63 0.680 0.76 3.00 Prom - 104849 104810 40 -12.01 4.00 Prom + 107259 107298 40 -2.56 4.01 Init + 108878 108934 57 1 0 99 68 6 0.248 1.02 4.02 Intr + 113452 114469 1018 0 1 111 11 434 0.289 28.23 4.03 Intr + 116568 116661 94 1 1 53 111 41 0.421 1.92 4.04 Intr + 124738 124912 175 1 1 59 11 121 0.015 1.44 4.05 Intr + 136308 136374 67 2 1 80 82 39 0.126 0.98 4.06 Intr + 136642 136713 72 2 0 60 92 76 0.765 4.58 4.07 Intr + 139648 139787 140 2 2 104 88 213 0.996 23.08 4.08 Intr + 139930 139980 51 0 0 90 109 105 0.989 11.90 4.09 Intr + 142183 142272 90 0 0 86 46 128 0.991 8.59 4.10 Intr + 145053 145142 90 2 0 48 113 166 0.971 15.29 4.11 Intr + 146054 146179 126 1 0 89 56 216 0.999 19.38 4.12 Intr + 146250 146396 147 2 0 97 50 161 0.869 13.63 4.13 Intr + 146476 146551 76 0 1 10 39 163 0.912 2.69 4.14 Intr + 146694 146809 116 1 2 87 101 242 0.987 25.57 4.15 Intr + 146922 147173 252 2 0 114 33 449 0.933 39.63 4.16 Intr + 148369 148524 156 0 0 44 94 283 0.993 24.71 4.17 Intr + 148607 148768 162 1 0 118 94 228 0.812 26.57 4.18 Intr + 149004 149065 62 2 2 98 82 33 0.868 1.23 4.19 Intr + 149335 149578 244 1 1 44 64 525 0.569 43.20 4.20 Intr + 152313 152552 240 2 0 85 94 415 0.601 39.55 4.21 Intr + 155160 155229 70 2 1 66 91 101 0.969 6.95 4.22 Intr + 155380 155985 606 2 0 99 91 941 0.979 87.92 4.23 Intr + 156924 156995 72 1 0 53 99 92 0.986 6.18 4.24 Intr + 157763 157812 50 0 2 101 114 67 0.994 9.00 4.25 Intr + 160001 160152 152 1 2 80 -43 191 0.900 4.16 4.26 Intr + 160300 160408 109 1 1 126 82 38 0.989 7.29 4.27 Intr + 160432 160586 155 0 2 54 35 208 0.552 10.97 4.28 Intr + 161048 161116 69 2 0 118 109 29 0.951 6.30 4.29 Intr + 161223 161300 78 0 0 113 56 80 0.970 5.97 4.30 Intr + 161326 161455 130 1 1 36 105 108 0.674 8.00 4.31 Intr + 161533 161657 125 0 2 80 64 143 0.956 10.48 4.32 Term + 162180 162297 118 0 1 114 53 124 0.937 9.51 4.33 PlyA + 162399 162404 6 -3.64 5.07 PlyA - 162420 162415 6 -10.49 5.06 Term - 163630 162480 1151 2 2 50 38 1563 0.981 139.58 5.05 Intr - 164153 164071 83 1 2 88 85 32 0.800 2.18 5.04 Intr - 170498 167812 2687 2 2 123 -5 1754 0.377 156.96 5.03 Intr - 179558 179439 120 2 0 34 99 50 0.555 1.39 5.02 Intr - 179946 179820 127 2 1 11 53 104 0.601 -0.12 5.01 Init - 181147 180969 179 1 2 42 107 160 0.927 10.03 5.00 Prom - 187736 187697 40 -6.76 6.09 PlyA - 188326 188321 6 -0.45 6.08 Term - 190241 190098 144 2 0 57 37 82 0.555 -2.09 6.07 Intr - 192273 192184 90 0 0 110 94 126 0.983 15.59 6.06 Intr - 197486 197368 119 0 2 80 121 75 0.988 10.18 6.05 Intr - 197731 197690 42 2 0 124 79 38 0.798 4.71 6.04 Intr - 198518 198350 169 2 1 100 -15 109 0.303 1.32 6.03 Intr - 199764 199598 167 1 2 41 56 81 0.486 -0.12 6.02 Intr - 200633 200504 130 2 1 87 72 253 0.805 23.87 6.01 Init - 201140 201051 90 2 0 55 34 103 0.188 0.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:176429031_176632472|GENSCAN_predicted_peptide_1|178_aa XTSSQSLFCLTGSTLQGDEGGAQAAESLAENHTGGQWKSVYQTQASLEETRGQCSASAER QGSRHGKADAGPGFQRRSHGGGRQAVRTTGGAMNGRGSGEPSPDLPASDPEAAMERWPKS AHSFRFEGGVGNAAPVSRALRNLPRDGQELLETLGQGPATKRLVFRNGCVSRETSKSI >gi568815593r:176429031_176632472|GENSCAN_predicted_CDS_1|537_bp nnaaccagctctcaaagccttttctgtctcaccggatcaaccctgcaaggggacgagggt ggggcgcaggcggccgagagccttgctgagaatcacacaggtggtcagtggaaaagcgtg taccagacccaggcctctctggaagagacaaggggccagtgcagcgctagtgcggagaga caaggctccaggcacggaaaagcagatgccgggccggggttccagcgccgcagtcacggc ggtggacggcaagctgtgcggacgactgggggcgccatgaatggtcgaggatccggggag ccgagtccagatcttcctgcaagtgaccccgaagcggcaatggaacggtggccgaagtct gcccacagcttccgcttcgaaggcggcgtcggaaatgccgcgccagtatccagagccctt cggaacctcccgagagatggtcaggagcttttggaaactttaggtcaggggccagcgacc aaaaggctggtatttcggaacggctgcgtctcccgagaaacctccaagagtatctga >gi568815593r:176429031_176632472|GENSCAN_predicted_peptide_2|447_aa MVLNFWAQVILLPWPPRMLGLQVEISLFALKLTGCLEEFLEDRDYILFIFCPSNIDLTGI ESMDQCRHTLEQHNWNIEGKYARHVLLILMQLYHPTSLFNVRFALRFIRPDPRSRVTDPV GDIVSFMHSFEEKYGRAHPVFYQGTYSQALNDAKRELRFLLVYLHGDDHQDSDEFCRNTL CAPEVISLINTRMLFWACSTNKPEGYRVSQALRENTYPFLAMIMLKDRRMTVVGRLEGLI QPDDLINQLTFIMDANQTYLVSERLEREERNQTQVLRQQQDEAYLASLRADQEKERKKRE ERERKRRKEEEVQQQKLAEERRRQNLQEEKERKLECLPPEPSPDDPESVKIIFKLPNDSR VERRFHFSQSLTDFNCALSEPEQVKYRLHILSLNASTICVTPKDFWDFDETCEGEDTEKP VICKHLLLFPHHLWDISAVVSKWQIIN >gi568815593r:176429031_176632472|GENSCAN_predicted_CDS_2|1344_bp atggtcttgaacttctgggcccaagtgatcctcttaccttggcctcccagaatgctggga ttacaggttgagatcagtctctttgccctgaagcttactggttgtttagaggagtttctt gaggatagagactacatcttattcattttctgtccctccaacatagatctcactggcatc gaatctatggatcagtgtcgccataccttggaacagcataactggaacatagagggaaaa tatgcacggcatgtattactaattttaatgcagctgtatcatccaaccagcttgtttaat gttaggtttgctcttcgttttatacggcctgaccctcgcagccgggtcactgaccccgtt ggggacattgtttcatttatgcactcttttgaagagaaatatgggagggcacaccctgtc ttctaccagggaacgtacagccaggcacttaacgatgccaaaagggagcttcgctttctt ttggtttatcttcatggagatgatcaccaggactctgatgagttttgtcgcaacacactc tgtgcacctgaagttatttcactaataaacactaggatgctcttctgggcatgctctaca aacaaacctgagggatacagggtctcacaggctttacgagagaacacctatccattcctg gccatgattatgctgaaggatcgaaggatgactgtggtgggacggctagaaggcctcatt caacctgatgacctcattaaccaactgacatttatcatggatgctaaccagacttacctg gtgtcagaacgcctagaaagggaagaaagaaaccagacccaagtgctgagacaacagcag gatgaggcctacctggcctctctcagagctgaccaggagaaagaaagaaagaaacgggag gagcgggagcgtaagcggcggaaggaggaggaggtgcaacagcaaaagttggcagaggag agacggcggcagaatttacaggaggaaaaggaaaggaagttggaatgcctgccccctgaa ccttcccctgatgaccctgaaagtgtcaagatcatcttcaaattacctaatgattctcga gtagagagacgattccacttttcacagtctctaacagattttaactgtgccttgtcagag ccagagcaggtcaaatacaggctgcacattttgtcacttaatgccagtacaatctgtgtt actcctaaggacttttgggattttgatgagacctgcgagggagaagacactgagaagcca gtgatctgcaagcatttgctcttgtttccacatcacctctgggatatttcagctgttgtt tccaaatggcaaatcatcaactaa >gi568815593r:176429031_176632472|GENSCAN_predicted_peptide_3|468_aa MRPWALAVTRWPPSAPVGQRRFSAGPGSTPGQLWGSPGLEGPLASPPARDERLPSQQPPS RPPHLPVEERRASAPAGGSPRMLHPATQQSPFMVDLHEQVHQGPVPLSYTVTTVTTQGFP LPTGQHIPGCSAQQLPACSVMFSGQHYPLCCLPPPLIQACTMQQLPVPYQAYPHLISSDH YILHPPPPAPPPQPTHMAPLGQFVSLQTQHPRMPLQRLDNDVDLRGDQPSLGSFTYSTSA PGPALSPSVPLHYLPHDPLHQELSFGVPYSHMMPRRLSTQRYRLQQPLPPPPPPPPPPPY YPSFLPYFLSMLPMSPTAMGPTISLDLDVDDVEMENYEALLNLAERLGDAKPRGLTKADI EQLPSYRFNPDSHQSEQTLCVVCFSDFEARQLLRVLPCNHEFHTKCVDKWLKANRTCPIC RADASEVPREAESDKLIHIHGPLLLFLPEMKALERGHPGETLILKTRG >gi568815593r:176429031_176632472|GENSCAN_predicted_CDS_3|1407_bp atgcgaccatgggctctggcagtgactaggtggccaccctccgcccccgtgggccagcgg cgattctctgcgggacctggcagcaccccgggccagctctggggaagccctggcctcgag ggccccctggccagcccgcctgcccgggatgagcgcttaccctcccagcagccgccgtcc cgacctccacacctccccgtagaggagcgccgagcctcggctcctgccggcgggagcccc cgaatgctgcacccagccacccagcagagcccgttcatggttgatctccacgagcaggtg caccagggacctgtccctctgtcctacacggtcaccacagtgacgacccaaggcttcccc ttgcctacaggccagcacatccctggctgcagtgcccagcagctcccagcatgctccgtg atgttcagtgggcagcattaccccctctgctgcctcccgcccccgcttatccaggcgtgc accatgcagcagctgcctgtgccctatcaggcctacccccacctcatctccagtgaccac tacatcctgcaccccccaccaccggccccacccccccagcccacccacatggcgcccctg gggcagtttgtgtctctgcagacccagcaccctcggatgcccctccagcggctcgacaac gacgtggacctgcgtggggaccagccctccctgggcagcttcacctactccacctctgcg cctggcccagccctttccccgtcggtgcccctgcactacctgccccacgatccgctgcac caggagctgtcctttggtgtgccatattctcacatgatgccacggagactgagcacccag agataccgcctgcagcagccactgcccccgccgcccccacccccacccccaccaccctac taccccagcttcctgccctacttcctctcgatgctgccaatgtcaccaacagcaatgggg cccaccatcagcctggacctggacgtggatgatgtggagatggagaactatgaggccctc ctgaacctggccgagcggctgggagatgccaagccccggggtctcaccaaagcagacata gagcagctcccgtcgtaccgctttaacccggacagccatcagtcggagcagacgctgtgt gtggtctgcttcagtgacttcgaggcgcggcagctgctccgagtcctcccctgcaaccat gagttccacaccaagtgtgttgacaagtggttgaaggccaaccggacgtgtcccatctgc cgggccgacgcctccgaggtgcccagggaggctgaatcagacaagctgattcacatccat gggcctctgctgctgtttctacctgagatgaaggctctggagagagggcaccctggagag accctcatattgaagactcgtggctga >gi568815593r:176429031_176632472|GENSCAN_predicted_peptide_4|1722_aa MARPPHFSLFILHGVFKKLFSLLEPPGSAKGAGTVTLFSDENRGLQECGPWKIEEDKSTR IPHCPSILLVLLPSLPSPRHQLDSSLVAFRQRKLCDLGQDTDSLRTPQRSDIQNAVRTAL KCSGLFPGPAAAPGQGLGRSGSPAAGTGGVGEMFRTDMARPKNCSASPARGAGRLGPSRS WDSPGVAVGTCAGRRARGGSEFPGAPRVGAFGPSGRGEERRSSRCRARDCELPPCGRRQR PGGKTPRAFLRRPAAAPYRRVLAGPAPGALGAGSGPGVTWGRGGAPPPPGGRRVAPAAAQ ARGAAAARAPPAAAPLRPGAGEPSADLTTRARGAHTGCAASRTPAPAPREWSCLDRTTQQ AYEVESALGPFWTRTPIVWEVVYLAQAPSSATCVSTFISTHIISMSVPTYHHQCHPYPRG IPILILDPASMTTSLTALDSSAMMSLSRVPADVMAQLWLSCFLLPALVVSVAANVAPKFL ANMTSVILPEDLPVGAQAFWLVAEDQDNDPLTYGMSGPNAYFFAVTPKTGEVKLASALDY ETLYTFKVTISVSDPYIQVQREMLVIVEDRNDNAPVFQNTAFSTSINETLPVGSVVFSVL AVDKDMGSAGMVVYSIEKVIPSTGDSEHLFRILANGSIVLNGSLSYNNKSAFYQLELKAC DLGGMYHNTFTIQCSLPVFLSISVVDQPDLDPQFVREFYSASVAEDAAKGTSVLTVEAVD GDKGINDPVIYSISYSTRPGWFDIGADGVIRVNGSLDREQLLEADEEVQLQVTATETHLN IYGQEAKVSIWVTVRVMDVNDHKPEFYNCSLPACTFTPEEAQVNFTGYVDEHASPRIPID DLTMVVYDPDKAGVVAWGSNGTFLLSLGGPDAEAFSVSPERAVGSASVQVLVRVSALVDY ERQTAMAVQVVATDSVSQNFSVAMVTIHLRDINDHRPTFPQSLYVLTVPEHSATGSVVTD SIHATDPDTGAWGQITYSLLPGNGADLFQVDPVSGTVTVRNGELLDRESQAVYYLTLQAT DGGNLSSSTTLQIHLLDINDNAPVVSGSYNIFVQEEEGNVSVTIQAHDNDEPGTNNSRLL FNLLPGPYSHNFSLDPDTGLLRNLGPLDREAIDPALEGRIVLTVLVSDCGEPVLGTKVNV TITVEDINDNLPIFNQSSYNFTVKEEDPGVLVGVVKAWDADQTEANNRISFSLSGSGANY FMIRGLVLGAGWAEGYLRLPPDVSLDYETQPVFNLTVSAENPDPQGGETIVDVCVNVKDV NDNPPTLDVASLRGIRVAENGSQHGQVAVVVASDVDTSAQLEIQLVNILCTKAGVDVGSL CWGWFSVAANGSVYINQSKAIDYEACDLVTLVVRACDLATDPGFQAYSNNGSLLITIEDV NDNAPYFLPENKTFVIIPELVLPNREVASVRARDDDSGNNGVILFSILRVDFISKDGATI PFQGVFSIFTSSEADVFAGSIQPVTSLDSTLQGTYQVTVQARDRPSLGPFLEATTTLNPP TPSASPQGPWCLISCTPPSQLFTVDQSYRSRLQFSTPKEEVGANRQAINAALTQATRTTV YIVDIQDIDSAARARPHSYLDAYFVFPNGSALTLDELSVCEGEQEAWGVPRSLGPKWSSV ARMIRNDQDSLTQLLQLGLVVLGSQESQESDLSKQLISVIIGLGVALLLVLVIMTMAFVC VRKSYNRKLQAMKAAKEARKTAAGVMPSAPAIPGTNMYNTER >gi568815593r:176429031_176632472|GENSCAN_predicted_CDS_4|5169_bp atggcccggccgccccacttctccttgtttatactacacggcgtctttaaaaagctgttc tcactgctggaacccccaggttctgcaaagggggcaggaaccgtcacattgttcagtgat gaaaaccgagggctgcaggagtgcggaccctggaagatcgaggaagacaagtctacccga atcccccactgtccctccatcttactcgtcctccttccctcacttcccagcccccggcac cagctcgactcgagcttagtggctttcaggcaacggaagctgtgtgaccttgggcaagac acggactctctccggaccccgcagcgttccgacatccagaatgcagtaaggacggcgctg aagtgcagcggtctgttccccggacctgcagccgctccagggcagggcttaggccgctct ggctccccggcggccggcacagggggcgtcggggaaatgtttaggacggacatggcgagg ccaaagaactgcagcgcgtctccggctcgcggtgccggacgcctgggtcccagccgcagc tgggactcgccgggggtggcggtcgggacgtgcgcggggcgcagggcacgcgggggctcg gagttccccggggctccccgagttggggcgtttggacctagcggacggggagaagagcgg cgcagctcccgctgccgggcccgggactgcgagctcccgccgtgcgggcgccggcagagg cctggcgggaagaccccgcgtgcgttcctccgccggcccgcggccgccccctaccgccgc gtgctcgccggccctgcgcccggggcgctcggcgcagggtccgggcccggcgttacctgg ggccgcggcggggctccacccccacccggcggccggcgcgtcgctcccgctgcagcccag gctcgcggtgcggctgcggcccgcgcgccgcctgccgcggcccctctccggcccggtgca ggggaaccgtccgcggacctcaccacccgggcgcgcggggcccacaccggctgcgcagct tccaggacccccgccccggccccgcgcgaatggagctgcctggaccgcaccacgcaacag gcctatgaggtggagtctgcattaggcccattttggacaaggacaccaatagtgtgggag gtggtgtaccttgcccaagcccccagcagcgccacctgcgtctccaccttcatcagcaca cacatcatcagcatgtccgtccctacttaccaccaccagtgccacccctacccccgaggg atccctatcctgatattggaccctgcctccatgaccacctcactcactgcccttgactcc agcgctatgatgtcactctccagggtccctgcggatgtgatggcccagctatggctgtcc tgcttcctccttcctgccctcgtggtgtctgtggcagccaacgtggccccgaagttccta gccaacatgacgtcagtgatcctgcctgaggacctgcctgtgggtgcccaggccttctgg ttggtagcggaagaccaggacaatgaccctctgacctatgggatgagcggccccaatgcc tacttcttcgctgtcactccgaaaactggggaagtgaagctggccagcgctctggactac gagacactctacacattcaaagtcaccatctccgtgagcgacccctacatccaggtgcag agggagatgctggtgattgtggaagatagaaacgacaacgcacccgttttccagaacacc gctttctccaccagcatcaacgagaccctgcccgtgggcagtgtggtgttctccgtgctg gccgtggataaagacatggggtctgcaggcatggtcgtgtactccatagagaaggtcatc cctagcactggggacagcgagcatctcttccggatcctggccaatggctccatagtcctc aatggcagcctcagctacaacaacaagagcgctttctaccagctggagctgaaggcctgt gacttgggcggcatgtaccacaacaccttcaccatccagtgctccctgcctgtcttcctg tccatctccgtggtggaccagcctgaccttgacccccagtttgtcagggagttttactcg gcctctgtggctgaggatgcagccaagggaacctcggtgctgacggtggaggctgtggat ggcgacaaaggcatcaatgaccctgtgatctacagcatctcctactccacgcggcccggc tggtttgacatcggggcagatggggtgatcagggtcaacggctccctggaccgtgagcag ctgctggaggcggatgaggaggtgcagctgcaggtcacggccaccgagacacacctcaac atctacgggcaggaggccaaggtgagcatctgggtgacagtgagagtgatggacgtcaat gaccacaaacctgagttttacaactgcagcctcccagcctgcaccttcacccccgaagag gcccaagtgaacttcactggctacgtggacgagcatgcctccccccgcatccccatcgat gacctcaccatggtggtctacgacccggacaaggcaggcgtggtggcgtggggcagcaat ggcaccttcctgttgtcgctggggggccccgatgcagaagccttcagcgtctccccggag cgggcagtgggctcagcctccgttcaggtgctggtgagagtatccgcgctggtggactac gagaggcagacggcgatggcggtgcaggttgtggccacagactccgtcagccagaacttc tccgtcgccatggtgaccatccaccttagagacattaatgaccacaggcccacgtttccc cagagcttgtacgtcctcacggtgccagagcacagcgccaccggctctgtggtcaccgac agcatccacgccacggacccagacacgggcgcgtggggccaaattacctacagcctgctc ccaggaaatggggcagacctcttccaagtggatcccgtctcagggacggtgacggtgagg aacggtgagctgctggaccgggagagccaggccgtgtactacctgacgctgcaggccaca gacggcgggaacctgtcctcctccaccacactgcagatccacctgctggacatcaacgac aatgcacccgtggttagcggctcctacaacatcttcgtccaggaggaggagggcaatgtc tccgtgaccatccaggcccacgacaatgatgagccgggcaccaacaacagccgtctgctc ttcaacctgctgcctggcccctacagccacaacttctccttggaccctgacacagggctc ctcagaaacctggggcccctggacagagaggccatcgaccccgccctggagggccgcatt gtgctgacagtgcttgtgtctgactgcggcgagcctgtcctcggcaccaaagtcaatgtc accatcactgtggaggacatcaatgataacctgcccatcttcaatcagtccagctacaac tttacggtgaaggaggaggatccaggagtgctagtgggcgtggtgaaggcctgggacgcg gaccagacggaagccaacaaccgcatcagcttcagcctgtcggggagtggtgccaactac ttcatgatccgaggcttggtgctgggggctgggtgggctgagggctacctccggctgccc ccggacgtgagcctggattacgagacacagcccgtcttcaacttgacagtgagtgctgag aacccagacccccaggggggtgagaccatagtagacgtctgcgtgaatgtgaaagacgtg aacgacaatccccccaccctggatgtagcctcactccggggcatccgtgtggctgagaat ggctcacagcacggccaggtggctgtggtggttgcctcggatgtggacaccagtgcccag ctggagatacagcttgtgaacattctctgcaccaaggccggggtcgatgtgggcagccta tgctggggctggttctcagtggcggccaacggctctgtgtacatcaaccagagcaaagcc atcgactacgaggcctgtgacctggtcacgctggttgtgcgggcctgtgacctagccacg gaccccggcttccaggcctacagcaacaatggaagcctcctcattaccattgaggacgtg aatgacaatgcaccctattttctgcctgagaataagacttttgtgatcatccctgaactc gtgctgcccaaccgggaggtggcttctgtccgggccagagacgatgattcagggaacaat ggcgtcatcctgttctccatcctccgagtagacttcatctctaaggacggggccaccatc cctttccagggtgtcttctcgatcttcacctcctccgaggccgacgtgttcgctgggagc attcagccggtgaccagcctcgactccactctccaaggcacctaccaagtgacagtccag gccagggacagaccttccttgggtcctttcctggaagccaccaccaccctgaatccccca acgccctctgccagcccccagggtccctggtgcctcatctcctgcacacccccttcccag ctcttcaccgtggaccagagttaccgctcgcggctgcagttctccacaccgaaggaggag gtgggcgccaacagacaggcgattaatgcggctcttacccaggcaaccaggactacagta tacattgtggacattcaggacatagattctgcagctcgggcccgacctcactcctacctc gatgcctactttgtcttccccaatgggtcagccctgacccttgatgagctgagtgtgtgt gagggtgagcaggaagcctggggtgtgccaaggtccctcgggcctaagtggagttctgtg gccaggatgatccggaatgatcaggactcgctgacgcagctgctgcagctggggctggtg gtgctgggctcccaggagagccaggagtcagacctgtcgaaacagctcatcagtgtcatc ataggattgggagtggctttgctgctggtccttgtgatcatgaccatggccttcgtgtgt gtgcggaagagctacaaccggaagcttcaagctatgaaggctgccaaggaggccaggaag acagcagcaggggtgatgccctcagcccctgccatcccagggactaacatgtacaacact gagcggtga >gi568815593r:176429031_176632472|GENSCAN_predicted_peptide_5|1448_aa MPGAGRGAGARSRRWGRRRRVCGSSRQPGRRQPRPLGRRRRRRREPERISPRSRSSGTAW RLRCSEGTTREGDEGQQTEILRHTQMPDEVPQAFVGLVSTEEGVHTHNKLFLSRTQEPGP GPRVAQAGTYIHKQSVHSFACQESPSKETLEAHGASISGTPEATTSGKPEPVSSVKTEPK SSDDRNPMFLEKMDFKSSKQADSTSIGKEDPGSSRKADPMFTGKAEPEILGKGDPVAPGR MDPMTVRKEDLGSLGKVDPLCSSKTYTVSPRKEDPGSLRKVDPVSSDKVDPVFPRKEEPR YSGKEHPVSSEKVAPTSAEKVDLVLSGKRDPGPSGKADPMPLESMDSASTGKTEPGLLGK LIPGSSGKNGPVSSGTGAPGSLGRLDPTCLGMADPASVGNVETVPATKEDSRFLGKMDPA SSGEGRPVSGHTDTTASAKTDLTSLKNVDPMSSGKVDPVSLGKMDPMCSGKPELLSPGQA ERVSVGKAGTVSPGKEDPVSSRREDPISAGSRKTSSEKVNPESSGKTNPVSSGPGDPRSL GTAGPPSAVKAEPATGGKGDPLSSEKAGLVASGKAAPTASGKAEPLAVGKEDPVSKGKAD AGPSGQGDSVSIGKVVSTPGKTVPVPSGKVDPVSLGKAEAIPEGKVGSLPLEKGSPVTTT KADPRASGKAQPQSGGKAETKLPGQEGAAAPGEAGAVCLKKETPQASEKVDPGSCRKAEP LASGKGEPVSLGKADSAPSRKTESPSLGKVVPLSLEKTKPSSSSRQLDRKALGSARSPEG ARGSEGRVEPKAEPVSSTEASSLGQKDLEAAGAERSPCPEAAAPPPGPRTRDNFTKAPSW EASAPPPPREDAGTQAGAQACVSVAVSPMSPQDGAGGSAFSFQAAPRAPSPPSRRDAGLQ VSLGAAETRSVATGPMTPQAAAPPAFPEVRVRPGSALAAAVAPPEPAEPVRDVSWDEKGM TWEVYGAAMEVEVLGMAIQKHLERQIEEHGRQGAPAPPPAARAGPGRSGSVRTAPPDGAA KRPPGLFRALLQSVRRPRLCWFTSGSAFPPPAGFPITAIKGWKQIEPPPPSPPSPSPRLP PSSPSPPPPSSQLPSSLPSAPSPSPQHQHHHHHYHHNHYHHHHHHHLNTNTTTTTVIIHH HHHHHHLNTNTTTTIIIITITTTITITTITITTITSTPTPPPPPSPSPPSPSPPSSPQHQ HHHHHHHCHHHHHHHHHHQHHHHLNTNTTTTIIIVIITIITTITITTITITTTITTTTTI TATTTMTTMTTTITIATTITHHHYHHHHHLNTNTTTTTTVIITIITTITITTITITSTPT PSSPPSPSQPSPSPQHQHHHHHHHHHNHHHHLNTNTTTTTTIVIITIITTITITTITITS TPTPSSPPSLLSSSPLSPLLPSPSPHYHHHHHHNYHYYHDHTITIVITIITTNTITTTTV TTCTTTTT >gi568815593r:176429031_176632472|GENSCAN_predicted_CDS_5|4347_bp atgcccggggcggggcgcggggcgggagcgcggagccggcgctggggacggcggcgccga gtctgcgggtcctcccgccagcccggccggaggcagccgaggccgctcgggcggcggcgg cggcggcggcgggagccggagcgcatctcgcccaggagccggagcagcggcactgcctgg agacttcgctgctctgagggcactacgagagaaggggacgaggggcagcagactgagatt ctcagacacactcagatgcctgacgaagtccctcaagcctttgttggtcttgtcagcaca gaagagggtgtacatacgcacaacaagctcttcttgagcagaacacaggagccaggtcct ggccccagagtggcccaagctggtacatacatccacaagcagtctgttcactcctttgca tgccaggagtcaccctccaaggagacattggaggcacatggagcctccatctcagggaca ccagaagccaccacgtctgggaagccagagcctgtgtcctccgtgaaaactgagcccaaa tcctcagatgacagaaatcccatgttcttagagaagatggatttcaagtcctcaaagcag gccgattccacttccataggaaaggaggatcctgggtcctcacggaaggcagatcccatg tttacaggaaaggcagagcctgaaatcttgggaaagggggatcctgtggctcctggaagg atggatcccatgactgtaagaaaggaagatcttggatccctgggaaaagtagatcctttg tgctccagcaagacgtatacagtgtcaccgaggaaggaggatcctgggtctttgagaaag gtggatcctgtgtcctcagacaaagtggaccctgtattcccaagaaaggaggagcccagg tattcaggaaaagagcatcctgtgtcctcagaaaaggtcgctcctacatctgcagaaaag gtagatcttgtattgtcgggaaagagagatcctgggccctcgggaaaggcagatcccatg cccttggaaagcatggattctgcgtccacaggaaagacagagccggggctcctgggcaag ctgattccaggctcatcaggcaagaatgggcctgtatcctctgggaccggggctcctggg tccttgggaaggctggatcccacatgcttggggatggcagatcccgcatctgtgggaaat gtagaaactgtgcctgccacaaaagaggactcccggttcctgggaaagatggaccctgcc tcctcaggagaggggcgtcctgtgtctggccacacggatactacggcttcagcaaagaca gatctcacatctttgaaaaatgtggatcccatgtcttcaggcaaggtggatccagtttct ctgggaaagatggaccccatgtgctcaggaaagccagagctcttgtctcctggacaggca gagcgtgtgtctgtgggaaaggcaggaactgtatccccaggaaaagaggacccggtgtcc tccagaagggaggaccccatatctgctggaagtagaaagacatcatctgaaaaagtgaat cctgagtcttcaggaaagacaaaccctgtgtcttcaggtccaggcgatcccaggtccttg gggacagcaggtcccccatctgcagtaaaggctgagccagcgacggggggaaaaggagat cccctgtcctcggagaaggcaggtctggtggcctctggaaaggcggctcccacagcctca gggaaggccgagcccctcgcggtgggcaaggaggaccctgtgagcaagggaaaggcagac gctggcccctctggacaaggggactctgtgtctataggtaaagtggtctcaactccagga aaaacagtcccggtgccctcggggaaggtggatcccgtgtccctgggaaaagcagaagct atcccagagggaaaagtgggttctctgcctctagagaaggggagtcctgttaccaccaca aaggcggatcccagggcctcggggaaagcacagccgcagtctggtggcaaagcagaaaca aagctccctgggcaagagggcgctgcagcaccaggggaagcaggggctgtgtgtttgaaa aaggagacaccacaggcctcagagaaggtggatcctggatcctgcagaaaagcagagccc cttgcctcagggaagggagagcctgtgtccctggggaaagccgactctgcaccttccaga aaaacggagtccccatccttggggaaggtggtccccctgagtctggagaagaccaagccg tcctcctcctccaggcagttagaccgcaaagccctcggctcagcccggtctcccgagggt gccaggggcagtgaaggccgcgtggagccgaaggccgagcccgtgtccagcaccgaggcc tccagtctcggccagaaagacctggaagccgctggggccgagagaagcccctgcccagag gccgcagcgcccccgccggggccgcggactcgcgacaacttcaccaaggcgccgtcgtgg gaggcgagcgccccgccgccgccgcgcgaggacgcgggcactcaggcgggcgcgcaggcc tgcgtctcagtggccgtgagccccatgtctccgcaggacggcgctgggggctcggccttc agcttccaggcggcgccgcgcgcgcccagcccgccctcgcgccgagatgcgggcctgcag gtgtcgctgggcgccgccgagacgcgctccgtggccactgggcccatgacacctcaagcc gccgcgccgcccgccttccccgaagtgcgggtgcggcccggctcagcgctggcggccgct gtagcgcccccggagccggctgagcccgtgcgagacgtgagctgggacgagaagggcatg acgtgggaggtatacggcgccgccatggaggtggaggtgctgggcatggccatccagaag catctggagcgacagatcgaggagcacggccgccaaggggcgcccgcgccgccgcccgcc gcccgtgccggccccggccgttcgggctcggtgcgcaccgcgcccccagatggcgccgcc aagcgtccgcccggcctgttccgcgcgctgctgcagagtgtgcgccggccgcgactatgc tggttcacctctggatctgctttcccacccccagctggctttcctataacggccataaag ggctggaagcagattgaaccaccacctccatcaccaccatcaccatcaccacgactacca ccatcatcaccatcaccaccaccaccatcatcacaactaccatcatcactaccatcagca ccatcaccatcacctcaacaccaacaccaccaccaccactatcatcataatcactatcac caccaccatcaccatcacctcaacaccaacaccaccaccaccaccgtcatcatccatcat caccaccatcaccatcacctcaacaccaacaccaccaccaccatcatcatcataaccatc accaccaccatcaccatcactaccatcaccatcaccaccatcacctcaacaccaacacca ccaccaccaccatcaccatcaccaccatcaccatcaccaccatcatcacctcaacaccaa caccaccaccaccatcatcattgtcatcatcaccatcatcaccaccatcaccatcagcac catcatcacctcaacaccaacaccaccaccaccatcatcattgtcatcatcaccatcatc accaccatcaccatcactaccatcaccatcaccaccaccatcaccaccaccaccacgata actgccaccactaccatgaccaccatgaccaccaccatcaccattgccaccaccatcacc caccaccactaccatcaccaccatcacctcaacaccaacaccaccaccaccaccaccgtc atcatcaccatcatcaccaccatcaccatcacaaccatcaccatcacctcaacaccaaca ccatcatcaccaccatcaccatcacaaccatcaccatcacctcaacaccaacaccatcat caccaccatcaccatcacaaccatcaccatcacctcaacaccaacaccaccaccaccacc accatcgtcatcatcaccatcatcaccaccatcaccatcacaaccatcaccatcacctca acaccaacaccatcatcaccaccatcattgttgtcatcatcaccactgtcaccattgcta ccatcaccatcacctcactaccatcaccaccaccaccataactaccactactaccatgac cacaccatcaccattgtcatcaccatcatcaccaccaacaccatcaccaccaccaccgtc actacctgtaccaccaccaccacctag >gi568815593r:176429031_176632472|GENSCAN_predicted_peptide_6|316_aa MRVSVHPGVCEYAPVFAAARAAVGTCVRAHAARMDVFMKGLSMAKEGVVAAAEKTKQGVT EAAEKTKEGVLYVGILQMLGFHSEQERMVPDLLMVIQSISAVRVDIQWLPEENASSLSNK RGAGLQKSQPRSGPLAFPAAPRHFEVIKLLPSEEGPANATGASYNLHITFRPTLSLTPPT LARRRGSKTREGVVQGVASVAEKTKEQASHLGGAVFSGAGNIAAATGLVKREEFPTDLKP EEVAQEAAEEPLIEPLMEPEGESYEDPPQRPRPHPDWLLAWLCRRGPQTRLRKTLPYSKP QCDLNSCRKPASPSSL >gi568815593r:176429031_176632472|GENSCAN_predicted_CDS_6|951_bp atgcgcgtcagtgtgcaccccggagtatgtgagtatgctccggtgtttgcagctgcccgg gcggctgtggggacgtgtgtccgcgcccatgccgccaggatggacgtgttcatgaagggc ctgtccatggccaaggagggcgttgtggcagccgcggagaaaaccaagcagggggtcacc gaggcggcggagaagaccaaggagggcgtcctctacgtcggcatcctgcagatgctgggg tttcacagtgaacaggaaagaatggttcctgatctcctgatggtgatacagagcatttca gcagtgcgagttgacatccagtggctccctgaagagaatgcctcatccctgagtaataag aggggggcagggctgcagaaaagccagcccagatcaggccctctggccttcccagcggcc cccagacattttgaagttatcaagttgctaccctccgaggaaggcccagccaatgccacg ggtgccagttacaatctgcacatcacctttagacccaccctgagcctgacaccccctact ttggccagaagaaggggaagcaagacccgagaaggtgtggtacaaggtgtggcttcagtg gctgaaaaaaccaaggaacaggcctcacatctgggaggagctgtgttctctggggcaggg aacatcgcagcagccacaggactggtgaagagggaggaattccctactgatctgaagcca gaggaagtggcccaggaagctgctgaagaaccactgattgagcccctgatggagccagaa ggggagagttatgaggacccaccccagaggccccgcccccacccggactggctcttggcg tggctatgccgcagaggcccccaaactcgcctgcgcaagaccctgccctattctaaacct cagtgcgacctcaattcgtgccgcaagcctgcctcgccctccagcctctag