GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:37:53 Sequence gi568815596r:218881670_219085075 : 203406 bp : 49.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 492 754 263 0 2 97 55 313 0.934 26.03 1.02 Intr + 8315 8694 380 0 2 77 94 490 0.867 43.18 1.03 Term + 11105 11602 498 1 0 99 53 931 0.967 85.22 1.04 PlyA + 12238 12243 6 1.05 2.05 PlyA - 12622 12617 6 1.05 2.04 Term - 12900 12759 142 2 1 113 43 38 0.462 -0.80 2.03 Intr - 15594 15477 118 1 1 110 81 34 0.860 4.42 2.02 Intr - 17131 16992 140 0 2 50 73 67 0.253 1.51 2.01 Init - 19314 19247 68 0 2 89 80 31 0.307 2.94 2.00 Prom - 19994 19955 40 -7.16 3.00 Prom + 23705 23744 40 -4.56 3.01 Init + 25198 25314 117 0 0 95 100 42 0.586 6.31 3.02 Term + 27367 27453 87 0 0 27 55 155 0.538 3.76 3.03 PlyA + 32590 32595 6 1.05 4.06 PlyA - 32967 32962 6 1.05 4.05 Term - 43642 43450 193 0 1 34 40 195 0.777 6.19 4.04 Intr - 45585 45494 92 0 2 93 79 27 0.916 1.09 4.03 Intr - 47100 46949 152 1 2 61 77 48 0.136 0.88 4.02 Intr - 72048 71928 121 0 1 -63 35 233 0.716 2.97 4.01 Init - 72284 72060 225 2 0 94 92 281 0.861 27.57 4.00 Prom - 74764 74725 40 -8.86 5.00 Prom + 74908 74947 40 -9.55 5.01 Sngl + 78152 79255 1104 1 0 72 50 2252 0.999 214.72 5.02 PlyA + 83908 83913 6 1.05 6.05 PlyA - 84143 84138 6 1.05 6.04 Term - 100587 99998 590 1 2 107 42 1050 0.910 96.98 6.03 Intr - 100854 100782 73 0 1 32 91 94 0.880 3.08 6.02 Intr - 102636 102527 110 1 2 126 73 80 0.622 10.30 6.01 Init - 103831 103774 58 1 1 78 61 122 0.853 7.97 6.00 Prom - 108219 108180 40 -8.76 7.44 PlyA - 108546 108541 6 1.05 7.43 Term - 108730 108583 148 0 1 84 44 256 0.815 18.17 7.42 Intr - 109325 109183 143 2 2 86 75 170 0.999 14.65 7.41 Intr - 110574 110433 142 2 1 89 89 211 0.999 21.66 7.40 Intr - 111511 111347 165 0 0 72 67 217 0.941 17.08 7.39 Intr - 114106 113851 256 2 1 56 95 60 0.510 0.00 7.38 Intr - 114552 114369 184 0 1 72 2 135 0.578 2.66 7.37 Intr - 118454 118422 33 2 0 99 80 29 0.304 1.62 7.36 Intr - 119361 119244 118 2 1 112 61 18 0.445 1.97 7.35 Intr - 119583 119489 95 0 2 95 48 36 0.731 -0.94 7.34 Intr - 120144 119884 261 0 0 16 43 211 0.529 7.08 7.33 Intr - 121352 121310 43 1 1 114 63 87 0.967 7.04 7.32 Intr - 121603 121466 138 0 0 93 110 279 0.999 30.18 7.31 Intr - 122786 122283 504 1 0 91 105 658 0.997 59.89 7.30 Intr - 123893 123765 129 1 0 63 81 86 0.728 5.21 7.29 Intr - 124554 124352 203 0 2 69 119 69 0.972 6.18 7.28 Intr - 124840 124796 45 1 0 84 100 15 0.643 0.91 7.27 Intr - 127485 127378 108 0 0 79 100 241 0.996 24.88 7.26 Intr - 127791 127678 114 0 0 44 97 46 0.683 1.74 7.25 Intr - 128416 128273 144 1 0 104 78 141 0.951 15.18 7.24 Intr - 129035 128877 159 2 0 102 75 109 0.971 11.18 7.23 Intr - 129327 129136 192 0 0 101 101 159 0.922 18.19 7.22 Intr - 131700 131590 111 0 0 48 82 106 0.958 6.58 7.21 Intr - 131916 131850 67 2 1 104 81 63 0.961 6.11 7.20 Intr - 132375 132199 177 2 0 67 35 102 0.505 1.83 7.19 Intr - 137510 137382 129 1 0 121 60 127 0.991 13.01 7.18 Intr - 138050 137837 214 0 1 100 99 261 0.999 26.17 7.17 Intr - 139611 139483 129 1 0 52 67 111 0.964 6.07 7.16 Intr - 140261 140111 151 2 1 48 5 266 0.991 14.04 7.15 Intr - 140660 140502 159 2 0 35 65 149 0.864 7.48 7.14 Intr - 141762 141538 225 0 0 82 70 261 0.995 21.88 7.13 Intr - 142591 142346 246 1 0 101 97 261 0.999 25.96 7.12 Intr - 144490 144353 138 1 0 86 121 105 0.967 14.26 7.11 Intr - 146340 145981 360 0 0 101 93 559 0.894 53.12 7.10 Intr - 146732 146532 201 2 0 38 92 372 0.999 32.08 7.09 Intr - 147999 147734 266 1 2 74 109 384 0.996 36.33 7.08 Intr - 148539 148317 223 0 1 99 71 246 0.982 21.70 7.07 Intr - 149165 149020 146 0 2 60 66 226 0.868 17.60 7.06 Intr - 149636 149437 200 1 2 86 68 252 0.892 21.99 7.05 Intr - 149989 149820 170 1 2 70 60 103 0.979 4.44 7.04 Intr - 150903 150801 103 2 1 108 91 146 0.904 17.08 7.03 Intr - 153995 153811 185 2 2 60 81 108 0.876 5.89 7.02 Intr - 156909 156706 204 0 0 104 110 203 0.935 23.50 7.01 Init - 157167 157105 63 0 0 101 96 -5 0.510 2.94 7.00 Prom - 160850 160811 40 -3.46 8.04 PlyA - 165071 165066 6 1.05 8.03 Term - 174196 173538 659 2 2 132 48 543 0.982 48.82 8.02 Intr - 176025 175764 262 0 1 71 105 443 0.999 41.36 8.01 Init - 178798 178484 315 1 0 76 99 616 0.872 56.83 8.00 Prom - 193133 193094 40 -5.06 9.04 PlyA - 193653 193648 6 1.05 9.03 Term - 194786 194712 75 2 0 106 33 25 0.239 -3.36 9.02 Intr - 195695 195577 119 0 2 78 113 26 0.843 4.28 9.01 Intr - 196537 196420 118 1 1 122 93 101 0.832 14.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_1|380_aa XSAPNDILDLRLPPEPVLNANTVCLTLPGLSRRQMEVCVRHPDVAASAIQGIQIAIHECQ HQFRDQRWNCSSLETRNKIPYESPIFSRGFRESAFAYAIAAAGVVHAVSNACALGKLKAC GCDASRRGDEEAFRRKLHRLQLDALQRGKGLSHGVPEHPALPTASPGLQDSWEWGGCSPD MGFGERFSKDFLDSREPHRDIHARMRLHNNRVGRQAVMENMRRKCKCHGTSGSCQLKTCW QVTPEFRTVGALLRSRFHRATLIRPHNRNGGQLEPGPAGAPSPAPGAPGPRRRASPADLV YFEKSPDFCEREPRLDSAGTVGRLCNKSSAGSDGCGSMCCGRGHNILRQTRSERCHCRFH WCCFVVCEECRITEWVSVCK >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_1|1143_bp nngtcagcacccaatgacattctggacctccgcctccccccggagcccgtgctcaatgcc aacacagtgtgcctaacattgccaggcctgagccggcggcagatggaggtgtgtgtgcgt caccctgatgtggctgcctcagccatacagggcatccagatcgccatccacgaatgccaa caccaattcagggaccagcgctggaactgctcaagcctggagactcgcaacaagatcccc tatgagagtcccatcttcagcagaggtttccgagagagcgcttttgcctacgccatcgca gcagctggcgtggtgcacgccgtgtccaatgcgtgtgccctgggcaaactgaaggcctgt ggctgtgatgcgtcccggcgaggggacgaggaggccttccgtaggaagctgcaccgctta caactggatgcactgcagcgtggtaagggcctgagccatggggtcccggaacacccagcc ctgcccacagccagcccaggcctgcaggactcctgggagtggggcggctgcagccccgac atgggcttcggggagcgcttttctaaggactttctggactcccgggagcctcacagagac atccacgcgagaatgaggcttcacaacaaccgagttgggaggcaggcagtgatggagaac atgcggcggaagtgcaagtgccacggcacgtcaggcagctgccagctcaagacgtgctgg caggtgacgcccgagttccgcaccgtgggggcgctgctgcgcagccgcttccaccgcgcc acgctcatccggccgcacaaccgcaacggcggccagctggagccgggcccagcgggggca ccctcgccggctccgggcgctcccgggccgcgccgacgggccagccccgccgacctggtc tacttcgaaaagtctcccgacttctgcgagcgcgagccgcgcctggactcggcgggcacc gtgggccgcctgtgcaacaagagcagcgccggctcggatggctgcggcagcatgtgctgc ggccgcggccacaacatcctgcgccagacgcgcagcgagcgctgccactgccgcttccac tggtgctgtttcgtggtctgcgaagagtgccgcatcaccgagtgggtcagcgtctgcaag tga >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_2|155_aa MADSRAVAEKLQDKPGTSCCDRKTRRGIPLSKWAVTYVIMFLFKDFYVGRGEIVPQNLRD EAETTGKMGGRSHPGEECALLAHPCHGNGTPPPPRTAAARPGPAGGEKRQLLLGTATRPG LWQRAQAQGGRLELWHQSPSRHPHCLQPVQAPSLT >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_2|468_bp atggctgattccagagctgtggcagagaaattacaagataagcctggaacatcttgctgt gacagaaagacgcggcgaggaattccattgagcaaatgggcggtgacttatgtaatcatg ttcctatttaaggacttttacgttgggcgcggagagatagtgccgcagaacctgcgggac gaagctgagacaactggaaaaatgggagggcgctcccaccctggggaagaatgtgccctc cttgcgcacccttgccatggtaacggcacacccccacccccgcgcaccgcagcggcccgg cctggccccgcgggcggcgagaaaagacagttgctgctgggcacggccactcgtcctggc ttgtggcagagggctcaggcacaaggtgggcggttagagctctggcaccagagcccttct cgacatccccactgcttacaaccagtccaagccccatctctgacttag >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_3|67_aa MDQGAGREAASDGEMSGIGSSPDHVGSNPTRLLLRLSLKARMPRVSASGEKTRYREIGGT YAFTGRK >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_3|204_bp atggaccagggggcaggcagagaggcagcaagtgacggtgagatgagtggcattggcagc agccccgaccatgtgggcagcaaccccacccggcttcttctcagactctcactgaaggcg aggatgcccagggtgagcgccagcggcgagaagacgcgctatcgcgagatcggtgggacc tacgcgtttacaggccggaagtga >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_4|260_aa MGMTCVVQRQISEMNQNISRLQAETEGLKGQGASLEAAIADAEQWGELAIKDANTKLSEL EAAMQRAKQDMARSWKLALDIEIATYRKLLEGEESRLESGMQNVSIHKKTTSGYAGRFKA PSSSMGSGNSSRLSLLLLPSIQQLEVYQFVTCGLSLIVQGCNDCSMVQTQRPPDLRVLRL DVFGYKTRKLLLNWLKQAQSRHLASIMIMIMIIIIIITADAKHLPSLLSGRLIPTALLAD GKPLTGSEVWGITKKLVLTM >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_4|783_bp atggggatgacctgcgttgtgcaaagacagatctccgagatgaaccagaacatcagcagg ctccaggctgagactgagggcctcaaaggccagggggcttccctggaggccgccatcgca gatgccgagcagtggggggagctggccattaaagatgccaacaccaagctgtctgagctg gaggccgccatgcagcgggccaagcaggacatggcacgcagctggaagctggccctggac atcgagatcgccacctacaggaagctgctggagggcgaggagagccgtctggagtctggg atgcagaacgtgagtatccataagaagaccaccagtggctatgcaggtaggttcaaggca ccatcctcctcaatgggtagcggcaatagctccagattgagcctactgcttttaccctcc atacagcagctagaagtgtatcagtttgtcacatgtggtctgagtttaatagtccagggc tgcaatgactgttccatggtccaaacacaaaggcctcccgaccttagagttctgcgtctg gatgtttttggctacaaaaccagaaaactccttctcaactggcttaaacaggctcagagt cggcatttagccagcatcatgatcatgatcatgatcatcatcatcatcatcactgcagat gccaaacaccttccatctcttctttctgggaggctcattcccacagctcttctggctgat ggtaaacctttaactggttctgaagtttggggcatcaccaagaagcttgtgttgaccatg tga >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_5|367_aa MGTVLSLSPASSAKGRRPGGLPEEKKKAPPAGDEALGGYGAPPVGKGGKGESRLKRPSVL ISALTWKRLVAASAKKKKGSKKVTPKPASTGPDPLVQQRNRENLLRKGRDPPDGGGTAKP LAVPVPTVPAAAATCEPPSGGSAAAQPPGSGGGKPPPPPPPAPQVAPPVPGGSPRRVIVQ ASTGELLRCLGDFVCRRCYRLKELSPGELVGWFRGVDRSLLLQGWQDQAFITPANLVFVY LLCRESLRGDELASAAELQAAFLTCLYLAYSYMGNEISYPLKPFLVEPDKERFWQRCLRL IQRLSPQMLRLNADPHFFTQVFQDLKNEGEAAASGGGPPSGGAPAASSAARDSCAAGTKH WTMNLDR >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_5|1104_bp atgggcacagtgctgtctctttcgcctgcctcctcggccaagggccggaggcccggcggg ctgcccgaggagaagaagaaggcgccgcccgcgggggacgaggcgctggggggctacggg gcgccgccagtgggcaagggcggcaaaggcgagagccgactcaagcggccgtccgtgctc atctcggcgctcacctggaagcgcctggtggccgcgtccgccaagaagaagaaaggcagc aagaaggtgacacccaagccggcatccacgggccccgaccccctggtccagcaacgcaac cgcgagaaccttctccgcaagggccgggatccccccgacggcggcggcaccgccaagccc ctggcggtgccagtgcccaccgtgcccgcggctgccgccacctgcgagccaccgtcgggg ggcagcgcggccgctcagccgccgggctcgggcgggggaaagcctccgccgccgcctccc ccagccccgcaggtggcgccgccggtgcctggcggctcgccgcggcgggtcatcgtgcag gcgtccaccggcgagctgctgcgctgtctgggcgacttcgtgtgccgacgctgctatcgc ctcaaggagctgagcccgggcgagctggtgggctggttccgcggtgtggaccgctcgctg ctgctgcagggctggcaagaccaggccttcattacgcctgcaaacctggtgttcgtgtac ctgctgtgccgcgagtcgctgcgtggggacgagctggcgtcggccgccgagctgcaggcc gccttcctcacctgcctctacctcgcctactcctacatgggcaacgagatctcctaccca ctcaagcccttcctcgtggagcccgacaaggagcgcttctggcagcgctgcctgcgcctc atccagcggctcagcccgcagatgctgcggctcaacgccgacccccacttcttcacgcag gtctttcaagacctcaagaacgagggcgaggccgccgccagcggcgggggcccaccgagc gggggcgcgcccgccgcctcctcggccgccagggacagctgcgcggccggaaccaagcac tggactatgaacctggaccgctag >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_6|276_aa MARAHPPRVSPAAGRGGLPDPVGDGLFKDGKNPSWGPLSPAVQKGEMAGALGGGGEFHKT VHLESYVLFSGSRISIVQKRGSGQIQLWQFLLELLADRANAGCIAWEGGHGEFKLTDPDE VARRWGERKSKPNMNYDKLSRALRYYYDKNIMSKVHGKRYAYRFDFQGLAQACQPPPAHA HAAAAAAAAAAAAQDGALYKLPAGLAPLPFPGLSKLNLMAASAGVAPAGFSYWPGPGPAA TAAAATAALYPSPSLQPPPGPFGAVAAASHLGGHYH >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_6|831_bp atggcccgggcccatcccccgcgcgtctccccggctgcggggcgcggggggctgccggat cccgtcggagacggtctcttcaaggacgggaagaacccgagctgggggccgctgagcccc gcggttcagaaaggtgagatggccggcgcgctgggcggcggaggcgagttccacaaaacc gtgcatctggaaagctacgtgctcttcagtggaagccgcatttccattgtgcaaaagcgc ggcagcggacagatccagctgtggcagtttctgctggagctgctggctgaccgcgcgaac gccggctgcatcgcgtgggagggcggtcacggcgagttcaagctcacggacccggacgag gtggcgcggcggtggggcgagcgcaagagcaagcccaacatgaactacgacaagctgagc cgcgccctgcgctactactacgacaagaacatcatgagcaaggtgcatggcaagcgctac gcctaccgcttcgacttccagggcctggcgcaggcctgccagccgccgcccgcgcacgct catgccgccgccgcagctgctgccgccgccgcggccgcccaggacggcgcgctctacaag ctgcccgccggcctcgccccgctgcccttccccggcctctccaaactcaacctcatggcc gcctcggccggggtcgcgcccgccggcttctcctactggccgggcccgggccccgccgcc accgctgccgccgccaccgccgcgctctaccccagtcccagcttgcagcccccgcccggg cccttcggggccgtggccgcagcctcgcacttggggggccattaccactag >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_7|2396_aa MEGPSGVGLVPKSLFGVPSLRLHTQSAPFGLCPKDMMLTQAPSSVVRSRNSRNHTVNSGG SCLSASTVAIPAINDSSAAMSACSTISAQPASSMDTQMHSPKKQERVNKRVIWGIEVAEE LHWKGWELGKETTRNLVLKNRSLKLQKMKYRPPKTKFFFTVIPQPIFLSPGITLTLPIVF RPLEAKEYMDQLWFEKAEGMFCVGLRATLPCHRLICRPPSLQLPMCAVGDTTEAFFCLDN VGDLPTFFTWEFSSPFQMLPATGLLEPGQASQIKVTFQPLTAVIYEVQATCWYGAGSRQR SSIQLQAVAKCAQLLVSIKHKCPEDQDAEGFQKLLYFGSVAVGCTSERQIRLHNPSAVNA PFRIEISPDELAEDQAFSCPTAHGIVLPGEKKCVSVFFHPKTLDTRTVDYCSIMPSGCAS KTLLKVVGFCRGPAVSLQHYCVNFSWVNLGERSEQPLWIENQSDCTAHFQFAIDCLESVF TIRPAFGTLVGKARMTLHCAFQPTHPIICFRRVACLIHHQDPLFLDLMGTCHSDSTKPAI LKPQHLTWYRTHLARGLTLYPPDILDAMLKEKKLAQDQNGALMIPIQDLEDMPAPQYPYI PPMTEFFFDGTSDITIFPPPISVEPVEVDFGACPGPEAPNPVPLCLMNHTKGKIMVVWTR RSDCPFWVTPESCDVPPLKSMAMRLHFQPPHPNCLYTVELEAFAIYKVLQSYSNIEEDCT MCPSWCLTVRARGHSYFAGFEHHIPQYSLDVPKLFPAVSSGEPTYRSLLLVNKDCKLLTF SLAPQRGSDVILRPTSGLVAPGAHQIILICTYPEGSSWKQHTFYLQCNASPQYLKEVSMY SREEPLQLKLDTHKSLYFKPTWVGCSSTSPFTFRNPSRLPLQFEWRVSEQHRKLLAVQPS RGLIQPNERLTLTWTFSPLEETKYLFQVGMWVWEAGLSPNANPAATTHYMLRLVGVGLTS SLSAKEKELAFGNVLVNSKQSRFLVLLNDGNCTLYYRLYLEQGSPEAVDNHPLALQLDRT EGSMPPRSQDTICLTACPKQRSQYSWTITYSLLSHRDNKAGEKQELCCVSLVAVYPLLSI LDVSSMGSAEGITRKHLWRLFSLDLLNSYLERDPTPCELTYKVPTRHSMSQIPPVLTPLR LDFNFGAAPFKAPPSVVFLALKNSGVVSLDWAFLLPSDQRIDVELWAEQAELNSTELHQM RVQDNCLFSISPKAGSLSPGQEQMVELKYSHLFIGTDHLPVLFKVSHGREILLNFIGVTV KPEQKYVHFTSTTHQFIPIPIGDTLPPRQIYELYNGGSVPVTYEVQTDVLSQVQEKNFDH PIFCCLNPKGEIQPGSTARVLWIFSPIEAKTYTVDVPIHILGWNSALIHFQGVGYNPHMM GDTAPFHNISSWDNSSIHSRLVVPGQNVFLSQSHISLGNIPVQSKCSRLLFLNNISKNEE IAFSWQPSPLDFGEVSVSPMIGVVAPEETVPFVVTLRASVHASFYSADLVCKLYSQQLMR QYHKELQEWKDEKVRQEVEFTITDMKVKKRTCCTACEPARKYKTLPPIKNQQSVSRPASW KLQTPKEEVSWPCPQPPSPGMLCLGLTARAHATDYFLANFFSEFPCHFLHRELPKRKAPR EESETSEEKSPNKWGPVSKQKKQLLVDILTTIIRGLLEDKNFHEAVDQSLVEQVPYFRQF WNEQSTKFMDQKNSLYLMPILPVPSSSWEDGKGKQPKEDRPEHYPGLGKKEEGEEEKGEE EEEELEEEEEEEEETEEEELGKEEIEEKEEERDEKEEKVSWAGIGPTPQPESQESMQWQW QQQLNVMVKEEQEQDEKEAIRRLPAFANLQEALLENMIQNILVEASRGEVVLTSRPRVIA LPPFCVPRSLTPDTLLPTQQAEVRAEASGALCSTELAEDQDQEITEGDRQAPGPPLPPRD EPLAQTGPERFVRSARVRQGRPLSTSPGAGLIATQAPAAATATAISTVRGLGPPQPPVPV PGLLVMLGKMQALTELEPRLSLKTSETVRLTEKDKPPGTAVHHLPQTSESHILASISWGG GGVSQCRAGLQLSGQRGLLLPSGGRIQNCRIEMGEGGKARSDSSPLTPARAGDTEGKDWD TKGPELARAPLCAPALSNPELQRWESKLPRPPRQLPLENFLYTPFPVDIWPTPDHLVRSL VQEVLLSAYYASSTVLGLRITTENKTQSLPLQMELTLCGMSSAPAPGPAPASLTLWDEED FQGRRCRLLSDCANVCERGGLPRVRSVKVENGVWVAFEYPDFQGQQFILEKGDYPRWSAW SGSSSHNSNQLLSFRPVLCANHNDSRVTLFEGDNFQGCKFDLVDDYPSLPSMGWASKDVG SLKVSSGAWVAYQYPGYRGYQYVLERDRHSGEFCTYGELGTQAHTGQLQSIRRVQH >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_7|7191_bp atggaggggcctagtggggtgggccttgtccccaagtccctctttggagtgccttccctg aggctccatactcagagtgctccctttggactgtgtcccaaggacatgatgctcacccag gctccaagctccgtcgtgaggtccaggaacagcaggaaccacaccgtgaactctggtgga tcctgcctgagtgccagcacagtggccatccctgccatcaacgacagcagtgcagccatg agtgcctgcagcaccatcagcgcccagcccgcaagctccatggacactcagatgcactcc ccaaagaagcaggagagagtgaacaagagggtcatctggggcattgaggtggctgaggag ctgcattggaaaggctgggagctaggaaaggagaccacaaggaatctggttctgaaaaat cgatccttgaaactccagaagatgaagtacaggccccccaagaccaagttcttcttcacg gtcatccctcagcccatcttcctgagcccaggcataaccctcacgctccccatcgtcttc cggcctctggaggcgaaggagtacatggaccagctgtggtttgagaaagcggaggggatg ttctgtgtcggcctacgggccaccctgccctgccacaggctgatctgccgcccaccatcc ctgcagctgcccatgtgtgctgtgggagatacgactgaggcctttttctgcctggataat gtgggggacctgcccaccttcttcacctgggagttctccagcccattccagatgctgccc gccacggggctcctggagccaggccaggcctctcagatcaaggtgacctttcagcccctt acagccgtcatctacgaggtgcaggccacgtgctggtacggggcgggcagccggcagagg agcagcatccagctgcaggctgtggccaagtgcgcccagctgctggtgagcataaagcac aagtgcccggaggaccaggatgccgagggcttccagaagctgttgtactttggctctgtt gctgtgggctgcacctcggagaggcagatcaggctacacaacccgtcggcggtaaatgcc cccttcaggattgaaatttccccggatgaactggccgaagaccaggccttctcatgcccc acggcccatggcatcgtgcttccgggagagaagaaatgtgtgtcggtgttcttccacccc aagactctggacaccagaactgtggactactgctccatcatgccttctggctgtgcctcc aagaccctgcttaaagtcgttggtttctgtagaggccctgctgtgtccctgcagcactac tgtgtcaacttcagctgggtcaaccttggggagcgctccgagcagcccctgtggattgag aaccaatcggactgcacggcccacttccagtttgccatcgactgcttggagagtgtcttt accatcaggcctgcctttgggacgctggtgggcaaggcccgtatgaccctgcactgtgcc ttccagcccactcaccccatcatctgctttcggcgtgtggcctgtctcatccaccaccag gacccactgttcctggacctgatggggacctgccactcggacagcaccaagccagccatc ctgaagcctcagcacctcacctggtaccgcacacacctggcccggggcctgacgctctac ccccctgacatcctggatgccatgctgaaggagaagaagctggcacaggaccagaacggg gctctcatgattcccatccaggatctggaggacatgccggccccgcagtacccttatatc ccccccatgaccgagttcttcttcgacggcaccagcgacataaccatcttccccccgccc atcagtgtagagcctgtcgaggtagacttcggtgcctgcccagggcctgaggcccccaac cctgtacccctgtgcctgatgaaccacaccaagggcaagatcatggtggtctggacgcga aggtctgactgccccttctgggtgactccagagagctgcgacgtgcccccactcaagtcc atggccatgcgcctgcacttccagccgcctcaccccaactgcctttacacggtggagctc gaagccttcgccatctataaggtcctgcagagctacagtaatattgaggaggactgcacc atgtgcccatcctggtgcctgacggtgcgggcacgaggccacagctatttcgctggcttt gagcaccacatcccccagtattccctagatgtccccaagctatttccagcagtgtcctcc ggtgagcccacctaccgcagcctgctcctggtcaacaaagactgcaagctgctgaccttc agcctggccccccagagaggctcagacgtcatccttcggcccacttcgggccttgtggca cccggggcccaccagatcatcctcatctgcacctaccctgagggcagctcctggaagcag cacactttctatctgcagtgcaatgcttccccccagtatctcaaggaggtgagcatgtac agccgggaggagccactgcagctgaagctggacacccacaaaagcctctacttcaagccc acctgggtgggctgctcctccaccagccccttcaccttccgcaacccctcgcgtctgccc ctgcagttcgagtggagggtctctgagcagcatcgaaagctgctggctgtccagccctcc agggggctaatccagcccaacgagagacttacgctgacgtggaccttcagccctttggag gagaccaagtacctgttccaagtggggatgtgggtctgggaagccggcctgtccccaaat gccaaccccgctgccaccacccactacatgctccggctggtgggcgttgggctcaccagc agcctctctgcaaaggaaaaggagctggcctttgggaatgtgctggtgaacagcaagcag tccaggttccttgtcctcctgaatgacggcaactgcaccctctattaccgcctctacctg gagcagggcagccctgaggccgttgacaaccaccccctcgctctgcagctggaccgaaca gaggggagcatgccaccccggtcccaggacaccatctgcctgactgcctgtcccaagcag cggtcccagtactcctggaccatcacctactctctcctttcccacagagataacaaggct ggggagaagcaggagctgtgctgcgtctccctggtggccgtgtaccccttgctttccatc ctggatgtcagctccatgggcagtgctgagggtatcacccggaagcacctgtggcgcctc ttctctctggacctgcttaacagttacttggagcgtgaccccaccccctgtgagctcacc tacaaggtgcccacccggcacagcatgagccagatcccccccgtcctcacccctttaagg cttgacttcaatttcggggccgcaccattcaaggccccaccttccgtggtattcctggcc ctgaagaacagcggagtggtgtccctggactgggccttcctccttccaagtgaccagcgg attgacgtggagctctgggcagagcaagcagagttgaattccactgagctccaccagatg cgcgtgcaggacaattgcctcttctccatcagccccaaggctgggagcctgagtcctggg caggagcagatggtggagttaaaatacagccacctgttcatcggtactgatcacctccca gtgctcttcaaggtgtcccatggccgggagatcctgctaaatttcataggtgtgacagtg aagccggagcagaagtatgtgcacttcacctctactacccaccagttcatccccattccc attggtgacacgctacccccacggcagatttatgagctgtataatggtggctcagtgccc gtgacatatgaggtccagaccgatgtcctgtcacaggttcaggaaaaaaattttgatcac cccatcttttgctgcctcaaccccaaaggggagatccagccaggcagcactgcccgggtc ttgtggatcttctcacctatcgaggccaagacctacacggtggacgtgcccatacacatc ctgggatggaactcggccctcatccacttccagggagtgggctacaacccccatatgatg ggggacacagccccattccacaacatctcctcgtgggacaacagttccatacactctagg ctggtggtgcctggacagaatgtcttcctgtcccagtctcatatttccctgggaaacata cctgtgcagagcaagtgcagccgcctgctcttcctcaacaacatctccaagaacgaggaa attgccttctcctggcagccaagtcctctagattttggggaggtgtctgtgagtcccatg ataggggtggtggctcctgaagagacggtcccatttgtggtgaccttgagggcctctgtg catgccagcttctacagtgcagacctggtatgcaagctgtactcgcagcagctcatgagg cagtatcacaaggagctgcaggagtggaaggacgagaaggtgcggcaggaagtggagttc accatcaccgacatgaaagtgaagaagagaacatgctgcacagcctgtgaacctgcgagg aagtacaagacactgcctcccatcaagaaccagcagtctgtcagccggcctgccagctgg aaactgcagaccccaaaggaggaggtgtcctggccctgcccccagccaccctcgccaggc atgctctgcctgggccttactgcccgagcccatgccaccgactactttctggctaacttc ttctcagagtttccctgccactttttgcaccgggagctgccaaagaggaaggcccccagg gaagagtcagagacttctgaggaaaaatcccctaacaagtggggccctgtttccaagcag aagaagcagctcctggttgacattctcaccacaataatcaggggcctgctggaagacaag aacttccatgaggctgtggaccaaagcctggtggagcaggtgccgtacttccgccaattc tggaatgagcagtcaactaagttcatggaccagaaaaacagcctgtacttaatgccaatc ctgcctgtaccctccagcagctgggaggatgggaagggcaagcagccgaaggaagacaga ccagagcactatccagggttgggaaagaaggaagagggggaggaggagaagggtgaagag gaagaagaagagttggaggaggaagaggaggaagaagaggagacagaagaggaggagttg ggcaaggaggagatagaggagaaggaggaggagagggatgagaaggaagagaaagtgagc tgggcgggcatcgggcccacaccacagcctgagtcccaggagtccatgcaatggcagtgg caacagcagctgaatgtcatggtgaaggaggagcaagaacaggacgagaaggaggccatc agaaggctcccggccttcgccaacctgcaggaggcgctgctggagaacatgatccagaac atcctggtggaggcgagccgcggggaggtggtactcacctcgcggccacgcgtcatcgcc ctgccgccgttctgcgtgcccaggagtctgaccccggacacgctgctgccgacgcagcaa gcagaggtgagggcggaggctagcggggcgctgtgcagcactgagctcgcggaagaccag gaccaggagatcaccgagggcgaccgccaggccccgggccctccgctcccgccccgcgac gagcccctcgcacaaaccggacctgagcgttttgttcgttcggctcgcgtgaggcagggg cggcctctcagcaccagcccgggggccggcctgatcgccacgcaggcacctgccgccgcc accgccaccgccatctcaaccgtacgggggctaggccctccccagcctcctgtcccggtt cctgggctcctggtcatgctggggaagatgcaggccctcacggagttggagccaaggctg agtctaaaaacgtctgagacagtcagactgactgaaaaggacaagcccccaggcacagcg gttcaccaccttcctcaaacctcagaatcccacatcctcgcttccatcagctggggtggg ggcggggtctcccagtgccgggcaggcctgcagctttcgggccagcgcggcctgctcctg ccctctggtggccgaatccagaattgtcggatagagatgggggaaggagggaaggcgaga tctgattcttcacccctcacccctgcccgggctggtgacactgaaggcaaagactgggac accaagggtccagaactggctcgtgccccactctgtgctcctgctctcagcaacccagaa cttcagaggtgggagagcaagctgccaagacccccccgccaacttccattggagaatttt ctctacaccccttttcccgtggacatttggcccacgcctgaccaccttgttcgttcactc gttcaagaagttttattgagtgcctactatgcatccagcactgtgctaggtttgaggata acaactgagaacaaaacacagtccctgccccttcaaatggagcttacactctgcggcatg agcagcgcccccgcgccgggcccggcgcccgccagcctcacgctctgggacgaggaggac ttccagggccgtcgctgtcggctgctaagcgactgtgcgaacgtctgcgagcgcggaggc ctgcccagggtgcgctcggtcaaggtggaaaacggcgtttgggtggcctttgagtacccc gacttccagggacagcagttcattctggagaagggagactatcctcgctggagcgcctgg agtggcagcagcagccacaacagcaaccagctgctgtccttccggccagtgctctgcgcg aaccacaatgacagccgtgtgacactgtttgagggggacaacttccaaggctgcaagttt gacctcgttgatgactacccatccctgccctccatgggctgggccagcaaggatgtgggt tccctcaaagtcagctccggagcgtgggtggcctaccagtacccaggctaccgaggctac cagtatgtgttggagcgggaccggcacagcggagagttctgtacttacggtgagctcggc acacaggcccacactgggcagctgcagtccatccggagagtccagcactag >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_8|411_aa MSPARLRPRLHFCLVLLLLLVVPAAWGCGPGRVVGSRRRPPRKLVPLAYKQFSPNVPEKT LGASGRYEGKIARSSERFKELTPNYNPDIIFKDEENTGADRLMTQRCKDRLNSLAISVMN QWPGVKLRVTEGWDEDGHHSEESLHYEGRAVDITTSDRDRNKYGLLARLAVEAGFDWVYY ESKAHVHCSVKSEHSAAAKTGGCFPAGAQVRLESGARVALSAVRPGDRVLAMGEDGSPTF SDVLIFLDREPHRLRAFQVIETQDPPRRLALTPAHLLFTADNHTEPAARFRATFASHVQP GQYVLVAGVPGLQPARVAAVSTHVALGAYAPLTKHGTLVVEDVVASCFAAVADHHLAQLA FWPLRLFHSLAWGSWTPGEGVHWYPQLLYRLGRLLLEEGSFHPLGMSGAGS >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_8|1236_bp atgtctcccgcccggctccggccccgactgcacttctgcctggtcctgttgctgctgctg gtggtgccggcggcatggggctgcgggccgggtcgggtggtgggcagccgccggcgaccg ccacgcaaactcgtgccgctcgcctacaagcagttcagccccaatgtgcccgagaagacc ctgggcgccagcggacgctatgaaggcaagatcgctcgcagctccgagcgcttcaaggag ctcacccccaattacaatccagacatcatcttcaaggacgaggagaacacaggcgccgac cgcctcatgacccagcgctgcaaggaccgcctgaactcgctggctatctcggtgatgaac cagtggcccggtgtgaagctgcgggtgaccgagggctgggacgaggacggccaccactca gaggagtccctgcattatgagggccgcgcggtggacatcaccacatcagaccgcgaccgc aataagtatggactgctggcgcgcttggcagtggaggccggctttgactgggtgtattac gagtcaaaggcccacgtgcattgctccgtcaagtccgagcactcggccgcagccaagacg ggcggctgcttccctgccggagcccaggtacgcctggagagtggggcgcgtgtggccttg tcagccgtgaggccgggagaccgtgtgctggccatgggggaggatgggagccccaccttc agcgatgtgctcattttcctggaccgcgagcctcacaggctgagagccttccaggtcatc gagactcaggaccccccacgccgcctggcactcacacccgctcacctgctctttacggct gacaatcacacggagccggcagcccgcttccgggccacatttgccagccacgtgcagcct ggccagtacgtgctggtggctggggtgccaggcctgcagcctgcccgcgtggcagctgtc tctacacacgtggccctcggggcctacgccccgctcacaaagcatgggacactggtggtg gaggatgtggtggcatcctgcttcgcggccgtggctgaccaccacctggctcagttggcc ttctggcccctgagactctttcacagcttggcatggggcagctggactccgggggagggt gtgcattggtacccccagctgctctaccgcctggggcgtctcctgctagaagagggcagc ttccacccactgggcatgtccggggcagggagctga >gi568815596r:218881670_219085075|GENSCAN_predicted_peptide_9|103_aa KLPEACSIGDGKPFVMNLQDLYMAVTTQEVQVGQKHQGAGDPHTSNSASLQGIDSQCVNQ PEQLVSSAPTLSAPEKESTGTSGPLQRPQLSKVKRKKPRGLFS >gi568815596r:218881670_219085075|GENSCAN_predicted_CDS_9|312_bp aaactgccagaggcatgcagcattggtgatggaaagccctttgtcatgaatctgcaggat ctgtatatggcagtcaccacacaagaggtccaagtgggacagaagcatcaaggcgctgga gatcctcatacctcaaacagtgcttccctgcaaggaatcgatagccaatgtgtaaaccag ccagaacaactggtctcctcagccccaaccctctcagcacctgagaaagagtccacgggt acttcaggccctctgcagagacctcagctgtcaaaggtcaagaggaagaagccaaggggt ctcttcagttaa