GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:51:15 Sequence gi568815583r:43261076_43469913 : 208838 bp : 42.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 362 473 112 2 1 69 36 89 0.370 1.26 1.02 Intr + 4062 4112 51 2 0 100 94 48 0.932 4.79 1.03 Term + 5967 6059 93 2 0 82 53 138 0.970 6.55 1.04 PlyA + 6388 6393 6 1.05 2.02 PlyA - 6774 6769 6 1.05 2.01 Sngl - 7259 6948 312 2 0 68 55 289 0.929 19.28 2.00 Prom - 11371 11332 40 -0.05 3.18 PlyA - 11766 11761 6 1.05 3.17 Term - 15695 15380 316 1 1 59 53 280 0.823 15.03 3.16 Intr - 15920 15787 134 2 2 102 83 131 0.990 12.52 3.15 Intr - 18202 18042 161 2 2 51 57 113 0.690 3.29 3.14 Intr - 18876 18550 327 1 0 129 96 345 0.903 34.15 3.13 Intr - 21011 20769 243 0 0 112 48 290 0.948 23.95 3.12 Intr - 21545 21442 104 1 2 97 89 103 0.992 10.20 3.11 Intr - 23877 23739 139 1 1 84 81 185 0.997 16.00 3.10 Intr - 25200 25126 75 1 0 69 49 74 0.482 0.17 3.09 Intr - 26382 26205 178 0 1 87 105 141 0.982 14.27 3.08 Intr - 26594 26466 129 2 0 -18 91 107 0.328 0.37 3.07 Intr - 29518 28950 569 2 2 75 63 155 0.081 3.27 3.06 Intr - 31879 31634 246 2 0 94 91 172 0.843 14.61 3.05 Intr - 32556 32374 183 1 0 123 117 232 0.997 28.44 3.04 Intr - 41590 41456 135 2 0 93 70 71 0.298 5.42 3.03 Intr - 47608 47425 184 1 1 65 74 55 0.082 0.24 3.02 Intr - 52327 52221 107 2 2 94 93 31 0.104 3.21 3.01 Init - 65307 65238 70 0 1 52 67 62 0.121 1.76 3.00 Prom - 66281 66242 40 -7.35 4.02 PlyA - 66726 66721 6 1.05 4.01 Sngl - 69414 67354 2061 0 0 85 49 1221 0.978 111.88 4.00 Prom - 69969 69930 40 -9.75 5.00 Prom + 70610 70649 40 -6.25 5.01 Init + 73133 73138 6 1 0 89 113 10 0.664 3.96 5.02 Intr + 74621 74751 131 1 2 69 89 129 0.964 9.67 5.03 Intr + 75551 75611 61 2 1 90 95 50 0.966 3.72 5.04 Intr + 79190 79343 154 1 1 52 59 120 0.763 4.22 5.05 Intr + 79734 79824 91 1 1 74 33 31 0.000 -5.67 5.06 Intr + 84779 84905 127 2 1 72 93 76 0.998 6.26 5.07 Intr + 85941 86021 81 2 0 79 84 45 0.811 2.12 5.08 Intr + 87832 87970 139 0 1 94 82 65 0.738 5.62 5.09 Term + 98628 98815 188 1 2 68 48 96 0.023 0.27 5.10 PlyA + 99270 99275 6 -0.45 6.08 PlyA - 99966 99961 6 1.05 6.07 Term - 100866 99998 869 1 2 52 46 681 0.625 52.14 6.06 Intr - 103307 102840 468 0 0 80 83 590 0.998 49.85 6.05 Intr - 105733 105035 699 2 0 69 83 795 0.858 67.62 6.04 Intr - 108052 107848 205 1 1 54 77 153 0.993 8.65 6.03 Intr - 108822 108521 302 1 2 -2 89 229 0.459 9.53 6.02 Intr - 109896 109483 414 1 0 -23 70 415 0.799 21.75 6.01 Init - 111857 111791 67 2 1 43 92 33 0.747 0.49 6.00 Prom - 112320 112281 40 -8.25 7.00 Prom + 113747 113786 40 -9.45 7.01 Init + 114950 115053 104 1 2 31 44 150 0.710 4.66 7.02 Intr + 115055 115151 97 2 1 83 94 105 0.909 9.69 7.03 Intr + 115449 115550 102 2 0 73 94 97 0.523 8.25 7.04 Intr + 119009 119088 80 1 2 65 106 35 0.640 0.43 7.05 Intr + 122228 122429 202 2 1 86 77 175 0.754 14.57 7.06 Intr + 124716 124881 166 2 1 67 70 160 0.995 10.71 7.07 Intr + 125131 125255 125 2 2 126 68 159 0.995 17.08 7.08 Intr + 134032 134082 51 0 0 66 90 65 0.629 2.69 7.09 Intr + 134508 134613 106 2 1 65 46 71 0.795 -0.53 7.10 Intr + 136139 136246 108 0 0 103 47 115 0.987 8.24 7.11 Intr + 136963 137104 142 2 1 76 87 97 0.795 6.89 7.12 Intr + 138969 139146 178 0 1 102 62 69 0.992 4.70 7.13 Intr + 140641 140775 135 0 0 52 95 60 0.811 2.94 7.14 Intr + 142608 142724 117 2 0 96 100 23 0.835 4.04 7.15 Intr + 143338 143477 140 0 2 96 91 112 0.566 10.64 7.16 Term + 145780 145816 37 1 1 110 39 12 0.071 -5.67 7.17 PlyA + 145995 146000 6 1.05 8.19 PlyA - 146169 146164 6 1.05 8.18 Term - 146495 146308 188 0 2 116 43 165 0.990 11.47 8.17 Intr - 147013 146868 146 0 2 105 77 106 0.996 10.21 8.16 Intr - 148021 147822 200 1 2 88 119 110 0.995 11.33 8.15 Intr - 148666 148572 95 2 2 67 94 141 0.999 11.36 8.14 Intr - 152259 152044 216 1 0 121 86 267 0.890 27.35 8.13 Intr - 154734 154519 216 1 0 83 119 283 0.999 28.45 8.12 Intr - 155341 155150 192 2 0 71 82 211 0.999 17.34 8.11 Intr - 159660 159230 431 2 2 104 95 458 0.999 40.53 8.10 Intr - 160099 159950 150 0 0 67 82 89 0.699 4.56 8.09 Intr - 161051 160780 272 2 2 107 46 175 0.989 10.62 8.08 Intr - 167093 166941 153 2 0 107 75 182 0.959 18.15 8.07 Intr - 171602 171119 484 1 1 86 94 384 0.986 30.90 8.06 Intr - 177341 177249 93 1 0 34 115 57 0.435 1.16 8.05 Intr - 180508 180451 58 2 1 85 91 70 0.672 4.02 8.04 Intr - 184733 184635 99 0 0 27 72 89 0.466 0.36 8.03 Intr - 185515 185312 204 2 0 119 106 196 0.999 22.75 8.02 Intr - 186410 186291 120 0 0 119 97 54 0.990 8.95 8.01 Intr - 196143 194817 1327 0 1 107 89 1102 0.485 98.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 81919 82000 82 2 1 104 82 83 0.921 7.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_1|85_aa XPRAQCPGSLDADNSQRKFHSQPSGASAKTQTDLTSLQGQNVMSSTLGDEKIKVQVFLGS VAELKGGVEADELSQPYAVWAFDSS >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_1|258_bp nngccacgtgctcagtgccctgggagcttggatgctgacaacagtcaaagaaagtttcat tcccaaccttctggagcctctgctaaaacacagactgacctcactagtctccaaggacaa aatgtcatgtcatccacacttggagatgagaaaatcaaggttcaggtttttcttggcagt gttgctgaattgaaaggaggtgtggaggctgatgaactgtcacagccttatgccgtgtgg gcctttgatagctcttga >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_2|103_aa MIPQNQKAIANYLKSWNETLTSRLATLPENPPAIDWTYYKANVAKAGLVDDFEKKFNVLK LPVPEDKYTAQVDAEEKEDGKTCAEWVSLSKARLENMRNSWRR >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_2|312_bp atgataccccaaaaccaaaaggccattgctaattatctgaaatcctggaatgagaccctc acctccaggctggctactttacctgagaatccaccagctattgactggacttactacaag gccaatgtggccaaggcaggcttggtggatgactttgagaagaagtttaatgtcctgaag cttcctgtaccagaggataaatatactgcccaggtggatgctgaagaaaaagaagatggg aaaacttgtgctgagtgggtgtctctctcaaaggccaggttggagaatatgagaaacagc tggagaagatga >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_3|1099_aa MTESWETGLITAGKTSGKAISGPDLHMCECPAGQRSSGAEHERVLMELRHRAISSHFRKI ILAVVWCMNHMGPKCSNVRHRPGEVMVAWPAGGWDRDGKKERESDYVQPEELTKFGYNII ALSYPTPIWSGEARPNWLRSNRVPGLEVTEAQGNSFKPSFAIHITLATLRLESVDLQSSR NNKEHHTQEMGVKRLTVRRGQPFYLRLSFSRPFQSQNDHITFVAETGPKPSELLGTRATF FLTRVQPGNVWSASDFTIDSNSLQVSLFTPANAVIGHYTLKIEISQGQGHSVTYPLGTFI LLFNPWSPELEKATLKFIWNPKRACIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYW YQNRDIDQWNRTEPSERMLHIYNHLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLD PFLTPYTKINSRWIKDLNVRPETIKTLEENLGNTIQDIGMGKDFMSKTPKEMATKAKIDK WDLIKLKSFCTARETTIRFEEDIIDICFEILNKSLYHLKNPAKDCSQRNDVVYVCRVVSA MINSNDDNGVLQGNWGEDYSKGVSPLEWKGSVAILQQWSARGGQPVKYGQCWVFASVMCT GARVTGPLRTGGAAPAKLPMATSSPVMRCLGVPTRVVSNFRSAHNVDRNLTIDTYYDRNA EMLSTQKRDKIWNFHVWNECWMIRKDLPPGYNGWQVLDPTPQQTSSGLFCCGPASVKAIR EGDVHLAYDTPFVYAEVNADEVIWLLGDGQAQEILAHNTSSIGKEISTKMVGSDQRQSIT SSYKYPEGSPEERAVFMKASRKMLGPQRASLPFLDLLESGGLRDQPAQLQLHLARIPEWG QDLQLLLRIQRVPDSTHPRGPIGLVVRFCAQALLHGGGTQKPFWRHTVRMNLDFGKETQW PLLLPYSNYRNKLTDEKLIRVSGIAEVEETGRSMLVLKDICLEPPHLSIEVSERAEVGKA LRVHVTLTNTLMVALSSCTMVLEGSGLINGQIAKDASLPPLSHPPQERLSSVVVVSGSLL PTQRGTRSPLPPAGSPRGSHRLFLTFSLGTLVAGHTLQIQLDLYPTKAGPRQLQVLISSN EVKEIKGYKDIFVTVAGAP >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_3|3300_bp atgactgagtcctgggaaacagggcttattacagctgggaagacatctggcaaggcgatc agtgggccagatttgcacatgtgtgagtgcccagcaggtcagagaagctctggggcagag catgagcgagtcctgatggagttgagacacagggctatcagctcgcacttcaggaagatc attctggctgttgtgtggtgtatgaatcatatgggaccaaaatgcagcaatgtgaggcac agaccaggtgaggtgatggtggcctggccagctggagggtgggacagagatggaaagaag gagagggaatcagattacgtgcagcctgaagaactaacaaagtttggttataacataatt gctctttcttacccaacacccatatggtcaggggaagccagaccgaactggcttagatcg aatcgggtcccaggccttgaagtgacagaggcacaagggaacagcttcaagccaagcttt gccattcacatcactttggcaaccttgcggcttgagtctgtcgacctgcagagctccagg aacaacaaggagcaccacacgcaggagatgggcgtcaagcggctcactgtgcgccgcggc cagcccttctacctccggctgagcttcagccgacccttccagtcccagaacgaccacatc acctttgtggctgagaccggacccaagccgtcagagctgctggggacccgagccacattc ttcctcacccgggtccagcccgggaatgtctggagcgcttctgatttcaccattgactcc aactctctccaagtttcccttttcacaccagccaatgcagttattggccattacactctg aaaatagagatctctcagggccaaggtcacagtgtgacttacccgctgggaactttcatc ctactttttaacccttggagtccagaattggaaaaagctactttaaagttcatatggaac ccaaaaagagcctgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatc acgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaagaatgctgcat atctacaaccatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattcc ctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaattggac cccttccttacaccttatacaaaaattaattcaagatggattaaagacttaaatgttaga cctgaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatg ggcaaggacttcatgtctaaaaccccaaaagaaatggcaacaaaagccaaaatcgacaaa tgggatctaattaaactaaagagcttctgcacagcaagagaaactaccatcagatttgaa gaggacatcatagacatctgctttgagatcctgaacaagagcctgtatcacttaaagaac ccggccaaagactgttcccagcggaacgacgtggtgtatgtgtgcagggtggtgagtgcc atgatcaacagcaacgatgacaatggcgtgctgcaggggaactggggcgaggactactcc aaaggggtcagtcctctggagtggaagggcagtgtggccatcctacagcagtggtcagcc aggggcgggcagcctgtgaagtacggacagtgctgggtcttcgcctctgttatgtgcacc ggcgctcgagttacagggccactgcgcactgggggagctgcccctgccaagcttcctatg gccacttccagcccagtaatgagatgcttaggtgttccaacccgtgttgtttccaatttc cgttccgcgcacaacgtggataggaacttgaccatcgatacgtactatgaccgaaatgcc gagatgctgtcaactcagaaacgagacaaaatatggaacttccacgtctggaatgagtgc tggatgatccggaaagatctcccaccaggatacaacgggtggcaggttctggaccccact ccccagcagaccagcagtgggctgttctgctgtggccctgcctctgtgaaggccatcagg gaaggggatgtccacctggcctatgacaccccttttgtgtatgccgaggtgaacgccgat gaagtcatttggctccttggggatggccaggcccaggaaatcctggcccacaacaccagt tccatcgggaaggagatcagcactaagatggtggggtcagaccagcgccagagcatcacc agctcctacaagtacccagaaggatcccctgaggagagagctgtcttcatgaaggcttct cggaaaatgctgggcccccaaagagcttctttgcccttcctggatctcctggagtctggg ggtcttagggatcagccagcgcagctgcagcttcacctggccaggatacccgagtggggc caggacctgcagctgctgctgcgtatccagagggtgccagacagcacccaccctcggggg cccatcggactggtggtgcgcttctgtgcacaggccctgctgcatgggggtggtacccag aagcccttctggaggcacacagtgcggatgaacctggactttgggaaggagacacagtgg ccgctcctcctgccctacagcaattacagaaacaagctaacggacgaaaagctcatccgc gtgtctggcatcgcggaggttgaagagacagggaggtccatgctggtcctaaaagatatc tgtctggagcctccccacttgtctattgaggtgtctgagagggctgaggtgggcaaggcg ctgagagtccatgtcaccctcaccaacaccttaatggtggctctgagcagctgcacgatg gtgctggaaggaagcggcctcatcaatgggcagatagcaaaggatgcctctctgccccca ctttctcaccctcctcaggaaaggctgagctcagtggtggtggtgagtgggagcctctta cccacccagaggggaaccagatcacccctgccgccagcaggaagtcctcgaggctctcac aggctgtttctgactttcagccttgggactctggtggccggacacaccctccaaattcaa ctggacctctacccgaccaaagctggaccccgccagctccaggttctcatcagcagcaac gaggtcaaggagatcaaaggctacaaggacatcttcgtcactgtggctggggctccctga >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_4|686_aa MGPRSRERRAGAVQNTNDSSALSKRSLAARGYVQDPFAALLVPGAARRAPLIHRGYYVRA RAVRHCVRAFLEQIGAPQAALRAQILSLGAGFDSLYFRLKTAGRLARAAVWEVDFPDVAR RKAERIGETPELCALTGPFERGEPASALCFESADYCILGLDLRQLQRVEEALGAAGLDAA SPTLLLAEAVLTYLEPESAAALIAWAAQRFPNALFVVYEQMRPQDAFGQFMLQHFRQLNS PLHGLERFPDVEAQRRRFLQAGWTACGAVDMNEFYHCFLPAEERRRVENIEPFDEFEEWH LKCAHYFILAASRGDTLSHTLVFPSSEAFPRVNPASPSGVFPASVVSSEGQVPNLKRYGH ASVFLSPDVILSAGGFGEQEGRHCRVSQFHLLSRDCDSEWKGSQIGSCGTGVQWDGRLYH TMTRLSESRVLVLGGRLSPVSPALGVLQLHFFKSEDNNTEDLKVTITKAGRKDDSTLCCW RHSTTEVSCQNQEYLFVYGGRSVVEPVLSDWHFLHVGTMAWVRIPVEGEVPEARHSHSAC TWQGGALIAGGLGASEEPLNSVLFLRPISCGFLWESVDIQPPITPRYSHTAHVLNGKLLL VGGIWIHSSSFPGVTVINLTTGLSSEYQIDTTYVPWPLMLHNHTSILLPEEQQLLLLGGG GNCFSFGTYFNPHTVTLDLSSLSAGQ >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_4|2061_bp atgggcccccggagccgtgagcgtcgggcaggcgcggtacagaacaccaacgacagcagc gccctcagcaagcgttccctggccgcgcgcgggtacgtgcaggacccctttgccgcgttg ctggttccgggcgcggcgcgccgcgcaccgctcattcaccgaggctactacgtccgcgca cgcgccgtgaggcactgcgtgcgcgcttttttggagcagattggcgcgccccaggccgcg cttcgcgcgcagatcttgtctctcggcgctggcttcgactcgctctattttcgcttaaaa accgcgggccgcctggcccgggctgcagtctgggaggtggattttccggacgtggcgcgg cgcaaagcagaaaggattggagagacgccagagctgtgcgcgttaaccgggcctttcgag aggggggagcccgcgtccgcgctgtgctttgagagcgcagactactgcatcctgggtctg gacttgcggcagctccagcgagtggaggaggccctgggcgccgcggggctcgacgcagcc tcacccactctgctcctggccgaggcggtgctgacctacctcgagccggagagtgccgcg gccctcatcgcctgggcagcccagcgttttcctaatgcccttttcgtggtctatgagcag atgaggcctcaagacgcctttggccagttcatgctgcaacattttcggcagctaaactcc cccctgcatggcctggagcgttttcctgacgtggaggcgcagcggcgccgcttccttcaa gctggctggaccgcctgcggtgccgtggacatgaatgaattctatcactgctttcttccc gcagaagaacgccggcgggtggaaaatattgaaccctttgacgaatttgaggagtggcat ctgaagtgcgcccattatttcattctggcagcttctaggggagacaccctctcccacacc ctagtgtttccatcctcagaggcatttcctcgcgtaaatcctgcttcgccttcaggggta ttccctgccagcgtagtcagtagcgagggccaggtcccaaacctgaagagatatggccac gcctctgtcttcttgagcccagacgttattctcagtgcaggaggatttggagagcaggag gggcggcactgccgagtgagccagtttcacttgctctcaagagattgtgactctgaatgg aaaggcagccaaataggcagttgtgggactggagttcagtgggatggacgcctttatcac accatgacaagactctcagagagtcgggttctggttctgggagggagactgtccccagta agtccagccttgggggttctccagcttcatttttttaagagtgaggataataacactgag gacctgaaagtgacaataacaaaggctggccgaaaggatgattccactttgtgttgttgg cggcattcaacaacagaagtgtcctgtcagaatcaggaatatttgtttgtgtatgggggt cgaagcgtggtggaacctgtactaagtgactggcatttcctccatgtagggacaatggct tgggtcaggatcccagtggagggagaagtacctgaagcccggcattctcacagtgcctgc acttggcaagggggagcccttattgctggaggtctcggggcttctgaggagccattgaac tctgtgctctttctgagaccaatctcttgtggattcctctgggagtcagtagacatccag cctcccattaccccaaggtactcccacacagctcatgtgctcaatggaaagctgttactg gttggagggatctggattcattcctcctcatttcctggagtgactgtgatcaatttgact acaggattgagctctgagtatcagattgacacaacatatgtgccatggccattaatgtta cacaaccatactagtatccttcttcctgaagagcaacagctcctgctccttggaggtggt gggaactgcttttcctttggtacctacttcaacccccatacagtcacattagacctttct tccttaagtgctgggcagtaa >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_5|325_aa MKELHAHLNGSISSHTMKKLIAQKPDLKIHDQMTVIDKGKKRTLEECFQMFQTIHQLTSS PEDILMVTKDVIKEFADDGVKYLELRSTPRRENATGRIYTSQQMYCPFPMCFDHSHVFVF TLRKGYLLLMFALLQAEADSCFLHHGHLYLIAVDRRGGPLVAKETVKLAEEFFLSTEGTV LGLDLSGDPTVGQAKDFLEPLLEAKKAGLKLALHLSEIPNQKKETQILLDLLPDRIGHGT FLNSGEGGSLDLVDFVRQHRIPLAALLCGTSGPQETDEIQMRLDRKKTNLNPVNVLFYSN PSSRTQKKLRNPGFTLQYKVFIVSA >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_5|978_bp atgaaggaacttcatgcccacttgaatggatccattagttctcataccatgaagaaatta atagcccagaagccagatcttaaaatccacgatcagatgactgtgattgacaagggaaag aaaagaactttggaagaatgtttccagatgtttcaaactattcatcagcttactagtagc cctgaagatattctaatggtcacaaaagatgtcataaaagaatttgcagatgacggcgtc aagtacctggaactaaggagcacacccagaagagaaaatgctactggtagaatttatact tctcagcaaatgtactgtccttttcctatgtgctttgatcactcacatgtttttgttttc actctgagaaaaggatacctgctccttatgtttgcattgttacaagcagaggctgattct tgcttcctgcatcatgggcacttgtatttgatagcagttgacagaagaggtggcccttta gtagccaaggagactgtaaaacttgccgaggagttcttcctttctactgagggtacagtt cttggccttgacctcagtggagaccctactgtaggacaagcaaaagacttcttggaacct cttttagaagctaagaaagcaggtctgaagttagcattgcatctttcagagattccaaac caaaaaaaagaaacacaaatactcctggatctgcttcctgacagaatcgggcatggaaca tttctcaactccggtgagggaggatccctggatctggtggactttgtgaggcaacatcgg ataccactggctgcgttgctctgtggaacttctggaccacaagagacagacgaaattcag atgagactggacagaaagaaaaccaacctaaatcctgtaaatgtattattctattccaac ccttcttctaggactcagaagaaactaagaaatccagggttcacacttcagtataaggtt ttcattgtttctgcataa >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_6|1007_aa MNEEDHWSSLPGAIYAGERLKTGSDPEWVRLARRQGCEGVEGQLERETLPGVGWRAPSGR LSRVRGLWRDRTTAPGYPAGGSGDKRGQNPAAAAPWAESSASANRQHLEPLADTRARPQE ARYYCSSDSVSPAQCDSPQAPMVLALEPKPASLLARGFPTALRENGTNSETFRQRFRRFH YQEVAGPREAFSQLWELCCRWLRPEVRTKEQIVELLVLEQFLTVLPGEIQNWVQEQCPEN GEEAVTLVEDLEREPGRPRSSVTVSVKGQEVRLEKMTPPKSSQELLSVRQESVEPQPRGV PKKERARSPDLGPQEQMNPKEKLKPFQRSGLPFPKSGVVSRLEQGEPWIPDLLGSKEKEL PSGSHIGDRRVHADLLPSKKDRRSWVEQDHWSFEDEKVAGVHWGYEETRTLLAILSQTEF YEALRNCHRNSQVYGAVAERLREYGFLRTLEQCRTKFKGLQKSYRKVKSGHPPETCPFFE EMEALMSAQVIALPSNGLEAAASHSGLVGSDAETEEPGQRGWQHEEGAEEAVAQESDSDD MDLEATPQDPNSAAPVVFRSPGGVHWGYEETKTYLAILSETQFYEALRNCHRNSQLYGAV AERLWEYGFLRTPEQCRTKFKSLQTSYRKVKNGQAPETCPFFEEMDALVSVRVAAPPNDG QEETASCPVQGTSEAEAQKQAEEADEATEEDSDDDEEDTEIPPGAVITRAPVLFQSPRGF EAGFENEDNSKRDISEEVQLHRTLLARSERKIPRYLHQGKGNESDCRSGRQWAKTSGEKR GKLTLPEKSLSEVLSQQRPCLGERPYKYLKYSKSFGPNSLLMHQVSHQVENPYKCADCGK SFSRSARLIRHRRIHTGEKPYKCLDCGKSFRDSSNFITHRRIHTGEKPYQCGECGKCFNQ SSSLIIHQRTHTGEKPYQCEECGKSFNNSSHFSAHRRIHTGERPHVCPDCGKSFSKSSDL RAHHRTHTGEKPYGCHDCGKCFSKSSALNKHGEIHAREKLLTQSAPK >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_6|3024_bp atgaatgaagaagaccactggtcttcgctcccaggggcaatctatgcgggagaaaggcta aaaacagggtcggaccccgagtgggtgaggctggcccggaggcaaggatgcgagggggtt gaggggcagctcgagagggagacgctcccaggggtgggctggagggcgccctcagggcgc cttagccgggtcagaggcctctggagggaccgcacgacggctccgggttacccggccgga gggagcggagacaaaagaggccagaatcccgcagccgctgctccttgggctgagagctct gcgtccgcgaaccggcagcacctggagcctctggccgacacccgagcccgtcctcaggaa gcccgttactactgctcctcggattccgtctctcctgcccaatgtgactctccccaagcc cccatggtcctcgctttggagccaaaaccagcttctctccttgcgcgaggttttccgaca gctctaagagagaatggcactaactctgagaccttccgacagcgtttcaggagattccat taccaggaggtggctgggccgcgggaggctttcagccaactctgggaactttgctgtcgg tggctaaggccggaggtgcgcaccaaggagcagattgtagaactgttggtgctagagcag ttcctgaccgtcttacctggggagatccagaattgggtacaggaacaatgtccagaaaat ggagaggaggcagtgactctcgtggaagatttagaaagagagcctggaagacctagatct tcggtcacagtctctgtgaaggggcaggaagtgcgcttggagaagatgacacccccgaaa tcatcacaagagttattaagtgttcggcaggagtcagtggaaccccagcccaggggtgta cccaagaaagagagggcaagaagcccagacctgggaccacaggagcagatgaacccaaag gagaagctcaaaccttttcaaaggagcggattgccatttcctaaatccggtgtggtctcc aggttggagcaaggagagccatggatcccagatctgctgggctctaaggagaaagaactt ccaagtggcagccacataggagacagacgagtgcatgctgatctgttaccatccaagaaa gatagaagaagctgggtggaacaggatcactggagctttgaagatgagaaggtggcaggt gtgcactggggctatgaagagaccagaacgctcctcgcaattctcagccagactgagttt tatgaggctctcagaaactgccataggaacagccaagtgtatggggctgtggctgagcgg ctcagggaatatggcttcctccggaccctggaacagtgtcggaccaagttcaaaggtctc cagaagagctatcggaaagtcaagagcggccacccacctgagacctgccccttctttgaa gagatggaagccctgatgagtgctcaggtcattgccctgcccagtaatggcctggaagca gcagcctctcactctggcctggtaggcagcgatgctgagactgaagagccagggcagagg ggctggcagcatgaggagggagcagaagaggctgtggctcaggagtctgacagtgatgac atggatctagaggcgaccccccaggaccccaactcagctgcacctgttgtgttcagaagc ccaggtggtgtacactggggctatgaagagaccaagacttaccttgcaattcttagtgag acccagttttatgaagccctccggaactgtcaccgcaacagccagctgtatggagcagtg gctgagaggttatgggaatatggctttcttaggaccccagaacagtgtcggaccaagttt aaaagcctgcaaaccagctatcggaaagttaagaatggccaggcaccagagacctgtccc ttctttgaagagatggatgctttggtgagtgtccgggttgctgccccacccaatgatggc caggaagagactgcttcttgccccgtccaggggaccagtgaggctgaagctcagaagcaa gctgaggaagcagacgaggccacagaggaagattctgatgatgatgaagaggatactgag atacccccaggggctgtcataacccgtgctccagtgttattccaaagcccccgtggtttt gaagctggatttgagaatgaagataattcaaaacgggatatttctgaggaagtacaactg cataggacattacttgcaagatctgaaaggaaaattccccggtatcttcatcagggtaaa ggcaatgagagtgactgtagatcaggaagacagtgggcaaagacctcaggggagaaaaga ggaaaactgacactcccggagaagagcttaagtgaagtcctaagtcaacagagaccttgc ttgggagagagaccctataaatatctcaaatacagcaaaagctttggtccaaactccctt ctcatgcatcaggtatcccaccaggtggaaaatccatataaatgtgctgattgtgggaaa agcttcagtcggagtgcacgactcattagacaccggagaatccacactggagagaaacct tataaatgtcttgactgtggaaaaagtttccgtgacagttcaaatttcatcacccatagg agaatccacacaggagagaaaccttatcaatgtggtgagtgtgggaaatgcttcaatcag agctcaagccttatcattcaccagagaacccacacaggagaaaagccctatcaatgtgaa gagtgtggaaaaagcttcaataacagttctcattttagtgcacatcggaggatacacaca ggagagagaccccatgtgtgtcctgactgtggaaagagtttcagtaagagttctgactta cgtgcacatcatagaacccacacaggagagaaaccctatgggtgtcatgactgtggtaag tgcttcagtaaaagctctgcccttaataagcacggagaaatccatgcacgggaaaagctt ctgacacagtcagctcccaagtaa >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_7|629_aa MKRLKHLKVGAATGGVFGSVPLPAGIAGLPFPPPHETSVLNRLCRLGTDYIRFTEFIEQY TGHVQQQGQGGLHGIYLRAFCTGLDSVLQPYRQALLDLEQEIHGCQILETVYKHSCGGLP PVRSALEKILAVCHGVMYKQLSAWMLHGLLLDQHEEFFIKQGPSSGNVSAQPEEDEEDLG IGGLTGKQLRELQDLRLIEEENMLAPSLKQFSLRVEILPSYIPVRVAEKILFVGESVQMF ENQNVNLTRKGSILKNQEDTFAAELHRLKQQPLFSLVDFEQVVDRIRSTVAEHLWKLMVE ESDLLGQLKIIKDFYLLGRGELFQAFIDTAQHMLKTPPTAVTEHDVNVAFQQSAHKVLLD DDNLLPLLHLTIEYHGKEHKADATQAREGPSRETSPREAPASGWAALGLSYKVQWPLHIL FTPAVLEKYNVVFKYLLSVRRVQAELQHCWALQMQRKHLKSNQTDAIKWRLRNHMAFLVD NLQYYLQVDVLESQFSQLLHQINSTRDFESIRLAHDHFLSNLLAQSFILLKPVFHCLNEI LDLCHSFCSLVSQNLGPLDERGAAQLSILVKGFSRQSSLLFKILSSVRNHQINSDLAQLL LRLDYNKYYTQAGGTLGSSKSWGVPTGEL >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_7|1890_bp atgaagcgtttgaagcacctgaaagttggggcagcgactggcggtgttttcggcagtgtt cctcttcctgcaggtatcgcaggacttccctttcctccaccccatgagaccagtgtcctg aatcgactctgccggctcggcacagactatattcgcttcactgagttcattgaacagtac acgggccatgtgcaacagcagggccaaggtgggttacatggaatctacctgcgggccttc tgcacagggctggattctgttttgcagccttatcgccaagcactgcttgatttggaacaa gagattcatggttgtcaaatcctggaaacagtctacaaacacagctgtggggggttgcct cctgttcgaagtgcactggaaaaaatcctggccgtttgtcatggggtcatgtataaacag ctctcagcctggatgctccatggactcctcttggaccagcatgaagaattctttatcaaa caggggccatcttctggtaatgtcagtgcccagccagaagaggacgaggaggatctgggc attgggggactgacaggaaaacaactgagagaactgcaggacttgcgcctgattgaggaa gagaacatgctggcaccatctctgaagcagttttccctacgagtggagattttgccatcc tacattccagtgagggttgctgaaaaaatcctatttgttggagaatctgtccagatgttt gagaatcaaaatgtgaacctgactagaaaaggatccattttgaaaaaccaggaagacact tttgctgcagagctgcaccgtctcaagcagcagccactcttcagcttggtggactttgaa caggtggtggatcgcattcgcagcactgtggctgagcatctctggaagttgatggtagaa gaatccgatttactgggtcagctgaagatcattaaagacttttaccttctgggacgtgga gaactgtttcaggccttcattgacacagctcaacacatgttgaaaacaccacccactgca gtaactgagcatgatgtgaatgtggcctttcaacagtcagcacacaaggtattgctagat gatgacaaccttctccctctgttgcacttgacaatcgagtatcacggaaaggagcacaaa gcagatgctactcaggcaagagaagggccttctcgggaaacttctccccgggaagcccct gcatctggctgggcagccctaggtctttcctacaaagtacagtggccactacatattctc ttcaccccagctgtcctggaaaagtacaatgttgtttttaagtacttactgagtgtgcgc cgggtgcaagctgagctgcagcactgctgggccctacaaatgcagcgcaagcacctcaag tcgaaccagactgatgcaatcaagtggcgcctaagaaatcacatggcatttttggtggat aatcttcagtactatctccaggtagatgtgttggagtctcagttctcccagctgcttcat cagatcaattctacccgagactttgaaagcatccgattggctcatgaccacttcctgagc aatttgctggctcaatcctttatcctattgaaacctgtgtttcactgcctgaatgaaatc ctagatctctgtcacagtttttgttcgctggtcagtcagaacctaggcccactggatgag cgtggagccgcccagctgagcattctcgtgaagggctttagccgccagtcttcactcctg ttcaagattctctccagtgttcggaatcatcagatcaactcagatttggctcaactactg ttacgactagattataacaaatactatacccaggctggtggaactctgggcagctctaag agttggggagtacccacaggtgagctgtga >gi568815583r:43261076_43469913|GENSCAN_predicted_peptide_8|1547_aa DIFIPSPSLEEQSNDGKKDGDMHSSSLTVECSKTSEIEPKNSPEDLGLSLTGDSCKLMLS TSEYSQSPKMESLSSHRIDEDGENTQIEDTEPMSPVLNSKFVPAENDSILMNPAQDGEVQ LSQNDDKTKGDDTDTRDDISILATGCKGREETVAEDVCIDLTCDSGSQAVPSPATRSEAL SSVLDQEEAMEIKEHHPEEGSSGSEVEEIPETPCESQGEELKEENMESVPLHLSLTETQS QGLCLQKEMPKKECSEAMEVETSVISIDSPQKLAILDQELEHKEQEAWEEATSEDSSVVI VDVKEPSPRVDVSCEPLEGVEKCSDSQSWEDIAPEIEPCAENRLDTKEEKSVEYEGDLKS GTAETEPVEQDSSQPSLPLVRADDPLRLDQELQQPQTQEKTSNSLTEDSKMANAKQLSSD AEAQKLGKPSAHASQSFCESSSETPFHFTLPKEGDIIPPLTGATPPLIGHLKLEPKRHST PIGISNYPESTIATSDVMSESMVETHDPILGSGKGDSGAAPDVDDKLCLRMKLVSPETEA SEESLQFNLEIFNQKVRSLVVGLEMRDEFEERVGRTMVPGEYGKPATGERKNGSTAVAES VASPQKTMSVLSCICEARQENEARSEDPPTTPIRGNLLHFPSSQGEEEKEKLEGDHTIRQ SQQPMKPISPVKDPVSPASQKMVIQGPSSPQGEAMVTDVLEDQKEGRSTNKENPSKALIE RPSQNNIGIQTMECSLRVPETVSAATQTIKNVCEQGTSTVDQNFGKQDATVQTERGSGEK PVSAPGDDTESLHSQGEEEFDMPQPPHGHVLHRHMRTIREVRTLVTRVITDVYYVDGTEV ERKVTEETEEPIVECQECETEVSPSQTGGSSGDLGDISSFSSKASSLHRTSSGTSLSAMH SSGSSGKGAGPLRGKTSGTEPADFALPSSRGGPGKLSPRKGVSQTGTPVCEEDGDAGLGI RQGGKAPVTPRGRGRRGRPPSRTTGTRETAVPGPLGIEDISPNLSPDDKSFSRVVPRVPD STRRTDVGAGALRRSDSPEIPFQAAAGPSDGLDASSPGNSFVGLRVVAKWSSNGYFYSGK ITRDVGAGKYKLLFDDGYECDVLGKDILLCDPIPLDTEVTALSEDEYFSAGVVKGHRKES GELYYSIEKEGQRKWYKRMAVILSLEQGNRLREQYGLGPYEAVTPLTKAADISLDNLVEG KRKRRSNVSSPATPTASSSSSTTPTRKITESPRASMGVLSGKRKLITSEEERSPAKRGRK SATVKPGAVGAGEFVSPCESGDNTGEPSALEEQRGPLPLNKTLFLGYAFLLTMATTSDKL ASRSKLPDGPTGSSEEEEEFLEIPPFNKQYTESQLRAGAGYILEDFNEAQCNTAYQCLLI ADQHCRTRKYFLCLASGIPCVSHVWVHDSCHANQLQNYRNYLLPAGYSLEEQRILDWQPR ENPFQNLKVLLVSDQQQNFLELWSEILMTGGAASVKQHHSSAHNKDIALGVFDVVVTDPS CPASVLKCAEALQLPVVSQEWVIQCLIVGERIGFKQHPKYKHDYVSH >gi568815583r:43261076_43469913|GENSCAN_predicted_CDS_8|4644_bp gacatttttattccttccccaagtctggaagaacaatcaaatgatgggaagaaagatgga gatatgcatagttcatctttgacagttgagtgttctaaaacttcagagattgaaccaaag aattcccctgaggatcttgggctatctttgacaggggattcttgcaagttgatgctttct acaagtgaatatagtcagtccccaaagatggagagcttgagttctcacagaattgatgaa gatggagaaaacacacagattgaggatacggaacccatgtctccagttctcaattctaaa tttgttcctgctgaaaatgatagtatcctgatgaatccagcacaggatggtgaagtacaa ctgagtcagaatgatgacaaaacaaagggagatgatacagacaccagggatgacattagt attttagccactggttgcaagggcagagaagaaacggtagcagaagatgtttgtattgat ctcacttgtgattcggggagtcaggcagttccgtcaccagctactcgatctgaggcactt tctagtgtgttagatcaggaggaagctatggaaattaaagaacaccatccagaggagggg tcttcagggtctgaggtggaagaaatccctgagacaccttgtgaaagtcaaggagaggaa ctcaaagaagaaaatatggagagtgttccgttgcacctttctctgactgaaactcagtcc caagggttgtgtcttcaaaaggaaatgccaaaaaaagaatgctcagaagctatggaagtt gaaaccagtgtgattagtattgattcccctcaaaagttggcaatacttgaccaagaattg gaacataaggaacaggaagcttgggaagaagctacttcagaggactccagtgttgtcatt gtagatgtgaaagagccatctcccagagttgatgtttcttgtgaacctttggagggagtg gagaagtgctcagattcccagtcatgggaggatattgctccagaaatagaaccatgtgct gagaatagattagacaccaaggaagaaaagagtgtagaatatgaaggagatctgaaatca gggactgcagaaacagaacctgtagagcaagattcttcacagccttccttacctttagtg agagcagatgatcctttaagacttgaccaggagttgcagcagccccaaactcaggagaaa acaagtaattcattaacagaagactcaaaaatggctaatgcaaagcagctaagctcagat gcagaggcccagaagctggggaagccctctgcccatgcctcacaaagcttctgtgaaagt tctagtgaaaccccatttcatttcactttgcctaaagaaggtgatatcatcccaccattg actggtgcaaccccacctcttattgggcacctaaaattggagcccaagagacacagtact cctattggtattagcaactatccagaaagcaccatagcaaccagtgatgtcatgtctgaa agcatggtggagacccatgatcccatacttgggagtggaaaaggggattctggggctgcc ccagacgtggatgataaattatgtctaagaatgaaactggttagtcctgagactgaggcg agtgaagagtctttgcagttcaacctggaaatctttaaccagaaagtgaggtcattggta gtggggctggagatgagggatgaatttgaggagcgagtaggcagaacgatggtgccaggt gaatatggaaagcctgcaactggtgaaagaaaaaatggatctactgctgttgctgagtct gttgccagtccccagaagaccatgtctgtgttgagctgtatctgtgaagccaggcaagag aatgaggctcgaagtgaggatccccccaccacacccatcagggggaacttgctccacttt ccaagttctcaaggagaagaggagaaagaaaaattggagggtgaccatacaatcaggcag agtcaacagcctatgaagcccattagtcctgtcaaggaccctgtttctcctgcttcccag aagatggtcatacaagggccatccagtcctcaaggagaggcaatggtgacagatgtgcta gaagaccagaaagaaggacggagtactaataaggaaaatcctagtaaggccttgattgaa aggcccagccaaaataacataggaatccaaaccatggagtgttccttgagggtcccagaa actgtttcagcagcaacccagactataaagaatgtgtgtgagcaggggaccagtacagtg gaccagaactttggaaagcaagatgccacagttcagactgagagggggagtggtgagaaa ccagtcagtgctcctggggatgatacagagtcgctccatagccagggagaagaagagttt gatatgcctcagcctccacatggccatgtcttacatcgtcacatgagaacaatccgggaa gtacgcacacttgtcactcgtgtcattacagatgtgtattatgtggatggaacagaagta gaaagaaaagtaactgaggagactgaagagccaattgtagagtgtcaggagtgtgaaact gaagtttccccttcacagactgggggctcctcaggtgacctgggggatatcagctccttc tcctccaaggcatccagcttacaccgcacatcaagtgggacaagtctctcagctatgcac agcagtggaagctcagggaaaggagccggaccactcagagggaaaaccagcgggacagaa cccgcagattttgccttacccagctcccgaggaggcccaggaaaactgagtcctagaaaa ggggtcagtcagacagggacgccagtgtgtgaggaggatggtgatgcaggccttggcatc agacagggagggaaggctccagtcacgcctcgtgggcgtgggcgaaggggccgcccacct tctcggaccactggaaccagagaaacagctgtgcctggccccttgggcatagaggacatt tcacctaacttgtcaccagatgataaatccttcagccgtgtcgtgccccgagtgccagac tccaccagacgaacagatgtgggtgctggtgctttgcgtcgtagtgactctccagaaatt cctttccaggctgctgctggcccttctgatggcttagatgcctcctctccaggaaatagc tttgtagggctccgtgttgtagccaagtggtcatccaatggctacttttactctgggaaa atcacacgagatgtcggagctgggaagtataaattgctctttgatgatgggtacgaatgt gatgtgttgggcaaagacattctgttatgtgaccccatcccgctggacactgaagtgacg gccctctcggaggatgagtatttcagtgcaggagtggtgaaaggacataggaaggagtct ggggaactgtactacagcattgaaaaagaaggccaaagaaagtggtataagcgaatggct gtcatcctgtccttggagcaaggaaacagactgagagagcagtatgggcttggcccctat gaagcagtaacacctcttacaaaggcagcagatatcagcttagacaatttggtggaaggg aagcggaaacggcgcagtaacgtcagctccccagccacccctactgcctccagtagcagc agcacaacccctacccgaaagatcacagaaagtcctcgtgcctccatgggagttctctca ggcaaaagaaaacttatcacttctgaagaggaacggtcccctgccaagcgaggtcgcaag tctgccacagtaaaacctggtgcagtaggggcaggagagtttgtgagcccctgtgagagt ggagacaacaccggtgaaccctctgccctggaagagcagagagggcctttgcctctcaac aagaccttgtttctgggctacgcatttctccttaccatggccacaaccagtgacaagttg gccagccgctccaaactgccagatggtcctacaggaagcagtgaagaagaggaggaattt ttggaaattcctcctttcaacaagcagtatacagaatcccagcttcgagcaggagctggc tatatccttgaagatttcaatgaagcccagtgtaacacagcttaccagtgtcttctaatt gcggatcagcattgtcgaacccggaagtacttcctgtgccttgccagtgggattccttgt gtgtctcatgtctgggtccatgatagttgccatgccaaccagctccagaactaccgtaat tatctgttgccagctgggtacagccttgaggagcaaagaattctggactggcaaccccgt gaaaatcctttccagaatctgaaggtactcttggtatcagaccaacagcagaacttcctg gagctctggtctgagatcctcatgactggtggtgcagcctctgtgaagcagcaccattca agtgcccataacaaagatattgctttaggggtatttgatgtggtggtgacggacccctca tgcccagcctcggtgctgaagtgtgctgaagcattgcagctgcctgtggtgtcacaagag tgggtgatccagtgcctcattgttggggagagaattggattcaagcagcatccaaaatat aaacacgattatgtttctcactaa