GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:04:00 Sequence gi568815593f:149994302_150195045 : 200744 bp : 49.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 446 441 6 1.05 1.01 Sngl - 2047 482 1566 1 0 62 38 986 0.901 86.38 1.00 Prom - 2684 2645 40 -8.46 2.00 Prom + 4993 5032 40 -5.96 2.01 Init + 5184 5186 3 2 0 76 101 0 0.569 0.10 2.02 Intr + 6188 6878 691 1 1 60 114 473 0.749 38.20 2.03 Intr + 10550 10688 139 0 1 94 71 96 0.940 8.02 2.04 Intr + 12172 12346 175 1 1 55 95 194 0.999 16.74 2.05 Intr + 15810 16307 498 2 0 110 110 461 0.985 43.68 2.06 Intr + 17954 18052 99 1 0 104 65 31 0.738 2.71 2.07 Intr + 24265 24396 132 0 0 93 116 -11 0.834 3.04 2.08 Intr + 29961 30379 419 2 2 80 110 297 0.980 23.92 2.09 Intr + 32405 32580 176 2 2 87 79 5 0.895 -0.92 2.10 Intr + 32719 32816 98 2 2 79 116 57 0.983 7.33 2.11 Intr + 36440 36538 99 1 0 55 111 84 0.990 7.71 2.12 Intr + 38153 38302 150 1 0 64 78 92 0.986 6.16 2.13 Intr + 42335 42636 302 1 2 73 80 355 0.828 28.73 2.14 Intr + 43099 43226 128 1 2 59 94 136 0.998 11.62 2.15 Intr + 46447 46578 132 2 0 115 98 122 0.999 16.52 2.16 Intr + 47484 47668 185 1 2 102 107 235 0.999 26.31 2.17 Intr + 51165 51384 220 2 1 86 95 257 0.998 24.07 2.18 Intr + 53323 53456 134 2 2 106 113 153 0.997 19.96 2.19 Intr + 54268 54384 117 0 0 104 110 85 0.953 12.96 2.20 Intr + 55951 56160 210 0 0 105 89 113 0.998 12.21 2.21 Term + 57424 57891 468 0 0 103 46 723 0.998 64.57 2.22 PlyA + 58809 58814 6 1.05 3.22 PlyA - 58821 58816 6 1.05 3.21 Term - 59923 59768 156 1 0 107 55 225 0.999 19.03 3.20 Intr - 60129 60021 109 2 1 89 100 97 0.941 11.29 3.19 Intr - 61035 60936 100 1 1 112 66 99 0.999 9.27 3.18 Intr - 61836 61725 112 0 1 31 85 149 0.945 8.85 3.17 Intr - 62040 61918 123 0 0 60 101 200 0.999 19.28 3.16 Intr - 63083 62986 98 0 2 107 82 132 0.998 14.23 3.15 Intr - 63291 63203 89 2 2 103 100 83 0.995 10.61 3.14 Intr - 65561 65399 163 0 1 92 68 177 0.999 15.13 3.13 Intr - 66671 66561 111 0 0 83 94 219 0.999 22.35 3.12 Intr - 67381 67190 192 2 0 77 65 166 0.475 12.66 3.11 Intr - 67548 67422 127 0 1 138 77 311 0.995 35.25 3.10 Intr - 74029 73914 116 2 2 125 99 202 0.997 25.17 3.09 Intr - 75762 75572 191 2 2 75 94 222 0.999 20.73 3.08 Intr - 76001 75881 121 0 1 121 97 70 0.999 10.75 3.07 Intr - 76270 76155 116 0 2 106 76 35 0.940 4.19 3.06 Intr - 79192 79000 193 2 1 90 81 189 0.911 16.95 3.05 Intr - 83134 82975 160 1 1 74 87 184 0.892 16.46 3.04 Intr - 83938 83811 128 2 2 30 86 182 0.394 12.60 3.03 Intr - 86035 85751 285 2 0 102 100 345 0.999 34.41 3.02 Intr - 86723 86466 258 0 0 116 100 250 0.995 26.43 3.01 Init - 92126 92078 49 2 1 91 95 111 0.987 11.22 3.00 Prom - 93870 93831 40 -8.26 4.00 Prom + 95476 95515 40 -3.16 4.01 Sngl + 100001 100747 747 1 0 110 45 855 0.999 79.59 4.02 PlyA + 100785 100790 6 1.05 5.25 PlyA - 102352 102347 6 1.05 5.24 Term - 106843 106792 52 0 1 79 43 66 0.147 -1.90 5.23 Intr - 119162 118960 203 2 2 107 86 142 0.671 13.98 5.22 Intr - 121645 121546 100 0 1 119 49 142 0.823 13.51 5.21 Intr - 123486 123317 170 0 2 83 91 182 0.683 16.74 5.20 Intr - 124551 124446 106 2 1 97 95 89 0.611 10.72 5.19 Intr - 125265 125166 100 1 1 116 76 186 0.998 19.37 5.18 Intr - 125822 125711 112 2 1 102 59 154 0.896 13.85 5.17 Intr - 126709 126587 123 1 0 64 75 196 0.997 16.68 5.16 Intr - 127021 126861 161 2 2 69 90 197 0.788 17.71 5.15 Intr - 127739 127579 161 1 2 82 68 358 0.999 32.83 5.14 Intr - 128900 128741 160 0 1 109 81 289 0.999 29.35 5.13 Intr - 130059 129949 111 1 0 76 71 154 0.589 12.85 5.12 Intr - 130530 130426 105 1 0 29 82 114 0.971 5.09 5.11 Intr - 131276 131144 133 2 1 80 52 306 0.997 26.42 5.10 Intr - 132313 132219 95 2 2 88 121 130 0.993 15.98 5.09 Intr - 135667 135456 212 0 2 82 74 415 0.988 38.06 5.08 Intr - 136361 136238 124 0 1 85 105 108 0.729 11.84 5.07 Intr - 137793 137678 116 2 2 143 19 159 0.719 14.59 5.06 Intr - 138641 138449 193 0 1 92 105 259 0.999 26.55 5.05 Intr - 139707 139285 423 1 0 58 62 656 0.620 53.84 5.04 Intr - 140715 140449 267 1 0 92 84 267 0.999 24.20 5.03 Intr - 141577 141254 324 2 0 100 110 381 0.989 37.25 5.02 Intr - 142752 142707 46 0 1 118 98 37 0.931 5.68 5.01 Init - 143909 143793 117 2 0 86 85 47 0.378 2.79 5.00 Prom - 150894 150855 40 -7.76 6.00 Prom + 156218 156257 40 -7.76 6.01 Init + 159094 159138 45 0 0 74 99 12 0.170 1.58 6.02 Intr + 160089 160164 76 2 1 31 93 53 0.235 -0.81 6.03 Intr + 163097 163264 168 0 0 95 80 31 0.513 2.92 6.04 Intr + 166604 166715 112 0 1 103 87 26 0.912 3.44 6.05 Term + 167416 167524 109 1 1 129 49 57 0.805 3.88 6.06 PlyA + 168046 168051 6 1.05 7.00 Prom + 171389 171428 40 -9.26 7.01 Init + 172576 173020 445 0 1 92 91 492 0.940 46.08 7.02 Intr + 175484 175572 89 0 2 22 87 28 0.272 -4.21 7.03 Intr + 181054 181126 73 0 1 72 23 154 0.043 6.28 7.04 Intr + 185932 186050 119 2 2 72 78 39 0.457 1.48 7.05 Intr + 186097 186236 140 0 2 73 100 47 0.477 3.66 7.06 Intr + 187955 188157 203 2 2 98 28 13 0.478 -4.77 7.07 Intr + 188467 188612 146 2 2 99 89 333 0.969 34.50 7.08 Term + 189173 189379 207 1 0 117 41 298 0.999 25.34 7.09 PlyA + 190175 190180 6 1.05 8.00 Prom + 190198 190237 40 -2.96 8.01 Init + 194296 194369 74 0 2 95 44 43 0.562 1.18 8.02 Intr + 195720 196059 340 0 1 80 97 185 0.601 14.08 8.03 Intr + 200427 200610 184 2 1 93 64 351 0.921 32.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 181054 181155 102 0 0 72 49 178 0.899 10.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_1|521_aa MANKGNKKRRQFSLEEKMKVVGAVDSGKRKGDVAKEFGITPSTLSTFLKDRTKFEEKVRE ASVGPQRKRMRSALYDDIDKAVFAWFQEIHAKNILVTGSVIRKKALNLANMLGYDNFQAS VGWLNRFRDRHGIALKAVCREDSDRLMNGLGIDKINEWHAGEIIKLIADYSPDDIFNADE TGVFFQLLPQHTLAAKGDHCRGGKKAKQRLTALFCCNASGTEKMRPLIVGRSASPHCLKN IHSLPCDYRANQWAWMTRDLFNEWLMQVDARMKRAERRILLLIDNCSAHNMLPHLERIQV GYLPSNCTAVLQPLNLGIIHTMKVLYQSHLLKQILLKLNSSEDQEEVDIKQAIDMIAAAW WSVKPSTVVKCWQKAGIVPMEFAECDTESAASEPDIAIEKLWHTVAIATCVPNEVNFQDF VTADDDLIISQDTDIIQDMVAGENTSEAGSEDEGEVSLPEQPKVTITEAISSVQKLRQFL STCVDIPDAIFGQLNGIDEYLMKRVTQTLIDSKITDFLQTK >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_1|1566_bp atggcaaacaaggggaacaagaagcgtcggcagttctctctggaggagaaaatgaaagtt gtgggagctgtagactcaggcaagaggaaaggtgatgtggcaaaagaatttggtatcact ccctctactttatctacattcttaaaggatcgcaccaaatttgaagaaaaggtgcgggag gcatccgtgggaccccagcggaaaaggatgaggagcgctctttatgatgacattgataag gctgtttttgcttggtttcaagaaatccatgccaaaaacattcttgtgactggttctgtc attcggaaaaaagcactaaacttggccaacatgcttggctatgacaattttcaagcaagt gtgggctggctgaacagatttagagatcgccacggaattgctttgaaagcagtctgtaga gaagatagtgacaggttaatgaatggtctaggaatagataagattaatgagtggcatgca ggggaaattataaaactgattgctgactacagcccagatgatatctttaatgctgatgag acaggagtgtttttccagttgcttccccagcacacacttgctgctaaaggagaccactgt agagggggcaagaaagcaaagcagcggttgacagcactcttttgttgcaatgcctcgggg actgaaaaaatgagaccattgattgttggtaggtcagccagcccacactgcctcaagaac attcattccctcccttgtgattaccgagccaaccagtgggcttggatgacaagggatctg tttaatgagtggctgatgcaagtggatgccaggatgaagagggcggaacgccggatcctc ttgctcatagacaactgctctgctcataacatgcttccacacttggaaaggattcaggtt gggtatctgccctccaactgtactgctgtcctgcagccactgaatcttggcataattcac accatgaaagtactgtaccagagccaccttctaaaacagatcctcctcaagctcaacagc agtgaggatcaagaagaggtggacatcaagcaggccatcgacatgattgctgcagcgtgg tggtcagtcaagccatccacagtggtgaaatgttggcagaaggcaggcatcgtccctatg gaatttgcagaatgtgacacagaatcagcagccagtgaaccagacattgccattgaaaag ttgtggcacacagtggctattgccacctgtgtcccaaatgaagtaaatttccaggacttt gttactgcagatgatgatctcattatctctcaggacacagacatcatccaggacatggtg gctggcgaaaataccagtgaagcaggaagtgaagatgaaggggaggtatctttaccagag caaccaaaagtcaccatcacagaagccatatcaagtgtacagaaacttagacagttcctt tccacttgtgtagacattcctgatgccatttttggacaattaaatggcatagatgaatat ttaatgaaaagagtgacacaaacccttattgattccaaaattacagatttcctccaaaca aaataa >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_2|1524_aa MQCPGAERPGRPTAGSHSFLLRPGPLAGSSPFALLDPLQAFEQFVWVRSQARAGLLRLRQ GSHAVTRCRPLPVRREGRRDGSPWRSVVCRYCRCSRQTGASVTTVSLPSSSSSPGLDPRG PRQASVRSLRSEPVLLFLPFRTPYRDSEEGKREGLSRLRAVCRRAGPRGRGSFSPRDARA SPRLHFLVAAVTTGAASRRQRGARVRQPSPSSSRRAKRLRECERRSLHAPPAMDASYDGT EVTVVMEEIEEAYCYTSPGPPKKKKKYKIHGEKTKKPRSAYLLYYYDIYLKVQQELPHLP QSEINKKISESWRLLSVAERSYYLEKAKLEKEGLDPNSKLSALTAVVPDIPGFRKILPRS DYIIIPKSSLQEDRSCPQLELCVAQNQMSPKGPPLVSNTAPETVPSHAGMAEQCLAVEAL AEEVGALTQSGAVQEIATSEILSQDVLLEDASLEVGESHQPYQTSLVIEETLVNGSPDLP TGSLAVPHPQVGESVSVVTVMRDSSESSSSAPATQFIMLPLPAYSVVENPTSIKLTTTYT RRGHGTCTSPGCSFTYVTRHKPPKCPTCGNFLGGKWIPKEKPAKVKVELASGVSSKGSVV KRNQQPVTTEQNSSKENASKLTLENSEAVSQLLNVAPPREVGEESEWEEVIISDAHVLVK EAPGNCGTAVTKTPVVKSGVQPEVTLGTTDNDSPGADVPTPSEGTSTSSPLPAPKKPTGA DLLTPGSRAPELKGRARGKPSLLAAARPMRAILPAPVNVGRGSSMGLPRARQAFSLSDKT PSVRTCGLKPSTLKQLGQPIQQPSGPGEVKLPSGPSNRTSQVKVVEVKPDMFPPYKYSCT VTLDLGLATSRGRGKCKNPSCSYVYTNRHKPRICPSCGVNLAKDRTEKTTKAIEVSSPLP DVLNATEPLSTAQREIQRQSTLQLLRKVLQIPENESELAEVFALIHELNSSRLILSNVSE ETVTIEQTSWSNYYESPSTQCLLCSSPLFKGGQNSLAGPQECWLLTASRLQTVTAQVKMC LNPHCLALHSFIDIYTGLFNVGNKLLVSLDLLFAIRNQIKLGEDPRVSINVVLKSVQEQT EKTLTSEELSQLQELLCNGYWAFECLTVRDYNDMICGICGVAPKVEMAQRSEENVLALKS VEFTWPEFLGSNEVNVEDFWATMETEVIEQVAFPASIPITKFDASVIAPFFPPLMRGAVV VNTEKDKNLDVQPVPGSGSALVRLLQEGTCKLDEIGSYSEEKLQHLLRQCGIPFGAEDSK DQLCFSLLALYESVQNGARAIRPPRHFTGGKIYKVCPHQVVCGSKYLVRGESARDHVDLL ASSRHWPPVYVVDMATSVALCADLCYPELTNQMWGRNQGCFSSPTEPPVSVSCPELLDQH YTVDMTETEHSIQHPVTKTATRRIVHAGLQPNPGDPSAGHHSLALCPELAPYATILASIV DSKPNGVRQRPIAFDNATHYYLYNRLMDFLTSREIVNRQIHDIVQSCQPGEVVIRDTLYR LGVAQIKTETEEEGEEEEVAAVAE >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_2|4575_bp atgcagtgcccgggggctgagcggccggggcgtcccaccgccggcagccacagcttcctc ctccgcccgggacctctggccggaagcagtccgttcgccttactagacccgctccaagcc tttgagcagttcgtgtgggtccgctcccaagctagggctgggctgttgcgcttgcgccag gggtcgcatgcagtcacgcgctgccgaccgcttccggtgcgccgcgagggccgccgggac gggtctccctggcgatccgtggtgtgccgctactgccggtgcagccgccaaaccggtgcc tcggtgacgaccgtgtccctgccgtcttcctcgagctccccggggcttgacccccggggc cctcggcaggcatcggtgaggagcctgcggagcgaacctgtgctcctattcttgcccttc aggaccccatatcgcgactccgaggaggggaagcgagaggggctgtcgcgactccgcgcc gtgtgtcgccgggcggggccgcggggccggggctccttcagcccccgggatgcgcgcgcg agccctcgcctccacttccttgttgctgctgtcacgactggagccgcctctcgccgacag cggggagcgcgagtgcgccagccatccccctcgtccagccgccgggccaagcgcctccgg gaatgtgagcggcgcagcttgcacgctcctccggccatggacgcatcatatgatggtact gaggtaactgtcgtgatggaggaaattgaggaagcctattgttacacctctcctgggcca cccaagaagaagaaaaagtataaaatacatggagaaaagacaaagaaacccaggtctgct taccttctgtactattacgacatctacctgaaagtgcagcaggagctcccccacctccct cagtctgagatcaataagaagattagtgagagttggaggcttctcagcgtggccgagagg agttactacttggagaaagccaaactagagaaggaaggtttggatcctaactctaagctc tctgcactgactgctgtggttccggacatcccaggtttccgcaagatcctcccacgctca gattatatcatcatccccaagagcagcctgcaggaggaccggagctgccctcagctagag ctatgtgtggctcagaaccagatgtccccgaaaggacctcctcttgtgtccaacactgcc ccggagacagtgcccagccatgcaggcatggcagagcagtgcctggctgtggaggccctg gctgaggaggtgggagcccttacccagtcaggtgctgtacaggagattgccacctcagag atcctcagccaggatgtgctcctagaggacgcttccctagaagtaggggagagccaccaa ccttaccagacaagcctggtaattgaagagaccttggtgaatggctcaccagacctcccc actggaagcctggctgtgccccacccccaggttggggagagtgtatcagtggtaacagtc atgagggattccagtgagagtagctcctctgcaccagccacacagttcatcatgttgcct ctgcctgcctactcggttgtggagaaccccacctccatcaaactgaccactacatatacc cgccggggccatgggacatgcaccagcccagggtgctcctttacatatgtcaccaggcac aagccacctaagtgccctacctgtggtaacttcctaggagggaagtggatcccaaaggaa aagccagccaaagtaaaagtggaattggcttctggcgtctcttccaaaggctctgtggtg aaaagaaatcagcaacctgtcaccactgagcaaaattcctctaaggaaaatgcctccaaa ctgactctggagaattcggaagctgtaagccagctcctgaacgtagctcctcccagagaa gtaggtgaggagagtgagtgggaggaagtgatcatctccgatgcccatgttttggttaag gaagctcccgggaattgtggtacagcagtcactaagacgccagtcgtcaaaagtggtgtg cagcctgaggtcactctggggacaactgacaatgacagtcctggagcagacgtaccaaca ccatccgaggggacaagtacctccagtccactccctgctcctaaaaaacctacaggagct gacctgcttacccctgggtccagagctccagagcttaaaggcagagcacggggcaagccc tcattactggctgcagcaagacccatgagagcaattttgccagccccagttaacgtgggg cgaggcagcagcatgggactgcccagggccaggcaggccttttccctgagtgataagact ccctctgtgaggacttgtggtctgaagccaagcacactgaagcagctgggccagcccatt caacagccatctggccctggtgaggtgaagctaccaagtggcccatccaacaggacttct caggtgaaagttgtggaggtcaagcccgatatgtttcctccatataagtacagctgcact gtcacattggatttgggcctggctacatcaagaggccggggaaagtgcaagaatccctct tgtagctatgtctacaccaacaggcacaaacctcgaatttgtcccagctgtggtgttaac cttgccaaagaccggactgagaaaaccaccaaggctatcgaggtgagctcaccactccca gatgtactgaatgccacagagcccctgagcacagcccagagggagatccagcgccagtcc acactgcagctgctgcgcaaagtcctgcagattcctgagaatgagtcagagctggctgag gtcttcgccttgattcatgaactcaacagctctcgacttatcttgtccaacgtgagtgag gagacagtcaccatcgagcaaacctcttggtcgaattattatgagtctccgtccacgcag tgccttctctgtagcagcccattattcaaagggggacaaaactccctggctgggccccag gagtgctggctgctgacagccagccgtctgcagacagtgactgcccaggtgaagatgtgt ctgaacccccattgtctggccctgcacagcttcatagacatctacacaggtctctttaat gtggggaacaagctgctggtaagcctggacttgctttttgcaatcagaaatcagatcaag ctcggagaggaccccagagtgtccatcaatgttgttctgaagtcggtgcaggagcagaca gagaagactctgacctcggaggagctgagccagctgcaggagctgctgtgcaatggctat tgggcctttgagtgcctcactgtccgagactacaatgacatgatctgtggcatctgtggt gtggcccccaaagtggaaatggctcagaggagtgaagagaatgtgctagcactgaagagc gtggagttcacctggcctgaattcctgggctctaatgaggtaaatgtggaggacttttgg gccacgatggagacagaggtgattgagcaggtggcatttcctgccagcatccctatcacc aaatttgatgcgtctgttattgcccccttcttcccaccactcatgagaggagctgtggtc gtcaacactgagaaagacaaaaacctggatgtgcagccagtacctggcagtggcagtgcc ttggtgaggctgctccaggagggcacctgcaagcttgatgagattggctcctacagtgaa gagaagctgcagcacctgctaaggcagtgtggaatcccctttggggcagaagactccaag gaccagctctgcttctccttgttggccctctacgaatctgtacagaatggagctagagct atacggcccccacgtcacttcacaggtggtaaaatctacaaggtgtgcccccatcaggtg gtctgcggctccaagtatcttgtgcgaggtgagagtgcccgtgaccatgtggacctgctt gcctcttcccgccactggccgcctgtctatgtggtagatatggccacgtcagtggccctg tgtgctgacctctgctacccagagctgactaaccagatgtgggggaggaaccagggctgt ttctctagccccacagagccacctgtgagtgtgtcctgcccagagctcttggaccagcat tatactgtggacatgacagaaactgagcactctatccagcacccagtcaccaagactgcc acgcggcgcatcgtccatgcaggcctacagcccaatcctggtgaccccagtgctgggcac cactccttggccctgtgccctgaattggcaccttacgcaaccatcctggcctccatcgtg gacagcaaaccaaacggtgtccgccagcggcccattgccttcgacaatgccactcactat tacctctacaaccgcctcatggacttcctcaccagccgcgaaattgtcaatcgtcagatc catgacattgtacagagctgccagcctggtgaggtggtcattcgtgacaccctctaccgc cttggggttgctcagatcaagacagagacagaggaggagggtgaggaagaggaggtggcc gcagtggcagaataa >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_3|998_aa MGPGVLLLLLVATAWHGQGIPVIEPSVPELVVKPGATVTLRCVGNGSVEWDGPPSPHWTL YSDGSSSILSTNNATFQNTGTYRCTEPGDPLGGSAAIHLYVKDPARPWNVLAQEVVVFED QDALLPCLLTDPVLEAGVSLVRVRGRPLMRHTNYSFSPWHGFTIHRAKFIQSQDYQCSAL MGGRKVMSISIRLKVQKGPPALTLVPAELVRIRGEAAQIVCSASSVDVNFDVFLQHNNTK LAIPQQSDFHNNRYQKVLTLNLDQVDFQHAGNYSCVASNVQGKHSTSMFFRVVESAYLNL SSEQNLIQEVTVGEGLNLKVMVEAYPGLQGFNWTYLGPFSDHQPEPKLANATTKDTYRHT FTLSLPRLKPSEAGRYSFLARNPGGWRALTFELTLRYPPEVSVIWTFINGSGTLLCAASG YPQPNVTWLQCSGHTDRCDEAQVLQVWDDPYPEVLSQEPFHKVTVQSLLTVETLEHNQTY ECRAHNSVGSGSWAFIPISAGAHTHPPDEFLFTPVVVACMSIMALLLLLLLLLLYKYKQK PKYQVRWKIIESYEGNSYTFIDPTQLPYNEKWEFPRNNLQFGPVGVAGSPWALGQRPFGA QGLKGPVCVAGKTLGAGAFGKVVEATAFGLGKEDAVLKVAVKMLKSTAHADEKEALMSEL KIMSHLGQHENIVNLLGACTHGGPVLVITEYCCYGDLLNFLRRKAEAMLGPSLSPGQDPE GGVDYKNIHLEKKYVRRDSGFSSQGVDTYVEMRPVSTSSNDSFSEQDLDKEDGRPLELRD LLHFSSQVAQGMAFLASKNCIHRDVAARNVLLTNGHVAKIGDFGLARDIMNDSNYIVKGN ARLPVKWMAPESIFDCVYTVQSDVWSYGILLWEIFSLGLNPYPGILVNSKFYKLVKDGYQ MAQPAFAPKNIYSIMQACWALEPTHRPTFQQICSFLQEQAQEDRRERDYTNLPSSSRSGG SGSSSSELEEESSSEHLTCCEQGDIAQPLLQPNNYQFC >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_3|2997_bp atgggcccaggagttctgctgctcctgctggtggccacagcttggcatggtcagggaatc ccagtgatagagcccagtgtccctgagctggtcgtgaagccaggagcaacggtgaccttg cgatgtgtgggcaatggcagcgtggaatgggatggccccccatcacctcactggaccctg tactctgatggctccagcagcatcctcagcaccaacaacgctaccttccaaaacacgggg acctatcgctgcactgagcctggagaccccctgggaggcagcgccgccatccacctctat gtcaaagaccctgcccggccctggaacgtgctagcacaggaggtggtcgtgttcgaggac caggacgcactactgccctgtctgctcacagacccggtgctggaagcaggcgtctcgctg gtgcgtgtgcgtggccggcccctcatgcgccacaccaactactccttctcgccctggcat ggcttcaccatccacagggccaagttcattcagagccaggactatcaatgcagtgccctg atgggtggcaggaaggtgatgtccatcagcatccggctgaaagtgcagaaagggccccca gccttgacactggtgcctgcagagctggtgcggattcgaggggaggctgcccagatcgtg tgctcagccagcagcgttgatgttaactttgatgtcttcctccaacacaacaacaccaag ctcgcaatccctcaacaatctgactttcataataaccgttaccaaaaagtcctgaccctc aacctcgatcaagtagatttccaacatgccggcaactactcctgcgtggccagcaacgtg cagggcaagcactccacctccatgttcttccgggtggtagagagtgcctacttgaacttg agctctgagcagaacctcatccaggaggtgaccgtgggggaggggctcaacctcaaagtc atggtggaggcctacccaggcctgcaaggttttaactggacctacctgggacccttttct gaccaccagcctgagcccaagcttgctaatgctaccaccaaggacacatacaggcacacc ttcaccctctctctgccccgcctgaagccctctgaggctggccgctactccttcctggcc agaaacccaggaggctggagagctctgacgtttgagctcacccttcgataccccccagag gtaagcgtcatatggacattcatcaacggctctggcacccttttgtgtgctgcctctggg tacccccagcccaacgtgacatggctgcagtgcagtggccacactgataggtgtgatgag gcccaagtgctgcaggtctgggatgacccataccctgaggtcctgagccaggagcccttc cacaaggtgacggtgcagagcctgctgactgttgagaccttagagcacaaccaaacctac gagtgcagggcccacaacagcgtggggagtggctcctgggccttcatacccatctctgca ggagcccacacgcatcccccggatgagttcctcttcacaccagtggtggtcgcctgcatg tccatcatggccttgctgctgctgctgctcctgctgctattgtacaagtataagcagaag cccaagtaccaggtccgctggaagatcatcgagagctatgagggcaacagttatactttc atcgaccccacgcagctgccttacaacgagaagtgggagttcccccggaacaacctgcag tttgggcctgtgggggttgcagggagcccatgggcccttggacagaggccctttggtgcc cagggacttaagggacctgtgtgcgtggcaggtaagaccctcggagctggagcctttggg aaggtggtggaggccacggcctttggtctgggcaaggaggatgctgtcctgaaggtggct gtgaagatgctgaagtccacggcccatgctgatgagaaggaggccctcatgtccgagctg aagatcatgagccacctgggccagcacgagaacatcgtcaaccttctgggagcctgtacc catggaggccctgtactggtcatcacggagtactgttgctatggcgacctgctcaacttt ctgcgaaggaaggctgaggccatgctgggacccagcctgagccccggccaggaccccgag ggaggcgtcgactataagaacatccacctcgagaagaaatatgtccgcagggacagtggc ttctccagccagggtgtggacacctatgtggagatgaggcctgtctccacttcttcaaat gactccttctctgagcaagacctggacaaggaggatggacggcccctggagctccgggac ctgcttcacttctccagccaagtagcccagggcatggccttcctcgcttccaagaattgc atccaccgggacgtggcagcgcgtaacgtgctgttgaccaatggtcatgtggccaagatt ggggacttcgggctggctagggacatcatgaatgactccaactacattgtcaagggcaat gcccgcctgcctgtgaagtggatggccccagagagcatctttgactgtgtctacacggtt cagagcgacgtctggtcctatggcatcctcctctgggagatcttctcacttgggctgaat ccctaccctggcatcctggtgaacagcaagttctataaactggtgaaggatggataccaa atggcccagcctgcatttgccccaaagaatatatacagcatcatgcaggcctgctgggcc ttggagcccacccacagacccaccttccagcagatctgctccttccttcaggagcaggcc caagaggacaggagagagcgggactataccaatctgccgagcagcagcagaagcggtggc agcggcagcagcagcagtgagctggaggaggagagctctagtgagcacctgacctgctgc gagcaaggggatatcgcccagcccttgctgcagcccaacaactatcagttctgctga >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_4|248_aa MEGVEEKKKEVPAVPETLKKKRRNFAELKMKRLRKKFAQKMLRKARRKLIYEKAKHYHKE YRQMYRTEIRMARMARKAGNFYVPAEPKLAFVIRIRGINGVSPKVRKVLQLLRLRQIFNG TFVKLNKASINMLRIVEPYIAWGYPNLKSVNELIYKRGYGKINKKRIALTDNALIARSLG KYGIICVEDLIHEIYTVGKRFKEANNFLWPFKLSSPRGGMKKKTTHFVEGGDAGNREDQI NRLIRRMN >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_4|747_bp atggagggtgtagaagagaagaagaaggaggttcctgctgtgccagaaacccttaagaaa aagcgaaggaatttcgcagagctgaagatgaagcgcctgagaaagaagtttgcccaaaag atgcttcgaaaggcaaggaggaagcttatctatgaaaaagcaaagcactatcacaaggaa tataggcagatgtacaggactgaaattcgaatggcgaggatggcaagaaaagctggcaac ttctatgtacctgcagaacccaaattggcgtttgtcatcagaatcagaggtatcaatgga gtgagcccaaaggttcgaaaggtgttgcagcttcttcgccttcgtcaaatcttcaatgga acctttgtgaagctcaacaaggcttcgattaacatgctgaggattgtagagccatatatt gcatgggggtaccccaatctgaagtcagtaaatgaactaatctacaagcgtggttatggc aaaatcaataagaagcgaattgctttgacagataacgctttgattgctcgatctcttggt aaatacggcatcatctgcgtggaggatttgattcatgagatctatactgttggaaaacgc ttcaaagaggcaaataacttcctgtggcccttcaaattgtcttctccacgaggtggaatg aagaaaaagaccacccattttgtagaaggtggagatgctggcaacagggaggaccagatc aacaggcttattagaagaatgaactaa >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_5|1237_aa MPSPPKAHALRPLSSGVLPANAAGRLLAYLAPGGLLGPEDTMRLPGAMPALALKGELLLL SLLLLLEPQISQGLVVTPPGPELVLNVSSTFVLTCSGSAPVVWERMSQEPPQEMAKAQDG TFSSVLTLTNLTGLDTGEYFCTHNDSRGLETDERKRLYIFVPDPTVGFLPNDAEELFIFL TEITEITIPCRVTDPQLVVTLHEKKGDVALPVPYDHQRGFSGIFEDRSYICKTTIGDREV DSDAYYVYRLQVSSINVSVNAVQTVVRQGENITLMCIVIGNEVVNFEWTYPRKEVMWGQA GVGGGARNGWISGLQADFSPAPPDLGGLPNLLLQSGRLVEPVTDFLLDMPYHIRSILHIP SAELEDSGTYTCNVTESVNDHQDEKAINITVVESGYVRLLGEVGTLQFAELHRSRTLQVV FEAYPPPTVLWFKDNRTLGDSSAGEIALSTRNVSETRYVSELTLVRVKVAEAGHYTMRAF HEDAEVQLSFQLQINVPVRVLELSESHPDSGEQTVRCRGRGMPQPNIIWSACRDLKRCPR ELPPTLLGNSSEEESQLETNVTYWEEEQEFEVVSTLRLQHVDRPLSVRCTLRNAVGQDTQ EVIVVPHSLPFKVVVISAILALVVLTIISLIILIMLWQKKPRYEIRWKVIESVSSDGHEY IYVDPMQLPYDSTWELPRDQLVLGRTLGSGAFGQVVEATAHGLSHSQATMKVAVKMLKST ARSSEKQALMSELKIMSHLGPHLNVVNLLGACTKGGPIYIITEYCRYGDLVDYLHRNKHT FLQHHSDKRRPPSAELYSNALPVGLPLPSHVSLTGESDGGYMDMSKDESVDYVPMLDMKG DVKYADIESSNYMAPYDNYVPSAPERTCRATLINESPVLSYMDLVGFSYQVANGMEFLAS KNVRVVLVGQSGGCGECVHRDLAARNVLICEGKLVKICDFGLARDIMRDSNYISKGSTFL PLKWMAPESIFNSLYTTLSDVWSFGILLWEIFTLGGTPYPELPMNEQFYNAIKRGYRMAQ PAHASDEIYEIMQKCWEEKFEIRPPFSQLVLLLERLLGEGYKKARLPGFHGLRSPLDTSS VLYTAVQPNEGDNDYIIPLPDPKPEVADEGPLEGSPSLASSTLNEVNTSSTISCDSPLEP QDEPEPEPQLELQARIEAGVLCDRQLLEPRFQVQVSHDQAPGRRRADRVSKSVRARSEEK VEKREEEEEEEERKRRELRPGPKQHLHDQAQIQGVEK >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_5|3714_bp atgccgtcacctcccaaagctcatgctctgcgacccctgtcctctggcgtccttcctgcc aatgctgcaggcagactcttggcatacttggctcctggtggtctcttgggacctgaggac accatgcggcttccgggtgcgatgccagctctggccctcaaaggcgagctgctgttgctg tctctcctgttacttctggaaccacagatctctcagggcctggtcgtcacacccccgggg ccagagcttgtcctcaatgtctccagcaccttcgttctgacctgctcgggttcagctccg gtggtgtgggaacggatgtcccaggagcccccacaggaaatggccaaggcccaggatggc accttctccagcgtgctcacactgaccaacctcactgggctagacacgggagaatacttt tgcacccacaatgactcccgtggactggagaccgatgagcggaaacggctctacatcttt gtgccagatcccaccgtgggcttcctccctaatgatgccgaggaactattcatctttctc acggaaataactgagatcaccattccatgccgagtaacagacccacagctggtggtgaca ctgcacgagaagaaaggggacgttgcactgcctgtcccctatgatcaccaacgtggcttt tctggtatctttgaggacagaagctacatctgcaaaaccaccattggggacagggaggtg gattctgatgcctactatgtctacagactccaggtgtcatccatcaacgtctctgtgaac gcagtgcagactgtggtccgccagggtgagaacatcaccctcatgtgcattgtgatcggg aatgaggtggtcaacttcgagtggacatacccccgcaaagaagtaatgtggggccaggca ggggtcggaggaggggccaggaacgggtggatatctggcttgcaggctgatttctccccg gcccctcctgatttggggggcctgcccaacctgttgctgcagagtgggcggctggtggag ccggtgactgacttcctcttggatatgccttaccacatccgctccatcctgcacatcccc agtgccgagttagaagactcggggacctacacctgcaatgtgacggagagtgtgaatgac catcaggatgaaaaggccatcaacatcaccgtggttgagagcggctacgtgcggctcctg ggagaggtgggcacactacaatttgctgagctgcatcggagccggacactgcaggtagtg ttcgaggcctacccaccgcccactgtcctgtggttcaaagacaaccgcaccctgggcgac tccagcgctggcgaaatcgccctgtccacgcgcaacgtgtcggagacccggtatgtgtca gagctgacactggttcgcgtgaaggtggcagaggctggccactacaccatgcgggccttc catgaggatgctgaggtccagctctccttccagctacagatcaatgtccctgtccgagtg ctggagctaagtgagagccaccctgacagtggggaacagacagtccgctgtcgtggccgg ggcatgccccagccgaacatcatctggtctgcctgcagagacctcaaaaggtgtccacgt gagctgccgcccacgctgctggggaacagttccgaagaggagagccagctggagactaac gtgacgtactgggaggaggagcaggagtttgaggtggtgagcacactgcgtctgcagcac gtggatcggccactgtcggtgcgctgcacgctgcgcaacgctgtgggccaggacacgcag gaggtcatcgtggtgccacactccttgccctttaaggtggtggtgatctcagccatcctg gccctggtggtgctcaccatcatctcccttatcatcctcatcatgctttggcagaagaag ccacgttacgagatccgatggaaggtgattgagtctgtgagctctgacggccatgagtac atctacgtggaccccatgcagctgccctatgactccacgtgggagctgccgcgggaccag cttgtgctgggacgcaccctcggctctggggcctttgggcaggtggtggaggccacggct catggcctgagccattctcaggccacgatgaaagtggccgtcaagatgcttaaatccaca gcccgcagcagtgagaagcaagcccttatgtcggagctgaagatcatgagtcaccttggg ccccacctgaacgtggtcaacctgttgggggcctgcaccaaaggaggacccatctatatc atcactgagtactgccgctacggagacctggtggactacctgcaccgcaacaaacacacc ttcctgcagcaccactccgacaagcgccgcccgcccagcgcggagctctacagcaatgct ctgcccgttgggctccccctgcccagccatgtgtccttgaccggggagagcgacggtggc tacatggacatgagcaaggacgagtcggtggactatgtgcccatgctggacatgaaagga gacgtcaaatatgcagacatcgagtcctccaactacatggccccttacgataactacgtt ccctctgcccctgagaggacctgccgagcaactttgatcaacgagtctccagtgctaagc tacatggacctcgtgggcttcagctaccaggtggccaatggcatggagtttctggcctcc aagaacgtacgtgtggtgttggtggggcagagtgggggctgtggggagtgcgtccacaga gacctggcggctaggaacgtgctcatctgtgaaggcaagctggtcaagatctgtgacttt ggcctggctcgagacatcatgcgggactcgaattacatctccaaaggcagcacctttttg cctttaaagtggatggctccggagagcatcttcaacagcctctacaccaccctgagcgac gtgtggtccttcgggatcctgctctgggagatcttcaccttgggtggcaccccttaccca gagctgcccatgaacgagcagttctacaatgccatcaaacggggttaccgcatggcccag cctgcccatgcctccgacgagatctatgagatcatgcagaagtgctgggaagagaagttt gagattcggccccccttctcccagctggtgctgcttctcgagagactgttgggcgaaggt tacaaaaaggcccgcttgcctgggttccatggcctccgatctcccctggacaccagctcc gtcctctatactgccgtgcagcccaatgagggtgacaacgactatatcatccccctgcct gaccccaaacccgaggttgctgacgagggcccactggagggttcccccagcctagccagc tccaccctgaatgaagtcaacacctcctcaaccatctcctgtgacagccccctggagccc caggacgaaccagagccagagccccagcttgagctccaggcccggattgaggccggcgtg ctctgtgacaggcagctgctagagcccagattccaggtccaggtgagtcatgatcaggcc ccaggtaggagaagggcagacagagtgtccaaaagcgtgagagcacgaagtgaggagaag gtggagaagagagaagaggaagaggaagaggaagagaggaagcggagggaactgcggcca gggccaaagcagcacctgcacgaccaagcccagattcaaggggtggagaaatag >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_6|169_aa MEILPSDWELPGEHELCGCVSLKFLMYKTEVKTVTASGVAASQIRRHLQLQTPSPQMAKG IILEGHIEWAYHPCSHYCTHRATEFQRVKKAVPGDTEFTSKHRGNHKLIISNCDECDERD KENNELGGWEFLFRIISGIRALLRQGNSTASPPAVPRSASRWRPQPLEG >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_6|510_bp atggagattcttccctctgactgggagcttcccggggagcacgagttatgtggctgcgtg agcctcaagttcctcatgtataaaacggaggtaaaaacagtcactgcctctggggttgct gcttctcagatcagaagacatctccagttacagactcctagccctcagatggcaaagggc attattttagaagggcatatagagtgggcttaccacccctgctcccactactgcacccac agggcaactgaattccagagggtgaagaaagctgtcccaggtgacacagagttcacaagc aaacaccgaggaaaccacaaattaataatatcaaattgtgatgagtgtgatgaaagagac aaagagaataatgagctgggtggatgggagttcttgttcaggataatcagtggcatccgg gcattactgagacaaggcaacagtacggcgagtcctccagcagtacccaggagtgccagc aggtggcggccccagcccctggaggggtag >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_7|473_aa MYVGYVLDKDSPVYPGPARPASLGLGPQAYGPPAPPPAPPQYPDFSSYSHVEPAPAPPTA WGAPFPAPKDDWAAAYGPGPAAPAASPASLAFGPPPDFSPVPAPPGPGPGLLAQPLGGPG TPSSPGAQRPTPYEWMRRSVAAGGGGGSAPFCPTSEPPAFESGQTGQLETTHVDEGTKLG QIVDLDEEDRAADDAKVALEEEGQGGAQSGAKAWAATPLSLSWGQTWLKSEAIPCYAGQR TEETQGKAAQEHLLPPILHKAIWLTESPVAVLPSCVGLVEGDRRQNGGGIQTYLISDHRV ESLLKMQIPPPGFLTNWSGMPRTCVLHKLLENGCCAATSGSCWFKVVGAGLQNRALGKTR TKDKYRVVYTDHQRLELEKEFHYSRYITIRRKSELAANLGLTERQVKIWFQNRRAKERKV NKKKQQQQQPPQPPMAHDITATPAGPSLGGLCPSNTSLLATSSPMPVKEEFLP >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_7|1422_bp atgtatgtgggctatgtgctggacaaggattcgcccgtgtaccccggcccagccaggcca gccagcctcggcctgggcccgcaagcctacggccccccggccccgcccccggcgcccccg cagtaccccgacttctccagctactctcacgtggagccggcccccgcgcccccgacggcc tggggggcgcccttccctgcgcccaaggacgactgggccgccgcctacggcccgggcccc gcggcccctgccgccagcccagcttcgctggcattcgggccccctccagactttagcccg gtgccggcgccccctgggcccggcccgggcctcctggcgcagcccctcgggggcccgggc acaccgtcctcgcccggagcgcagaggccgacgccctacgagtggatgcggcgcagcgtg gcggccggaggcggcggtggcagcgcccccttttgccccacttctgagcctccagcattc gagagtggacaaactggacagttagagaccacacatgtagacgagggaaccaagctgggt cagattgtggaccttgatgaagaagatagggcagcagatgatgcaaaggtggccctggag gaggagggccaaggaggggcccaatctggagccaaggcctgggcagccactcccttatcc ctttcctggggccaaacctggctgaagtccgaggctatcccgtgctatgctgggcagcgg actgaggaaacccagggaaaagctgcccaggaacatctgctgcctcccattctccacaaa gccatttggctcacggagagcccagtggctgtcctgccaagctgcgtgggcttggtggag ggggacaggcgtcagaatgggggcggcattcaaacctatctgatatctgaccacagagtg gagagtttgttaaaaatgcagattccacccccaggctttctgactaactggtcagggatg cccagaacctgtgttcttcacaagctcctggagaatggctgctgtgcagccacatctgga agctgctggtttaaagttgttggagctggccttcagaatagggctctgggtaagactcgg accaaggacaagtaccgcgtggtctacaccgaccaccaacgcctggagctggagaaggag tttcattacagccgttacatcacaatccggcggaaatcagagctggctgccaatctgggg ctcactgaacggcaggtgaagatctggttccaaaaccggcgggcaaaggagcgcaaagtg aacaagaagaaacagcagcagcaacagcccccacagccgccgatggcccacgacatcacg gccaccccagccgggccatccctggggggcctgtgtcccagcaacaccagcctcctggcc acctcctctccaatgcctgtgaaagaggagtttctgccatag >gi568815593f:149994302_150195045|GENSCAN_predicted_peptide_8|200_aa MTHSFPGFIPNAQCMTSLLWDSFHSSACGGARIPPPSVRQLSVWVSMRAQQCTLPQPRAL RRDRQGIRSALPALHARSRQTAAPASVPAPAGAREPRGQRRSGQRTISRALALCAPGQLS PGHPLSKMKKLQGAHLRKPVTPDLLMTPSDQGDVDLDVDFAAHRGNWTGKLDFLLSCIGY CVGLGNVWRFPYRAYTNGGX >gi568815593f:149994302_150195045|GENSCAN_predicted_CDS_8|600_bp atgacccacagctttcctggcttcatccccaacgcccagtgcatgacttcactcctctgg gacagtttccacagctctgcgtgcgggggcgcgcgcatccccccgccgtccgtccgtcag ctgtctgtctgggtgtctatgcgggcgcagcagtgcacccttccccagcctcgggcgctg cgcagggacagacaaggcattcgcagcgccctgcccgcgctccacgcccgcagccgccag acggcagcgcctgcgtccgtgcccgccccagccggtgcgcgggagccgcgggggcaaagg cgcagtggccagcggaccatctctcgtgccctcgctctctgcgctccggggcagctgagc cccggccacccgctctccaagatgaagaagctccagggagctcacctccgcaagcctgtc accccagacctgctgatgacccccagtgaccagggcgatgtcgacctggatgtggacttt gctgcacaccgggggaactggacaggcaagctggacttcctgctgtcctgcattggctac tgtgtaggcctggggaatgtctggcgcttcccctatcgagcgtacaccaatggaggagnn