GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:03:02 Sequence gi568815594r:6223305_6481846 : 258542 bp : 52.76% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3622 3699 78 1 0 90 90 15 0.168 1.06 1.02 Intr + 4713 4835 123 0 0 70 98 63 0.273 5.71 1.03 Intr + 6669 6842 174 0 0 9 56 194 0.747 7.87 1.04 Intr + 6944 7085 142 2 1 38 77 68 0.814 1.56 1.05 Intr + 7138 7257 120 0 0 52 81 63 0.629 3.19 1.06 Intr + 9556 9662 107 0 2 131 91 103 0.985 14.41 1.07 Intr + 9862 10142 281 1 2 67 41 70 0.335 -2.14 1.08 Term + 12368 12477 110 0 2 100 45 100 0.822 5.87 1.09 PlyA + 13689 13694 6 1.05 2.00 Prom + 14126 14165 40 -1.31 2.01 Init + 18178 18267 90 0 0 57 95 55 0.065 3.66 2.02 Intr + 23798 23869 72 1 0 104 111 27 0.074 6.70 2.03 Term + 27362 27463 102 1 0 6 38 156 0.347 0.98 2.04 PlyA + 27659 27664 6 1.05 3.00 Prom + 32713 32752 40 -3.01 3.01 Init + 40062 40118 57 2 0 72 100 62 0.017 5.12 3.02 Intr + 46552 46710 159 0 0 74 117 48 0.029 7.00 3.03 Intr + 48407 48522 116 1 2 83 62 94 0.018 6.05 3.04 Intr + 48808 49011 204 1 0 86 89 35 0.117 2.24 3.05 Intr + 54194 54383 190 2 1 95 107 63 0.445 9.01 3.06 Intr + 55158 55173 16 2 1 97 101 11 0.065 -1.10 3.07 Intr + 59489 59683 195 0 0 93 80 39 0.349 3.51 3.08 Intr + 61595 62023 429 0 0 68 22 179 0.115 3.56 3.09 Intr + 63789 63871 83 1 2 86 78 90 0.848 7.65 3.10 Intr + 65683 65827 145 0 1 51 90 257 0.999 22.57 3.11 Intr + 67893 68063 171 1 0 68 66 392 0.999 35.43 3.12 Intr + 68613 68875 263 1 2 129 34 202 0.634 16.64 3.13 Intr + 71664 71885 222 2 0 50 113 394 0.259 36.95 3.14 Term + 77353 79164 1812 0 0 118 55 3766 0.828 362.35 3.15 PlyA + 79936 79941 6 1.05 4.00 Prom + 82201 82240 40 -4.01 4.01 Init + 82438 82477 40 0 1 88 80 18 0.505 1.31 4.02 Intr + 83133 83203 71 1 2 97 103 51 0.918 6.89 4.03 Term + 84039 84383 345 2 0 58 32 190 0.481 5.25 4.04 PlyA + 84741 84746 6 -0.45 5.12 PlyA - 85366 85361 6 1.05 5.11 Term - 88506 88297 210 0 0 30 45 166 0.361 4.12 5.10 Intr - 94329 94198 132 0 0 77 40 89 0.048 4.35 5.09 Intr - 94528 94496 33 1 0 120 39 35 0.152 0.70 5.08 Intr - 94726 94643 84 1 0 25 70 82 0.139 0.61 5.07 Intr - 96075 95869 207 0 0 35 -5 202 0.218 5.40 5.06 Intr - 100289 100019 271 1 1 124 92 620 0.968 64.08 5.05 Intr - 103960 103844 117 0 0 119 89 13 0.941 4.58 5.04 Intr - 106049 105958 92 2 2 110 115 171 0.999 21.29 5.03 Intr - 106411 106262 150 1 0 97 32 88 0.877 4.97 5.02 Intr - 110427 110258 170 1 2 86 80 291 0.968 28.28 5.01 Init - 110902 110830 73 1 1 96 40 36 0.308 0.78 5.00 Prom - 110944 110905 40 -3.41 6.03 PlyA - 111789 111784 6 1.05 6.02 Term - 118392 118322 71 1 2 121 39 59 0.072 2.70 6.01 Init - 124875 124542 334 0 1 86 94 284 0.247 26.27 6.00 Prom - 126557 126518 40 -5.61 7.00 Prom + 126632 126671 40 -0.61 7.01 Init + 128384 128416 33 1 0 41 107 15 0.456 -1.48 7.02 Term + 130025 130738 714 1 0 45 46 574 0.822 42.83 7.03 PlyA + 131707 131712 6 1.05 8.00 Prom + 134678 134717 40 -5.21 8.01 Init + 136454 136503 50 1 2 98 86 42 0.530 4.14 8.02 Intr + 141755 141929 175 2 1 44 72 127 0.314 7.16 8.03 Term + 142175 142228 54 1 0 88 48 19 0.230 -4.25 8.04 PlyA + 142577 142582 6 1.05 9.09 PlyA - 142888 142883 6 1.05 9.08 Term - 143307 143236 72 0 0 118 48 5 0.173 -2.30 9.07 Intr - 144985 144893 93 1 0 47 66 72 0.153 1.56 9.06 Intr - 149396 149160 237 2 0 142 52 389 0.957 39.04 9.05 Intr - 152627 152515 113 0 2 89 99 128 0.971 14.60 9.04 Intr - 155268 155103 166 0 1 87 89 228 0.789 22.85 9.03 Intr - 157902 157693 210 0 0 105 64 191 0.810 18.03 9.02 Intr - 159897 159740 158 1 2 136 81 126 0.867 16.94 9.01 Init - 167213 167099 115 2 1 88 60 130 0.973 8.53 9.00 Prom - 169662 169623 40 -5.31 10.00 Prom + 169776 169815 40 -7.89 10.01 Init + 170257 170414 158 0 2 96 75 93 0.666 6.18 10.02 Intr + 170741 170943 203 2 2 90 89 28 0.388 2.55 10.03 Intr + 172033 172148 116 2 2 -21 86 85 0.545 -1.93 10.04 Intr + 172601 172793 193 1 1 92 96 143 0.908 15.09 10.05 Term + 178111 178280 170 2 2 -5 37 187 0.115 2.66 10.06 PlyA + 183040 183045 6 1.05 11.00 Prom + 185637 185676 40 -5.11 11.01 Init + 187047 187095 49 2 1 97 72 56 0.115 4.30 11.02 Intr + 191814 191981 168 1 0 37 77 108 0.447 5.03 11.03 Intr + 192166 192321 156 2 0 66 -20 132 0.153 0.69 11.04 Intr + 195020 195214 195 0 0 40 66 100 0.177 2.91 11.05 Intr + 197294 197425 132 0 0 83 63 45 0.694 2.62 11.06 Intr + 197645 197818 174 0 0 40 80 65 0.421 1.33 11.07 Term + 198228 198319 92 1 2 29 40 142 0.624 1.68 11.08 PlyA + 200302 200307 6 -0.45 12.05 PlyA - 200383 200378 6 -0.45 12.04 Term - 204685 204630 56 2 2 67 50 49 0.155 -2.99 12.03 Intr - 205175 205040 136 2 1 58 89 53 0.302 3.05 12.02 Intr - 205512 205416 97 2 1 120 68 11 0.294 2.81 12.01 Init - 211407 211361 47 0 2 93 71 20 0.323 0.91 12.00 Prom - 212326 212287 40 0.29 13.00 Prom + 213384 213423 40 -1.71 13.01 Init + 215722 215737 16 0 1 63 100 17 0.246 0.82 13.02 Intr + 218314 218406 93 2 0 104 61 46 0.044 3.93 13.03 Intr + 221074 221227 154 2 1 75 44 77 0.241 1.64 13.04 Intr + 224512 224590 79 1 1 79 42 82 0.049 2.75 13.05 Intr + 230194 230385 192 0 0 56 63 79 0.056 2.41 13.06 Term + 230485 230535 51 0 0 91 41 47 0.046 -1.98 13.07 PlyA + 231362 231367 6 1.05 14.06 PlyA - 233021 233016 6 1.05 14.05 Term - 237912 237888 25 2 1 126 50 39 0.132 1.88 14.04 Intr - 245407 245257 151 2 1 55 110 97 0.168 8.33 14.03 Intr - 248018 247866 153 0 0 13 63 96 0.049 0.16 14.02 Intr - 248945 248856 90 0 0 -23 92 151 0.171 4.86 14.01 Init - 249010 248971 40 1 1 103 74 80 0.722 6.40 14.00 Prom - 253505 253466 40 -2.61 15.03 PlyA - 253521 253516 6 1.05 15.02 Term - 255573 255422 152 1 2 34 32 154 0.361 2.78 15.01 Init - 257676 257583 94 0 1 97 80 29 0.859 3.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 22095 21990 106 2 1 95 41 98 0.822 3.98 S.002 Term - 46233 46040 194 1 2 84 47 117 0.966 5.11 S.003 Intr - 46499 46416 84 0 0 95 74 102 0.958 9.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_1|378_aa XKGKGAGLKARRETPRGTLLFRVPTTGYMIFLNEQRSQLRATHPDLPFTEIMKMLAVQWA QLSQDKKGRKYAGKALGVTPPPSALTPVALIEEPKQLDIQSLPFQPILDRLKIKDKRQLV PSYLGGMHTVTEGLAGRQVLLQQVQGGAWESAFLTCPWAMAAAAPGPTLTMLQPSISAVG FSQMAMGPHVQGRASKLSIYVNPRRQLSTTTAMGDELNDLYCGTSRQLFSSLCNKKEPAA KAAPAAPLSSPGPPALGHAGHGRQLLLERSLTLFHLYELLKYPRASPGLWPPRVWMPHSL TCGTFWVPMIMMRFCAHVASGLLRPPFNQVFAHLSPAQGRHPGENGIIVLTSRAAVGFDV PHVTVSNRVAHSGCPWLR >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_1|1137_bp nnaaaaggaaagggggctggcctaaaggcaagaagagaaaccccccgagggaccttgctg ttccgtgttcctacgacagggtacatgatattcctgaatgagcagagaagtcagctgagg gccacacaccccgatctgcctttcacagaaatcatgaagatgctggctgttcagtgggcc cagctgtctcaggataaaaagggcagaaagtacgctgggaaagcccttggtgttactcca cctccttcagcgctcacaccagtcgccttgattgaagagccaaagcagttggacatccag tctcttccttttcagcccatcctggaccggttaaagattaaagataagaggcagcttgtt ccctcctacctgggaggcatgcacacggtcaccgagggtcttgctggtaggcaggttctg ctccagcaggttcagggtggggcctgggagtctgcatttctgacatgcccatgggccatg gctgctgctgctccagggcccacactgactatgctgcagcccagcatctctgccgtgggc ttttctcagatggccatggggccacatgtgcagggcagagctagcaaactgagcatctat gtaaacccacgcagacagctttccaccaccaccgccatgggagatgaactcaatgacctc tactgcgggacctccaggcagctcttcagctccctgtgcaacaagaaggaacctgctgca aaggcagcacctgcagcgcctctcagttcgccaggcccgcctgctctgggccacgcagga cacggacgtcagcttctcctggagcgctcgctcactttattccatctctacgagctcctc aagtatcccagagcatccccaggcctgtggccacctcgggtctggatgccgcattctctg acctgtggcacattctgggtgcccatgattatgatgaggttctgtgcccatgtggcctca gggctcttgagaccacccttcaaccaggtctttgctcacctgtctcctgcccagggacgc cacccaggtgaaaatgggatcatcgtgctgacctcaagggcagctgtaggattcgatgtt cctcatgtcacggtctctaaccgtgtggcacattctgggtgcccatggttacgatga >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_2|87_aa MTPVIPNPAPWVPGPELGTGQRNAEMKEPLVELFLLRMQRIEVAADPYGFATANCALSTS RIDITWELVTNAESRARPRPIEKESAF >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_2|264_bp atgactccggtcatccctaacccagccccatgggtaccaggccctgagctgggcactggg cagaggaatgcagagatgaaggagcccctggttgaattatttcttctgaggatgcaaaga attgaggttgctgcagacccatatggatttgccactgctaactgtgctctaagcaccagc cgcatcgacatcacctgggaactggttacaaatgccgaatctagagctcgccccagacct attgagaaagaatcagcattttaa >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_3|1353_aa MLAAVMVSLHPGQAGPISKKAALAGSSAASADCSPAAADLPFAPRSAAPEQLFCRPRGPR ASQRRVRPTHGPDPRLSLDDPQAPPWSRSGPGCTQPCWFFSCHFPLGYRVSGVRAPFCAV FVGVCLGVNPSHLDEAAPMLALDKTSLGQGGPAQCLAQDGPTLSPHLLPIQARPGAGASP RQHRSPRRVPDSMPQPRWSRRGAKGPEHPDPRLALALVLETQRPPLNPRPSIPGAGKEQT APKPQTHEYLLNQGVELGSSAAQPGRCCVLDYARISEGERAAAGPICSGFLLESPSAPCV LSSVVVEAGGETGSVSAVPLGIVSLLLSPQPETTLGGWQGDGSKEGELSLMDSGSTKVWG LWGGWVSKRAALRDLMPSLSSVLEDPGKEAEPETLCKSCVQGELHPGRVTSRERGVQEEG HSGRGTFREESSRDGDIQGEGRPARGASREGYIQGGPTKGDMEIPFEEVLERAKAGDPKA QTEVGKHYLQLAGDTDEELNSCTAVDWLVLAAKQGRREAVKLLRRCLADRRGITSENERE VRQLSSETDLERAVRKAALVMYWKLNPKKKKQVAVAELLENVGQVNEHDGGAQPGPVPKS LQKQRRMLERLVSSECECSPCPVSPMPPSLHLQGDLSFLCDSILACPISPVPPSLRLQGD LSFLCDPILALLGSQAPIALCEGGSGAAVWGAHAVFSHASAKNYIALDDFVEITKKYAKG VIPSSLFLQDDEDDDELAGKSPEDLPLRLKVVKYPLHAIMEIKEYLIDMASRAGMHWLST IIPTHHINALIFFFIVSNLTIDFFAFFIPLVIFYLSFISMVICTLKVFQDSKAWENFRTL TDLLLRFEPNLDVEQAEVNFGWNHLEPYAHFLLSVFFVIFSFPIASKDCIPCSELAVITG FFTVTSYLSLSTHAEPYTRRALATEVTAGLLSLLPSMPLNWPYLKVLGQTFITVPVGHLV VLNVSVPCLLYVYLLYLFFRMAQLRNFKGTYCYLVPYLVCFMWCELSVVILLESTGLGLL RASIGYFLFLFALPILVAGLALVGVLQFARWFTSLELTKIAVTVAVCSVPLLLRWWTKAS FSVVGMVKSLTRSSMVKLILVWLTAIVLFCWFYVYRSEGMKVYNSTLTWQQYGALCGPRA WKETNMARTQILCSHLEGHRVTWTGRFKYVRVTDIDNSAESAINMLPFFIGDWMRCLYGE AYPACSPGNTSTAEEELCRLKLLAKHPCHIKKFDRYKFEITVGMPFSSGADGSRSREEDD VTKDIVLRASSEFKSVLLSLRQGSLIEFSTILEGRLGSKWPVFELKAISCLNCMAQLSPT RRHVKIEHDWRSTVHGAVKFAFDFFFFPFLSAA >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_3|4062_bp atgctggctgctgtcatggtctccctgcatccagggcaggctgggcccatttcgaagaag gccgcgctagccggctcttcagcagcgagtgcagattgctcccccgcggccgcagatctc ccgtttgcgccgcgttcagctgctcccgaacaacttttctgccggcccagaggccccagg gcgtcgcagcgccgcgtgcggcccactcacgggccggaccctcgcctctccctggacgac ccccaggctcctccctggtcccgatccgggcctggctgcacgcagccctgctggttcttc agctgccacttccctctgggctatcgggtcagtggagtgagggcccccttctgtgccgtc tttgtgggtgtctgtttaggtgttaacccttcccatctagatgaagctgctccaatgctg gccctggacaagacttctctagggcaggggggcccagcacagtgcttggcacaagatgga cctaccctcagtccacacctgctgcccatccaggccaggcctggagcaggtgccagcccc cgccagcaccgcagccccaggcgcgttcccgactcaatgccacagcctcgttggagcagg agaggagcgaaaggccccgagcacccggaccccaggctggccctggccctggtgttagag acgcagcggcccccgctgaaccccaggcccagcataccaggagccgggaaagagcagacg gcaccgaagccccagacccatgaatatctgctcaatcagggagttgagcttggcagttca gctgctcagccagggaggtgctgtgtcctggactatgctaggatttcagaaggagagagg gcagctgcaggccctatttgcagtggcttcctcctggaatccccgtctgctccctgtgta ctgtccagcgtcgtagtggaagctggtggcgagacgggctctgtgtctgcagttcctcta gggatagtgagtctgctgctgagcccgcagccagaaaccacgctggggggctggcagggt gacggcagcaaggaaggagaactgtccttgatggattcagggagcaccaaggtctggggg ctctggggagggtgggtgtccaaaagggctgccctgcgggacctgatgccatctctgagc agcgtccttgaggaccccggcaaggaagctgagcccgaaactctgtgcaagagttgcgtc cagggagagttacatccagggagagttacgtccagggagaggggcgtccaggaagagggg cattcagggagagggacattcagggaggagtcttccagggacggggacatccagggagag gggcgtccagcgagaggagcctctagggaggggtacatccagggagggcctacaaaggga gacatggaaatcccctttgaagaagtcctggagagggccaaggccggggaccccaaggca cagactgaggtggggaagcactacctgcagttggccggcgacacggatgaagaactcaac agctgcaccgctgtggactggctggtcctcgccgcgaagcagggccgtcgcgaggctgtg aagctgcttcgccggtgcttggcggacagaagaggcatcacgtccgagaacgaacgggag gtgaggcagctctcctccgagaccgacctggagagggccgtgcgcaaggcagccctggtc atgtactggaagctcaaccccaagaagaagaagcaggtggccgtggcggagctgctggag aatgtcggccaggtcaacgagcacgatggaggggcgcagccaggccccgtgcccaagtcc ctgcagaagcagaggcgcatgctggagcgcctggtcagcagcgagtgtgagtgcagcccc tgccccgtctcacccatgcctcccagcctgcacctgcagggcgacctctccttcctgtgc gactccatcctggcctgccctatctcacccgtgcctcccagcctgcgcctgcagggcgac ctctccttcctgtgcgaccccatcctggccctgctaggatctcaggcgcccattgctctg tgtgagggtggcagtggggctgcagtgtggggcgcccatgctgttttctctcatgcttca gccaagaactacatcgcgctggatgactttgtggagatcactaagaagtacgccaagggc gtcatccccagcagcctgttcctgcaggacgacgaagatgatgacgagctggcggggaag agccctgaggacctgccactgcgtctgaaggtggtcaagtaccccctgcacgccatcatg gagatcaaggagtacctgattgacatggcctccagggcaggcatgcactggctgtccacc atcatccccacgcaccacatcaacgcgctcatcttcttcttcatcgtcagcaacctcacc atcgacttcttcgccttcttcatcccgctggtcatcttctacctgtccttcatctccatg gtgatctgcaccctcaaggtgttccaggacagcaaggcctgggagaacttccgcaccctc accgacctgctgctgcgcttcgagcccaacctggatgtggagcaggccgaggtcaacttc ggctggaaccacctggagccctatgcccatttcctgctctctgtcttcttcgtcatcttc tccttccccatcgccagcaaggactgcatcccctgctcggagctggctgtcatcaccggc ttctttaccgtgaccagctacctgagcctgagcacccatgcagagccctacacgcgcagg gccctggccaccgaggtcaccgccggcctgctatcgctgctgccctccatgcccttgaat tggccctacctgaaggtccttggccagaccttcatcaccgtgcctgtcggccacctggtc gtcctcaacgtcagcgtcccgtgcctgctctatgtctacctgctctatctcttcttccgc atggcacagctgaggaatttcaagggcacctactgctaccttgtgccctacctggtgtgc ttcatgtggtgtgagctctccgtggtcatcctgctggagtccaccggcctggggctgctc cgcgcctccatcggctacttcctcttcctctttgccctccccatcctggtggccggcctg gccctggtgggcgtgctgcagttcgcccggtggttcacgtctctggagctcaccaagatc gcagtcaccgtggcggtctgtagtgtgcccctgctgttgcgctggtggaccaaggccagc ttctctgtggtggggatggtgaagtccctgacgcggagctccatggtcaagctcatcctg gtgtggctcacggccatcgtgctgttctgctggttctatgtgtaccgctcagagggcatg aaggtctacaactccacactgacctggcagcagtatggtgcgctgtgcgggccacgcgcc tggaaggagaccaacatggcgcgcacccagatcctctgcagccacctggagggccacagg gtcacgtggaccggccgcttcaagtacgtccgcgtgactgacatcgacaacagcgccgag tctgccatcaacatgctcccgttcttcatcggcgactggatgcgctgcctctacggcgag gcctaccctgcctgcagccctggcaacacctccacggccgaggaggagctctgtcgcctt aagctgctggccaagcacccctgccacatcaagaagttcgaccgctacaagtttgagatt accgtgggcatgccattcagcagcggcgctgacggctcgcgcagccgcgaggaggacgac gtcaccaaggacatcgtgctgcgggccagcagcgagttcaagagcgtgctgctcagcctg cgccagggcagcctcatcgagttcagcaccatcctggagggccgcctgggcagcaagtgg cctgtcttcgagctcaaggccatcagctgcctcaactgcatggcccagctctcacccacc aggcggcacgtgaagatcgagcacgactggcgcagcaccgtgcatggcgccgtgaagttc gccttcgacttctttttcttcccattcctgtcggcggcctga >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_4|151_aa MDQVPVSEYSTMAACLSLSEAHDVICVGPPRNSNSTQDRSVRPVEYGRCDEMSLLRFAYK RLWLSVWLLSLGSITLGEEAMPRAAPWGSLHLKDLEPQANCHMVHLETGMAATAHTRLRL RGDTQPEPPGESTPGFPTPTEYKMTNVALGC >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_4|456_bp atggaccaggtaccagtcagtgaatatagcacaatggcagcttgtctgtccctctctgag gctcatgatgtcatctgcgtgggccccccgcgtaattctaacagcacccaggataggtct gtgcgaccagtagaatatggcagatgtgacgagatgtcacttctgaggtttgcttacaaa aggctgtggctttcagtttggctgctctctctcggatccatcactctgggtgaggaagcc atgccgcgagcagccccgtggggaagcctgcacctcaaggacctggagcctcaagccaac tgccacatggtgcacttggaaacagggatggctgccacggcccacactcgcctgcggctc cggggagacacccagccagaacctcctggtgaatccactcctggattccctacccccaca gagtataagatgacgaatgtggctttaggctgttaa >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_5|512_aa MNRDNYWPKVPGPVSARATSSAQSVFEEPEDPSNRSFFSEIISSVSDVKFSHSGRYMLTR DYLTVKVWDLNMEARPIETYQLRGQPCAWSWAPDKMSRAVLGCVTVEVSRVPGRGPPDKM SRAVLGCVTVGVHDYLRSKLCSLYENDCIFDKFECAWNGSDSWLCSARLEPLSLFASRNG WGALGGYYDSTCSEPFTGDLSVIMTGAYNNFFRMFDRNTKRDVTLEASRESSKPRAVLKP RRVCVGGKRRRDDISVDSLDFTKKILHTAWHPAENIIAIAATNNLYIFQDKPTLTNCQLW DLADVIKVMDLKMEFSLDYPCGTHLITQVRESVGPFLVVLETAVATEKGSARHDMKTRPA SPKVLITKLYPGPSGYLVGPQASEGDILPRPPGTGTDTDARESKKGSKQRERELDRGLED GKQEGTWGSGNSLETERAEPKPARKEAKMLPERGPNPDPKRGLLDLVEERIQGKSMEKSE SKFIRKVKEYKNGYSMDRAARGLLVAHFYGYF >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_5|1539_bp atgaacagagataactactggcccaaggtcccagggccagttagtgccagagccacaagc agtgcccagtctgtctttgaagagcctgaggaccccagtaaccgctcattcttctcggaa atcatctcctccgtgtccgacgtgaagttcagccacagcggccgctacatgctcacccgg gactaccttacagtcaaggtctgggacctgaacatggaggcaagacccatagagacctac cagctcagaggtcagccgtgtgcctggtcgtgggcccccgataagatgagcagggctgta ttgggctgtgtcacggtggaggtcagccgtgtgcctggtcgtgggccccccgataagatg agcagggctgtattgggctgtgtcacggtcggggtccatgactaccttcggagcaagctc tgttccctgtacgagaacgactgcattttcgacaagtttgaatgtgcctggaacgggagc gacagctggctctgctctgcccggctggaacctctgtccttgtttgcctcccgcaatggg tggggtgccctggggggctactatgactcaacctgttctgagcccttcactggggacctc agcgtcatcatgaccggggcctacaacaacttcttccgcatgttcgatcggaacaccaag cgggacgtgaccctggaggcctcgagggaaagcagcaagccccgggctgtgctcaagcca cggcgcgtgtgcgtggggggcaagcgccggcgtgatgacatcagtgtggacagcttggac ttcaccaagaagatcctgcacacggcctggcacccggctgagaacatcattgccatcgcc gccaccaacaacctgtacatcttccaggacaagccaacattgaccaattgccagctctgg gatcttgcagatgtgattaaggtcatggacctgaagatggaatttagcctggattatcca tgtgggacccatctaatcacccaagtccgtgaaagtgtaggaccgttcctggttgtattg gagacagctgtagcaacagaaaaaggctcagcaaggcatgacatgaagactcgaccagcg agccccaaggtcttaattactaaactgtatcctggaccttctggctacctggtggggcct caggcctcggaaggtgacattctgcccaggccccctggcactggcacggacacagatgct cgagagtcaaagaagggatcaaaacaaagagaacgggagctggaccgtggccttgaagat gggaagcaagaaggcacatgggggagtggaaacagcttagagactgagagagcagagccc aagcccgcgagaaaggaagcaaagatgttgccggaaaggggtcccaatccagaccccaag agagggctcttggatctcgtggaagaaagaattcagggcaagtccatggagaaaagtgaa agcaagtttattaggaaagtaaaggaatacaagaatggctactccatggacagagcagcc cgagggctgctggttgcccatttttatggttatttctga >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_6|134_aa MQVCSSGGRILRVSGTERQLRREGGRFCLGRGVRNLREGGALRAPFTAPSLTALPPDIVD IKPANMEDLTEVITASEFHPHHCNLFVYSSSKGSLRLCDMRAAALCDKHSKHPCDDIGPI QIIQGHLRTSRSFT >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_6|405_bp atgcaggtctgcagctcagggggacgcattttgagggtctcaggaacagagaggcagctc agacgcgagggtggacggttttgcctcgggagaggtgtcagaaacctccgggaaggcgga gcattgagggcgcccttcactgccccttccttgaccgcactgcccccagacatcgtggac atcaagccggccaacatggaggaccttacggaggtgatcacagcatctgagttccatccg caccactgcaacctcttcgtctacagcagcagcaagggctccctgcggctctgcgacatg cgggcagctgccctgtgtgacaagcattccaagcacccttgtgatgacattgggcccatc cagattatccagggtcacctccgtacctcaaggtccttcacttaa >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_7|248_aa MRQALHNRVPTHPNTDSPPTLTAPQPTAPTLTAPPHRQPSTPTAPPHQQPPTPDSPQHPT APPHQQPPTPTAPHTDSPTHQQPPHSDSPPHRQPPQTDSPTLTALHTDRPPTPTAPNTQQ PLTPDSPPTLTTPNTDSPPHRQPHTPTAPHTDSPQHRQPPTPTAPHTNSAPTPTAPNTDS PQHRQPPHTDSPPTPTAPNTDSPQHRQPPHTDSPPHQQPPHRSFSRTPISSSPALGHQAR HQGASSSF >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_7|747_bp atgcggcaggccctccacaacagggtccccacgcaccccaacaccgacagccccccaaca ctgacagccccccaaccaacagcccccacactgacagcccccccacaccgacagccctcc acaccgacagcccccccacaccaacagcccccaacacccgacagcccccaacacccaaca gcccccccacaccaacaaccccccacaccgacagccccccacaccgacagccccacacac caacagcccccccacagtgacagccccccacaccgacagcccccccaaaccgacagcccc acactgacagcccttcacaccgacagaccccccacaccaacagcccccaacacgcaacag cccctaacacccgacagcccccccacactgacaacccccaacaccgacagccccccacac cgacagccccacacaccaacagccccccacaccgacagcccccaacaccgacagccccca acaccgacagccccacacaccaacagcgcccccacaccgacagcccccaacaccgacagc ccccaacaccgacagccgccccacaccgacagcccccccacaccgacagcccccaacacc gacagcccccaacaccgacagcccccccacaccgacagccccccacaccaacagccccca caccgctccttctctagaactcccatttcctcatccccagccctggggcaccaagccagg catcagggggcttcttcctcattctaa >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_8|92_aa MAAKLHQSAPRSSSPVRCLAWIITSFGQQTTHTTVLAGPGRDPVCFPSTKSNAQHREDSR PMLLNQPMNAQNRGQLLVKETHSHGIKGVQKD >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_8|279_bp atggcagctaagctgcaccagtctgccccccggtcgtcctctcctgtaaggtgcttggct tggatcatcacctcttttgggcagcagacaacccatacaactgtactcgcaggccctggc agagaccctgtctgcttccctagcaccaagtccaatgctcaacacagagaagactctcgg ccaatgctgctgaatcaaccaatgaatgcacagaaccgtgggcagctcctggtaaaggag acacactcccatgggataaagggggtgcagaaagactga >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_9|387_aa MGRGLAAALFRQLLLAGKGAWGWAAFYLPVAVPTLSVLGASGPACAGPAPGTARGVDGNP QYLSPFASLFAACRIAVLQAKAKHRDKPRIKGCGASGSGELAIAVALLSTRERRVLTATP SRLAVPLPADIISTVEFNHTGELLATGDKGGRVVIFQREPESKNAPHSQGEYDVYSTFQS HEPEFDYLKSLEIEEKINKIKWLPQQNAAHSLLSTNDKTIKLWKITERDKRPEGYNLKDE EGKLKDLSTVTSLQVPVLKPMDLMVEVSPRRIFANGHTYHINSISVNSDCETYMSADDLR INLWHLAITDRSFSILHCGLASASHGQSSFRWVPRLLLAAGLRSVEGKPGVWFLPSEDVD TELMAGQSPSKPAAPWLPPVTLSVLLL >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_9|1164_bp atggggaggggcctggctgcagcccttttcaggcagctcttgttggctgggaagggagcc tggggatgggccgccttctacctgccggtggctgtgcccactctgagtgtgcttggggcc agtgggccagcctgcgcaggtcctgctcctggtaccgccagaggcgttgatggcaacccc cagtacctaagcccctttgcatctctgtttgcggcttgtcgaattgccgttctgcaggcc aaagcaaaacacagggacaagcccagaatcaagggctgtggggcctcggggagcggggag ctcgccattgctgtggcactgttgagcacccgggagaggcgggttctgacagccacccct tcccgtcttgctgttcccctcccagctgacatcatctctaccgttgagttcaaccacacg ggagagctgctggccacaggtgacaagggcggccgggtcgtcatcttccagcgggaacca gagagtaaaaatgcgccccacagccagggcgaatacgacgtgtacagcactttccagagc cacgagccggagtttgactatctcaagagcctggagatagaggagaagatcaacaagatc aagtggctcccacagcagaacgccgcccactcactcctgtccaccaacgataaaactatc aaattatggaagattaccgaacgagataaaaggcccgaaggatacaacctgaaggatgaa gaggggaaacttaaggacctgtccacggtgacgtcactgcaggtgccagtgctgaagccc atggatctgatggtggaggtgagccctcggaggatctttgccaatggccacacctaccac atcaactccatctccgtcaacagtgactgcgagacctacatgtcggcggatgacctgcgc atcaacctctggcacctggccatcaccgacaggagcttcagtatccttcactgtggcctg gccagtgcctcccacgggcagagtagcttccgttgggtgccacgcctgctgctggctgca ggcctcaggtcggtggagggtaagccaggggtgtggtttttgccaagtgaggatgtagac acggagctgatggctggtcagagcccctctaagcctgcggctccttggctgcccccggtc acactgagtgttttgttactgtga >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_10|279_aa MAPASNPCLHPALLLQEPDPAWKATAPPMGSAQGTGIGEARGCGCQFLPGQARLRKAPPP VTWASLLKTKLAKALPRRTGPELLGPRICILTGFPVVCQGLRTTGKGETGKAPQWGCLIL KLMPGNGNSETDPAGMSRPSQTSLTNEGWVCSKDHPVSRGIKAYGPPAGTDVLLEPPGSK QGVALEMFLSWLHAGHPSGTWVTGELTQEHYTQCVRTLAAALAVVMHMGYDGNIRDAKVT HPREFPDEQDAAIQPSRDQGQGQDQGLKEQLGAKHSTKT >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_10|840_bp atggccccggcctccaacccgtgtctgcacccagctctgctactgcaggagccagaccca gcgtggaaggccactgcccctcccatgggcagtgcacaagggacagggatcggggaggcc aggggctgtggctgccagtttctccctggccaagccaggctccggaaagccccaccccca gtcacctgggcttctctcctaaagacgaagcttgccaaggctctgcccagacggacagga ccagaattgctggggcctagaatctgcattctgacaggttttcctgtggtgtgccaaggt ttgagaaccactggtaaaggggagactggtaaagcacctcagtggggctgcctgattctc aagcttatgccaggcaacgggaattcagaaacggaccccgcgggcatgtccaggccctcc cagaccagtctgaccaatgagggttgggtatgcagcaaggaccacccggtgtccaggggc attaaggcttatgggccccctgcaggcacagatgttctgcttgagccaccaggcagcaaa caaggagtggctttggagatgttcctgtcctggttacacgccggccaccccagtgggacc tgggtcaccggggaactcacacaggagcattacacacagtgtgtgcgaaccctcgcggca gccctggcagttgttatgcacatgggctacgatggcaacatccgagatgctaaagtcact catcccagagagtttccagatgaacaagatgcagctattcagccttccagagaccaaggg caagggcaagaccaaggcctgaaagagcagctgggggccaaacacagcaccaagacctaa >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_11|321_aa MDKGDSRPLTRGVVATMHAPRPHAGGPARAFSLGGEYRGAFKGIKLASASMSTVTAGLGA QSPSETTNGTSQGASSQGKQAFSEPTLIQAEGNLSCEGGTVTQRTALTHSSFQIPEQPPG KAWPSRTCYALLERKPKPSKVKTLIKGHTAAATQQSQNSSLGLAPTPRSYHQLARLNILL LFSKLLHLPGLEREPLLVKLYLRDTNCHPGNYGLGPPTNSTTSHRAKMEKLMPSRPQKGS SQELGQLVLLDVPEEGFVEEAVLCVHPSSYLGCPQKWRGHTFLSVSPKTHAGTLRAAKMA AVKTKPTDLAGMLTGRWTTRR >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_11|966_bp atggacaaaggggactctcgacccctgactcggggggtggtggccacgatgcatgcgcct cgtccgcatgctggaggacctgccagggccttctctcttggaggtgaatatcggggggct ttcaaagggatcaaactggccagcgccagcatgtccactgttacagccggcctgggggct cagagcccctctgaaactaccaatggaacatcccaaggtgcctccagtcagggcaagcag gctttctcagaacccacgctgatccaggccgaggggaacctcagctgtgaggggggcacg gtcactcagaggacggccctcacccatagctcattccagataccagagcagccaccagga aaggcttggccaagcaggacttgctatgccctcctggagaggaagccaaagcccagcaag gttaagacactcatcaagggccacacagctgctgccacgcagcaaagccagaattcaagt ctgggtctagctccaactccacgctcctatcaccagctcgcacgcttaaacatcctcctc ctcttctctaaactcctgcatctcccaggtctggaaagggaaccactgttggttaagctc tatctgcgggacacgaactgtcatccaggaaattatgggcttggtcctccaactaactca acaacctctcacagagcaaaaatggaaaagctgatgcccagcaggcctcagaagggcagc agccaagagcttgggcagctggttctcctcgatgtgcctgaggagggcttcgtggaggaa gcggtgctttgtgtgcacccttcttcctatttggggtgtccacagaagtggagaggccac acctttctctctgtttcccccaaaacacatgcaggcactctgcgggcagccaaaatggcg gccgtgaaaacgaagcccacagacctggctggaatgctcacgggcagatggaccacaaga cgataa >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_12|111_aa MAMDFTSNPCLMEVHKVLFLSLSNAAKYCPVCTAVFGDRDTQQGVQPLNHLSSRTVLYPG LKAGGPGHSHQQSAFIDLAGPWKLKACPALASCARSLIDKTGITVPTSLGC >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_12|336_bp atggctatggatttcacttctaatccttgtctgatggaagtacacaaggttctgtttctg tctctgtcaaatgcagccaaatactgcccagtttgcacagctgtttttggagacagagac actcagcaaggggtccagcccctgaaccacctgagctcccggactgtgctatatccaggc ctaaaggctggtggaccaggacacagtcaccagcaatcagctttcatcgacttggctgga ccttggaaacttaaggcttgccctgccctggcttcttgcgctcgttccctcattgacaaa acaggaataacagtgcccacctccttgggctgttag >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_13|194_aa MWKKPRHSQGSTLIHAQIRGLCSSHDCPSEGQRSLKVFLTTPKPSRYLSHLTDKKTEAPD PLDIKGGMRTTGNSKYTLPLSGVLAQPSYGWSNSSASTSLREPQPIEVASSFPTGQTHGE LSLNGRAASCASIIFTENGPPAPVRARGVHRVWAQTTGAVMVSHKTASARLNKQDDGEWP WQEGKHSPTEGHVG >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_13|585_bp atgtggaagaaacccagacacagccagggctctaccctcatccatgcccagatccgaggc ctctgctcctctcatgactgccccagtgaggggcaaaggagccttaaagtcttcctgaca accccaaagcctagtcgttacttgtcccatttaacagataagaagactgaggccccagat cccctggatattaagggaggaatgaggactactggaaattccaaatacactctgccactc tccggggtcctcgcacagcccagctacggctggagtaacagcagcgcaagcacatccctg cgggaaccacaacccatcgaggtcgccagctcctttcccacagggcagacacatggtgag ctctcactgaatggcagggcggcttcctgtgcatccatcatattcaccgagaatggtccc cccgcaccagtgcgggcacgtggtgtgcaccgggtctgggcccagaccaccggggctgta atggtttctcataaaactgcttctgcaaggctgaacaaacaagatgatggggagtggccc tggcaagaaggcaagcacagccccaccgagggccacgttggctga >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_14|152_aa MARAPGATSADCIEHRARPSMGEDTDTRKINHSFLRDHSYVTEGRARGVTPMEDGDREVW ARGRLCWSPARVRGLSAANPRAGGPAAEVCADSPAPFAKHILPWAAVMTKTQELSQGEVV QADQVDPGSDWVFLGSNSHSHVPDSVHTCTVL >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_14|459_bp atggcccgcgcgcccggcgcgacctctgcggattgcatcgagcaccgggcacggccttca atgggcgaggacacggacacgcggaaaattaaccacagcttcctgcgggaccacagctat gtgactgaaggcagggcccgcggagtgaccccgatggaggatggggaccgggaggtctgg gctcggggccgcctgtgctggagccctgcccgagtgcggggactgtcagccgctaaccca cgggctggcggcccggccgcagaagtgtgcgcggattccccggcaccatttgccaagcac atcctgccctgggcagcagtgatgaccaagacacaggagctcagccaaggggaggtggtt caggctgaccaggtagatcctggcagtgactgggtctttctggggagcaacagtcactct catgtgcctgacagtgtccacacgtgcacggtgctgtga >gi568815594r:6223305_6481846|GENSCAN_predicted_peptide_15|81_aa MGNGKRQAIHRRGNSNGQVERFPNTRNQEMQTSHVQLHQPGSREEDNRGQSCSPLEMHTQ QECEIALKPLRFGGSFTAVVF >gi568815594r:6223305_6481846|GENSCAN_predicted_CDS_15|246_bp atgggcaatgggaaaagacaggcgattcatcgaagaggaaattcaaatggccaagtggaa agattcccaaacactaggaatcaggaaatgcaaaccagccatgtgcagctccatcagccg gggtcccgggaagaagataacaggggtcagagctgtagcccactcgagatgcacacgcag caggagtgcgaaatagccttgaagccgctgcgttttggagggtcattcacggctgtggtc ttctag